Enum rustybam::cli::Commands

source · [−]

pub enum Commands {
Show 14 variants    Stats {
        bam: String,
        qbed: bool,
        paf: bool,
    },
    BedLength {
        bed: String,
        readable: bool,
        column: Option<u8>,
    },
    Filter {
        paf: String,
        paired_len: u64,
        aln: u64,
        query: u64,
    },
    Invert {
        paf: String,
    },
    Liftover {
        paf: String,
        bed: String,
        qbed: bool,
        largest: bool,
    },
    TrimPaf {
        paf: String,
        match_score: i32,
        diff_score: i32,
        indel_score: i32,
    },
    Orient {
        paf: String,
        scaffold: bool,
        insert: u64,
    },
    BreakPaf {
        paf: String,
        max_size: u32,
    },
    PafToSam {
        paf: String,
    },
    FastxSplit {
        fastx: Vec<String>,
    },
    GetFasta {
        fasta: String,
        bed: String,
        strand: bool,
        name: bool,
    },
    Nucfreq {
        bam: String,
        region: Option<String>,
        bed: Option<String>,
        small: bool,
    },
    Repeat {
        fasta: String,
        min: usize,
    },
    Suns {
        fasta: String,
        kmer_size: usize,
        max_size: usize,
        validate: bool,
    },
}

Expand description

This structure contains all the subcommands for rustybam and their help descriptions.

Because of naming conventions for rust enums the commands names have different capitalization than on the command line. For example, the Liftover enum is invoked using rustybam liftover and the TrimPaf command with rustybam trim-paf.

Variants

`Stats`

Fields

bam: String

sam/bam/cram/file

qbed: bool

print query coordinates first

paf: bool

The input is paf format (must have cg tag with extended cigar or cs tag).

Get percent identity stats from a sam/bam/cram or PAF.

Requires =/X operations in the CIGAR string!

`BedLength`

Fields

bed: String

a bed file

readable: bool

make the output human readable (Mbp)

column: Option<u8>

count bases for each category in this column .

Count the number of bases in a bed file.

`Filter`

Fields

paf: String

PAF file from minimap2 or unimap. Must have the cg tag, and n matches will be zero unless the cigar uses =X.

paired_len: u64

minimum number of aligned bases across all alignments between a target and query in order to keep those records

aln: u64

minimum alignment length

query: u64

minimum query length

Filter PAF records in various ways.

`Invert`

Fields

paf: String

PAF file from minimap2 or unimap. Must have the cg tag, and n matches will be zero unless the cigar uses =X.

Invert the target and query sequences in a PAF along with the CIGAR string.

`Liftover`

Fields

paf: String

PAF file from minimap2 or unimap run with -c and –eqx [i.e. the PAF file must have the cg tag and use extended CIGAR opts (=/X)].

bed: String

BED file of reference regions to liftover to the query.

qbed: bool

Specifies that the BED file contains query coordinates to be lifted onto the reference (reverses direction of liftover).

Note, that this will make the query in the input PAF the target in the output PAF.

largest: bool

If multiple records overlap the same region in the return only the largest liftover. The default is to return all liftovers.

Liftover target sequence coordinates onto query sequence using a PAF.

This is a function for lifting over coordinates from a reference () to a query using a PAF file from minimap2 or unimap (note, you can use paftools.js sam2paf to convert SAM data to PAF format). The returned file is a PAF file that is trimmed to the regions in the bed file. Even the cigar in the returned PAF file is trimmed so it can be used downstream! Additionally, a tag with the format id:Z:<> is added to the PAF where <> is either the 4th column of the input bed file or if not present chr_start_end.

`TrimPaf`

Fields

paf: String

PAF file from minimap2 or unimap. Must have the cg tag, and n matches will be zero unless the cigar uses =X.

match_score: i32

Value added for a matching base.

diff_score: i32

Value subtracted for a mismatching base.

indel_score: i32

Value subtracted for an insertion or deletion.

Trim paf records that overlap in query sequence.

This idea is to mimic some of the trimming that happens in PAV to improve breakpoint detection. Starts with the largest overlap and iterates until no query overlaps remain.

`Orient`

Fields

paf: String

PAF file from minimap2 or unimap. Must have the cg tag, and n matches will be zero unless the cigar uses =X.

scaffold: bool

Generate ~1 query per target that scaffolds together all the records that map to that target sequence.

The order of the scaffold will be determined by the average target position across all the queries, and the name of the new scaffold will be

insert: u64

space to add between records

Orient paf records so that most of the bases are in the forward direction.

Optionally scaffold the queriers so that there is one query per target.

`BreakPaf`

Fields

paf: String

PAF file from minimap2 or unimap. Must have the cg tag, and n matches will be zero unless the cigar uses =X.

max_size: u32

maximum indel size to keep in the paf

Break PAF records with large indels into multiple records (useful for SafFire).

`PafToSam`

Fields

paf: String

PAF file from minimap2 or unimap.

Convert a PAF file into a SAM file. Warning, all alignments will be marked as primary!

`FastxSplit`

Fields

fastx: Vec<String>

List of fastx files to write to

Splits fastx from stdin into multiple files.

Specifically it reads fastx format (fastq, fasta, or mixed) from stdin and divides the records across multiple output files. Output files can be compressed by adding .gz, and the input can also be compressed or uncompressed.

`GetFasta`

Fields

fasta: String

fasta file to extract sequences from

bed: String

bed file of regions to extract

strand: bool

Reverse complement the sequence if the strand is “-”

name: bool

Add the name (4th column) to the header of the fasta output

Mimic bedtools getfasta but allow for bgzip in both bed and fasta inputs.

`Nucfreq`

Fields

bam: String

sam/bam/cram/file

region: Option<String>

print nucfreq info from the input region e.g “chr1:1-1000”

bed: Option<String>

print nucfreq info from regions in the bed file output is optionally tagged using the 4th column

small: bool

smaller output format

Get the frequencies of each bp at each position.

`Repeat`

Fields

fasta: String

a fasta file

min: usize

The smallest repeat length to report

Report the longest exact repeat length at every position in a fasta.

`Suns`

Fields

fasta: String

fasta file with the genome

kmer_size: usize

the size of the required unique kmer

max_size: usize

the maximum size SUN interval to report

validate: bool

confirm all the SUNs (very slow) only for debugging.

Extract the intervals in a genome (fasta) that are made up of SUNs.

Enum rustybam::cli::Commands

Variants

Stats

Fields

BedLength

Fields

Filter

Fields

Invert

Fields

Liftover

Fields

TrimPaf

Fields

Orient

Fields

BreakPaf

Fields

PafToSam

Fields

FastxSplit

Fields

GetFasta

Fields

Nucfreq

Fields

Repeat

Fields

Suns

Fields

Trait Implementations

impl Debug for Commands

fn fmt(&self, f: &mut Formatter<'_>) -> Result

impl FromArgMatches for Commands

fn from_arg_matches(__clap_arg_matches: &ArgMatches) -> Result<Self, Error>

fn update_from_arg_matches<'b>( &mut self, __clap_arg_matches: &ArgMatches) -> Result<(), Error>

impl Subcommand for Commands

fn augment_subcommands<'b>(__clap_app: App<'b>) -> App<'b>

fn augment_subcommands_for_update<'b>(__clap_app: App<'b>) -> App<'b>

fn has_subcommand(__clap_name: &str) -> bool

Auto Trait Implementations

impl RefUnwindSafe for Commands

impl Send for Commands

impl Sync for Commands

impl Unpin for Commands

impl UnwindSafe for Commands

Blanket Implementations

impl<T> Any for T where T: 'static + ?Sized,

pub fn type_id(&self) -> TypeId

impl<T> Borrow<T> for T where T: ?Sized,

pub fn borrow(&self) -> &T

impl<T> BorrowMut<T> for T where T: ?Sized,

pub fn borrow_mut(&mut self) -> &mut T

impl<T> From<T> for T

pub fn from(t: T) -> T

impl<T, U> Into<U> for T where U: From<T>,

pub fn into(self) -> U

impl<T> Pointable for T

pub const ALIGN: usize

type Init = T

pub unsafe fn init(init: <T as Pointable>::Init) -> usize

pub unsafe fn deref<'a>(ptr: usize) -> &'a T

pub unsafe fn deref_mut<'a>(ptr: usize) -> &'a mut T

pub unsafe fn drop(ptr: usize)

impl<T> Same<T> for T

type Output = T

impl<SS, SP> SupersetOf<SS> for SP where SS: SubsetOf<SP>,

pub fn to_subset(&self) -> Option<SS>

pub fn is_in_subset(&self) -> bool

pub unsafe fn to_subset_unchecked(&self) -> SS

pub fn from_subset(element: &SS) -> SP

impl<T, U> TryFrom<U> for T where U: Into<T>,

type Error = Infallible

pub fn try_from(value: U) -> Result<T, <T as TryFrom<U>>::Error>

impl<T, U> TryInto<U> for T where U: TryFrom<T>,

type Error = <U as TryFrom<T>>::Error

pub fn try_into(self) -> Result<U, <U as TryFrom<T>>::Error>

impl<V, T> VZip<V> for T where V: MultiLane<T>,

pub fn vzip(self) -> V

`Stats`

`BedLength`

`Filter`

`Invert`

`Liftover`

`TrimPaf`

`Orient`

`BreakPaf`

`PafToSam`

`FastxSplit`

`GetFasta`

`Nucfreq`

`Repeat`

`Suns`

fn update_from_arg_matches<'b>(
&mut self,
__clap_arg_matches: &ArgMatches
) -> Result<(), Error>

impl<T> Any for T where
T: 'static + ?Sized,

impl<T> Borrow<T> for T where
T: ?Sized,

impl<T> BorrowMut<T> for T where
T: ?Sized,

impl<T, U> Into<U> for T where
U: From<T>,

impl<SS, SP> SupersetOf<SS> for SP where
SS: SubsetOf<SP>,

impl<T, U> TryFrom<U> for T where
U: Into<T>,

impl<T, U> TryInto<U> for T where
U: TryFrom<T>,

impl<V, T> VZip<V> for T where
V: MultiLane<T>,