Expand description
Bioinformatics file formats
Provides support for common bioinformatics file formats:
- FASTA: Nucleotide and protein sequences
- FASTQ: Sequences with quality scores
- SAM/BAM: Sequence alignment data
- VCF: Variant Call Format for genomic variations Bioinformatics file format support
This module provides support for common bioinformatics file formats used in genomics, proteomics, and molecular biology research.
§Supported Formats
- FASTA: Standard format for nucleotide and protein sequences
- FASTQ: Sequences with per-base quality scores
- SAM/BAM: Sequence Alignment/Map format
- VCF: Variant Call Format for genomic variations
§Examples
use scirs2_io::formats::bioinformatics::{FastaReader, FastqReader};
// Read FASTA file
let mut reader = FastaReader::open("sequences.fasta")?;
for record in reader.records() {
let record = record?;
println!(">{}", record.id());
println!("{}", record.sequence());
}
// Read FASTQ file
let mut reader = FastqReader::open("reads.fastq")?;
for record in reader.records() {
let record = record?;
println!("@{}", record.id());
println!("{}", record.sequence());
println!("+");
println!("{}", record.quality());
}Modules§
- analysis
- Sequence analysis utilities
Structs§
- Fasta
Reader - FASTA file reader
- Fasta
Record - FASTA sequence record
- Fasta
Record Iterator - Iterator over FASTA records
- Fasta
Writer - FASTA file writer
- Fastq
Reader - FASTQ file reader
- Fastq
Record - FASTQ sequence record
- Fastq
Record Iterator - Iterator over FASTQ records
- Fastq
Writer - FASTQ file writer
Enums§
- Quality
Encoding - FASTQ quality encoding
Functions§
- count_
fastasequences - Count sequences in a FASTA file
- count_
fastqsequences - Count sequences in a FASTQ file