Skip to main content

Module parse

Module parse 

Source
Expand description

FASTA/FASTQ parsing with automatic decompression

Reads DNA sequences from FASTA or FASTQ files, with transparent gzip decompression. Validates DNA alphabet (A, C, G, T only).

Functionsยง

count_sequences
Count sequences and total bases in a file
parse_sequences
Parse a FASTA/FASTQ file and call a function for each valid DNA sequence
validate_dna_sequence
Validate that a sequence contains only valid DNA bases (A, C, G, T)