Seqsum
[!WARNING]
Seqsum was rewritten in Rust in 0.3.0. The original Python version of seqsum and how to use is archived in thepythonbranch. It remains available on PyPI.
Robust checksums for nucleotide sequences. Accepts one or more fast[a|q][.gz|.zst] files or standard input. Generates an aggregate checksum for each input file by default, similar to md5sum/sha256sum. Warnings are shown for duplicate sequences and within-collection checksum collisions at the selected bit depth. Sequences are uppercased before hashing with RapidHash (v3) and may be normalised (with -n) to use only ACGTN-. Read IDs and FASTQ base quality scores do not inform the checksum. Output is tab-delimited text to stdout.
By default, seqsum outputs one aggregate checksum per file. Use --individual (-i) for per-record checksums, or --all (-a) for both individual and aggregate checksums. These flags are mutually exclusive.
Install
Development
Command line usage
# Default: aggregate checksum per file
# Multiple files
# Stdin
|
# Individual per-record checksums
# All: individual checksums + aggregate
Built-in help