Crate sourmash

Source
Expand description

§Compute, compare and search signatures for nucleotide (DNA/RNA) and protein sequences.

sourmash is a command-line tool and Python library for computing MinHash sketches from DNA sequences, comparing them to each other, and plotting the results. This allows you to estimate sequence similarity between even very large data sets quickly and accurately.

sourmash can be used to quickly search large databases of genomes for matches to query genomes and metagenomes.

sourmash also includes k-mer based taxonomic exploration and classification routines for genome and metagenome analysis. These routines can use the NCBI taxonomy but do not depend on it in any way. Documentation and further examples for each module can be found in the module descriptions below.

Re-exports§

pub use errors::SourmashError as Error;

Modules§

ani_utils
cmd
collection
encodings
errors
ffi
Foreign Function Interface for calling sourmash from a C API
index
Indexing structures for fast similarity search
manifest
prelude
selection
signature
Compressed representations of genomic data
sketch
storage

Functions§

_hash_murmur

Type Aliases§

Result
ScaledType