Skip to main content

Crate kira_cdh_compat_clstr

Crate kira_cdh_compat_clstr 

Source
Expand description

CD-HIT-compatible .clstr writer/reader and a semantic diff helper.

§Format notes

  • Clusters start with a header line: >Cluster N.
  • Member lines follow. The first member is the representative and is marked with *.
  • We optionally emit lengths with units (e.g., 150nt, or 300aa, ).
  • Parsers in the wild typically extract the member ID as the substring after > up to the first occurrence of .... We follow this convention.

The writer here is intentionally small and conservative: it emits only the minimal fields required by most downstream tooling.

Structs§

ClstrWriter
CD-HIT-compatible .clstr writer.

Enums§

ClstrUnit
Unit for sequence length annotations in .clstr.

Functions§

parse_clusters_from_reader
Parse clusters from any buffered reader.
read_clusters
Read clusters from a .clstr file (path).