biors-0.5.0 is not a library.
bio-rs
Rust tools for validating protein FASTA input and tokenizing FASTA records into
stable protein-20 token ids.
Features
- FASTA parsing for one or more protein sequences
protein-20residue validation- lowercase sequence normalization
- ambiguous residue reporting for
X,B,Z,J,U, andO - invalid residue reporting
- JSON array output from the CLI
Quickstart
Inspect FASTA records:
Tokenize FASTA records:
Tokenize FASTA records from stdin:
|
Tokenize a multi-record FASTA file:
Use the Rust library:
[]
= "0.5.0"
use ;
let tokenized = tokenize_fasta_records?;
let summary = summarize_tokenized_proteins;
assert_eq!;
assert_eq!;
# Ok::
JSON Contracts
tokenize always emits an array of records:
inspect always emits a summary object:
Checks
The check suite runs cargo fmt, native Rust checks, a biors-core
wasm32-unknown-unknown build check, tests, and cargo clippy with warnings
denied.
Run the Rust library example:
Workspace
packages/
rust/
biors/ CLI
biors-core/ FASTA parsing and tokenization library
examples/
multi.fasta
protein.fasta
Protein-20
A C D E F G H I K L M N P Q R S T V W Y
Token ids follow that order, starting at 0.
License
Dual licensed under MIT OR Apache-2.0.