slabs
Text chunking for RAG pipelines.
Dual-licensed under MIT or Apache-2.0.
Quickstart
[]
= "0.1.0"
use ;
let chunker = prose;
let text = "Your long document here...";
let slabs = chunker.chunk;
for slab in slabs
Strategies
| Strategy | Use Case | Complexity |
|---|---|---|
| Fixed | Homogeneous content, baselines | $O(n)$ |
| Sentence | Prose, articles | $O(n)$ |
| Recursive | General-purpose | $O(n \log n)$ |
| Semantic | Topic coherence (semantic feature) |
$O(nd)$ |
| Late | Contextual embeddings across chunk boundaries | Depends on base chunker |
Features
| Feature | What it enables |
|---|---|
semantic |
Semantic chunker (requires fastembed, innr, textprep) |
code |
Code-aware chunker via tree-sitter (Rust, Python, TypeScript, Go) |
cli |
slabs CLI binary |