Expand description
CHUNK_TEXT(text, chunk_size, overlap, strategy) — deterministic text splitting.
Splits a text string into overlapping chunks using one of three strategies:
character: split at character boundaries, respecting chunk_size and overlapsentence: split at sentence boundaries (.!?followed by whitespace)paragraph: split at double-newline boundaries
All operations are UTF-8 safe (split on char boundaries, not byte boundaries). Shared between Origin and Lite.
Structs§
- Text
Chunk - A single chunk produced by text splitting.
Enums§
- Chunk
Error - Error returned when chunk parameters are invalid.
- Chunk
Strategy - Chunking strategy.
Functions§
- chunk_
text - Split text into chunks using the specified strategy.