Crate wg_ragsmith

Crate wg_ragsmith 

Source
Expand description
Source Discovery ─┬─► ingestion::fetch_html ──► DocumentCache
                  └─► ingestion::resume      ──┐
                                               │
Cached HTML ──► semantic_chunking::service ──► ChunkBatch
                                   │
                                   ├─► embeddings / segmenter helpers
                                   └─► cache & breakpoint strategies

ChunkBatch ──► ingestion::chunk_response_to_ingestion ──► stores::sqlite::SqliteChunkStore
            └─► downstream VectorStore implementations (future adapters)

Stored vectors ──► query utilities & RAG applications

Re-exports§

pub use semantic_chunking::assembly;
pub use semantic_chunking::breakpoints;
pub use semantic_chunking::cache;
pub use semantic_chunking::config;
pub use semantic_chunking::embeddings;
pub use semantic_chunking::segmenter;
pub use semantic_chunking::service;
pub use semantic_chunking::tokenizer;
pub use semantic_chunking::types as chunk_types;

Modules§

ingestion
Ingestion utilities for turning external documents into chunked datasets.
semantic_chunking
Semantic chunking primitives for JSON and HTML sources.
stores
types