Frame Catalog - Vector Similarity Search and RAG Infrastructure

High-performance vector search, embeddings, and retrieval-augmented generation (RAG) for AI systems.

Extracted from the Frame microservices architecture.

Features

HNSW Vector Search: Sub-millisecond similarity search using Hierarchical Navigable Small World graphs
ONNX Embeddings: MiniLM-L6-v2 text-to-vector conversion (384-dimensional)
Persistent Storage: SQLite-backed vector store with optional BytePunch compression
RAG System: High-level document indexing and retrieval interface
Event Database: Conversation and event storage with metadata

ONNX Model Setup

Required for embeddings feature: Frame Catalog uses the MiniLM-L6-v2 ONNX model for generating 384-dimensional semantic embeddings.

Download Models

Run the provided script to download the required ONNX model (~87MB):

./download-models.sh

This downloads:

models/all-minilm-l6-v2.onnx (87MB) - ONNX model
models/all-minilm-l6-v2-tokenizer.json (456KB) - Tokenizer config
models/vocab.txt (227KB) - BERT vocabulary

See models/README.md for manual download instructions and model details.

Note: Models are gitignored and must be downloaded separately. The onnx feature (enabled by default) requires these models.

Quick Start

[dependencies]
frame-catalog = "0.1.0"

Dependency Architecture

frame-catalog is the base layer for most Frame subsystems:


frame-catalog
└── (no Frame dependencies)

Used by: frame-thoughtchain, frame-sentinel, frame-presence, frame-identity, frame-mesh

Position in Frame ecosystem:


frame-catalog (base layer)
    ├→ frame-thoughtchain (reasoning)
    ├→ frame-sentinel (trust)
    ├→ frame-presence (sessions)
    ├→ frame-identity (biometrics) + frame-sentinel
    └→ frame-mesh (distributed) + frame-presence

Quick Start

use frame_catalog::{VectorStore, VectorStoreConfig, OnnxEmbeddingGenerator, EmbeddingGenerator, DocumentChunk};

// Create embedding generator
let embedder = OnnxEmbeddingGenerator::new()?;

// Create vector store
let config = VectorStoreConfig::default();
let mut store = VectorStore::new(config)?;

// Index documents
let chunk = DocumentChunk {
    id: "doc1".to_string(),
    content: "Rust is a systems programming language".to_string(),
    source: "rust-docs".to_string(),
    metadata: None,
};

let embedding = embedder.generate(&chunk.content)?;
store.add_chunk(chunk, &embedding)?;

// Search
let query_embedding = embedder.generate("programming languages")?;
let results = store.search(&query_embedding, 5)?;

for result in results {
    println!("{:.3}: {}", result.score, result.chunk.content);
}

Modules

vector_store (891 LOC) - HNSW similarity search with RwLock thread safety
embeddings (275 LOC) - ONNX embedding generation + simple hash fallback
persistent_store (324 LOC) - SQLite persistence with BytePunch/DataSpool support
retrieval (189 LOC) - High-level RAG interface with automatic chunking
database (582 LOC) - Event/conversation store for chat history

Performance

Search latency: 0.5-2ms for 10K documents (HNSW ef=50, M=16)
Embedding generation: ~10-50ms per text (MiniLM-L6-v2)
Memory: ~6KB per document (384-dim float32 vectors + metadata)
Throughput: ~1000 searches/sec single-threaded

Configuration

use frame_catalog::VectorStoreConfig;

let config = VectorStoreConfig {
    ef_construction: 200,  // Build quality (higher = better recall, slower build)
    max_connections: 32,   // Graph degree (higher = better recall, more memory)
    ef_search: 100,        // Search quality (higher = better recall, slower search)
};

Features

[dependencies]
frame-catalog = { version = "0.1.0", features = ["full"] }

onnx (default): ONNX Runtime embedding generation
persistence: SQLite persistence with BytePunch/DataSpool
full: All features enabled

API Reference

VectorStore

new(config) -> Result<Self> - Create in-memory HNSW index
add_chunk(chunk, embedding) -> Result<usize> - Add document with vector
search(embedding, top_k) -> Result<Vec<SearchResult>> - Find similar documents
clear() - Remove all documents
stats() -> VectorStoreStats - Get index statistics

EmbeddingGenerator (trait)

generate(&self, text: &str) -> Result<Vec<f32>> - Generate single embedding
generate_batch(&self, texts: &[&str]) -> Result<Vec<Vec<f32>>> - Batch processing
dimension(&self) -> usize - Get embedding dimension (384 for MiniLM)

Implementations:

OnnxEmbeddingGenerator - MiniLM-L6-v2 via ONNX Runtime
SimpleEmbeddingGenerator - Deterministic hash-based (testing only)

RetrievalSystem

new(embedder, config) -> Result<Self> - Create RAG system
index_document(id, content, source) -> Result<()> - Auto-chunk and index
retrieve(query, top_k) -> Result<Vec<SearchResult>> - Search with embedding
clear() - Remove all documents

Database

new(path) -> Result<Self> - Create/open SQLite database
create_conversation(id) -> Result<()> - Start conversation
store_event(event) -> Result<()> - Store timestamped event
search_conversation_history(id, embedding, top_k) -> Result<Vec<(Event, f32)>> - Semantic search

Testing

cargo test                    # Run all tests (40 passing, 3 ignored)
cargo test --features full    # Test with all features

Ignored tests require ONNX model file (models/all-minilm-l6-v2.onnx).

Compatibility

Rust Edition: 2021
MSRV: 1.70+
Platforms: All (ONNX runtime supports Windows/Linux/macOS)

Dependencies

hnsw_rs (0.3) - HNSW implementation
ort (2.0.0-rc.10) - ONNX Runtime
rusqlite (0.31) - SQLite database
rust_tokenizers (8.1) - BERT tokenization
ndarray (0.15) - Array operations
bytepunch, dataspool (optional) - Compression and bundling

License

MIT - See LICENSE for details.

Author

Magnus Trent magnus@blackfall.dev

frame-catalog 0.2.1

Frame Catalog - Vector Similarity Search and RAG Infrastructure

Features

ONNX Model Setup

Download Models

Quick Start

Dependency Architecture

Quick Start

Modules

Performance

Configuration

Features

API Reference

VectorStore

EmbeddingGenerator (trait)

RetrievalSystem

Database

Testing

Compatibility

Dependencies

License

Author

Links