Ragzilla

A Rust library providing tools for RAG (Retrieval-Augmented Generation) pipelines.

Features

Ragzilla provides several features that can be selectively enabled:

embedding: Generate embeddings using Gemini API
parsing: Parse PDFs using Mistral AI OCR API
transcription: Transcribe audio to text using OpenAI's GPT-4o API
all: Enable all features

Usage

Add ragzilla to your dependencies with the features you need:

[dependencies]
ragzilla = { version = "0.2.0", features = ["embedding", "parsing"] }

Embedding Example

use ragzilla::embedding;

async fn create_embedding() {
    let api_key = std::env::var("GEMINI_API_KEY").expect("GEMINI_API_KEY must be set");
    let text = "What is the meaning of life?";
    
    let embedding = embedding::embed(text, &api_key).await.unwrap();
    println!("Generated embedding with {} dimensions", embedding.len());
}

PDF Parsing Example

use ragzilla::parsing;

async fn parse_document() {
    let api_key = std::env::var("MISTRAL_API_KEY").expect("MISTRAL_API_KEY must be set");
    let document_url = "https://example.com/document.pdf";
    
    let chunks = parsing::parse_pdf(document_url, &api_key).await.unwrap();
    println!("Extracted {} pages from PDF", chunks.len());
}

Audio Transcription Example

use ragzilla::transcription;
use std::fs;

async fn transcribe_audio() {
    let api_key = std::env::var("OPENAI_API_KEY").expect("OPENAI_API_KEY must be set");
    let audio_data = fs::read("path/to/audio.mp3").expect("Could not read audio file");
    
    let text = transcription::transcribe(&audio_data, &api_key).await.unwrap();
    println!("Transcription: {}", text);
}

License

MIT