memvid-ask-model
LLM inference module for Memvid Q&A with local and cloud model support.
About
memvid-ask-model provides LLM inference capabilities for Memvid's Q&A functionality. It supports both local inference using llama.cpp and cloud APIs (OpenAI, Claude, Gemini).
Features
- Local Inference - Built-in llama.cpp with phi1.5 model (no API key needed)
- OpenAI - GPT-4 and GPT-3.5 support
- Anthropic - Claude models
- Google - Gemini models
- RAG Integration - Seamlessly works with memvid-core search results
Installation
[]
= "2.0.102"
Usage
use run_model_inference;
use Memvid;
// Get search results from memvid-core
let mem = open?;
let hits = mem.find?;
// Run inference with local model
let answer = run_model_inference?;
// Or use cloud API
let answer = run_model_inference?;
Environment Variables
For cloud models, set the appropriate API key:
Documentation
License
Licensed under Apache 2.0