//! Local LLM Provider Module
//!
//! This module provides support for running local LLM models using llama.cpp.
//! It's designed for CPU-optimized inference with models like:
//! - LFM2 (Liquid Foundation Model 2) - Hybrid architecture with excellent CPU performance
//! - Granite 4.0 Nano - Transformer architecture optimized for CPU
//!
//! Key features:
//! - GGUF model format support
//! - CPU-optimized inference (no GPU required)
//! - Memory-efficient with small models (100MB - 2GB RAM)
//! - 32K context window support
//! - Streaming text generation
pub use *;
pub use *;
pub use *;