offline-intelligence 0.1.1

High-performance LLM inference engine with memory management - Cross-platform native library with bindings for Python, Java, C++, and JavaScript
Documentation