ruvllm 2.0.6

LLM serving runtime with Ruvector integration - Paged attention, KV cache, and SONA learning