yomo 0.4.0

A QUIC-based runtime for AI-LLM tool routing and serverless execution
Documentation

YoMo

  • build

    cargo build
    
  • use Ollama as the LLM provider:

    ollama pull qwen3.5
    
  • run YoMo server:

    RUST_LOG=info ./target/debug/yomo serve
    
  • run YoMo serverless tool:

    RUST_LOG=info ./target/debug/yomo run --name get-weather ./demo/go/get_weather
    
  • send a request:

    curl \
      --request POST \
      --url http://127.0.0.1:9001/v1/chat/completions \
      --header 'Content-Type: application/json' \
      --data '{
        "model": "qwen3.5",
        "messages": [
            {
                "role": "user",
                "content": "How is the weather in Beijing?"
            }
          ]
        }'