yomo 0.4.0

A QUIC-based runtime for AI-LLM tool routing and serverless execution
Documentation
# YoMo

- build

  ```
  cargo build
  ```

- use Ollama as the LLM provider:

  ```
  ollama pull qwen3.5
  ```

- run YoMo server:

  ```
  RUST_LOG=info ./target/debug/yomo serve
  ```

- run YoMo serverless tool:

  ```
  RUST_LOG=info ./target/debug/yomo run --name get-weather ./demo/go/get_weather
  ```

- send a request:

  ```
  curl \
    --request POST \
    --url http://127.0.0.1:9001/v1/chat/completions \
    --header 'Content-Type: application/json' \
    --data '{
      "model": "qwen3.5",
      "messages": [
          {
              "role": "user",
              "content": "How is the weather in Beijing?"
          }
        ]
      }'
  ```