aprender 0.31.2

<!-- PCU: getting-started-first-server | contract: contracts/apr-page-getting-started-first-server-v1.yaml -->
<!-- Example: cargo run -p aprender-core --example none -->
<!-- Status: enforced -->

# First Server

Serve a model as an HTTP API:

```bash
apr serve model.gguf --port 8080
```

Then query it:

```bash
curl http://localhost:8080/v1/completions \
  -H "Content-Type: application/json" \
  -d '{"prompt": "What is 2+2?", "max_tokens": 32}'
```

## OpenAI-Compatible API

The server implements the OpenAI completions API:

```bash
# Chat completions
curl http://localhost:8080/v1/chat/completions \
  -H "Content-Type: application/json" \
  -d '{
    "messages": [{"role": "user", "content": "Hello"}],
    "max_tokens": 100
  }'
```