Expand description
Request-plane metrics for AddressedPushRouter. Used to pinpoint serialization vs transport roundtrip latency.
Statics§
- REQUEST_
PLANE_ INFLIGHT - Currently in-flight requests (incremented at generate() entry, decremented on stream complete).
- REQUEST_
PLANE_ QUEUE_ SECONDS - Time from generate() entry to send_request() (serialization + encoding + control message).
- REQUEST_
PLANE_ ROUNDTRIP_ TTFT_ SECONDS - Time from send_request() to first response item (transport roundtrip TTFT).
- REQUEST_
PLANE_ SEND_ SECONDS - Time for send_request() to complete (frontend view: network + queue + ack).
Functions§
- ensure_
request_ plane_ metrics_ registered - Register request-plane metrics with the given registry. Idempotent; only the first call registers.
- ensure_
request_ plane_ metrics_ registered_ prometheus - Register request-plane metrics with a raw Prometheus registry (e.g. for LLM HTTP service /metrics). Idempotent; only the first call registers. Call this when the service exposes /metrics from its own registry.