An HTTP client for use with Conjure servers.
This crate provides both asynchronous and blocking clients. The clients can either be used in Conjure-generated wrapper types or standalone for use with non-Conjure servers.
Configuration
While a conjure_runtime client can be built up programmatically, the more common approach is for the configuration
to be deserialized from a service's runtime-reloadable configuration file. The ServicesConfig supports
configuration for multiple downstream services, as well as allowing for both global and per-service configuration
overrides:
services:
auth-service:
uris:
- https://auth.my-network.com/auth-service
cache-service:
uris:
- https://cache-1.my-network.com/cache-service
- https://cache-2.my-network.com/cache-service
request-timeout: 10s
# options set at this level will apply as defaults to all configured services
security:
ca-file: var/security/ca.pem
Using the refreshable crate, a live-updating config file can be used with the ClientFactory type to create
live-updating clients for the configured services.
Usage
First construct a raw conjure_runtime::Client:
use ;
use SecurityConfig;
use PathBuf;
#
The client can then be used with Conjure-generated service interfaces:
use TestServiceAsyncClient;
use BearerToken;
# async
Or manually if a service does not provide a Conjure API:
# async
The blocking::Client's API is identical, with the exception that you don't .await on methods:
use TestServiceClient;
use BearerToken;
#
#
Behavior
conjure_runtime wraps the hyper HTTP library with opinionated behavior designed to more effectively communicate
between services in a distributed system. It is broadly designed to align with the dialogue Java library, though
it does differ in various ways.
Error Propagation
Servers should use the standard Conjure error format to propagate application-specific errors to callers. Non-QoS
(see below) errors received from the server are treated as fatal. By default, conjure_runtime will return a
conjure_error::Error that will generate a generic 500 Internal Server Error response. Its cause will be a
RemoteError object that contains the serialized Conjure error information sent by the server. The
Client::set_propagate_service_errors method can be used to change that behavior to instead transparently propagate
the error received from the server. Rather than producing a generic 500 response, the returned
conjure_error::Error will produce the same response the client received from the server.
Call Tracing
The client propagates trace information via the zipkin crate using the traditional X-B3-* HTTP headers. It
also creates local spans covering various stages of request processing:
conjure-runtime: get /widget-service/{widgetId}- The name of this span is built from the request's method and path pattern.conjure-runtime: attempt 1conjure-runtime: acquire-permit- If client QoS is enabled and the node selection strategy is notBalanced, this span covers the time spent acquiring a concurrency limiter permit.conjure-runtime: balanced-node-selection- If the node selection strategy isBalanced, this span covers the time spent selecting a node and (if client QoS is enabled) acquiring a concurrency limiter permit.conjure-runtime: wait-for-headers- This span is sent to the server, and lasts until the server sends the headers of the response.conjure-runtime: wait-for-body- This span is tracked along with the response body, and lasts until theResponseBodyobject is dropped. It is "detached" from the zipkin tracer so new spans created outside ofconjure-runtimewill not be parented to it, and can outlive the parentconjure-runtimespans. It will not be created if an IO error occurs before headers are received.conjure-runtime: backoff-with-jitter- If the request is retried, this span tracks the time spent waiting between attempts.conjure-runtime: attempt 2- ...
Quality of Service: Retry, Failover, Throttling, and Backpressure
The client treats certain HTTP errors specially. Servers can advertise an overloaded state via the 429 Too Many
Requests or 503 Service Unavailable status codes. Unlike other 4xx and 5xx status codes, these responses do not
cause the request to fail. Instead, conjure_runtime will throttle itself and retry the request. Requests are
retried a fixed number of times, with an exponentially growing backoff in between attempts. If a 429 response
contains a Retry-After header, its backoff will be used rather than the default. IO errors also trigger a retry.
A 503 response or IO error will also cause that host to be temporarily put on "cooldown" so it will not be used by other requests unless there is no other option.
Only some requests can be retried. By default, conjure_runtime will only retry requests with HTTP methods
identified as idempotent - GET, PUT, DELETE, HEAD, TRACE, and OPTIONS. Non-idempotent requests cannot be
safely retried to avoid the risk of unexpected behavior if the request ends up being applied twice. The
Client::set_assume_idempotent method can be used to override this behavior and have the client assume all requests
are idempotent. In addition, requests with streaming request bodies can only be retried if the body had either not
started to be written when the error occurred or if it was successfully reset for another attempt.
Metrics
Clients record metrics to both a standard MetricsRegistry and a conjure_runtime-specific HostMetricsRegistry.
Standard Metrics
client.request (service: <service_name>)- ATimerrecording request durations, tagged by service. Note that the requests timed by this metric are the user-percieved request, including any backoffs/retries/etc. It only records the time until response headers are received, not until the entire response body is read.client.request.error (service: <service_name>, reason: IOException)- AMetertracking the rate of IO errors, tagged by service. Like theclient.requestmetric, this tracks overall user-perceived request. Thereasontag has a value ofIOExceptionto align with [conjure-java-runtime]'s metric.tls.handshake (context: <service_name>, protocol: <protocol_version>, cipher: <cipher_name>)- AMetertracking the rate of TLS handshakes, tagged by the service, TLS protocol version (e.g.TLSv1.3), and cipher name (e.g.TLS_CHACHA20_POLY1305_SHA256).conjure-runtime.concurrencylimiter.max (service: <service_name>, hostIndex: <host_index>)- AGaugereporting the maximum number of concurrent requests which are currently permitted to be made to a specific host.conjure-runtime.concurrencylimiter.in-flight (service: <service_name>, hostIndex: <host_index>)- AGaugereporting the current number of requests being made to a specific host.
Host Metrics
The HostMetricsRegistry contains metrics for every host of every service being actively used by a
conjure_runtime client.