Skip to main content

WorkerSpec

Struct WorkerSpec 

Source
pub struct WorkerSpec {
Show 21 fields pub url: String, pub models: WorkerModels, pub worker_type: WorkerType, pub connection_mode: ConnectionMode, pub runtime_type: RuntimeType, pub provider: Option<ProviderType>, pub labels: HashMap<String, String>, pub priority: u32, pub cost: f32, pub api_key: Option<String>, pub bootstrap_port: Option<u16>, pub bootstrap_host: String, pub dp_base_url: Option<String>, pub dp_rank: Option<usize>, pub dp_size: Option<usize>, pub kv_connector: Option<String>, pub kv_role: Option<String>, pub kv_block_size: Option<usize>, pub health: HealthCheckUpdate, pub max_connection_attempts: u32, pub load_monitor_interval_secs: Option<u64>,
}
Expand description

Core worker identity and configuration.

The single canonical representation of “what is a worker”. Used as the shared sub-struct across API requests, API responses, and internal runtime state via #[serde(flatten)].

Fields use #[serde(default)] so the same struct works for both input (partial config from user) and output (fully resolved state).

Fields§

§url: String

Worker URL.

§models: WorkerModels

Models this worker can serve.

§worker_type: WorkerType

Worker type: regular, prefill, or decode.

§connection_mode: ConnectionMode

Connection mode: http or grpc.

§runtime_type: RuntimeType

Runtime type: sglang, vllm, trtllm, or external.

§provider: Option<ProviderType>

External provider for API transformations. None means native/passthrough.

§labels: HashMap<String, String>

Additional labels/tags.

§priority: u32

Worker priority (higher = preferred).

§cost: f32

Worker cost factor (baseline = 1.0).

§api_key: Option<String>

Worker API key. Accepted on input, never included in responses.

§bootstrap_port: Option<u16>

Bootstrap port for prefill workers in PD disaggregated mode.

§bootstrap_host: String

Bootstrap hostname (derived from URL at construction time).

§dp_base_url: Option<String>

Base URL without DP rank suffix (for DP-aware workers). When set, url contains the rank-suffixed form ({base}@{rank}).

§dp_rank: Option<usize>

Data-parallel rank (None = not DP-aware).

§dp_size: Option<usize>

Total data-parallel group size (None = not DP-aware).

§kv_connector: Option<String>

KV connector type (e.g. “MooncakeConnector”, “NixlConnector”).

§kv_role: Option<String>

KV role (e.g. “kv_producer”, “kv_consumer”, “kv_both”).

§kv_block_size: Option<usize>

KV cache block size (tokens per block) for event-driven routing. When set, overrides the router-level default for this worker’s model. Typically matches the backend engine’s page size (e.g. 16 for SGLang).

§health: HealthCheckUpdate

Per-worker health check overrides (partial — only Some fields override router defaults).

§max_connection_attempts: u32

Maximum connection attempts during worker registration (default: 20).

§load_monitor_interval_secs: Option<u64>

Per-worker load monitor interval override (seconds). When set, workers in the same group use this interval for load polling. Falls back to the global load_monitor_interval_secs from router config.

Implementations§

Source§

impl WorkerSpec

Source

pub fn new(url: impl Into<String>) -> Self

Create a new WorkerSpec with the given URL and sensible defaults.

Trait Implementations§

Source§

impl Clone for WorkerSpec

Source§

fn clone(&self) -> WorkerSpec

Returns a duplicate of the value. Read more
1.0.0 · Source§

fn clone_from(&mut self, source: &Self)

Performs copy-assignment from source. Read more
Source§

impl Debug for WorkerSpec

Source§

fn fmt(&self, f: &mut Formatter<'_>) -> Result

Formats the value using the given formatter. Read more
Source§

impl<'de> Deserialize<'de> for WorkerSpec

Source§

fn deserialize<__D>(__deserializer: __D) -> Result<Self, __D::Error>
where __D: Deserializer<'de>,

Deserialize this value from the given Serde deserializer. Read more
Source§

impl JsonSchema for WorkerSpec

Source§

fn schema_name() -> String

The name of the generated JSON Schema. Read more
Source§

fn schema_id() -> Cow<'static, str>

Returns a string that uniquely identifies the schema produced by this type. Read more
Source§

fn json_schema(generator: &mut SchemaGenerator) -> Schema

Generates a JSON Schema for this type. Read more
Source§

fn is_referenceable() -> bool

Whether JSON Schemas generated for this type should be re-used where possible using the $ref keyword. Read more
Source§

impl Serialize for WorkerSpec

Source§

fn serialize<__S>(&self, __serializer: __S) -> Result<__S::Ok, __S::Error>
where __S: Serializer,

Serialize this value into the given Serde serializer. Read more

Auto Trait Implementations§

Blanket Implementations§

Source§

impl<T> Any for T
where T: 'static + ?Sized,

Source§

fn type_id(&self) -> TypeId

Gets the TypeId of self. Read more
Source§

impl<T> Borrow<T> for T
where T: ?Sized,

Source§

fn borrow(&self) -> &T

Immutably borrows from an owned value. Read more
Source§

impl<T> BorrowMut<T> for T
where T: ?Sized,

Source§

fn borrow_mut(&mut self) -> &mut T

Mutably borrows from an owned value. Read more
Source§

impl<T> CloneToUninit for T
where T: Clone,

Source§

unsafe fn clone_to_uninit(&self, dest: *mut u8)

🔬This is a nightly-only experimental API. (clone_to_uninit)
Performs copy-assignment from self to dest. Read more
Source§

impl<T> DynClone for T
where T: Clone,

Source§

fn __clone_box(&self, _: Private) -> *mut ()

Source§

impl<T> From<T> for T

Source§

fn from(t: T) -> T

Returns the argument unchanged.

Source§

impl<T> Instrument for T

Source§

fn instrument(self, span: Span) -> Instrumented<Self>

Instruments this type with the provided Span, returning an Instrumented wrapper. Read more
Source§

fn in_current_span(self) -> Instrumented<Self>

Instruments this type with the current Span, returning an Instrumented wrapper. Read more
Source§

impl<T, U> Into<U> for T
where U: From<T>,

Source§

fn into(self) -> U

Calls U::from(self).

That is, this conversion is whatever the implementation of From<T> for U chooses to do.

Source§

impl<T> ToOwned for T
where T: Clone,

Source§

type Owned = T

The resulting type after obtaining ownership.
Source§

fn to_owned(&self) -> T

Creates owned data from borrowed data, usually by cloning. Read more
Source§

fn clone_into(&self, target: &mut T)

Uses borrowed data to replace owned data, usually by cloning. Read more
Source§

impl<T, U> TryFrom<U> for T
where U: Into<T>,

Source§

type Error = Infallible

The type returned in the event of a conversion error.
Source§

fn try_from(value: U) -> Result<T, <T as TryFrom<U>>::Error>

Performs the conversion.
Source§

impl<T, U> TryInto<U> for T
where U: TryFrom<T>,

Source§

type Error = <U as TryFrom<T>>::Error

The type returned in the event of a conversion error.
Source§

fn try_into(self) -> Result<U, <U as TryFrom<T>>::Error>

Performs the conversion.
Source§

impl<V, T> VZip<V> for T
where V: MultiLane<T>,

Source§

fn vzip(self) -> V

Source§

impl<T> WithSubscriber for T

Source§

fn with_subscriber<S>(self, subscriber: S) -> WithDispatch<Self>
where S: Into<Dispatch>,

Attaches the provided Subscriber to this type, returning a WithDispatch wrapper. Read more
Source§

fn with_current_subscriber(self) -> WithDispatch<Self>

Attaches the current default Subscriber to this type, returning a WithDispatch wrapper. Read more
Source§

impl<T> DeserializeOwned for T
where T: for<'de> Deserialize<'de>,