Struct google_ml1::api::GoogleCloudMlV1__AutoScaling [−][src]
Options for automatically scaling a model.
This type is not used in any activity, and only used as part of another schema.
Fields
max_nodes: Option<i32>
The maximum number of nodes to scale this model under load. The actual value will depend on resource quota and availability.
metrics: Option<Vec<GoogleCloudMlV1__MetricSpec>>
MetricSpec contains the specifications to use to calculate the desired nodes count.
min_nodes: Option<i32>
Optional. The minimum number of nodes to allocate for this model. These nodes are always up, starting from the time the model is deployed. Therefore, the cost of operating this model will be at least rate
* min_nodes
* number of hours since last billing cycle, where rate
is the cost per node-hour as documented in the pricing guide, even if no predictions are performed. There is additional cost for each prediction performed. Unlike manual scaling, if the load gets too heavy for the nodes that are up, the service will automatically add nodes to handle the increased load as well as scale back as traffic drops, always maintaining at least min_nodes
. You will be charged for the time in which additional nodes are used. If min_nodes
is not specified and AutoScaling is used with a legacy (MLS1) machine type, min_nodes
defaults to 0, in which case, when traffic to a model stops (and after a cool-down period), nodes will be shut down and no charges will be incurred until traffic to the model resumes. If min_nodes
is not specified and AutoScaling is used with a Compute Engine (N1) machine type, min_nodes
defaults to 1. min_nodes
must be at least 1 for use with a Compute Engine machine type. You can set min_nodes
when creating the model version, and you can also update min_nodes
for an existing version: update_body.json: { ‘autoScaling’: { ‘minNodes’: 5 } } HTTP request: PATCH https://ml.googleapis.com/v1/{name=projects//models//versions/*}?update_mask=autoScaling.minNodes -d @./update_body.json
Trait Implementations
impl Clone for GoogleCloudMlV1__AutoScaling
[src]
fn clone(&self) -> GoogleCloudMlV1__AutoScaling
[src]
pub fn clone_from(&mut self, source: &Self)
1.0.0[src]
impl Debug for GoogleCloudMlV1__AutoScaling
[src]
impl Default for GoogleCloudMlV1__AutoScaling
[src]
impl<'de> Deserialize<'de> for GoogleCloudMlV1__AutoScaling
[src]
fn deserialize<__D>(__deserializer: __D) -> Result<Self, __D::Error> where
__D: Deserializer<'de>,
[src]
__D: Deserializer<'de>,
impl Part for GoogleCloudMlV1__AutoScaling
[src]
impl Serialize for GoogleCloudMlV1__AutoScaling
[src]
Auto Trait Implementations
impl RefUnwindSafe for GoogleCloudMlV1__AutoScaling
impl Send for GoogleCloudMlV1__AutoScaling
impl Sync for GoogleCloudMlV1__AutoScaling
impl Unpin for GoogleCloudMlV1__AutoScaling
impl UnwindSafe for GoogleCloudMlV1__AutoScaling
Blanket Implementations
impl<T> Any for T where
T: 'static + ?Sized,
[src]
T: 'static + ?Sized,
impl<T> Borrow<T> for T where
T: ?Sized,
[src]
T: ?Sized,
impl<T> BorrowMut<T> for T where
T: ?Sized,
[src]
T: ?Sized,
pub fn borrow_mut(&mut self) -> &mut T
[src]
impl<T> DeserializeOwned for T where
T: for<'de> Deserialize<'de>,
[src]
T: for<'de> Deserialize<'de>,
impl<T> From<T> for T
[src]
impl<T> Instrument for T
[src]
pub fn instrument(self, span: Span) -> Instrumented<Self>
[src]
pub fn in_current_span(self) -> Instrumented<Self>
[src]
impl<T, U> Into<U> for T where
U: From<T>,
[src]
U: From<T>,
impl<T> ToOwned for T where
T: Clone,
[src]
T: Clone,
type Owned = T
The resulting type after obtaining ownership.
pub fn to_owned(&self) -> T
[src]
pub fn clone_into(&self, target: &mut T)
[src]
impl<T, U> TryFrom<U> for T where
U: Into<T>,
[src]
U: Into<T>,
type Error = Infallible
The type returned in the event of a conversion error.
pub fn try_from(value: U) -> Result<T, <T as TryFrom<U>>::Error>
[src]
impl<T, U> TryInto<U> for T where
U: TryFrom<T>,
[src]
U: TryFrom<T>,