Struct aws_sdk_sagemaker::model::StoppingCondition [−][src]
#[non_exhaustive]pub struct StoppingCondition {
pub max_runtime_in_seconds: i32,
pub max_wait_time_in_seconds: Option<i32>,
}
Expand description
Specifies a limit to how long a model training job or model compilation job can run. It also specifies how long a managed spot training job has to complete. When the job reaches the time limit, Amazon SageMaker ends the training or compilation job. Use this API to cap model training costs.
To stop a training job, Amazon SageMaker sends the algorithm the SIGTERM
signal, which delays
job termination for 120 seconds. Algorithms can use this 120-second window to save the
model artifacts, so the results of training are not lost.
The training algorithms provided by Amazon SageMaker automatically save the intermediate results
of a model training job when possible. This attempt to save artifacts is only a best
effort case as model might not be in a state from which it can be saved. For example, if
training has just started, the model might not be ready to save. When saved, this
intermediate data is a valid model artifact. You can use it to create a model with
CreateModel
.
The Neural Topic Model (NTM) currently does not support saving intermediate model artifacts. When training NTMs, make sure that the maximum runtime is sufficient for the training job to complete.
Fields (Non-exhaustive)
This struct is marked as non-exhaustive
Struct { .. }
syntax; cannot be matched against without a wildcard ..
; and struct update syntax will not work.max_runtime_in_seconds: i32
The maximum length of time, in seconds, that a training or compilation job can run.
For compilation jobs, if the job does not complete during this time, you will
receive a TimeOut
error. We recommend starting with 900 seconds and increase as
necessary based on your model.
For all other jobs, if the job does not complete during this time, Amazon SageMaker ends the job. When
RetryStrategy
is specified in the job request,
MaxRuntimeInSeconds
specifies the maximum time for all of the attempts
in total, not each individual attempt. The default value is 1 day. The maximum value is 28 days.
max_wait_time_in_seconds: Option<i32>
The maximum length of time, in seconds, that a managed Spot training job has to
complete. It is the amount of time spent waiting for Spot capacity plus the amount of
time the job can run. It must be equal to or greater than
MaxRuntimeInSeconds
. If the job does not complete during this time,
Amazon SageMaker ends the job.
When RetryStrategy
is specified in the job request,
MaxWaitTimeInSeconds
specifies the maximum time for all of the attempts
in total, not each individual attempt.
Implementations
The maximum length of time, in seconds, that a training or compilation job can run.
For compilation jobs, if the job does not complete during this time, you will
receive a TimeOut
error. We recommend starting with 900 seconds and increase as
necessary based on your model.
For all other jobs, if the job does not complete during this time, Amazon SageMaker ends the job. When
RetryStrategy
is specified in the job request,
MaxRuntimeInSeconds
specifies the maximum time for all of the attempts
in total, not each individual attempt. The default value is 1 day. The maximum value is 28 days.
The maximum length of time, in seconds, that a managed Spot training job has to
complete. It is the amount of time spent waiting for Spot capacity plus the amount of
time the job can run. It must be equal to or greater than
MaxRuntimeInSeconds
. If the job does not complete during this time,
Amazon SageMaker ends the job.
When RetryStrategy
is specified in the job request,
MaxWaitTimeInSeconds
specifies the maximum time for all of the attempts
in total, not each individual attempt.
Creates a new builder-style object to manufacture StoppingCondition
Trait Implementations
This method tests for self
and other
values to be equal, and is used
by ==
. Read more
This method tests for !=
.
Auto Trait Implementations
impl RefUnwindSafe for StoppingCondition
impl Send for StoppingCondition
impl Sync for StoppingCondition
impl Unpin for StoppingCondition
impl UnwindSafe for StoppingCondition
Blanket Implementations
Mutably borrows from an owned value. Read more
Attaches the provided Subscriber
to this type, returning a
WithDispatch
wrapper. Read more
Attaches the current default Subscriber
to this type, returning a
WithDispatch
wrapper. Read more