pub struct FineTuneReinforcementHyperparameters {
pub batch_size: BatchSize,
pub learning_rate_multiplier: LearningRateMultiplier,
pub n_epochs: NEpochs,
pub reasoning_effort: FineTuneReasoningEffort,
pub compute_multiplier: ComputeMultiplier,
pub eval_interval: EvalInterval,
pub eval_samples: EvalSamples,
}Fields§
§batch_size: BatchSizeNumber of examples in each batch. A larger batch size means that model parameters are updated less frequently, but with lower variance.
learning_rate_multiplier: LearningRateMultiplierScaling factor for the learning rate. A smaller learning rate may be useful to avoid overfitting.
n_epochs: NEpochsThe number of epochs to train the model for. An epoch refers to one full cycle through the training dataset.
reasoning_effort: FineTuneReasoningEffortLevel of reasoning effort.
compute_multiplier: ComputeMultiplierMultiplier on amount of compute used for exploring search space during training.
eval_interval: EvalIntervalThe number of training steps between evaluation runs.
eval_samples: EvalSamplesNumber of evaluation samples to generate per training step.
Trait Implementations§
Source§impl Clone for FineTuneReinforcementHyperparameters
impl Clone for FineTuneReinforcementHyperparameters
Source§fn clone(&self) -> FineTuneReinforcementHyperparameters
fn clone(&self) -> FineTuneReinforcementHyperparameters
Returns a duplicate of the value. Read more
1.0.0 · Source§fn clone_from(&mut self, source: &Self)
fn clone_from(&mut self, source: &Self)
Performs copy-assignment from
source. Read moreSource§impl<'de> Deserialize<'de> for FineTuneReinforcementHyperparameters
impl<'de> Deserialize<'de> for FineTuneReinforcementHyperparameters
Source§fn deserialize<__D>(__deserializer: __D) -> Result<Self, __D::Error>where
__D: Deserializer<'de>,
fn deserialize<__D>(__deserializer: __D) -> Result<Self, __D::Error>where
__D: Deserializer<'de>,
Deserialize this value from the given Serde deserializer. Read more
Source§impl PartialEq for FineTuneReinforcementHyperparameters
impl PartialEq for FineTuneReinforcementHyperparameters
Source§fn eq(&self, other: &FineTuneReinforcementHyperparameters) -> bool
fn eq(&self, other: &FineTuneReinforcementHyperparameters) -> bool
Tests for
self and other values to be equal, and is used by ==.impl StructuralPartialEq for FineTuneReinforcementHyperparameters
Auto Trait Implementations§
impl Freeze for FineTuneReinforcementHyperparameters
impl RefUnwindSafe for FineTuneReinforcementHyperparameters
impl Send for FineTuneReinforcementHyperparameters
impl Sync for FineTuneReinforcementHyperparameters
impl Unpin for FineTuneReinforcementHyperparameters
impl UnwindSafe for FineTuneReinforcementHyperparameters
Blanket Implementations§
Source§impl<T> BorrowMut<T> for Twhere
T: ?Sized,
impl<T> BorrowMut<T> for Twhere
T: ?Sized,
Source§fn borrow_mut(&mut self) -> &mut T
fn borrow_mut(&mut self) -> &mut T
Mutably borrows from an owned value. Read more