pub struct FineTuneDpoMethodHyperparameters {
pub beta: Option<FineTuneDpoMethodHyperparametersBeta>,
pub batch_size: Option<FineTuneDpoMethodHyperparametersBatchSize>,
pub learning_rate_multiplier: Option<FineTuneDpoMethodHyperparametersLearningRateMultiplier>,
pub n_epochs: Option<FineTuneDpoMethodHyperparametersNEpochs>,
}
Expand description
The hyperparameters used for the fine-tuning job.
Fields§
§beta: Option<FineTuneDpoMethodHyperparametersBeta>
The beta value for the DPO method. A higher beta value will increase the weight of the penalty between the policy and reference model.
batch_size: Option<FineTuneDpoMethodHyperparametersBatchSize>
Number of examples in each batch. A larger batch size means that model parameters are updated less frequently, but with lower variance.
learning_rate_multiplier: Option<FineTuneDpoMethodHyperparametersLearningRateMultiplier>
Scaling factor for the learning rate. A smaller learning rate may be useful to avoid overfitting.
n_epochs: Option<FineTuneDpoMethodHyperparametersNEpochs>
The number of epochs to train the model for. An epoch refers to one full cycle through the training dataset.
Implementations§
Source§impl FineTuneDpoMethodHyperparameters
impl FineTuneDpoMethodHyperparameters
Sourcepub fn builder() -> FineTuneDpoMethodHyperparametersBuilder<((), (), (), ())>
pub fn builder() -> FineTuneDpoMethodHyperparametersBuilder<((), (), (), ())>
Create a builder for building FineTuneDpoMethodHyperparameters
.
On the builder, call .beta(...)
(optional), .batch_size(...)
(optional), .learning_rate_multiplier(...)
(optional), .n_epochs(...)
(optional) to set the values of the fields.
Finally, call .build()
to create the instance of FineTuneDpoMethodHyperparameters
.
Trait Implementations§
Source§impl Clone for FineTuneDpoMethodHyperparameters
impl Clone for FineTuneDpoMethodHyperparameters
Source§fn clone(&self) -> FineTuneDpoMethodHyperparameters
fn clone(&self) -> FineTuneDpoMethodHyperparameters
1.0.0 · Source§fn clone_from(&mut self, source: &Self)
fn clone_from(&mut self, source: &Self)
source
. Read moreSource§impl Default for FineTuneDpoMethodHyperparameters
impl Default for FineTuneDpoMethodHyperparameters
Source§fn default() -> FineTuneDpoMethodHyperparameters
fn default() -> FineTuneDpoMethodHyperparameters
Source§impl<'de> Deserialize<'de> for FineTuneDpoMethodHyperparameters
impl<'de> Deserialize<'de> for FineTuneDpoMethodHyperparameters
Source§fn deserialize<D>(deserializer: D) -> Result<Self, D::Error>where
D: Deserializer<'de>,
fn deserialize<D>(deserializer: D) -> Result<Self, D::Error>where
D: Deserializer<'de>,
Source§impl PartialEq for FineTuneDpoMethodHyperparameters
impl PartialEq for FineTuneDpoMethodHyperparameters
Source§fn eq(&self, other: &FineTuneDpoMethodHyperparameters) -> bool
fn eq(&self, other: &FineTuneDpoMethodHyperparameters) -> bool
self
and other
values to be equal, and is used by ==
.