pub struct FineTuneDPOHyperparameters {
pub beta: Beta,
pub batch_size: BatchSize,
pub learning_rate_multiplier: LearningRateMultiplier,
pub n_epochs: NEpochs,
}Fields§
§beta: BetaThe beta value for the DPO method. A higher beta value will increase the weight of the penalty between the policy and reference model.
batch_size: BatchSizeNumber of examples in each batch. A larger batch size means that model parameters are updated less frequently, but with lower variance.
learning_rate_multiplier: LearningRateMultiplierScaling factor for the learning rate. A smaller learning rate may be useful to avoid overfitting.
n_epochs: NEpochsThe number of epochs to train the model for. An epoch refers to one full cycle through the training dataset.
Trait Implementations§
Source§impl Clone for FineTuneDPOHyperparameters
impl Clone for FineTuneDPOHyperparameters
Source§fn clone(&self) -> FineTuneDPOHyperparameters
fn clone(&self) -> FineTuneDPOHyperparameters
Returns a duplicate of the value. Read more
1.0.0 · Source§fn clone_from(&mut self, source: &Self)
fn clone_from(&mut self, source: &Self)
Performs copy-assignment from
source. Read moreSource§impl Debug for FineTuneDPOHyperparameters
impl Debug for FineTuneDPOHyperparameters
Source§impl Default for FineTuneDPOHyperparameters
impl Default for FineTuneDPOHyperparameters
Source§fn default() -> FineTuneDPOHyperparameters
fn default() -> FineTuneDPOHyperparameters
Returns the “default value” for a type. Read more
Source§impl<'de> Deserialize<'de> for FineTuneDPOHyperparameters
impl<'de> Deserialize<'de> for FineTuneDPOHyperparameters
Source§fn deserialize<__D>(__deserializer: __D) -> Result<Self, __D::Error>where
__D: Deserializer<'de>,
fn deserialize<__D>(__deserializer: __D) -> Result<Self, __D::Error>where
__D: Deserializer<'de>,
Deserialize this value from the given Serde deserializer. Read more
impl StructuralPartialEq for FineTuneDPOHyperparameters
Auto Trait Implementations§
impl Freeze for FineTuneDPOHyperparameters
impl RefUnwindSafe for FineTuneDPOHyperparameters
impl Send for FineTuneDPOHyperparameters
impl Sync for FineTuneDPOHyperparameters
impl Unpin for FineTuneDPOHyperparameters
impl UnwindSafe for FineTuneDPOHyperparameters
Blanket Implementations§
Source§impl<T> BorrowMut<T> for Twhere
T: ?Sized,
impl<T> BorrowMut<T> for Twhere
T: ?Sized,
Source§fn borrow_mut(&mut self) -> &mut T
fn borrow_mut(&mut self) -> &mut T
Mutably borrows from an owned value. Read more