pub struct DPOHyperparameters {
pub beta: Beta,
pub batch_size: BatchSize,
pub learning_rate_multiplier: LearningRateMultiplier,
pub n_epochs: NEpochs,
}
Fields§
§beta: Beta
The beta value for the DPO method. A higher beta value will increase the weight of the penalty between the policy and reference model.
batch_size: BatchSize
Number of examples in each batch. A larger batch size means that model parameters are updated less frequently, but with lower variance.
learning_rate_multiplier: LearningRateMultiplier
Scaling factor for the learning rate. A smaller learning rate may be useful to avoid overfitting.
n_epochs: NEpochs
The number of epochs to train the model for. An epoch refers to one full cycle through the training dataset.
Trait Implementations§
Source§impl Clone for DPOHyperparameters
impl Clone for DPOHyperparameters
Source§fn clone(&self) -> DPOHyperparameters
fn clone(&self) -> DPOHyperparameters
Returns a copy of the value. Read more
1.0.0 · Source§fn clone_from(&mut self, source: &Self)
fn clone_from(&mut self, source: &Self)
Performs copy-assignment from
source
. Read moreSource§impl Debug for DPOHyperparameters
impl Debug for DPOHyperparameters
Source§impl Default for DPOHyperparameters
impl Default for DPOHyperparameters
Source§fn default() -> DPOHyperparameters
fn default() -> DPOHyperparameters
Returns the “default value” for a type. Read more
Source§impl<'de> Deserialize<'de> for DPOHyperparameters
impl<'de> Deserialize<'de> for DPOHyperparameters
Source§fn deserialize<__D>(__deserializer: __D) -> Result<Self, __D::Error>where
__D: Deserializer<'de>,
fn deserialize<__D>(__deserializer: __D) -> Result<Self, __D::Error>where
__D: Deserializer<'de>,
Deserialize this value from the given Serde deserializer. Read more
Source§impl PartialEq for DPOHyperparameters
impl PartialEq for DPOHyperparameters
Source§impl Serialize for DPOHyperparameters
impl Serialize for DPOHyperparameters
impl StructuralPartialEq for DPOHyperparameters
Auto Trait Implementations§
impl Freeze for DPOHyperparameters
impl RefUnwindSafe for DPOHyperparameters
impl Send for DPOHyperparameters
impl Sync for DPOHyperparameters
impl Unpin for DPOHyperparameters
impl UnwindSafe for DPOHyperparameters
Blanket Implementations§
Source§impl<T> BorrowMut<T> for Twhere
T: ?Sized,
impl<T> BorrowMut<T> for Twhere
T: ?Sized,
Source§fn borrow_mut(&mut self) -> &mut T
fn borrow_mut(&mut self) -> &mut T
Mutably borrows from an owned value. Read more