openai_openapi_types

Struct FineTuneReinforcementHyperparameters

pub struct FineTuneReinforcementHyperparameters {
    pub batch_size: Option<FineTuneReinforcementHyperparametersBatchSize>,
    pub learning_rate_multiplier: Option<FineTuneReinforcementHyperparametersLearningRateMultiplier>,
    pub n_epochs: Option<FineTuneReinforcementHyperparametersNEpochs>,
    pub reasoning_effort: Option<FineTuneReinforcementHyperparametersReasoningEffort>,
    pub compute_multiplier: Option<FineTuneReinforcementHyperparametersComputeMultiplier>,
    pub eval_interval: Option<FineTuneReinforcementHyperparametersEvalInterval>,
    pub eval_samples: Option<FineTuneReinforcementHyperparametersEvalSamples>,
}

Expand description

The hyperparameters used for the reinforcement fine-tuning job.

Fields§

§batch_size: Option<FineTuneReinforcementHyperparametersBatchSize>

Number of examples in each batch. A larger batch size means that model parameters are updated less frequently, but with lower variance.

§learning_rate_multiplier: Option<FineTuneReinforcementHyperparametersLearningRateMultiplier>

Scaling factor for the learning rate. A smaller learning rate may be useful to avoid overfitting.

§n_epochs: Option<FineTuneReinforcementHyperparametersNEpochs>

The number of epochs to train the model for. An epoch refers to one full cycle through the training dataset.

§reasoning_effort: Option<FineTuneReinforcementHyperparametersReasoningEffort>

Level of reasoning effort.

§compute_multiplier: Option<FineTuneReinforcementHyperparametersComputeMultiplier>

Multiplier on amount of compute used for exploring search space during training.

§eval_interval: Option<FineTuneReinforcementHyperparametersEvalInterval>

The number of training steps between evaluation runs.

§eval_samples: Option<FineTuneReinforcementHyperparametersEvalSamples>

Number of evaluation samples to generate per training step.

Implementations§

impl FineTuneReinforcementHyperparameters

pub fn builder() -> FineTuneReinforcementHyperparametersBuilder<((), (), (), (), (), (), ())>

Create a builder for building FineTuneReinforcementHyperparameters. On the builder, call .batch_size(...)(optional), .learning_rate_multiplier(...)(optional), .n_epochs(...)(optional), .reasoning_effort(...)(optional), .compute_multiplier(...)(optional), .eval_interval(...)(optional), .eval_samples(...)(optional) to set the values of the fields. Finally, call .build() to create the instance of FineTuneReinforcementHyperparameters.

Trait Implementations§

impl Clone for FineTuneReinforcementHyperparameters

fn clone(&self) -> FineTuneReinforcementHyperparameters

Returns a duplicate of the value. Read more

1.0.0 · Source§

fn clone_from(&mut self, source: &Self)

Performs copy-assignment from source. Read more

impl Debug for FineTuneReinforcementHyperparameters

fn fmt(&self, f: &mut Formatter<'_>) -> Result

Formats the value using the given formatter. Read more

impl Default for FineTuneReinforcementHyperparameters

fn default() -> FineTuneReinforcementHyperparameters

Returns the “default value” for a type. Read more

impl<'de> Deserialize<'de> for FineTuneReinforcementHyperparameters

fn deserialize<D>(deserializer: D) -> Result<Self, D::Error>
where D: Deserializer<'de>,

Deserialize this value from the given Serde deserializer. Read more

impl PartialEq for FineTuneReinforcementHyperparameters

fn eq(&self, other: &FineTuneReinforcementHyperparameters) -> bool

Tests for self and other values to be equal, and is used by ==.

1.0.0 · Source§

fn ne(&self, other: &Rhs) -> bool

Tests for !=. The default implementation is almost always sufficient, and should not be overridden without very good reason.

impl Serialize for FineTuneReinforcementHyperparameters

fn serialize<S>(&self, serializer: S) -> Result<S::Ok, S::Error>
where S: Serializer,

Serialize this value into the given Serde serializer. Read more

impl Copy for FineTuneReinforcementHyperparameters

impl StructuralPartialEq for FineTuneReinforcementHyperparameters

Auto Trait Implementations§

impl Freeze for FineTuneReinforcementHyperparameters

impl RefUnwindSafe for FineTuneReinforcementHyperparameters

impl Send for FineTuneReinforcementHyperparameters

impl Sync for FineTuneReinforcementHyperparameters

impl Unpin for FineTuneReinforcementHyperparameters

impl UnwindSafe for FineTuneReinforcementHyperparameters

Blanket Implementations§

impl<T> Any for T
where T: 'static + ?Sized,

fn type_id(&self) -> TypeId

Gets the TypeId of self. Read more

impl<T> Borrow<T> for T
where T: ?Sized,

fn borrow(&self) -> &T

Immutably borrows from an owned value. Read more

impl<T> BorrowMut<T> for T
where T: ?Sized,

fn borrow_mut(&mut self) -> &mut T

Mutably borrows from an owned value. Read more

impl<T> CloneToUninit for T
where T: Clone,

unsafe fn clone_to_uninit(&self, dest: *mut u8)

🔬This is a nightly-only experimental API. (clone_to_uninit)

Performs copy-assignment from self to dest. Read more

impl<T> From<T> for T

fn from(t: T) -> T

Returns the argument unchanged.

impl<T, U> Into<U> for T
where U: From<T>,

fn into(self) -> U

Calls U::from(self).

That is, this conversion is whatever the implementation of From<T> for U chooses to do.

impl<T> ToOwned for T
where T: Clone,

type Owned = T

The resulting type after obtaining ownership.

fn to_owned(&self) -> T

Creates owned data from borrowed data, usually by cloning. Read more

fn clone_into(&self, target: &mut T)

Uses borrowed data to replace owned data, usually by cloning. Read more

impl<T, U> TryFrom<U> for T
where U: Into<T>,

type Error = Infallible

The type returned in the event of a conversion error.

fn try_from(value: U) -> Result<T, <T as TryFrom<U>>::Error>

Performs the conversion.

impl<T, U> TryInto<U> for T
where U: TryFrom<T>,

type Error = <U as TryFrom<T>>::Error

The type returned in the event of a conversion error.

fn try_into(self) -> Result<U, <U as TryFrom<T>>::Error>

Performs the conversion.

impl<T> DeserializeOwned for T
where T: for<'de> Deserialize<'de>,