Struct rust_bert::gpt2::Gpt2Config [−][src]
pub struct Gpt2Config {Show 17 fields
pub attn_pdrop: Option<f64>,
pub embd_pdrop: Option<f64>,
pub hidden_dropout_prob: Option<f64>,
pub afn: Option<Activation>,
pub initializer_range: f64,
pub layer_norm_epsilon: f64,
pub n_ctx: i64,
pub n_embd: i64,
pub n_head: i64,
pub n_layer: i64,
pub n_positions: i64,
pub num_labels: Option<i64>,
pub output_past: Option<bool>,
pub output_attentions: Option<bool>,
pub output_hidden_states: Option<bool>,
pub resid_pdrop: Option<f64>,
pub vocab_size: i64,
}
Expand description
GPT2 model configuration
Defines the GPT2 model architecture (e.g. number of layers, hidden layer size, vocab size…). Shared between GPT and GPT2 models
Fields
attn_pdrop: Option<f64>
embd_pdrop: Option<f64>
afn: Option<Activation>
initializer_range: f64
layer_norm_epsilon: f64
n_ctx: i64
n_embd: i64
n_head: i64
n_layer: i64
n_positions: i64
num_labels: Option<i64>
output_past: Option<bool>
output_attentions: Option<bool>
resid_pdrop: Option<f64>
vocab_size: i64
Trait Implementations
fn deserialize<__D>(__deserializer: __D) -> Result<Self, __D::Error> where
__D: Deserializer<'de>,
fn deserialize<__D>(__deserializer: __D) -> Result<Self, __D::Error> where
__D: Deserializer<'de>,
Deserialize this value from the given Serde deserializer. Read more
Auto Trait Implementations
impl RefUnwindSafe for Gpt2Config
impl Send for Gpt2Config
impl Sync for Gpt2Config
impl Unpin for Gpt2Config
impl UnwindSafe for Gpt2Config
Blanket Implementations
Mutably borrows from an owned value. Read more
Instruments this type with the provided Span
, returning an
Instrumented
wrapper. Read more
type Output = T
type Output = T
Should always be Self