Struct TransformerDecoderLayer

Source

pub struct TransformerDecoderLayer { /* private fields */ }

Expand description

A single Transformer decoder layer.

Consists of:

Masked multi-head self-attention (causal)
Multi-head cross-attention over encoder output
Position-wise feedforward network

Each sublayer has residual connections and layer normalization.

§Shape

Target: (N, T, E)
Memory: (N, S, E)
Output: (N, T, E)

Implementations§

Source §

impl TransformerDecoderLayer

Source

pub fn new(d_model: usize, nhead: usize, dim_feedforward: usize) -> Self

Creates a new TransformerDecoderLayer (post-norm, default).

§Arguments

d_model - Embedding dimension
nhead - Number of attention heads
dim_feedforward - Hidden dimension of feedforward network

Source

pub fn new_with_pre_norm( d_model: usize, nhead: usize, dim_feedforward: usize, pre_norm: bool, ) -> Self

Creates a TransformerDecoderLayer with configurable norm ordering.

Source

pub fn forward_with_memory( &self, tgt: &Variable, memory: &Variable, tgt_mask: Option<&Variable>, memory_mask: Option<&Variable>, ) -> Variable

Forward pass with encoder memory and optional masks.

§Arguments

tgt - Target sequence (N, T, E)
memory - Encoder output (N, S, E)
tgt_mask - Optional causal mask for self-attention
memory_mask - Optional mask for cross-attention

Source

pub fn d_model(&self) -> usize

Returns the model dimension.

Trait Implementations§

Source §

impl Module for TransformerDecoderLayer

Source §

fn forward(&self, input: &Variable) -> Variable

Performs the forward pass. Read more

Source §

fn parameters(&self) -> Vec<Parameter>

Returns all parameters of this module. Read more

Source §

fn named_parameters(&self) -> HashMap<String, Parameter>

Returns named parameters of this module.

Source §

fn name(&self) -> &'static str

Returns the module name for debugging.

Source §

fn num_parameters(&self) -> usize

Returns the number of trainable parameters.

Source §

fn train(&mut self)

Sets the module to training mode.

Source §

fn eval(&mut self)

Sets the module to evaluation mode.

Source §

fn set_training(&mut self, _training: bool)

Sets the training mode.

Source §

fn is_training(&self) -> bool

Returns whether the module is in training mode.

Source §

fn zero_grad(&self)

Zeros all gradients of parameters.

Source §

fn to_device(&self, device: Device)

Moves all parameters to the specified device.

Auto Trait Implementations§

§

impl !UnwindSafe for TransformerDecoderLayer

Blanket Implementations§

Source §

impl<T> Any for T
where T: 'static + ?Sized,

Source §

fn type_id(&self) -> TypeId

Gets the TypeId of self. Read more

Source §

impl<T> Borrow<T> for T
where T: ?Sized,

Source §

fn borrow(&self) -> &T

Immutably borrows from an owned value. Read more

Source §

impl<T> BorrowMut<T> for T
where T: ?Sized,

Source §

fn borrow_mut(&mut self) -> &mut T

Mutably borrows from an owned value. Read more

Source §

impl<T> From<T> for T

Source §

fn from(t: T) -> T

Returns the argument unchanged.

Source §

impl<T, U> Into for T
where U: From<T>,

Source §

fn into(self) -> U

Calls U::from(self).

That is, this conversion is whatever the implementation of From<T> for U chooses to do.

Source §

impl<T> IntoEither for T

Source §

fn into_either(self, into_left: bool) -> Either<Self, Self>

Converts self into a Left variant of Either<Self, Self> if into_left is true. Converts self into a Right variant of Either<Self, Self> otherwise. Read more

Source §

fn into_either_with<F>(self, into_left: F) -> Either<Self, Self>
where F: FnOnce(&Self) -> bool,

Converts self into a Left variant of Either<Self, Self> if into_left(&self) returns true. Converts self into a Right variant of Either<Self, Self> otherwise. Read more

Source §