pub struct TransformerDecoderLayer<F: Float + Debug + Send + Sync> { /* private fields */ }
Expand description
Transformer decoder layer
Implements a single layer of the transformer decoder as described in “Attention Is All You Need” by Vaswani et al. It consists of masked multi-head self-attention, multi-head cross-attention over encoder output, and a position-wise feed-forward network, with residual connections and layer normalization.
Implementations§
Source§impl<F: Float + Debug + ScalarOperand + Send + Sync + 'static> TransformerDecoderLayer<F>
impl<F: Float + Debug + ScalarOperand + Send + Sync + 'static> TransformerDecoderLayer<F>
Sourcepub fn new<R: Rng>(
d_model: usize,
n_heads: usize,
d_ff: usize,
dropout: f64,
epsilon: f64,
rng: &mut R,
) -> Result<Self>
pub fn new<R: Rng>( d_model: usize, n_heads: usize, d_ff: usize, dropout: f64, epsilon: f64, rng: &mut R, ) -> Result<Self>
Create a new transformer decoder layer
§Arguments
d_model
- Model embedding dimensionn_heads
- Number of attention headsd_ff
- Feed-forward network hidden dimensiondropout
- Dropout rate (0 means no dropout)epsilon
- Small constant for layer normalizationrng
- Random number generator for weight initialization
§Returns
- A new transformer decoder layer
Trait Implementations§
Source§impl<F: Float + Debug + ScalarOperand + Send + Sync + 'static> Layer<F> for TransformerDecoderLayer<F>
impl<F: Float + Debug + ScalarOperand + Send + Sync + 'static> Layer<F> for TransformerDecoderLayer<F>
Source§fn as_any_mut(&mut self) -> &mut dyn Any
fn as_any_mut(&mut self) -> &mut dyn Any
Get the layer as a mutable dyn Any for downcasting Read more
Source§fn forward(&self, input: &Array<F, IxDyn>) -> Result<Array<F, IxDyn>>
fn forward(&self, input: &Array<F, IxDyn>) -> Result<Array<F, IxDyn>>
Forward pass of the layer Read more
Source§fn backward(
&self,
input: &Array<F, IxDyn>,
_grad_output: &Array<F, IxDyn>,
) -> Result<Array<F, IxDyn>>
fn backward( &self, input: &Array<F, IxDyn>, _grad_output: &Array<F, IxDyn>, ) -> Result<Array<F, IxDyn>>
Backward pass of the layer to compute gradients Read more
Source§fn update(&mut self, learning_rate: F) -> Result<()>
fn update(&mut self, learning_rate: F) -> Result<()>
Update the layer parameters with the given gradients Read more
Source§fn gradients(&self) -> Vec<Array<F, IxDyn>> ⓘ
fn gradients(&self) -> Vec<Array<F, IxDyn>> ⓘ
Get the gradients of the layer parameters Read more
Source§fn set_gradients(&mut self, _gradients: &[Array<F, IxDyn>]) -> Result<()>
fn set_gradients(&mut self, _gradients: &[Array<F, IxDyn>]) -> Result<()>
Set the gradients of the layer parameters Read more
Source§fn set_params(&mut self, _params: &[Array<F, IxDyn>]) -> Result<()>
fn set_params(&mut self, _params: &[Array<F, IxDyn>]) -> Result<()>
Set the parameters of the layer Read more
Source§fn set_training(&mut self, _training: bool)
fn set_training(&mut self, _training: bool)
Set the layer to training mode (true) or evaluation mode (false) Read more
Source§fn is_training(&self) -> bool
fn is_training(&self) -> bool
Get the current training mode Read more
Source§fn layer_type(&self) -> &str
fn layer_type(&self) -> &str
Get the type of the layer (e.g., “Dense”, “Conv2D”) Read more
Source§fn parameter_count(&self) -> usize
fn parameter_count(&self) -> usize
Get the number of trainable parameters in this layer Read more
Source§fn layer_description(&self) -> String
fn layer_description(&self) -> String
Get a detailed description of this layer Read more
Auto Trait Implementations§
impl<F> !Freeze for TransformerDecoderLayer<F>
impl<F> !RefUnwindSafe for TransformerDecoderLayer<F>
impl<F> Send for TransformerDecoderLayer<F>
impl<F> !Sync for TransformerDecoderLayer<F>
impl<F> Unpin for TransformerDecoderLayer<F>where
F: Unpin,
impl<F> UnwindSafe for TransformerDecoderLayer<F>where
F: UnwindSafe + RefUnwindSafe,
Blanket Implementations§
Source§impl<T> BorrowMut<T> for Twhere
T: ?Sized,
impl<T> BorrowMut<T> for Twhere
T: ?Sized,
Source§fn borrow_mut(&mut self) -> &mut T
fn borrow_mut(&mut self) -> &mut T
Mutably borrows from an owned value. Read more
Source§impl<T> IntoEither for T
impl<T> IntoEither for T
Source§fn into_either(self, into_left: bool) -> Either<Self, Self>
fn into_either(self, into_left: bool) -> Either<Self, Self>
Converts
self
into a Left
variant of Either<Self, Self>
if into_left
is true
.
Converts self
into a Right
variant of Either<Self, Self>
otherwise. Read moreSource§fn into_either_with<F>(self, into_left: F) -> Either<Self, Self>
fn into_either_with<F>(self, into_left: F) -> Either<Self, Self>
Converts
self
into a Left
variant of Either<Self, Self>
if into_left(&self)
returns true
.
Converts self
into a Right
variant of Either<Self, Self>
otherwise. Read more