Struct MultiHeadAttention

Source

pub struct MultiHeadAttention {
    pub d_model: usize,
    pub n_heads: usize,
    pub d_head: usize,
    pub w_q: Vec<f64>,
    pub w_k: Vec<f64>,
    pub w_v: Vec<f64>,
    pub w_o: Vec<f64>,
    pub b_o: Vec<f64>,
}

Expand description

Multi-head attention module.

Projects Q, K, V with learned linear projections, runs n_heads parallel attention heads, then concatenates and projects the output.

All weight matrices are stored flat row-major.

Fields§

§d_model: usize

Model dimensionality.

§n_heads: usize

Number of attention heads.

§d_head: usize

Dimensionality per head: d_model / n_heads.

§w_q: Vec<f64>

W_Q projection [d_model × d_model].

§w_k: Vec<f64>

W_K projection [d_model × d_model].

§w_v: Vec<f64>

W_V projection [d_model × d_model].

§w_o: Vec<f64>

W_O output projection [d_model × d_model].

§b_o: Vec<f64>

Output bias [d_model].

Implementations§

Source §

impl MultiHeadAttention

Source

pub fn new(d_model: usize, n_heads: usize) -> Self

Create a new MHA module with zero-initialised projections.

Source

pub fn init_identity(&mut self)

Initialise W_Q, W_K, W_V, W_O with identity-like weights for testing.

Source

pub fn forward(&self, x: &[f64], seq_len: usize) -> Vec<f64>

Forward pass.

x has shape [seq_len × d_model] (flat row-major). Returns output of shape [seq_len × d_model].

Source

pub fn num_params(&self) -> usize

Total number of trainable parameters.

Trait Implementations§

Source §

impl Clone for MultiHeadAttention

Source §

fn clone(&self) -> MultiHeadAttention

Returns a duplicate of the value. Read more

1.0.0 (const: unstable) · Source§

fn clone_from(&mut self, source: &Self)

Performs copy-assignment from source. Read more

Source §

impl Debug for MultiHeadAttention

Source §

fn fmt(&self, f: &mut Formatter<'_>) -> Result

Formats the value using the given formatter. Read more

Auto Trait Implementations§

§

impl UnwindSafe for MultiHeadAttention

Blanket Implementations§

Source §

impl<T> Any for T
where T: 'static + ?Sized,

Source §

fn type_id(&self) -> TypeId

Gets the TypeId of self. Read more

Source §

impl<T> Borrow<T> for T
where T: ?Sized,

Source §

fn borrow(&self) -> &T

Immutably borrows from an owned value. Read more

Source §

impl<T> BorrowMut<T> for T
where T: ?Sized,

Source §

fn borrow_mut(&mut self) -> &mut T

Mutably borrows from an owned value. Read more

Source §

impl<T> CloneToUninit for T
where T: Clone,

Source §

unsafe fn clone_to_uninit(&self, dest: *mut u8)

🔬This is a nightly-only experimental API. (clone_to_uninit)

Performs copy-assignment from self to dest. Read more

Source §

impl<T> Downcast<T> for T

Source §

fn downcast(&self) -> &T

Source §

impl<T> From<T> for T

Source §

fn from(t: T) -> T

Returns the argument unchanged.

Source §

impl<T, U> Into for T
where U: From<T>,

Source §

fn into(self) -> U

Calls U::from(self).

That is, this conversion is whatever the implementation of From<T> for U chooses to do.

Source §

impl<T> IntoEither for T

Source §

fn into_either(self, into_left: bool) -> Either<Self, Self>

Converts self into a Left variant of Either<Self, Self> if into_left is true. Converts self into a Right variant of Either<Self, Self> otherwise. Read more

Source §

fn into_either_with<F>(self, into_left: F) -> Either<Self, Self>
where F: FnOnce(&Self) -> bool,

Converts self into a Left variant of Either<Self, Self> if into_left(&self) returns true. Converts self into a Right variant of Either<Self, Self> otherwise. Read more

Source §