SparseFfn

ruvector_sparse_inference::sparse

Struct SparseFfn

pub struct SparseFfn { /* private fields */ }

Expand description

Sparse Feed-Forward Network computation.

This implements a two-layer FFN that can compute using only a subset of neurons:

W1: [hidden_dim, input_dim] - first projection (row-major for neuron access)
W2_T: [hidden_dim, output_dim] - second projection TRANSPOSED (row-major for contiguous access)
Activation function applied between layers

The sparse forward pass:

Sparse first layer: only compute active neurons
Apply activation function
Sparse second layer: accumulate only active neuron contributions (now contiguous!)

§Performance Optimization

W2 is stored transposed so that accessing columns (by neuron index) becomes row access, which is contiguous in memory. This provides 15-25% speedup in the sparse accumulation step.

Implementations§

impl SparseFfn

pub fn new( input_dim: usize, hidden_dim: usize, output_dim: usize, activation: ActivationType, ) -> Result<Self>

Create a new sparse FFN with given dimensions.

pub fn from_weights( w1: Array2<f32>, w2: Array2<f32>, b1: Array1<f32>, b2: Array1<f32>, activation: ActivationType, ) -> Result<Self>

Create from existing weights.

pub fn input_dim(&self) -> usize

Get input dimension.

pub fn hidden_dim(&self) -> usize

Get hidden dimension.

pub fn output_dim(&self) -> usize

Get output dimension.

pub fn forward_sparse( &self, input: &[f32], active_neurons: &[usize], ) -> Result<Vec<f32>>

Compute FFN using only active neurons (sparse computation).

This is the main optimization: only compute activations for predicted neurons.

pub fn forward_dense(&self, input: &[f32]) -> Result<Vec<f32>>

Compute FFN using all neurons (dense computation).

This is the baseline for comparison and correctness checking.

Trait Implementations§

impl Clone for SparseFfn

fn clone(&self) -> SparseFfn

Returns a duplicate of the value. Read more

1.0.0 · Source§

fn clone_from(&mut self, source: &Self)

Performs copy-assignment from source. Read more

impl Debug for SparseFfn

fn fmt(&self, f: &mut Formatter<'_>) -> Result

Formats the value using the given formatter. Read more

impl<'de> Deserialize<'de> for SparseFfn

fn deserialize<D>(deserializer: D) -> Result<Self, D::Error>
where __D: Deserializer<'de>,

Deserialize this value from the given Serde deserializer. Read more

impl FeedForward for SparseFfn

fn forward_sparse( &self, input: &[f32], active_neurons: &[usize], ) -> Result<Vec<f32>>

Sparse forward pass using only active neurons.

fn forward_dense(&self, input: &[f32]) -> Result<Vec<f32>>

Dense forward pass using all neurons.

impl Serialize for SparseFfn

fn serialize<S>(&self, serializer: S) -> Result<S::Ok, S::Error>
where S: Serializer,

Serialize this value into the given Serde serializer. Read more

Auto Trait Implementations§

impl Freeze for SparseFfn

impl RefUnwindSafe for SparseFfn

impl Send for SparseFfn

impl Sync for SparseFfn

impl Unpin for SparseFfn

impl UnwindSafe for SparseFfn

Blanket Implementations§

impl<T> Any for T
where T: 'static + ?Sized,

fn type_id(&self) -> TypeId

Gets the TypeId of self. Read more

impl<T> Borrow<T> for T
where T: ?Sized,

fn borrow(&self) -> &T

Immutably borrows from an owned value. Read more

impl<T> BorrowMut<T> for T
where T: ?Sized,

fn borrow_mut(&mut self) -> &mut T

Mutably borrows from an owned value. Read more

impl<T> CloneToUninit for T
where T: Clone,

unsafe fn clone_to_uninit(&self, dest: *mut u8)

🔬This is a nightly-only experimental API. (clone_to_uninit)

Performs copy-assignment from self to dest. Read more

impl<T> From<T> for T

fn from(t: T) -> T

Returns the argument unchanged.

impl<T> Instrument for T

fn instrument(self, span: Span) -> Instrumented<Self>

Instruments this type with the provided Span, returning an Instrumented wrapper. Read more

fn in_current_span(self) -> Instrumented<Self>

Instruments this type with the current Span, returning an Instrumented wrapper. Read more

impl<T, U> Into<U> for T
where U: From<T>,

fn into(self) -> U

Calls U::from(self).

That is, this conversion is whatever the implementation of From<T> for U chooses to do.

impl<T> ToOwned for T
where T: Clone,

type Owned = T

The resulting type after obtaining ownership.

fn to_owned(&self) -> T

Creates owned data from borrowed data, usually by cloning. Read more

fn clone_into(&self, target: &mut T)

Uses borrowed data to replace owned data, usually by cloning. Read more

impl<T, U> TryFrom<U> for T
where U: Into<T>,

type Error = Infallible

The type returned in the event of a conversion error.

fn try_from(value: U) -> Result<T, <T as TryFrom<U>>::Error>

Performs the conversion.

impl<T, U> TryInto<U> for T
where U: TryFrom<T>,

type Error = <U as TryFrom<T>>::Error

The type returned in the event of a conversion error.

fn try_into(self) -> Result<U, <U as TryFrom<T>>::Error>

Performs the conversion.

impl<V, T> VZip<V> for T
where V: MultiLane<T>,

fn vzip(self) -> V

impl<T> WithSubscriber for T

fn with_subscriber<S>(self, subscriber: S) -> WithDispatch<Self>
where S: Into<Dispatch>,

Attaches the provided Subscriber to this type, returning a WithDispatch wrapper. Read more

fn with_current_subscriber(self) -> WithDispatch<Self>

Attaches the current default Subscriber to this type, returning a WithDispatch wrapper. Read more

impl<T> DeserializeOwned for T
where T: for<'de> Deserialize<'de>,