Skip to main content

ElementwiseTemplate

Struct ElementwiseTemplate 

Source
pub struct ElementwiseTemplate {
    pub op: ElementwiseOp,
    pub precision: PtxType,
    pub target: SmVersion,
}
Expand description

Template for generating elementwise PTX kernels.

Combines an ElementwiseOp, a precision (PtxType), and a target architecture (SmVersion) to produce a complete PTX module string.

The generated kernel handles global thread indexing and bounds checking. For complex activations (GELU, sigmoid, SiLU), the template emits approximate PTX instruction sequences using ex2.approx and rcp.approx.

Fields§

§op: ElementwiseOp

The elementwise operation to generate.

§precision: PtxType

The data precision for computation (e.g., PtxType::F32).

§target: SmVersion

The target GPU architecture.

Implementations§

Source§

impl ElementwiseTemplate

Source

pub const fn new( op: ElementwiseOp, precision: PtxType, target: SmVersion, ) -> Self

Creates a new elementwise template with the given parameters.

Source

pub fn kernel_name(&self) -> String

Returns the kernel function name derived from the operation and precision.

The name follows the pattern elementwise_{op}_{type}, for example elementwise_add_f32 or elementwise_relu_f16.

Source

pub fn generate(&self) -> Result<String, PtxGenError>

Generates the complete PTX module text for this elementwise operation.

§Errors

Returns PtxGenError if the precision type is unsupported for the requested operation or if PTX text generation fails.

Auto Trait Implementations§

Blanket Implementations§

Source§

impl<T> Any for T
where T: 'static + ?Sized,

Source§

fn type_id(&self) -> TypeId

Gets the TypeId of self. Read more
Source§

impl<T> Borrow<T> for T
where T: ?Sized,

Source§

fn borrow(&self) -> &T

Immutably borrows from an owned value. Read more
Source§

impl<T> BorrowMut<T> for T
where T: ?Sized,

Source§

fn borrow_mut(&mut self) -> &mut T

Mutably borrows from an owned value. Read more
Source§

impl<T> From<T> for T

Source§

fn from(t: T) -> T

Returns the argument unchanged.

Source§

impl<T, U> Into<U> for T
where U: From<T>,

Source§

fn into(self) -> U

Calls U::from(self).

That is, this conversion is whatever the implementation of From<T> for U chooses to do.

Source§

impl<T, U> TryFrom<U> for T
where U: Into<T>,

Source§

type Error = Infallible

The type returned in the event of a conversion error.
Source§

fn try_from(value: U) -> Result<T, <T as TryFrom<U>>::Error>

Performs the conversion.
Source§

impl<T, U> TryInto<U> for T
where U: TryFrom<T>,

Source§

type Error = <U as TryFrom<T>>::Error

The type returned in the event of a conversion error.
Source§

fn try_into(self) -> Result<U, <U as TryFrom<T>>::Error>

Performs the conversion.