Algorithm

Enum Algorithm 

Source
#[repr(i32)]
pub enum Algorithm {
Show 13 variants AlgUnset = 0, AlgDotAnyF8AnyF8F32 = 1, AlgDotAnyF8AnyF8F32FastAccum = 2, AlgDotF16F16F16 = 3, AlgDotF16F16F32 = 4, AlgDotBf16Bf16Bf16 = 5, AlgDotBf16Bf16F32 = 6, AlgDotBf16Bf16F32X3 = 7, AlgDotBf16Bf16F32X6 = 8, AlgDotTf32Tf32F32 = 9, AlgDotTf32Tf32F32X3 = 10, AlgDotF32F32F32 = 11, AlgDotF64F64F64 = 12,
}
Expand description

The algorithm used to evaluate the instruction.

The naming convention for the dot instruction is ALG_DOT_{A_TYPE}{B_TYPE}{ACCUM_TYPE}[_X{NUM_OPS}] where A_TYPE, B_TYPE and ACCUM_TYPE correspond to the types in the “primitive dot operations” (such as TensorCore operations) and NUM_OPS is the number of such operations used per “primitive tile”. When the NUM_OPS field is skipped, it is assumed to be 1. The types mentioned in the name are independent of the storage types.

In general ATYPE and BTYPE are the precisions that the LHS and RHS of the operation are rounded to and ACCUMTYPE is the accumulation type. If a backend does not support the given algorithm, an error is raised. The Algorithm enum is intended to eventually replace the Precision enum.

Variants§

§

AlgUnset = 0

If the algorithm is ALG_UNSET, we will decide the algorithm based on the operand_precision values (for now).

§

AlgDotAnyF8AnyF8F32 = 1

The storage type can be any 8-bit floating point type.

§

AlgDotAnyF8AnyF8F32FastAccum = 2

The storage type can be any 8-bit floating point type. Intermediate results will not periodically be promoted to a higher precision. This corresponds to CUBLASLT_MATMUL_DESC_FAST_ACCUM. Triton’s maxNumImpreciseAcc=32 setting may be similar.

§

AlgDotF16F16F16 = 3

§

AlgDotF16F16F32 = 4

§

AlgDotBf16Bf16Bf16 = 5

§

AlgDotBf16Bf16F32 = 6

§

AlgDotBf16Bf16F32X3 = 7

An algorithm which uses 3 BF16_BF16_F32 matmuls to achieve better precision.

§

AlgDotBf16Bf16F32X6 = 8

An algorithm which uses 6 BF16_BF16_F32 matmuls to achieve better precision (similar to F32).

§

AlgDotTf32Tf32F32 = 9

§

AlgDotTf32Tf32F32X3 = 10

An algorithm which uses 3 TF32_TF32_F32 matmuls to achieve better precision (similar to F32).

§

AlgDotF32F32F32 = 11

§

AlgDotF64F64F64 = 12

Implementations§

Source§

impl Algorithm

Source

pub fn is_valid(value: i32) -> bool

Returns true if value is a variant of Algorithm.

Source

pub fn from_i32(value: i32) -> Option<Algorithm>

👎Deprecated: Use the TryFrom<i32> implementation instead

Converts an i32 to a Algorithm, or None if value is not a valid variant.

Source§

impl Algorithm

Source

pub fn as_str_name(&self) -> &'static str

String value of the enum field names used in the ProtoBuf definition.

The values are not transformed in any way and thus are considered stable (if the ProtoBuf definition does not change) and safe for programmatic use.

Source

pub fn from_str_name(value: &str) -> Option<Self>

Creates an enum from field names used in the ProtoBuf definition.

Trait Implementations§

Source§

impl Clone for Algorithm

Source§

fn clone(&self) -> Algorithm

Returns a duplicate of the value. Read more
1.0.0§

fn clone_from(&mut self, source: &Self)

Performs copy-assignment from source. Read more
Source§

impl Debug for Algorithm

Source§

fn fmt(&self, f: &mut Formatter<'_>) -> Result

Formats the value using the given formatter. Read more
Source§

impl Default for Algorithm

Source§

fn default() -> Algorithm

Returns the “default value” for a type. Read more
Source§

impl From<Algorithm> for i32

Source§

fn from(value: Algorithm) -> i32

Converts to this type from the input type.
Source§

impl Hash for Algorithm

Source§

fn hash<__H: Hasher>(&self, state: &mut __H)

Feeds this value into the given Hasher. Read more
1.3.0§

fn hash_slice<H>(data: &[Self], state: &mut H)
where H: Hasher, Self: Sized,

Feeds a slice of this type into the given Hasher. Read more
Source§

impl Ord for Algorithm

Source§

fn cmp(&self, other: &Algorithm) -> Ordering

This method returns an Ordering between self and other. Read more
1.21.0§

fn max(self, other: Self) -> Self
where Self: Sized,

Compares and returns the maximum of two values. Read more
1.21.0§

fn min(self, other: Self) -> Self
where Self: Sized,

Compares and returns the minimum of two values. Read more
1.50.0§

fn clamp(self, min: Self, max: Self) -> Self
where Self: Sized,

Restrict a value to a certain interval. Read more
Source§

impl PartialEq for Algorithm

Source§

fn eq(&self, other: &Algorithm) -> bool

Tests for self and other values to be equal, and is used by ==.
1.0.0§

fn ne(&self, other: &Rhs) -> bool

Tests for !=. The default implementation is almost always sufficient, and should not be overridden without very good reason.
Source§

impl PartialOrd for Algorithm

Source§

fn partial_cmp(&self, other: &Algorithm) -> Option<Ordering>

This method returns an ordering between self and other values if one exists. Read more
1.0.0§

fn lt(&self, other: &Rhs) -> bool

Tests less than (for self and other) and is used by the < operator. Read more
1.0.0§

fn le(&self, other: &Rhs) -> bool

Tests less than or equal to (for self and other) and is used by the <= operator. Read more
1.0.0§

fn gt(&self, other: &Rhs) -> bool

Tests greater than (for self and other) and is used by the > operator. Read more
1.0.0§

fn ge(&self, other: &Rhs) -> bool

Tests greater than or equal to (for self and other) and is used by the >= operator. Read more
Source§

impl TryFrom<i32> for Algorithm

Source§

type Error = UnknownEnumValue

The type returned in the event of a conversion error.
Source§

fn try_from(value: i32) -> Result<Algorithm, UnknownEnumValue>

Performs the conversion.
Source§

impl Copy for Algorithm

Source§

impl Eq for Algorithm

Source§

impl StructuralPartialEq for Algorithm

Auto Trait Implementations§

Blanket Implementations§

§

impl<T> Any for T
where T: 'static + ?Sized,

§

fn type_id(&self) -> TypeId

Gets the TypeId of self. Read more
§

impl<T> Borrow<T> for T
where T: ?Sized,

§

fn borrow(&self) -> &T

Immutably borrows from an owned value. Read more
§

impl<T> BorrowMut<T> for T
where T: ?Sized,

§

fn borrow_mut(&mut self) -> &mut T

Mutably borrows from an owned value. Read more
§

impl<T> CloneToUninit for T
where T: Clone,

§

unsafe fn clone_to_uninit(&self, dest: *mut u8)

🔬This is a nightly-only experimental API. (clone_to_uninit)
Performs copy-assignment from self to dest. Read more
§

impl<T> From<T> for T

§

fn from(t: T) -> T

Returns the argument unchanged.

§

impl<T, U> Into<U> for T
where U: From<T>,

§

fn into(self) -> U

Calls U::from(self).

That is, this conversion is whatever the implementation of From<T> for U chooses to do.

§

impl<T> ToOwned for T
where T: Clone,

§

type Owned = T

The resulting type after obtaining ownership.
§

fn to_owned(&self) -> T

Creates owned data from borrowed data, usually by cloning. Read more
§

fn clone_into(&self, target: &mut T)

Uses borrowed data to replace owned data, usually by cloning. Read more
§

impl<T, U> TryFrom<U> for T
where U: Into<T>,

§

type Error = Infallible

The type returned in the event of a conversion error.
§

fn try_from(value: U) -> Result<T, <T as TryFrom<U>>::Error>

Performs the conversion.
§

impl<T, U> TryInto<U> for T
where U: TryFrom<T>,

§

type Error = <U as TryFrom<T>>::Error

The type returned in the event of a conversion error.
§

fn try_into(self) -> Result<U, <U as TryFrom<T>>::Error>

Performs the conversion.