Enum ShapeLayoutKind

Source

#[non_exhaustive]
#[repr(u16)]pub enum ShapeLayoutKind {
    Pad = 0,
    Concat = 1,
    Permute = 2,
    Repeat = 3,
    Flip = 4,
    Roll = 5,
    Meshgrid = 6,
    Fill = 7,
    WriteSlice = 8,
    Contiguize = 9,
    Triu = 10,
    Tril = 11,
}

Expand description

Shape / layout op discriminant — Category N.

Tags the kernel SKU for telemetry / autotuner-cache keys. Each variant has its own Plan type today (PadPlan, ConcatPlan, …) because their descriptor / args shapes differ enough that one ShapeLayoutPlan<T, N> doesn’t fit. The enum exists so all of them populate KernelSku::op from a shared discriminant space.

Variants (Non-exhaustive)§

This enum is marked as non-exhaustive

Non-exhaustive enums could have additional variants added in future. Therefore, when matching against variants of non-exhaustive enums, an extra wildcard arm must be added to account for any future variants.

§

Pad = 0

F.pad(x, pad, mode='constant', value=v) — Phase 3 trailblazer.

§

Concat = 1

torch.cat(tensors, dim) — variable-arity input. Reserved.

§

Permute = 2

Materialized torch.permute(x, dims) (strided-view materialization when needed). Reserved.

§

Repeat = 3

x.repeat(...) / torch.tile(x, ...). Reserved.

§

Flip = 4

torch.flip(x, dims) — reverse along axes. Reserved.

§

Roll = 5

torch.roll(x, shifts, dims) — shift along axes. Reserved.

§

Meshgrid = 6

torch.meshgrid(*tensors) — N rank-1 → N rank-N. Reserved.

§

Fill = 7

torch.full(shape, value) / Tensor.fill_(value) — fill every element of an output tensor with a scalar constant. Wired from fuel-cuda-kernels/fill.cu.

§

WriteSlice = 8

dest[start_0..end_0, ..., start_{N-1}..end_{N-1}] = source (assign, not accumulate). Per-axis range write. Phase 13.1 trailblazer — driven by Fuel team’s persistent KV-cache append (autoregressive decoding). See baracuda_kernels::WriteSlicePlan.

§

Contiguize = 9

Strided→contiguous materialization (torch.Tensor.contiguous). Phase 13.2: closes the D2H→CPU contiguize→H2D fallback cliff for non-contiguous CUDA inputs. Byte-level dtype-agnostic (sizeof-templated kernel) covering every byte-aligned dtype; nibble (S4 / U4) shipped behind a documented innermost-stride constraint. See baracuda_kernels::ContiguizePlan.

§

Triu = 10

torch.triu(input, diagonal) — keep upper triangular part of the last two dims of input; zero everything below the diagonal-th diagonal. Batch dims (anything before the last two) are independently masked. Phase 13.4 trailblazer — driven by Fuel team’s CPU-only triu/tril gap. See baracuda_kernels::TriuPlan.

§

Tril = 11

torch.tril(input, diagonal) — keep lower triangular part of the last two dims of input; zero everything above the diagonal-th diagonal. Sibling of Self::Triu with the predicate flipped. See baracuda_kernels::TrilPlan.

ShapeLayoutKind

Enum ShapeLayoutKind Copy item path

Variants (Non-exhaustive)§

Pad = 0

Concat = 1

Permute = 2

Repeat = 3

Flip = 4

Roll = 5

Meshgrid = 6

Fill = 7

WriteSlice = 8

Contiguize = 9

Triu = 10

Tril = 11

Trait Implementations§

impl Clone for ShapeLayoutKind

fn clone(&self) -> ShapeLayoutKind

fn clone_from(&mut self, source: &Self)

impl Copy for ShapeLayoutKind

impl Debug for ShapeLayoutKind

fn fmt(&self, f: &mut Formatter<'_>) -> Result<(), Error>

impl Eq for ShapeLayoutKind

impl Hash for ShapeLayoutKind

fn hash<__H>(&self, state: &mut __H)where __H: Hasher,

fn hash_slice<H>(data: &[Self], state: &mut H)where H: Hasher, Self: Sized,

impl PartialEq for ShapeLayoutKind

fn eq(&self, other: &ShapeLayoutKind) -> bool

fn ne(&self, other: &Rhs) -> bool

impl StructuralPartialEq for ShapeLayoutKind

Auto Trait Implementations§

impl Freeze for ShapeLayoutKind

impl RefUnwindSafe for ShapeLayoutKind

impl Send for ShapeLayoutKind

impl Sync for ShapeLayoutKind

impl Unpin for ShapeLayoutKind

impl UnsafeUnpin for ShapeLayoutKind

impl UnwindSafe for ShapeLayoutKind

Blanket Implementations§

impl<T> Any for Twhere T: 'static + ?Sized,

fn type_id(&self) -> TypeId

impl<T> Borrow<T> for Twhere T: ?Sized,

fn borrow(&self) -> &T

impl<T> BorrowMut<T> for Twhere T: ?Sized,

fn borrow_mut(&mut self) -> &mut T

impl<T> CloneToUninit for Twhere T: Clone,

unsafe fn clone_to_uninit(&self, dest: *mut u8)

impl<T> From<T> for T

fn from(t: T) -> T

impl<T, U> Into<U> for Twhere U: From<T>,

fn into(self) -> U

impl<T> ToOwned for Twhere T: Clone,

type Owned = T

fn to_owned(&self) -> T

fn clone_into(&self, target: &mut T)

impl<T, U> TryFrom<U> for Twhere U: Into<T>,

type Error = Infallible

fn try_from(value: U) -> Result<T, <T as TryFrom<U>>::Error>

impl<T, U> TryInto<U> for Twhere U: TryFrom<T>,

type Error = <U as TryFrom<T>>::Error

fn try_into(self) -> Result<U, <U as TryFrom<T>>::Error>

Enum ShapeLayoutKind

fn hash<H>(&self, state: &mut H)
where __H: Hasher,

fn hash_slice<H>(data: &[Self], state: &mut H)
where H: Hasher, Self: Sized,

impl<T> Any for T
where T: 'static + ?Sized,

impl<T> Borrow<T> for T
where T: ?Sized,

impl<T> BorrowMut<T> for T
where T: ?Sized,

impl<T> CloneToUninit for T
where T: Clone,

impl<T, U> Into<U> for T
where U: From<T>,

impl<T> ToOwned for T
where T: Clone,

impl<T, U> TryFrom<U> for T
where U: Into<T>,

impl<T, U> TryInto<U> for T
where U: TryFrom<T>,