Structs§
- ColMajor
Tiling Order - Tiles laid out in column-major order.
- ColMajor
Tiling Order Expand - Contiguous
Tiling Layout - Each tile is stored contiguously in shared memory. Global memory loads may require remapping to match this layout.
- Filled
Stage - Filled
Stage Expand - Filled
Stage Family - NoEvent
- Use when there is no event listening to do
- NoEvent
Expand - NoTiling
Layout - Dummy tiling layout that panics if it’s used. Can be used when the reader is known to be a
FillReader - NumStages
- Number of stages in one shared memory, i.e. buffers for double buffering
- Ordered
Tiling Order - Tiles are laid out in column-major order across a fixed number of rows, with all tiles from those rows placed contiguously side by side.
- Ordered
Tiling Order Expand - Partition
Scheduler - Schedules global indices for M, N, and K axes in a partitioned matmul.
Internally uses an
AxisSchedulerper axis. - Plane
Matmul Family - Plane Matmul family for any precision
- Plane
Partitioner - Defines how to partition across planes
- RowMajor
Tiling Order - Tiles laid out in row-major order.
- RowMajor
Tiling Order Expand - Stage
Memory Config - Strided
Stage - Wrapper over the shared memory used for staging, abstracting its layout
- Strided
Stage Expand - Strided
Stage Family - Strided
Tiling Layout - Tiles follow a strided layout that often mirrors global memory layout. Not all tiles are contiguous in shared memory, but mapping is more direct.
- Unit
Matmul Family - Unit Matmul family for any precision
- Unit
Partitioner - Defines how to partition across units
Enums§
- Partition
Buffering - Partition
Scheduler Scheme - Defines how partition indices are scheduled across axes.
- Stage
Event - Events that occur during the process of loading tiles to registers and executing inner Tile Matmuls
- Tiling
Order Enum - Enum for the available traits
Traits§
- Stage
- Stage that can be divided into tiles, with the same kind used by the tile matmul readers.
- Stage
Config - Configuration for the Stage matmul (SMM) level
- Stage
Event Listener - Function that is called at each StageEvent
- Stage
Family - Stage family for any precision
- Stage
Matmul - Provides matrix multiplication operations at the stage level.
- Stage
Matmul Family - A family of StageMatmul implementations that operate with any precision.
- Stage
Partitioner - Defines how the stage is partitioned among compute primitives (e.g., units or planes). Controls global writeback and and compute indexing.
- Tiling
Layout - Describes how tiles are arranged in shared memory.
- Tiling
Order - Determines the order in which tiles are stored in shared memory, if TilingLayout is contiguous
- Tiling
Validation