Module batch

Source
Expand description

Executes multiple independent global matmuls with optional broadcasting.

Structs§

ColMajorGlobalPartitionMatmul
Iterates on global matmuls in a col major fashion
HypercubeConfig
Determines how to launch the hypercube, i.e. anything relevant to CubeCount and where a Cube at a cube position should work Similar to [HyperCubeSelection] but injected in kernel as comptime struct
HypercubeSelection
Determines how to launch the hypercube, i.e. anything relevant to CubeCount and where a Cube at a cube position should work
PartitionedBatchMatmulFamily
Simple partitioned batch matmul family for any precision
RowMajorGlobalPartitionMatmul
Iterates on global matmuls in a row major fashion

Enums§

CubeCountInput
CubeCountPlan stripped of non-essential runtime information
CubeCountInputArgs
CubeCountPlanSelection
Front-facing configuration when crafting a MatmulSelection Allows choosing a strategy before knowing actual values
GlobalOrderSelection
Used to create [GlobalOrder].
SmAllocation
Controls how Streaming Multiprocessors (SMs) are assigned cubes.

Traits§

BatchConfig
Configuration for the batch matmul level.
BatchMatmul
Provides matrix multiplication operations at the batch level.
BatchMatmulFamily
A family of matmuls working with any precision.