Module batch

Expand description

Executes multiple independent global matmuls with optional broadcasting.

Structs§

ColMajorGlobalPartitionMatmul: Iterates on global matmuls in a col major fashion
HypercubeConfig: Determines how to launch the hypercube, i.e. anything relevant to CubeCount and where a Cube at a cube position should work Similar to [HyperCubeSelection] but injected in kernel as comptime struct
HypercubeSelection: Determines how to launch the hypercube, i.e. anything relevant to CubeCount and where a Cube at a cube position should work
PartitionedBatchMatmulFamily: Simple partitioned batch matmul family for any precision
RowMajorGlobalPartitionMatmul: Iterates on global matmuls in a row major fashion

CubeCountInput: CubeCountPlan stripped of non-essential runtime information
CubeCountInputArgs
CubeCountPlanSelection: Front-facing configuration when crafting a MatmulSelection Allows choosing a strategy before knowing actual values
GlobalOrderSelection: Used to create [GlobalOrder].
SmAllocation: Controls how Streaming Multiprocessors (SMs) are assigned cubes.