Module tma

Expand description

TMA-related runtime types

Enums§

OobFill: What value to use when filling out of bounds values
TensorMapFormat: Format of [TensorMap]
TensorMapInterleave: Interleave setting for [TensorMap]
TensorMapPrefetch: Additional prefetching to perform during load Specifies L2 fetch size which indicates the byte granularity at which L2 requests are filled from DRAM
TensorMapSwizzle: Data are organized in a specific order in global memory; however, this may not match the order in which the application accesses data in shared memory. This difference in data organization may cause bank conflicts when shared memory is accessed. In order to avoid this problem, data can be loaded to shared memory with shuffling across shared memory banks. When interleave is TensorMapInterleave::B32, swizzle must be TensorMapSwizzle::B32. Other interleave modes can have any swizzling pattern.