Module tma

Source
Expand description

TMA-related runtime types

Enumsยง

OobFill
What value to use when filling out of bounds values
TensorMapFormat
Format of [TensorMap]
TensorMapInterleave
Interleave setting for [TensorMap]
TensorMapPrefetch
Additional prefetching to perform during load Specifies L2 fetch size which indicates the byte granularity at which L2 requests are filled from DRAM
TensorMapSwizzle
Data are organized in a specific order in global memory; however, this may not match the order in which the application accesses data in shared memory. This difference in data organization may cause bank conflicts when shared memory is accessed. In order to avoid this problem, data can be loaded to shared memory with shuffling across shared memory banks. When interleave is TensorMapInterleave::B32, swizzle must be TensorMapSwizzle::B32. Other interleave modes can have any swizzling pattern.