Structs§
- Block
Scaled Layout - Block
Scaled Layout Compilation Arg - Block
Scaled Layout Expand - Block
Scaled Layout Launch - PerTensor
Layout - PerTensor
Layout Compilation Arg - PerTensor
Layout Expand - PerTensor
Layout Launch
Enums§
- Scales
Layout - Layout for quantization scales, indexed by quant element index and returns the corresponding scale based on the quantization type.
- Scales
Layout Args - Scales
Layout Compilation Arg - Scales
Layout Expand
Functions§
- scales_
layout - scales_
view - Create a scales view from the values and scales handle, line size and quantization scheme.
valuesshould be the quantized tensor, and will be adjusted bynum_quants.
Type Aliases§
- Scales
View - [TensorView] with a linear layout inferred from the shape/strides at launch. Useful for elementwise kernels.
- Scales
View Launch - Launch type for [LinearTensorView].