pub struct KernelProfile {
pub tile_m: u32,
pub tile_n: u32,
pub tile_k: u32,
pub unroll_factor: u32,
pub use_shared_memory: bool,
pub register_target: u32,
pub split_k: u32,
}Expand description
Mutable kernel configuration that the profile-guided optimizer adjusts.
Downstream builders read these fields after optimisation to generate the final PTX code.
Fields§
§tile_m: u32Tile size in the M dimension.
tile_n: u32Tile size in the N dimension.
tile_k: u32Tile size in the K dimension.
unroll_factor: u32Loop unroll factor.
Whether shared memory staging is enabled.
register_target: u32Target register count per thread (0 = no constraint).
split_k: u32Number of split-K slices (1 = disabled).
Implementations§
Source§impl KernelProfile
impl KernelProfile
Trait Implementations§
Source§impl Clone for KernelProfile
impl Clone for KernelProfile
Source§fn clone(&self) -> KernelProfile
fn clone(&self) -> KernelProfile
Returns a duplicate of the value. Read more
1.0.0 · Source§fn clone_from(&mut self, source: &Self)
fn clone_from(&mut self, source: &Self)
Performs copy-assignment from
source. Read moreSource§impl Debug for KernelProfile
impl Debug for KernelProfile
Source§impl Default for KernelProfile
impl Default for KernelProfile
Auto Trait Implementations§
impl Freeze for KernelProfile
impl RefUnwindSafe for KernelProfile
impl Send for KernelProfile
impl Sync for KernelProfile
impl Unpin for KernelProfile
impl UnsafeUnpin for KernelProfile
impl UnwindSafe for KernelProfile
Blanket Implementations§
Source§impl<T> BorrowMut<T> for Twhere
T: ?Sized,
impl<T> BorrowMut<T> for Twhere
T: ?Sized,
Source§fn borrow_mut(&mut self) -> &mut T
fn borrow_mut(&mut self) -> &mut T
Mutably borrows from an owned value. Read more