pub fn resolved_i8_dot_kernel() -> I8DotKernelExpand description
Return the cached INT8 dot-product kernel.
Callers that invoke INT8 dot product in a tight loop can hoist this call outside the loop so the OnceLock check runs once, not per-iteration.