macro_rules! time_brick { ($profiler:expr, $name:expr, $elements:expr, $body:block) => { ... }; }
Macro for convenient brick timing with automatic sync.
time_brick!(profiler, "RmsNorm", 1, { rmsnorm_kernel.launch(); stream.synchronize(); // REQUIRED for GPU });