zer-prof
Host-side NVTX profiling annotations for the zer entity-resolution library.
Provides trace!, trace_cuda!, and trace_vulkan! macros that wrap code blocks with RAII NVTX ranges visible in Nsight Systems (nsys). All macros are zero-cost no-ops when no feature flag is enabled.
- Documentation: docs.zal-analytics.ch
- Website: www.zal-analytics.ch
- Support & feedback: info@zal-analytics.ch
Feature flags
| Flag | Effect |
|---|---|
cuda |
Activates NVTX; trace_cuda! emits named ranges |
vulkan |
Activates NVTX; trace_vulkan! emits named ranges |
avx2 |
Activates NVTX for AVX2 SIMD profiling |
cpu |
Activates NVTX for CPU-path profiling |
nvtx |
Standalone NVTX activation, no compute backend |
| (none) | All macros expand to bare blocks, zero overhead |
Usage
init; // call once in main()
let result = trace!;
Filter by CUDA regions in Nsight Compute:
License
Apache-2.0 · GitHub