Module utils

Expand description

Utility functions for cache optimization

Functionsยง

align_to_cache_line
Align size to cache line boundary
optimal_thread_count
Calculate optimal number of threads based on data size and cache
prefetch_read
Prefetch data at the given address (hint to CPU)
transpose_blocked
Transpose matrix with cache blocking