pub fn byte_histogram_simd(data: &[u8]) -> [u32; 256]
SIMD-accelerated histogram using scatter/gather approach.
This uses multiple histogram arrays to reduce memory conflicts and processes data in parallel lanes.