pub fn parallel_histogram(
data: &[f64],
lo: f64,
hi: f64,
n_bins: usize,
n_workers: usize,
) -> Vec<u64>Expand description
Parallel histogram reduce: split data into n_workers chunks, compute
a partial histogram per chunk (in parallel), then merge all partial
histograms serially. Mirrors the GPU pattern of per-work-group private
histograms followed by a reduction pass.