Expand description
GPU reduction and scan operations (CPU mock implementation).
Provides parallel-style reductions (sum, max, min, dot product), exclusive prefix scan, radix sort (counting sort mock), and histogram.
Functionsยง
- gpu_dot
- Compute the dot product of two equal-length slices (parallel mock).
- gpu_
histogram - Compute a histogram of the input data over
num_binsequal-width bins spanning[min_val, max_val). - gpu_max
- Compute the maximum element in a slice (parallel reduction mock).
- gpu_min
- Compute the minimum element in a slice (parallel reduction mock).
- gpu_
prefix_ sum - Compute the exclusive prefix scan (prefix sum) of a slice.
- gpu_
sort_ radix - Sort a slice of
u64values using a radix sort (counting sort) mock. - gpu_sum
- Compute the sum of all elements in a slice (parallel reduction mock).