Expand description
GPU/CPU compute kernels for physics simulation.
This module groups all low-level compute kernels. Each sub-module exposes a CPU-mock implementation that mirrors a GPU kernel in its data layout and dispatch model, but executes on the CPU using Rayon for parallelism.
Modules§
- broadphase
- Broadphase AABB kernels for parallel overlap detection.
- group_
sizes - Typical work-group sizes used by NVIDIA/AMD GPUs.
- md_
force - Molecular dynamics force kernels.
- rigid
- Auto-generated module structure
- sph
- SPH (Smoothed Particle Hydrodynamics) compute kernels.
Structs§
- Dispatch
Dims - Describes the 3-D work-group dispatch dimensions for a kernel launch.
- Kernel
Perf Counters - Lightweight performance counters attached to a single kernel invocation.
Enums§
- Kernel
Family - Identifier for a built-in kernel family.
Functions§
- dispatch_
size_ 1d - Compute the 1-D dispatch size needed to cover
nitems withgroup_sizethreads per work-group. - smem_
bytes_ matmul - Calculate the shared-memory footprint (bytes) for a tiled matrix-multiply
kernel with tiles of size
tile×tileofT-sized elements. - workgroup_
barrier - Simulated GPU barrier: in CPU mock this is a no-op but documents synchronisation points for future GPU backend porting.