pub fn register_gpu_module(m: &Bound<'_, PyModule>) -> PyResult<()>
Register all GPU-dispatch functions in the parent Python module.