Crate runmat_accelerate

Expand description

RunMat Accelerate: GPU Acceleration Abstraction Layer

Goals:

Provide a backend-agnostic API surface that maps RunMat operations to GPU kernels.
Support multiple backends via features (CUDA, ROCm, Metal, Vulkan, OpenCL, wgpu).
Allow zero-copy interop with runmat-builtins::Matrix where possible.
Defer actual kernel authoring to backend crates/modules; this crate defines traits and wiring.

Re-exports§

AccelerateInitOptions: Initialization options for selecting and configuring the acceleration provider.
Accelerator: High-level façade for accelerated operations, falling back to runmat-runtime.
AutoOffloadOptions: Configuration passed to the native auto-offload planner.
DeviceInfo: Device descriptor used for selection and capabilities query.
Planner: Planner determines whether to execute on CPU or a selected backend. This will eventually consult sizes, heuristics, and device availability.

AccelPowerPreference: Power preference used when initializing a WGPU backend
AccelerateProviderPreference: Preferred acceleration provider selection
AutoOffloadLogLevel: Logging verbosity for auto-offload promotion decisions.
DeviceKind: High-level device kind. Concrete selection is provided by backend.
ExecutionTarget
ReductionAxes

AccelerateBackend: Core backend interface that concrete backends must implement.
BufferHandle: Abstract buffer that may reside on device or be host-pinned.
DeviceMatrix: Abstract matrix allocated on a device backend.

configure_auto_offload
initialize_acceleration_provider: Initialize the acceleration provider using default options.
initialize_acceleration_provider_with: Initialize the global acceleration provider using the supplied options.
value_is_all_keyword