Crate runmat_accelerate

Crate runmat_accelerate

Expand description

RunMat Accelerate: GPU Acceleration Abstraction Layer

Goals:

Provide a backend-agnostic API surface that maps RunMat operations to GPU kernels.
Support multiple backends via features (CUDA, ROCm, Metal, Vulkan, OpenCL, wgpu).
Allow zero-copy interop with runmat-builtins::Matrix where possible.
Defer actual kernel authoring to backend crates/modules; this crate defines traits and wiring.

This is scaffolding only; implementations will land after interpreter/JIT semantics are complete.

Modules§

simple_provider

Structs§

Accelerator: High-level façade for accelerated operations, falling back to runmat-runtime.
DeviceInfo: Device descriptor used for selection and capabilities query.
Planner: Planner determines whether to execute on CPU or a selected backend. This will eventually consult sizes, heuristics, and device availability.

Enums§

DeviceKind: High-level device kind. Concrete selection is provided by backend.
ExecutionTarget

Traits§

AccelerateBackend: Core backend interface that concrete backends must implement.
BufferHandle: Abstract buffer that may reside on device or be host-pinned.
DeviceMatrix: Abstract matrix allocated on a device backend.