Crate runmat_accelerate

Crate runmat_accelerate 

Source
Expand description

RunMat Accelerate: GPU Acceleration Abstraction Layer

Goals:

  • Provide a backend-agnostic API surface that maps RunMat operations to GPU kernels.
  • Support multiple backends via features (CUDA, ROCm, Metal, Vulkan, OpenCL, wgpu).
  • Allow zero-copy interop with runmat-builtins::Matrix where possible.
  • Defer actual kernel authoring to backend crates/modules; this crate defines traits and wiring.

This is scaffolding only; implementations will land after interpreter/JIT semantics are complete.

Modules§

simple_provider

Structs§

Accelerator
High-level façade for accelerated operations, falling back to runmat-runtime.
DeviceInfo
Device descriptor used for selection and capabilities query.
Planner
Planner determines whether to execute on CPU or a selected backend. This will eventually consult sizes, heuristics, and device availability.

Enums§

DeviceKind
High-level device kind. Concrete selection is provided by backend.
ExecutionTarget

Traits§

AccelerateBackend
Core backend interface that concrete backends must implement.
BufferHandle
Abstract buffer that may reside on device or be host-pinned.
DeviceMatrix
Abstract matrix allocated on a device backend.