pub struct CudaGemmDispatch;Trait Implementations§
Source§impl GpuGemmDispatch for CudaGemmDispatch
impl GpuGemmDispatch for CudaGemmDispatch
fn try_fast_atb( &self, a: ArrayView2<'_, f64>, b: ArrayView2<'_, f64>, ) -> Option<Array2<f64>>
fn try_fast_ab( &self, a: ArrayView2<'_, f64>, b: ArrayView2<'_, f64>, ) -> Option<Array2<f64>>
fn try_fast_av( &self, a: ArrayView2<'_, f64>, v: ArrayView1<'_, f64>, ) -> Option<Array1<f64>>
fn try_fast_atv( &self, a: ArrayView2<'_, f64>, v: ArrayView1<'_, f64>, ) -> Option<Array1<f64>>
fn try_fast_xt_diag_x( &self, x: ArrayView2<'_, f64>, w: ArrayView1<'_, f64>, ) -> Option<Array2<f64>>
fn try_fast_xt_diag_y( &self, x: ArrayView2<'_, f64>, w: ArrayView1<'_, f64>, y: ArrayView2<'_, f64>, ) -> Option<Array2<f64>>
fn try_fast_joint_hessian_2x2( &self, x_a: ArrayView2<'_, f64>, x_b: ArrayView2<'_, f64>, w_aa: ArrayView1<'_, f64>, w_ab: ArrayView1<'_, f64>, w_bb: ArrayView1<'_, f64>, ) -> Option<Array2<f64>>
Source§fn device_count(&self) -> usize
fn device_count(&self) -> usize
Number of usable GPU devices in the runtime pool (
0 when no GPU
runtime is available). Geometry’s multi-GPU row-tiling only engages when
this exceeds 1.Source§fn try_fast_ab_broadcast_b_batched(
&self,
a3: ArrayView3<'_, f64>,
b: ArrayView2<'_, f64>,
) -> Option<Array3<f64>>
fn try_fast_ab_broadcast_b_batched( &self, a3: ArrayView3<'_, f64>, b: ArrayView2<'_, f64>, ) -> Option<Array3<f64>>
Broadcast-
B strided-batched GEMM: each tiles × rows × k slab of a3
is multiplied by the shared k × n b, yielding a tiles × rows × n
batch. Returns None when the workload is below the multi-GPU floor or
the runtime declines, so the caller falls back to the single-device GEMM.Auto Trait Implementations§
impl Freeze for CudaGemmDispatch
impl RefUnwindSafe for CudaGemmDispatch
impl Send for CudaGemmDispatch
impl Sync for CudaGemmDispatch
impl Unpin for CudaGemmDispatch
impl UnsafeUnpin for CudaGemmDispatch
impl UnwindSafe for CudaGemmDispatch
Blanket Implementations§
Source§impl<T> BorrowMut<T> for Twhere
T: ?Sized,
impl<T> BorrowMut<T> for Twhere
T: ?Sized,
Source§fn borrow_mut(&mut self) -> &mut T
fn borrow_mut(&mut self) -> &mut T
Mutably borrows from an owned value. Read more
impl<ST, DT> CastableFrom<ST, Initialized, Initialized> for DT
impl<ST, DT> CastableFrom<ST, Uninit, Uninit> for DT
Source§impl<T> DistributionExt for Twhere
T: ?Sized,
impl<T> DistributionExt for Twhere
T: ?Sized,
impl<T, U> Imply<T> for U
Source§impl<T> IntoEither for T
impl<T> IntoEither for T
Source§fn into_either(self, into_left: bool) -> Either<Self, Self>
fn into_either(self, into_left: bool) -> Either<Self, Self>
Converts
self into a Left variant of Either<Self, Self>
if into_left is true.
Converts self into a Right variant of Either<Self, Self>
otherwise. Read moreSource§fn into_either_with<F>(self, into_left: F) -> Either<Self, Self>
fn into_either_with<F>(self, into_left: F) -> Either<Self, Self>
Converts
self into a Left variant of Either<Self, Self>
if into_left(&self) returns true.
Converts self into a Right variant of Either<Self, Self>
otherwise. Read more