Expand description
cuSOLVER-backed GPU linear algebra: SVD, Cholesky, QR, Solve.
Each operation follows the cuSOLVER pattern:
- Query workspace size via
*_bufferSize. - Allocate workspace + output buffers on the device.
- Call the cuSOLVER routine.
- Check
devInfo— non-zero means the operation failed (singular matrix, etc.).
All functions operate on column-major data because cuSOLVER (LAPACK-style) uses column-major layout. The caller is responsible for transposing row-major tensors before calling and transposing outputs back.
Functions§
- gpu_
cholesky_ f32 - Compute the Cholesky decomposition of an n-by-n SPD matrix (row-major f32).
- gpu_
cholesky_ f64 - Compute the Cholesky decomposition of an n-by-n SPD matrix (row-major f64).
- gpu_
qr_ f32 - Compute the reduced QR decomposition of an m-by-n matrix (row-major f32).
- gpu_
qr_ f64 - Compute the reduced QR decomposition of an m-by-n matrix (row-major f64).
- gpu_
solve_ f32 - Solve A * X = B for X where A is n-by-n and B is n-by-nrhs (row-major f32).
- gpu_
solve_ f64 - Solve A * X = B for X where A is n-by-n and B is n-by-nrhs (row-major f64).
- gpu_
svd_ f32 - Compute the thin SVD of an m-by-n matrix (row-major f32).
- gpu_
svd_ f64 - Compute the thin SVD of an m-by-n matrix (row-major f64).