Skip to main content

Module launch

Module launch 

Source
Expand description

Kernel launch API.

Implements cudaLaunchKernel, cudaFuncGetAttributes, and cudaFuncSetAttribute on top of the CUDA Driver API.

§Design

In the CUDA Runtime, kernels are typically invoked via <<<...>>> syntax which the NVCC compiler rewrites into cudaLaunchKernel calls. Since OxiCUDA never uses NVCC, callers must use the driver-level module/function handle pair directly. This module therefore exposes a slightly lower-level surface that accepts a CudaFunction instead of a raw symbol pointer.

Structs§

Dim3
3-D grid / block dimensions for kernel launches.
FuncAttributes
Attributes of a compiled kernel function.

Enums§

FuncAttribute
Attribute selector for cudaFuncSetAttribute.

Functions§

func_get_attributes
Query attributes of a compiled kernel.
func_set_attribute
Set a kernel attribute.
launch_kernel
Launch a CUDA kernel.
module_get_function
Get a function handle by name from a loaded module.
module_load_ptx
Load a PTX module from a null-terminated byte string.
module_unload
Unload a previously loaded module.

Type Aliases§

CudaFunction
A compiled GPU kernel function (alias for the driver’s CUfunction).
CudaModule
A compiled GPU module (alias for the driver’s CUmodule).