Skip to main content

StreamCapture

oxicuda_driver::graph

Struct StreamCapture

pub struct StreamCapture { /* private fields */ }

Expand description

Records GPU operations submitted to a stream into a Graph.

Stream capture intercepts operations that would normally be submitted to a CUDA stream and instead records them as graph nodes. The captured operations can then be replayed efficiently via GraphExec.

§Usage

let mut capture = StreamCapture::begin(&stream)?;

capture.record_kernel("my_kernel", (4, 1, 1), (256, 1, 1), 0);
capture.record_memcpy(MemcpyDirection::DeviceToHost, 1024);

let graph = capture.end()?;
assert_eq!(graph.node_count(), 2);

Implementations§

impl StreamCapture

pub fn begin(_stream: &Stream) -> CudaResult<Self>

Begins capturing operations on the given stream.

On a real CUDA system, this would call cuStreamBeginCapture(stream, CU_STREAM_CAPTURE_MODE_GLOBAL).

§Errors

Returns CudaError::NotInitialized if the CUDA driver is not available.

pub fn record_kernel( &mut self, function_name: &str, grid: (u32, u32, u32), block: (u32, u32, u32), shared_mem: u32, )

Records a kernel launch operation in the capture.

§Parameters

function_name - Name of the kernel function.
grid - Grid dimensions (x, y, z).
block - Block dimensions (x, y, z).
shared_mem - Dynamic shared memory in bytes.

pub fn record_memcpy(&mut self, direction: MemcpyDirection, size: usize)

Records a memory copy operation in the capture.

§Parameters

direction - Direction of the memory copy.
size - Size of the transfer in bytes.

pub fn record_memset(&mut self, size: usize, value: u8)

Records a memset operation in the capture.

§Parameters

size - Number of bytes to set.
value - Byte value to fill with.

pub fn recorded_count(&self) -> usize

Returns the number of operations recorded so far.

pub fn is_active(&self) -> bool

Returns whether the capture is still active.

pub fn end(self) -> CudaResult<Graph>

Ends the capture and returns the resulting Graph.

On a real CUDA system, this would call cuStreamEndCapture and return the captured graph handle.

The captured nodes are connected in a linear chain (each node depends on the previous one) to preserve the order in which operations were recorded.

§Errors

Returns CudaError::StreamCaptureUnmatched if the capture was already ended.

Auto Trait Implementations§

impl Freeze for StreamCapture

impl RefUnwindSafe for StreamCapture

impl Send for StreamCapture

impl Sync for StreamCapture

impl Unpin for StreamCapture

impl UnsafeUnpin for StreamCapture

impl UnwindSafe for StreamCapture

Blanket Implementations§

impl<T> Any for T
where T: 'static + ?Sized,

fn type_id(&self) -> TypeId

Gets the TypeId of self. Read more

impl<T> Borrow<T> for T
where T: ?Sized,

fn borrow(&self) -> &T

Immutably borrows from an owned value. Read more

impl<T> BorrowMut<T> for T
where T: ?Sized,

fn borrow_mut(&mut self) -> &mut T

Mutably borrows from an owned value. Read more

impl<T> From<T> for T

fn from(t: T) -> T

Returns the argument unchanged.

impl<T> Instrument for T

fn instrument(self, span: Span) -> Instrumented<Self>

Instruments this type with the provided Span, returning an Instrumented wrapper. Read more

fn in_current_span(self) -> Instrumented<Self>

Instruments this type with the current Span, returning an Instrumented wrapper. Read more

impl<T, U> Into<U> for T
where U: From<T>,

fn into(self) -> U

Calls U::from(self).

That is, this conversion is whatever the implementation of From<T> for U chooses to do.

impl<T, U> TryFrom<U> for T
where U: Into<T>,

type Error = Infallible

The type returned in the event of a conversion error.

fn try_from(value: U) -> Result<T, <T as TryFrom<U>>::Error>

Performs the conversion.

impl<T, U> TryInto<U> for T
where U: TryFrom<T>,

type Error = <U as TryFrom<T>>::Error

The type returned in the event of a conversion error.

fn try_into(self) -> Result<U, <U as TryFrom<T>>::Error>

Performs the conversion.

impl<T> WithSubscriber for T

fn with_subscriber<S>(self, subscriber: S) -> WithDispatch<Self>
where S: Into<Dispatch>,

Attaches the provided Subscriber to this type, returning a WithDispatch wrapper. Read more

fn with_current_subscriber(self) -> WithDispatch<Self>

Attaches the current default Subscriber to this type, returning a WithDispatch wrapper. Read more