Skip to main content

GraphExec

Struct GraphExec 

Source
pub struct GraphExec { /* private fields */ }
Expand description

An instantiated, executable graph.

Created by Graph::instantiate, a GraphExec holds a snapshot of the graph and a pre-computed execution order.

§Driver backing

When a CUDA driver is available, instantiate builds a genuine CUgraph (cuGraphCreate + one cuGraphAdd*Node per in-memory node, with the dependency DAG wired through real CUgraphNode edges) and finalises it into a CUgraphExec via cuGraphInstantiate. In that case launch issues a real cuGraphLaunch.

The in-memory GraphNode representation stores only an operation specification (kernel name, copy direction/size, memset size/value) — it carries no resolved CUfunction or device pointers. Every node is therefore translated to a real cuGraphAddEmptyNode: the resulting driver graph reproduces the node count and dependency topology exactly and executes on the GPU as a DAG of synchronisation barriers. The per-node dispatch in Graph::build_driver_graph is structured so that kernel / memcpy / memset nodes that gain concrete device operands can be promoted to cuGraphAddKernelNode / cuGraphAddMemcpyNode / cuGraphAddMemsetNode without further restructuring.

On macOS (or any host without a CUDA driver), no driver handles are created; the graph is still validated (topological sort) and launch returns CudaError::NotInitialized.

Implementations§

Source§

impl GraphExec

Source

pub fn launch(&self, stream: &Stream) -> CudaResult<()>

Launches the executable graph on the given stream.

When this GraphExec is backed by a real CUgraphExec, this issues cuGraphLaunch(hGraphExec, hStream), submitting the entire graph to the stream with minimal CPU overhead. Otherwise it surfaces the driver-load error.

§Errors
Source

pub fn graph(&self) -> &Graph

Returns a reference to the underlying graph.

Source

pub fn execution_order(&self) -> &[usize]

Returns the pre-computed execution order (topological sort).

Source

pub fn node_count(&self) -> usize

Returns the total number of nodes that would be executed.

Source

pub fn is_driver_backed(&self) -> bool

Returns true if this GraphExec is backed by a real, live CUgraphExec driver handle (as opposed to a CPU-side-only graph).

Trait Implementations§

Source§

impl Debug for GraphExec

Source§

fn fmt(&self, f: &mut Formatter<'_>) -> Result

Formats the value using the given formatter. Read more
Source§

impl Drop for GraphExec

Source§

fn drop(&mut self)

Executes the destructor for this type. Read more
Source§

fn pin_drop(self: Pin<&mut Self>)

🔬This is a nightly-only experimental API. (pin_ergonomics)
Execute the destructor for this type, but different to Drop::drop, it requires self to be pinned. Read more

Auto Trait Implementations§

Blanket Implementations§

Source§

impl<T> Any for T
where T: 'static + ?Sized,

Source§

fn type_id(&self) -> TypeId

Gets the TypeId of self. Read more
Source§

impl<T> Borrow<T> for T
where T: ?Sized,

Source§

fn borrow(&self) -> &T

Immutably borrows from an owned value. Read more
Source§

impl<T> BorrowMut<T> for T
where T: ?Sized,

Source§

fn borrow_mut(&mut self) -> &mut T

Mutably borrows from an owned value. Read more
Source§

impl<T> From<T> for T

Source§

fn from(t: T) -> T

Returns the argument unchanged.

Source§

impl<T> Instrument for T

Source§

fn instrument(self, span: Span) -> Instrumented<Self>

Instruments this type with the provided Span, returning an Instrumented wrapper. Read more
Source§

fn in_current_span(self) -> Instrumented<Self>

Instruments this type with the current Span, returning an Instrumented wrapper. Read more
Source§

impl<T, U> Into<U> for T
where U: From<T>,

Source§

fn into(self) -> U

Calls U::from(self).

That is, this conversion is whatever the implementation of From<T> for U chooses to do.

Source§

impl<T, U> TryFrom<U> for T
where U: Into<T>,

Source§

type Error = Infallible

The type returned in the event of a conversion error.
Source§

fn try_from(value: U) -> Result<T, <T as TryFrom<U>>::Error>

Performs the conversion.
Source§

impl<T, U> TryInto<U> for T
where U: TryFrom<T>,

Source§

type Error = <U as TryFrom<T>>::Error

The type returned in the event of a conversion error.
Source§

fn try_into(self) -> Result<U, <U as TryFrom<T>>::Error>

Performs the conversion.
Source§

impl<T> WithSubscriber for T

Source§

fn with_subscriber<S>(self, subscriber: S) -> WithDispatch<Self>
where S: Into<Dispatch>,

Attaches the provided Subscriber to this type, returning a WithDispatch wrapper. Read more
Source§

fn with_current_subscriber(self) -> WithDispatch<Self>

Attaches the current default Subscriber to this type, returning a WithDispatch wrapper. Read more