Skip to main content

CutlassActor

Struct CutlassActor 

Source
pub struct CutlassActor { /* private fields */ }
Expand description

Host-side actor. Holds an Arc<CutlassInner> so messages can be processed from a worker thread without locking the actor itself after construction.

Implementations§

Source§

impl CutlassActor

Source

pub fn new(plan_cache_capacity: usize) -> Self

Examples found in repository?
examples/cutlass_gemm_fp8.rs (line 11)
10fn main() {
11    let actor = CutlassActor::new(16);
12    let req = GemmRequest::<F8E4m3>::new(GemmShape::new(4096, 4096, 4096), SmArch::Sm90a)
13        .with_epilogue(GemmEpilogue::LinearReLU {
14            alpha: 1.0,
15            beta: 0.0,
16        });
17
18    println!("plan key: {:?}", req.plan_key());
19    let (src, name) = req.render_cu();
20    println!("kernel:   {name}");
21    println!("--- generated .cu ---");
22    println!("{src}");
23
24    actor.handle(CutlassMsg::Gemm(Box::new(req.clone())));
25    actor.handle(CutlassMsg::Gemm(Box::new(req)));
26
27    println!("dispatched: {}", actor.inner().dispatched());
28    println!("plan cache len: {}", actor.inner().plan_cache.len());
29}
Source

pub fn prebuilt_active() -> bool

true when the crate was built with cutlass-prebuilt and nvcc was found at build time, so libatomr_cutlass_prebuilt.a is statically linked into the binary. Phase 6.1 ships one canonical GEMM placeholder cell (proves the wiring); Phase 6.2 expands the cell matrix and routes hits through the prebuilt symbol table before falling back to NVRTC.

Source

pub fn inner(&self) -> Arc<CutlassInner>

Examples found in repository?
examples/cutlass_gemm_fp8.rs (line 27)
10fn main() {
11    let actor = CutlassActor::new(16);
12    let req = GemmRequest::<F8E4m3>::new(GemmShape::new(4096, 4096, 4096), SmArch::Sm90a)
13        .with_epilogue(GemmEpilogue::LinearReLU {
14            alpha: 1.0,
15            beta: 0.0,
16        });
17
18    println!("plan key: {:?}", req.plan_key());
19    let (src, name) = req.render_cu();
20    println!("kernel:   {name}");
21    println!("--- generated .cu ---");
22    println!("{src}");
23
24    actor.handle(CutlassMsg::Gemm(Box::new(req.clone())));
25    actor.handle(CutlassMsg::Gemm(Box::new(req)));
26
27    println!("dispatched: {}", actor.inner().dispatched());
28    println!("plan cache len: {}", actor.inner().plan_cache.len());
29}
Source

pub fn handle(&self, msg: CutlassMsg)

Synchronously process a message. The real production path runs through atomr_core::actor::Actor::handle; this method is the host-only fast path that the unit tests exercise.

Examples found in repository?
examples/cutlass_gemm_fp8.rs (line 24)
10fn main() {
11    let actor = CutlassActor::new(16);
12    let req = GemmRequest::<F8E4m3>::new(GemmShape::new(4096, 4096, 4096), SmArch::Sm90a)
13        .with_epilogue(GemmEpilogue::LinearReLU {
14            alpha: 1.0,
15            beta: 0.0,
16        });
17
18    println!("plan key: {:?}", req.plan_key());
19    let (src, name) = req.render_cu();
20    println!("kernel:   {name}");
21    println!("--- generated .cu ---");
22    println!("{src}");
23
24    actor.handle(CutlassMsg::Gemm(Box::new(req.clone())));
25    actor.handle(CutlassMsg::Gemm(Box::new(req)));
26
27    println!("dispatched: {}", actor.inner().dispatched());
28    println!("plan cache len: {}", actor.inner().plan_cache.len());
29}

Auto Trait Implementations§

Blanket Implementations§

Source§

impl<T> Any for T
where T: 'static + ?Sized,

Source§

fn type_id(&self) -> TypeId

Gets the TypeId of self. Read more
Source§

impl<T> Borrow<T> for T
where T: ?Sized,

Source§

fn borrow(&self) -> &T

Immutably borrows from an owned value. Read more
Source§

impl<T> BorrowMut<T> for T
where T: ?Sized,

Source§

fn borrow_mut(&mut self) -> &mut T

Mutably borrows from an owned value. Read more
Source§

impl<T> From<T> for T

Source§

fn from(t: T) -> T

Returns the argument unchanged.

Source§

impl<T> Instrument for T

Source§

fn instrument(self, span: Span) -> Instrumented<Self>

Instruments this type with the provided Span, returning an Instrumented wrapper. Read more
Source§

fn in_current_span(self) -> Instrumented<Self>

Instruments this type with the current Span, returning an Instrumented wrapper. Read more
Source§

impl<T, U> Into<U> for T
where U: From<T>,

Source§

fn into(self) -> U

Calls U::from(self).

That is, this conversion is whatever the implementation of From<T> for U chooses to do.

Source§

impl<T, U> TryFrom<U> for T
where U: Into<T>,

Source§

type Error = Infallible

The type returned in the event of a conversion error.
Source§

fn try_from(value: U) -> Result<T, <T as TryFrom<U>>::Error>

Performs the conversion.
Source§

impl<T, U> TryInto<U> for T
where U: TryFrom<T>,

Source§

type Error = <U as TryFrom<T>>::Error

The type returned in the event of a conversion error.
Source§

fn try_into(self) -> Result<U, <U as TryFrom<T>>::Error>

Performs the conversion.
Source§

impl<T> WithSubscriber for T

Source§

fn with_subscriber<S>(self, subscriber: S) -> WithDispatch<Self>
where S: Into<Dispatch>,

Attaches the provided Subscriber to this type, returning a WithDispatch wrapper. Read more
Source§

fn with_current_subscriber(self) -> WithDispatch<Self>

Attaches the current default Subscriber to this type, returning a WithDispatch wrapper. Read more