Skip to main content

Module collective

Module collective 

Source
Expand description

Async collective operations: AllReduce via dedicated comm threads.

Structs§

AllReduceHandle
Non-blocking AllReduce handle.
CollectiveBarrier
Cross-rank barrier for summing f32 vectors.
CommRequest
Request from the compute thread to the comm thread.
CommThread
Dedicated communication thread for async AllReduce.

Functions§

async_allreduce
Submit a non-blocking AllReduce via the comm thread.