Safe Rust wrappers for NVIDIA NCCL.
v0.1 covers the communicator (single-process multi-GPU via
ncclCommInitAll, multi-process via ncclCommInitRank + UniqueId) and
the all_reduce + broadcast collectives — enough for synchronous
data-parallel training.
NCCL is a Linux library; Windows has experimental support but no
general distribution. On hosts without NCCL, [Communicator::init_all]
returns LoaderError::LibraryNotFound — callers can fall back to
single-device execution.