Expand description
Common executor utilities — extracted from duplicated code across Qwen3, Qwen2, and Llama executors.
Structs§
- Generic
KvCache Handle - Generic KV cache handle usable by any model architecture.
Functions§
- default_
executor_ status - Default executor status (all executors return the same thing).
- tensor_
to_ tokens - Extract token IDs from a TensorRef.
- tokens_
to_ tensor - Convert token IDs to a candle Tensor on the target device.
- wrap_
tensor - Wrap a candle Tensor as a TensorRef.