Expand description
Utilities to simplify running auto-regressive RTen models such as transformer decoders.
For working examples, see the examples in the rten-examples
crate which import rten_generate
.
Re-exports§
pub use generator::Generator;
pub use generator::GeneratorConfig;
pub use generator::GeneratorError;
pub use generator::GeneratorUtils;
pub use generator::ModelInputsConfig;
Modules§
- Filters for processing model outputs prior to sampling.
- Tools to run the generation loop for an auto-regressive model.
- Record timing metrics during generation.
- Abstraction over
rten::Model
for querying and executing ML models. - Samplers which select a token from model outputs.
- Iterator adapters to decode token IDs into text using
rten-text
.