Crate rten_generate

Source
Expand description

Utilities to simplify running auto-regressive RTen models such as transformer decoders.

For working examples, see the examples in the rten-examples crate which import rten_generate.

Re-exports§

Modules§

  • Filters for processing model outputs prior to sampling.
  • Tools to run the generation loop for an auto-regressive model.
  • Record timing metrics during generation.
  • Abstraction over rten::Model for querying and executing ML models.
  • Samplers which select a token from model outputs.
  • Iterator adapters to decode token IDs into text using rten-text.