pub struct ReformerModelWithLMHead { /* private fields */ }
Expand description

Reformer Model for text generation

Reformer model with a vocabulary decoding head It is made of the following blocks:

  • reformer: ReformerModel Base Reformer model
  • lm_head: ReformerLMHead projecting hidden states to the vocabulary dimension

Implementations

Build a new ReformerModelWithLMHead

Arguments
  • p - Variable store path for the root of the BART model
  • config - ReformerConfig object defining the model architecture
Example
use rust_bert::reformer::{ReformerConfig, ReformerModelWithLMHead};
use rust_bert::Config;
use std::path::Path;
use tch::{nn, Device};

let config_path = Path::new("path/to/config.json");
let device = Device::Cpu;
let p = nn::VarStore::new(device);
let config = ReformerConfig::from_file(config_path);
let reformer_model: ReformerModelWithLMHead =
    ReformerModelWithLMHead::new(&p.root(), &config).unwrap();

Forward pass through the model

Arguments
  • input_ids - Optional input tensor of shape (batch size, sequence_length). Must be provided when no pre-computed embeddings are given.
  • position_ids - Optional input tensor of shape (batch size, sequence_length). If not provided will be calculated on the fly starting from position 0.
  • input_embeds - Optional input tensor of shape (batch size, sequence_length, embeddings_dim). Must be provided when no input ids are given.
  • attention_mask - Optional attention mask of shape (batch size, sequence_length). Positions with a mask with value 0 will be masked.
  • num_hashes - Optional specification of the number of hashes to use. If not provided will use the value provided in the model configuration.
  • old_layer_states - Optional cached input (Option<Vec<Option<LayerState>>>) containing previous values for the cached states and buckets.
  • train - boolean flag to turn on/off the dropout layers in the model. Should be set to false for inference.
Returns
  • ReformerLMModelOutput containing:
    • logits - Tensor of shape (batch size, sequence_length, vocab_size) representing the logits for each vocabulary item
    • all_hidden_states - Option<Vec<Tensor>> of length n_layers with shape (batch size, sequence_length, hidden_size)
    • all_attentions - Option<Vec<Tensor>> of length n_layers with shape (batch size, sequence_length, hidden_size)
    • cache - Option<Vec<Option<LayerState>>> of length n_layer containing values for the states and buckets for future use.
Example
use rust_bert::reformer::{ReformerConfig, ReformerModelWithLMHead};
let (batch_size, sequence_length) = (64, 128);
let input_tensor = Tensor::rand(&[batch_size, sequence_length], (Int64, device));
let input_positions = Tensor::arange(sequence_length, (Kind::Int64, device)).unsqueeze(0).expand(&[batch_size, sequence_length], true);
let attention_mask = Tensor::ones(&[batch_size, sequence_length], (Int64, device));

let model_output = no_grad(|| {
    reformer_model.forward_t(
        Some(&input_tensor),
        Some(&input_positions),
        None,
        Some(&attention_mask),
        Some(4),
        None,
        false,
    )
});

Trait Implementations

Forward pass through the model. Example provided for GPT2. Read more

Generate text based on a vector of promp texts. Read more

Generate token indices without decoding (useful for token-level operations before returning final text or as validation step during training). Read more

Generate token indices given a list of indices (useful when the input has been pre-tokenized). Returns a list of output tokens that need to be decoded using a tokenizer. Read more

Returns a reference to the text generator’s tokenizer Read more

Auto Trait Implementations

Blanket Implementations

Gets the TypeId of self. Read more

Immutably borrows from an owned value. Read more

Mutably borrows from an owned value. Read more

Returns the argument unchanged.

Instruments this type with the provided Span, returning an Instrumented wrapper. Read more

Instruments this type with the current Span, returning an Instrumented wrapper. Read more

Calls U::from(self).

That is, this conversion is whatever the implementation of From<T> for U chooses to do.

The alignment of pointer.

The type for initializers.

Initializes a with the given initializer. Read more

Dereferences the given pointer. Read more

Mutably dereferences the given pointer. Read more

Drops the object pointed to by the given pointer. Read more

Should always be Self

The type returned in the event of a conversion error.

Performs the conversion.

The type returned in the event of a conversion error.

Performs the conversion.

Attaches the provided Subscriber to this type, returning a WithDispatch wrapper. Read more

Attaches the current default Subscriber to this type, returning a WithDispatch wrapper. Read more