Expand description

FNet, Mixing Tokens with Fourier Transforms (Lee-Thorp et al.)

Implementation of the FNet language model (https://arxiv.org/abs/2105.03824 Lee-Thorp, Ainslie, Eckstein, Ontanon, 2021). The base model is implemented in the fnet_model::FNetModel struct. Several language model heads have also been implemented, including:

  • Masked language model: fnet_model::FNetForMaskedLM
  • Question answering: fnet_model::FNetForQuestionAnswering
  • Sequence classification: fnet_model::FNetForSequenceClassification
  • Token classification (e.g. NER, POS tagging): fnet_model::FNetForTokenClassification

Model set-up and pre-trained weights loading

The example below illustrate a FNet Masked language model example, the structure is similar for other models. All models expect the following resources:

  • Configuration file expected to have a structure following the Transformers library
  • Model weights are expected to have a structure and parameter names following the Transformers library. A conversion using the Python utility scripts is required to convert the .bin weights to the .ot format.
  • FNetTokenizer using a spiece.model SentencePiece (BPE) model file Pretrained models are available and can be downloaded using RemoteResources.
use tch::{nn, Device};
use rust_bert::fnet::{FNetConfig, FNetForMaskedLM};
use rust_bert::resources::{LocalResource, ResourceProvider};
use rust_bert::Config;
use rust_tokenizers::tokenizer::{BertTokenizer, FNetTokenizer};

let config_resource = LocalResource {
    local_path: PathBuf::from("path/to/config.json"),
};
let vocab_resource = LocalResource {
    local_path: PathBuf::from("path/to/spiece.model"),
};
let weights_resource = LocalResource {
    local_path: PathBuf::from("path/to/model.ot"),
};
let config_path = config_resource.get_local_path()?;
let vocab_path = vocab_resource.get_local_path()?;
let weights_path = weights_resource.get_local_path()?;
let device = Device::cuda_if_available();
let mut vs = nn::VarStore::new(device);
let tokenizer: FNetTokenizer =
    FNetTokenizer::from_file(vocab_path.to_str().unwrap(), true, true)?;
let config = FNetConfig::from_file(config_path);
let bert_model = FNetForMaskedLM::new(&vs.root(), &config);
vs.load(weights_path)?;

Structs

FNet model configuration

FNet Pretrained model config files

FNet for masked language model

FNet for multiple choices

FNet for question answering

FNet for sequence classification

FNet for token classification (e.g. NER, POS)

Container for the FNet masked LM model output.

FNet Base model

Container for the FNet model output.

FNet Pretrained model weight files

Container for the FNet question answering model output.

Container for the FNet sequence classification model output.

FNet Pretrained model vocab files

Type Definitions

Container for the FNet token classification model output.