Expand description
Readers and writers for text formats.
This module provides two readers/writers:
ReadText/WriteText: word embeddings in text format. In this format, each line contains a word followed by its embedding. The word and the embedding vector components are separated by a space. This format is used by GloVe.ReadTextDims/WriteTextDims: this format is the same as (1), but the data is preceded by a line with the shape of the embedding matrix. This format is used by word2vec’s text output.
For example:
use std::fs::File;
use std::io::BufReader;
use rust2vec::prelude::*;
let mut reader = BufReader::new(File::open("testdata/similarity.txt").unwrap());
// Read the embeddings. The second arguments specifies whether
// the embeddings should be normalized to unit vectors.
let embeddings = Embeddings::read_text_dims(&mut reader, true)
.unwrap();
// Look up an embedding.
let embedding = embeddings.embedding("Berlin");Traits§
- Read
Text - Method to construct
Embeddingsfrom a text file. - Read
Text Dims - Method to construct
Embeddingsfrom a text file with dimensions. - Write
Text - Method to write
Embeddingsto a text file. - Write
Text Dims - Method to write
Embeddingsto a text file.