[][src]Function tokenizations::get_charmap

pub fn get_charmap(a: &str, b: &str) -> (CharMap, CharMap)

Returns character mappings c_a2b (from a to b) and c_b2a (from b to a) based on shortest edit script (SES).

a and b can be noisy. For example, bar and bår can be compared.

Examples

Basic usage:

use tokenizations::get_charmap;
let a = "bar";
let b = "bår";
let (c_a2b, c_b2a) = get_charmap(a, b);
assert_eq!(c_a2b, vec![Some(0), Some(1), Some(2)]);
assert_eq!(c_b2a, vec![Some(0), Some(1), Some(2)]);