pub fn tokenize(s: &str) -> Vec<String>
Lowercase, split on non-alphanumerics, drop tokens shorter than 2 chars.