pub fn split_compound_word(word: &str, vocab: &HashSet<String>) -> Vec<String>
Attempts to split a compound word into its constituent parts using a vocabulary Returns the original word if it cannot be split