pub fn bytes_to_unicode() -> (HashMap<u8, char>, HashMap<char, u8>)
Create byte to Unicode character mapping.
GPT-2 uses a specific mapping to avoid issues with certain bytes.