Expand description
OpenJTalk dictionary download manager.
Automatically searches for or downloads the OpenJTalk UTF-8 dictionary
(MeCab binary format), mirroring the C++ openjtalk_dictionary_manager.c behavior.
Note: This module downloads the OpenJTalk MeCab-format dictionary which is
used by C++ and C# implementations. The Rust jpreprocess library uses a
different binary format (lindera). When the naist-jdic feature is enabled
(default), jpreprocess bundles its own dictionary and this module is not used
for Japanese phonemization. This module is primarily used by the C# CLI’s
DictionaryManager equivalent.
§Dictionary search order
OPENJTALK_DICTIONARY_PATHenvironment variable- Executable-relative:
<exe_dir>/../share/open_jtalk/dic - System paths (platform-dependent)
- Data directory:
<data_dir>/open_jtalk_dic_utf_8-1.11
§Control flags
PIPER_OFFLINE_MODE=1— disable all downloadsPIPER_AUTO_DOWNLOAD_DICT=0— disable dictionary auto-download
Functions§
- ensure_
dictionary - Ensure the OpenJTalk dictionary is available, downloading if necessary.
- find_
dictionary - Search for an existing OpenJTalk dictionary without downloading.