# TOCFL
The Test of Chinese as a Foreign Language (TOCFL) (Chinese: 華語文能力測驗; pinyin: Huáyǔwén Nénglì Cèyàn) is a standardized test of Taiwanese Mandarin language
proficiency for non-native speakers, including foreign students.
While there are many vocabulary lists available online, a lot of them are either incomplete / outdated or behind paywalls.
This repo provides a dataset based on (linked from the official TOCFL website):
[coct.naer.edu.tw/download/tech_report](https://coct.naer.edu.tw/download/tech_report/)
[Excel Sheet](https://coct.naer.edu.tw/download/tech_report/%E8%87%BA%E7%81%A3%E8%8F%AF%E8%AA%9E%E6%96%87%E8%83%BD%E5%8A%9B%E5%9F%BA%E6%BA%96%E8%A9%9E%E8%AA%9E%E8%A1%A8_111-11-14.xlsx)
### Vocabulary
Taiwan Chinese Language Proficiency Benchmark Vocabulary List_111-11-14.xlsx
The vocabulary list is great, it gives frequency for written AND spoken.
It also provides pinyin to differentiate same char with different meaning pronounciation.
### Characters
Taiwan Chinese Language Proficiency Benchmark Chinese Character List_111-09-20.xlsx
# Other
https://github.com/tomcumming/tocfl-word-list also provides TOCFL lists, but seems to be incomplete (or outdated).
The source used to compile the list is not entirely clear.