This directory contains language-specific data files. Most importantly, you will find here:
- A list of unique characters for the target language (e.g. English) in
data/alphabet.txt - A binary n-gram language model compiled by
kenlmindata/lm/lm.binary - A trie model compiled by
generate_trie.cppindata/lm/trie
For more information on how to create these resources, see data/lm/README.md