wiki:LanguageModelSources

Version 24 (modified by kmaclean, 12 years ago) (diff)

--

Natural Language Toolkit (NLTK)

ARPA

Possible sources of written data (written corpora) for the creation of Language Models

Other Sources but with Licensing Restrictions

Multilingual Copora