From: Language Model Adaptation Using Machine-Translated Text for Resource-Deficient Languages
Corpus set
Sentences
Words
Unique words
62962
440347
3396
62996
406814
7312