Skip to main content

Table 2 Translated datasets.

From: Language Model Adaptation Using Machine-Translated Text for Resource-Deficient Languages

Corpus set Sentences Words Unique words
62962 440347 3396
62996 406814 7312