Skip to main content

Table 2 Translated datasets.

From: Language Model Adaptation Using Machine-Translated Text for Resource-Deficient Languages

Corpus set

Sentences

Words

Unique words

62962

440347

3396

62996

406814

7312