Skip to main content

Table 5 Some attributes of the phonetically balanced Icelandic text corpus.

From: Language Model Adaptation Using Machine-Translated Text for Resource-Deficient Languages

Attribute Text corpus
No. of sentences 290
No. of words 1375
No. of phones 8407
PB unit Biphone
No. of unique PB units 916
Average no. of words/sentence 4.7
Average no. of phones/word 6.1