Skip to main content

Table 5 Some attributes of the phonetically balanced Icelandic text corpus.

From: Language Model Adaptation Using Machine-Translated Text for Resource-Deficient Languages

Attribute

Text corpus

No. of sentences

290

No. of words

1375

No. of phones

8407

PB unit

Biphone

No. of unique PB units

916

Average no. of words/sentence

4.7

Average no. of phones/word

6.1