Skip to main content

Advertisement

Table 14 Frequency of the three-phoneme sequence occurrence in Polish—comparison to the results published in the literature [45]

From: Statistical analysis of orthographic and phonemic language corpus for word-based and phoneme-based Polish language modelling

No. Obtained results Triphone [SAMPA] Results in [45]
  f(q i−2,q i−1,q i ) [%] [q i−2][q i−1][q i ] f(q i−2,q i−1,q i ) [%]
1 0.67353 [v][j][e] 0.3159
2 0.62188 [e][g][o] 0.3655
3 0.56670 [o][v][a] 0.3801
4 0.52262 [s][t][a] 0.3287
5 0.51677 [p][S][e] 0.2969
6 0.36109 [m][j][e] 0.2503
7 0.34557 [e][s][t] 0.1734
8 0.33317 [o][n][ts] 0.1749
9 0.32041 [p][r][a] 0.1681
10 0.31920 [o][s’][ts’] 0.1533
11 0.31277 [j][o][n] 0.189
12 0.28832 [j][o][e ] 0.143
13 0.27851 [p][S][I] 0.118
14 0.27638 [k][t][u] 0.1311
15 0.27428 [p][r][o] 0.1807
16 0.26887 [t][u][r] 0.1448
17 0.25608 [n][I][x] 0.1673
18 0.25453 [o][v][j] 0.1404
19 0.25330 [j][e][s] n.a.
20 0.25049 [o][s][t] 0.1785
21 0.24914 [p][j][e] 0.120
22 0.24394 [e][n][t] 0.1842
23 0.23760 [a][j][o] 0.126
24 0.23674 [a][l][e] 0.116
25 0.22874 [s’][ts’][i] 0.130
26 0.22739 [p][o][v] 0.122
27 0.22454 [a][n’][e] 0.1586
28 0.21298 [s][p][o] 0.1627
29 0.20990 [o][v][e] 0.1712