Skip to main content

Advertisement

Table 13 Frequency of the two-phoneme sequence occurrence in Polish—comparison to the results published in the literature [45]

From: Statistical analysis of orthographic and phonemic language corpus for word-based and phoneme-based Polish language modelling

No. Obtained results Diphone [SAMPA] Results in [45]
  f(q i−1,q i ) [%] [q i−1][q i ] f(q i−1,q i ) [%]
1 2.09086 [j][e] 1.7253
2 1.48817 [n][a] 1.1632
3 1.40880 [n’][e] 0.8438
4 1.40198 [s][t] 1.0791
5 1.38280 [p][o] 1.0479
6 1.25078 [o][v] 1.1829
7 1.08491 [r][a] 0.9189
8 1.03023 [o][n] 0.8756
9 1.01037 [r][o] 0.9155
10 0.99573 [v][a] 0.8012
11 0.94847 [t][a] 0.8035
12 0.88593 [k][o] 0.7337
13 0.84639 [j][a] 0.6367
14 0.79998 [o][e ] 0.506
15 0.79985 [v][j] 0.442
16 0.79298 [d][o] 0.6459
17 0.76749 [e][j] 0.6620
18 0.76340 [a][w] 0.5595
19 0.75699 [t][e] 0.6229
20 0.74761 [t][o] 0.5814
21 0.72197 [z][a] 0.497
22 0.71902 [e][m] 0.60411
23 0.71384 [g][o] 0.515
24 0.69492 [e][n] 0.6768
25 0.68893 [S][e] 0.456
26 0.68631 [k][a] 0.540
27 0.67323 [n][e] 0.60803
28 0.67050 [v][I] 0.526
29 0.66791 [l][i] 0.58227