Skip to main content

Table 14 Frequency of the three-phoneme sequence occurrence in Polish—comparison to the results published in the literature [45]

From: Statistical analysis of orthographic and phonemic language corpus for word-based and phoneme-based Polish language modelling

No.

Obtained results

Triphone [SAMPA]

Results in [45]

 

f(q i−2,q i−1,q i ) [%]

[q i−2][q i−1][q i ]

f(q i−2,q i−1,q i ) [%]

1

0.67353

[v][j][e]

0.3159

2

0.62188

[e][g][o]

0.3655

3

0.56670

[o][v][a]

0.3801

4

0.52262

[s][t][a]

0.3287

5

0.51677

[p][S][e]

0.2969

6

0.36109

[m][j][e]

0.2503

7

0.34557

[e][s][t]

0.1734

8

0.33317

[o][n][ts]

0.1749

9

0.32041

[p][r][a]

0.1681

10

0.31920

[o][s’][ts’]

0.1533

11

0.31277

[j][o][n]

0.189

12

0.28832

[j][o][e ∼]

0.143

13

0.27851

[p][S][I]

0.118

14

0.27638

[k][t][u]

0.1311

15

0.27428

[p][r][o]

0.1807

16

0.26887

[t][u][r]

0.1448

17

0.25608

[n][I][x]

0.1673

18

0.25453

[o][v][j]

0.1404

19

0.25330

[j][e][s]

n.a.

20

0.25049

[o][s][t]

0.1785

21

0.24914

[p][j][e]

0.120

22

0.24394

[e][n][t]

0.1842

23

0.23760

[a][j][o]

0.126

24

0.23674

[a][l][e]

0.116

25

0.22874

[s’][ts’][i]

0.130

26

0.22739

[p][o][v]

0.122

27

0.22454

[a][n’][e]

0.1586

28

0.21298

[s][p][o]

0.1627

29

0.20990

[o][v][e]

0.1712

⋯

⋯

⋯

⋯