Skip to main content

Table 13 Frequency of the two-phoneme sequence occurrence in Polish—comparison to the results published in the literature [45]

From: Statistical analysis of orthographic and phonemic language corpus for word-based and phoneme-based Polish language modelling

No.

Obtained results

Diphone [SAMPA]

Results in [45]

 

f(q i−1,q i ) [%]

[q i−1][q i ]

f(q i−1,q i ) [%]

1

2.09086

[j][e]

1.7253

2

1.48817

[n][a]

1.1632

3

1.40880

[n’][e]

0.8438

4

1.40198

[s][t]

1.0791

5

1.38280

[p][o]

1.0479

6

1.25078

[o][v]

1.1829

7

1.08491

[r][a]

0.9189

8

1.03023

[o][n]

0.8756

9

1.01037

[r][o]

0.9155

10

0.99573

[v][a]

0.8012

11

0.94847

[t][a]

0.8035

12

0.88593

[k][o]

0.7337

13

0.84639

[j][a]

0.6367

14

0.79998

[o][e ∼]

0.506

15

0.79985

[v][j]

0.442

16

0.79298

[d][o]

0.6459

17

0.76749

[e][j]

0.6620

18

0.76340

[a][w]

0.5595

19

0.75699

[t][e]

0.6229

20

0.74761

[t][o]

0.5814

21

0.72197

[z][a]

0.497

22

0.71902

[e][m]

0.60411

23

0.71384

[g][o]

0.515

24

0.69492

[e][n]

0.6768

25

0.68893

[S][e]

0.456

26

0.68631

[k][a]

0.540

27

0.67323

[n][e]

0.60803

28

0.67050

[v][I]

0.526

29

0.66791

[l][i]

0.58227

⋯

⋯

⋯

⋯