Skip to main content

Advertisement

Table 3 Average tonal syllable recognition rate (%) after eigenphone-based speaker adaptation using elastic net

From: Speaker adaptation based on regularized speaker-dependent eigenphone matrix estimation

λ 2 Number of adaptation sentences
  1 2 4 6 8 10
10 52.27 55.98 58.10 59.19 60.22 61.08
  (0.67) (0.48) (0.33) (0.24) (0.20) (0.16)
40 52.27 55.98 58.14 59.17 60.18 61.08
  (0.67) (0.48) (0.33) (0.24) (0.20) (0.16)
80 52.22 55.96 58.12 59.17 60.20 61.04
  (0.67) (0.48) (0.33) (0.24) (0.20) (0.16)
120 52.22 55.98 58.16 59.17 60.16 61.08
  (0.67) (0.48) (0.33) (0.24) (0.20) (0.16)
1,000 52.31 55.98 58.02 59.13 60.13 60.97
  (0.67) (0.48) (0.33) (0.24) (0.20) (0.16)
2,000 52.35 55.98 58.02 59.13 60.16 60.97
  (0.67) (0.48) (0.33) (0.24) (0.20) (0.16)
  1. The number of eigenphones (N) was fixed to 100. λ1=10, λ3=0, and λ2 was varied between 10 and 2,000. The average overall sparsity is shown in parentheses.