Skip to main content

Advertisement

Table 3 Performances of the children test set "CH1" (with breakup for different pitch groups based on original average pitch values) with and without pitch-normalization. The quantity in bracket shows the number of utterances in that group. The 95% confidence interval for the performance is 0.39 (for the 250 Hz, 250–300 Hz, and 300 Hz pitch groups the confidence interval turns out to be 0.39, 0.79, and 3.37, resp.).

From: Exploring the Effect of Differences in the Acoustic Correlates of Adults' and Children's Speech in the Context of Automatic Speech Recognition

Condition WER (%)
  All Values (7,772) 250 Hz (5,224) 250–300 Hz (2,346) 300 Hz (202)
Baseline 11.37 6.54 17.47 39.03
Norm. 9.64 6.02 14.24 30.11