Skip to main content

Advertisement

Table 8 Experimental results by using combined training data (three known positions) on the second testing scheme

From: Single-channel dereverberation by feature mapping using cascade neural networks for robust distant speaker identification and speech recognition

Method Dataset Speaker identification rate (%)
P01 P03 P05 P02 P04 Avg. known Avg. unknown Avg. all
Prop. (24 NNs) + CMN combd 1s.15u 89.7 92.4 94.6 96.9 91.0 92.2 94.0 92.9
3s.45u 91.5 92.8 95.8 99.0 92.7 93.4 95.8 94.4
Prop. (12 NNs) + CMN combd 1s.15u 89.5 92.4 93.6 96.1 91.6 91.8 93.9 92.6
3s.45u 90.5 92.2 94.5 98.0 92.5 92.4 95.3 93.5
Prop. (6 NNs) + CMN combd 1s.15u 90.5 90.8 93.7 94.8 93.5 91.7 94.2 92.7
3s.45u 91.5 92.0 95.0 96.0 93.0 92.8 94.5 93.5
  1. The known environments include P01, P03, and P05, while the unknown environments include P02 and P04. The experiments were done by using skip1 8-1-0 frame selection. The bold text represents the best average performance for each training data number.