Skip to main content

Table 7 Results in simulated REVERB-Eval set for different noises at different initial SNR

From: Progressive loss functions for speech enhancement with deep neural networks

SNRInput Babble Cafe Music Traffic Tram Average
CNN (estimated SNR / LLR)
0 12.74 / 0.84 20.00 / 0.77 13.24 / 0.93 29.68 / 0.71 26.09 / 0.76 20.35 / 0.80
5 29.24 / 0.64 34.23 / 0.60 26.62 / 0.66 42.24 / 0.56 42.28 / 0.59 34.92 / 0.61
10 38.65 / 0.54 40.57 / 0.53 32.45 / 0.54 42.83 / 0.51 42.10 / 0.53 39.32 / 0.53
15 37.14 / 0.51 37.31 / 0.51 33.95 / 0.51 38.47 / 0.50 37.78 / 0.51 36.93 / 0.51
20 35.86 / 0.50 35.76 / 0.50 34.71 / 0.50 35.95 / 0.50 35.88 / 0.50 35.63 / 0.50
25 35.61 / 0.50 35.59 / 0.50 35.37 / 0.50 35.65 / 0.50 35.63 / 0.50 35.57 / 0.50
P-CNN with WP (estimated SNR / LLR)
0 10.45 / 0.93 15.38 / 0.90 11.11 / 1.06 17.56 / 0.89 16.35 / 0.92 14.17 / 0.94
5 19.18 / 0.83 22.31 / 0.82 18.73 / 0.89 24.51 / 0.81 24.01 / 0.82 21.75 / 0.84
10 24.06 / 0.80 25.32 / 0.80 23.09 / 0.81 26.33 / 0.79 26.00 / 0.80 24.96 / 0.80
15 25.41 / 0.79 25.71 / 0.79 24.77 / 0.79 26.12 / 0.79 25.81 / 0.79 25.57 / 0.79
20 25.66 / 0.79 25.69 / 0.79 25.34 / 0.79 25.74 / 0.79 25.68 / 0.79 25.62 / 0.79
25 25.69 / 0.79 25.70 / 0.79 25.61 / 0.79 25.73 / 0.79 25.70 / 0.79 25.68 / 0.79
P-CNN with UP (estimated SNR / LLR)
0 14.34 / 0.84 34.01 / 0.78 15.58 / 0.99 31.38 / 0.72 41.89 / 0.78 27.44 / 0.82
5 34.57 / 0.65 51.18 / 0.63 34.37 / 0.71 46.58 / 0.61 55.58 / 0.62 44.46 / 0.64
10 49.16 / 0.57 54.36 / 0.56 47.56 / 0.57 53.47 / 0.55 55.58 / 0.56 52.03 / 0.56
15 52.56 / 0.54 53.52 / 0.54 50.77 / 0.54 54.54 / 0.54 54.40 / 0.54 53.16 / 0.54
20 52.19 / 0.53 52.28 / 0.53 51.36 / 0.53 52.42 / 0.53 52.46 / 0.53 52.14 / 0.53
25 51.99 / 0.53 51.95 / 0.53 51.78 / 0.53 51.98 / 0.53 51.94 / 0.53 51.93 / 0.53
ResNet (estimated SNR / LLR)
0 12.94 / 0.79 27.71 / 0.73 15.39 / 0.84 27.58 / 0.67 32.64 / 0.73 23.25 / 0.75
5 26.44 / 0.59 38.18 / 0.55 27.15 / 0.59 35.72 / 0.53 43.03 / 0.55 34.11 / 0.56
10 34.51 / 0.51 37.63 / 0.50 31.89 / 0.50 36.97 / 0.49 39.25 / 0.49 36.05 / 0.50
15 34.96 / 0.48 35.46 / 0.48 33.55 / 0.48 36.04 / 0.47 35.94 / 0.48 35.19 / 0.48
20 34.41 / 0.47 34.45 / 0.47 33.86 / 0.47 34.50 / 0.47 34.51 / 0.47 34.35 / 0.47
25 34.21 / 0.47 34.24 / 0.47 34.07 / 0.47 34.26 / 0.47 34.23 / 0.47 34.20 / 0.47
P-ResNet with WP (estimated SNR / LLR)
0 15.22 / 0.86 32.58 / 0.79 14.14 / 1.02 29.04 / 0.75 39.48 / 0.79 26.09 / 0.84
5 38.97 / 0.64 57.25 / 0.60 36.74 / 0.70 51.67 / 0.59 62.66 / 0.61 49.46 / 0.63
10 53.23 / 0.52 57.93 / 0.51 49.75 / 0.53 58.03 / 0.50 59.30 / 0.51 55.65 / 0.51
15 54.03 / 0.49 55.02 / 0.48 51.38 / 0.49 56.45 / 0.48 55.36 / 0.48 54.45 / 0.48
20 53.71 / 0.48 53.73 / 0.48 52.31 / 0.48 54.07 / 0.48 53.80 / 0.48 53.52 / 0.48
25 53.60 / 0.48 53.58 / 0.48 53.28 / 0.48 53.68 / 0.48 53.56 / 0.48 53.54 / 0.48
P-ResNet with UP (estimated SNR / LLR)
0 10.65 / 0.86 20.78 / 0.82 11.41 / 0.99 20.74 / 0.77 22.91 / 0.83 17.30 / 0.85
5 24.49 / 0.65 34.68 / 0.62 23.54 / 0.69 31.38 / 0.59 38.09 / 0.62 30.44 / 0.63
10 38.09 / 0.54 40.87 / 0.53 34.71 / 0.54 40.11 / 0.51 42.19 / 0.52 39.19 / 0.53
15 41.02 / 0.50 41.27 / 0.50 38.70 / 0.50 42.06 / 0.49 41.65 / 0.50 40.94 / 0.50
20 40.53 / 0.49 40.41 / 0.49 39.40 / 0.49 40.56 / 0.49 40.50 / 0.49 40.28 / 0.49
25 40.26 / 0.49 40.25 / 0.49 40.02 / 0.49 40.31 / 0.49 40.28 / 0.49 40.22 / 0.49