From: Progressive loss functions for speech enhancement with deep neural networks
SNRInput | Babble | Cafe | Music | Traffic | Tram | Average |
---|---|---|---|---|---|---|
CNN (estimated SNR / LLR) | ||||||
0 | 12.74 / 0.84 | 20.00 / 0.77 | 13.24 / 0.93 | 29.68 / 0.71 | 26.09 / 0.76 | 20.35 / 0.80 |
5 | 29.24 / 0.64 | 34.23 / 0.60 | 26.62 / 0.66 | 42.24 / 0.56 | 42.28 / 0.59 | 34.92 / 0.61 |
10 | 38.65 / 0.54 | 40.57 / 0.53 | 32.45 / 0.54 | 42.83 / 0.51 | 42.10 / 0.53 | 39.32 / 0.53 |
15 | 37.14 / 0.51 | 37.31 / 0.51 | 33.95 / 0.51 | 38.47 / 0.50 | 37.78 / 0.51 | 36.93 / 0.51 |
20 | 35.86 / 0.50 | 35.76 / 0.50 | 34.71 / 0.50 | 35.95 / 0.50 | 35.88 / 0.50 | 35.63 / 0.50 |
25 | 35.61 / 0.50 | 35.59 / 0.50 | 35.37 / 0.50 | 35.65 / 0.50 | 35.63 / 0.50 | 35.57 / 0.50 |
P-CNN with WP (estimated SNR / LLR) | ||||||
0 | 10.45 / 0.93 | 15.38 / 0.90 | 11.11 / 1.06 | 17.56 / 0.89 | 16.35 / 0.92 | 14.17 / 0.94 |
5 | 19.18 / 0.83 | 22.31 / 0.82 | 18.73 / 0.89 | 24.51 / 0.81 | 24.01 / 0.82 | 21.75 / 0.84 |
10 | 24.06 / 0.80 | 25.32 / 0.80 | 23.09 / 0.81 | 26.33 / 0.79 | 26.00 / 0.80 | 24.96 / 0.80 |
15 | 25.41 / 0.79 | 25.71 / 0.79 | 24.77 / 0.79 | 26.12 / 0.79 | 25.81 / 0.79 | 25.57 / 0.79 |
20 | 25.66 / 0.79 | 25.69 / 0.79 | 25.34 / 0.79 | 25.74 / 0.79 | 25.68 / 0.79 | 25.62 / 0.79 |
25 | 25.69 / 0.79 | 25.70 / 0.79 | 25.61 / 0.79 | 25.73 / 0.79 | 25.70 / 0.79 | 25.68 / 0.79 |
P-CNN with UP (estimated SNR / LLR) | ||||||
0 | 14.34 / 0.84 | 34.01 / 0.78 | 15.58 / 0.99 | 31.38 / 0.72 | 41.89 / 0.78 | 27.44 / 0.82 |
5 | 34.57 / 0.65 | 51.18 / 0.63 | 34.37 / 0.71 | 46.58 / 0.61 | 55.58 / 0.62 | 44.46 / 0.64 |
10 | 49.16 / 0.57 | 54.36 / 0.56 | 47.56 / 0.57 | 53.47 / 0.55 | 55.58 / 0.56 | 52.03 / 0.56 |
15 | 52.56 / 0.54 | 53.52 / 0.54 | 50.77 / 0.54 | 54.54 / 0.54 | 54.40 / 0.54 | 53.16 / 0.54 |
20 | 52.19 / 0.53 | 52.28 / 0.53 | 51.36 / 0.53 | 52.42 / 0.53 | 52.46 / 0.53 | 52.14 / 0.53 |
25 | 51.99 / 0.53 | 51.95 / 0.53 | 51.78 / 0.53 | 51.98 / 0.53 | 51.94 / 0.53 | 51.93 / 0.53 |
ResNet (estimated SNR / LLR) | ||||||
0 | 12.94 / 0.79 | 27.71 / 0.73 | 15.39 / 0.84 | 27.58 / 0.67 | 32.64 / 0.73 | 23.25 / 0.75 |
5 | 26.44 / 0.59 | 38.18 / 0.55 | 27.15 / 0.59 | 35.72 / 0.53 | 43.03 / 0.55 | 34.11 / 0.56 |
10 | 34.51 / 0.51 | 37.63 / 0.50 | 31.89 / 0.50 | 36.97 / 0.49 | 39.25 / 0.49 | 36.05 / 0.50 |
15 | 34.96 / 0.48 | 35.46 / 0.48 | 33.55 / 0.48 | 36.04 / 0.47 | 35.94 / 0.48 | 35.19 / 0.48 |
20 | 34.41 / 0.47 | 34.45 / 0.47 | 33.86 / 0.47 | 34.50 / 0.47 | 34.51 / 0.47 | 34.35 / 0.47 |
25 | 34.21 / 0.47 | 34.24 / 0.47 | 34.07 / 0.47 | 34.26 / 0.47 | 34.23 / 0.47 | 34.20 / 0.47 |
P-ResNet with WP (estimated SNR / LLR) | ||||||
0 | 15.22 / 0.86 | 32.58 / 0.79 | 14.14 / 1.02 | 29.04 / 0.75 | 39.48 / 0.79 | 26.09 / 0.84 |
5 | 38.97 / 0.64 | 57.25 / 0.60 | 36.74 / 0.70 | 51.67 / 0.59 | 62.66 / 0.61 | 49.46 / 0.63 |
10 | 53.23 / 0.52 | 57.93 / 0.51 | 49.75 / 0.53 | 58.03 / 0.50 | 59.30 / 0.51 | 55.65 / 0.51 |
15 | 54.03 / 0.49 | 55.02 / 0.48 | 51.38 / 0.49 | 56.45 / 0.48 | 55.36 / 0.48 | 54.45 / 0.48 |
20 | 53.71 / 0.48 | 53.73 / 0.48 | 52.31 / 0.48 | 54.07 / 0.48 | 53.80 / 0.48 | 53.52 / 0.48 |
25 | 53.60 / 0.48 | 53.58 / 0.48 | 53.28 / 0.48 | 53.68 / 0.48 | 53.56 / 0.48 | 53.54 / 0.48 |
P-ResNet with UP (estimated SNR / LLR) | ||||||
0 | 10.65 / 0.86 | 20.78 / 0.82 | 11.41 / 0.99 | 20.74 / 0.77 | 22.91 / 0.83 | 17.30 / 0.85 |
5 | 24.49 / 0.65 | 34.68 / 0.62 | 23.54 / 0.69 | 31.38 / 0.59 | 38.09 / 0.62 | 30.44 / 0.63 |
10 | 38.09 / 0.54 | 40.87 / 0.53 | 34.71 / 0.54 | 40.11 / 0.51 | 42.19 / 0.52 | 39.19 / 0.53 |
15 | 41.02 / 0.50 | 41.27 / 0.50 | 38.70 / 0.50 | 42.06 / 0.49 | 41.65 / 0.50 | 40.94 / 0.50 |
20 | 40.53 / 0.49 | 40.41 / 0.49 | 39.40 / 0.49 | 40.56 / 0.49 | 40.50 / 0.49 | 40.28 / 0.49 |
25 | 40.26 / 0.49 | 40.25 / 0.49 | 40.02 / 0.49 | 40.31 / 0.49 | 40.28 / 0.49 | 40.22 / 0.49 |