From: Nonlinear residual echo suppression based on dual-stream DPRNN
Echo | Model | PESQ | SDR | STOI |
---|---|---|---|---|
Artificial speech | LAEC | 1.48 | −2.60 | 0.622 |
Time | 2.61 | 12.3 | 0.866 | |
Time_1 | 2.56 | 12.2 | 0.865 | |
Time_2 | 2.57 | 12.1 | 0.864 | |
TF | 2.75 | 12.4 | 0.880 | |
TF_1 | 2.70 | 12.4 | 0.875 | |
TF_2 | 2.69 | 12.3 | 0.875 | |
Artificial music | LAEC | 1.48 | −2.90 | 0.634 |
Time | 2.50 | 11.5 | 0.842 | |
Time_1 | 2.44 | 11.4 | 0.841 | |
Time_2 | 2.46 | 11.3 | 0.841 | |
TF | 2.62 | 11.4 | 0.857 | |
TF_1 | 2.58 | 11.3 | 0.853 | |
TF_2 | 2.57 | 11.3 | 0.852 | |
ER speech | LAEC | 1.61 | −2.05 | 0.697 |
Time | 2.68 | 11.7 | 0.892 | |
Time_1 | blue2.70 | blue12.0 | blue0.894 | |
Time_2 | blue2.75 | blue12.5 | blue0.899 | |
TF | 2.77 | 11.3 | 0.904 | |
TF_1 | blue2.80 | blue11.9 | blue0.905 | |
TF_2 | blue2.88 | blue12.4 | blue0.912 | |
ER music | LAEC | 1.70 | −1.12 | 0.730 |
Time | 2.75 | 12.6 | 0.900 | |
Time_1 | blue2.76 | blue12.8 | blue0.901 | |
Time_2 | blue2.80 | blue13.0 | blue0.906 | |
TF | 2.79 | 11.9 | 0.907 | |
TF_1 | blue2.83 | blue12.3 | blue0.908 | |
TF_2 | blue2.91 | blue12.6 | blue0.914 | |
LL speech | LAEC | 1.95 | 1.67 | 0.806 |
Time | 3.00 | 15.6 | 0.932 | |
Time_1 | 3.00 | 15.8 | 0.933 | |
Time_2 | 3.03 | 16.1 | 0.935 | |
TF | 3.02 | 15.3 | 0.938 | |
TF_1 | 3.08 | 15.8 | 0.939 | |
TF_2 | 3.13 | 16.1 | 0.943 | |
LL music | LAEC | 1.97 | 2.16 | 0.820 |
Time | 3.07 | 16.0 | 0.935 | |
Time_1 | 3.03 | 16.1 | 0.936 | |
Time_2 | 3.04 | 16.2 | 0.937 | |
TF | 3.12 | 15.8 | 0.944 | |
TF_1 | 3.17 | 16.1 | 0.944 | |
TF_2 | 3.18 | 16.2 | 0.946 |