Skip to main content

Table 2 Reconstruction error of audio source separation using magnitude spectrograms as input

From: Learning long-term filter banks for audio source separation and audio scene classification

Init Method Re_toep Re_inv
   M/V =0.1 M/V =1 M/V =10 M/V =0.1 M/V =1 M/V =10
Null [47] 2.58 0.99 0.033 2.58 0.99 0.033
CNN-1layer [22] 2.83 0.96 0.047 2.83 0.96 0.047
GaussLTFB 2.49 0.94 0.037 2.60 0.95 0.034
Random FullLTFB 2.77 1.12 0.080 2.85 1.03 0.043
Identity FullLTFB 2.50 0.94 0.037 2.82 0.95 0.034
  1. M/V represents the energy ratio between music and voice