EURASIP Journal on Audio, Speech, and Music Processing

Table 2 Reconstruction error of audio source separation using magnitude spectrograms as input

From: Learning long-term filter banks for audio source separation and audio scene classification

Init	Method	Re_toep			Re_inv
		M/V =0.1	M/V =1	M/V =10	M/V =0.1	M/V =1	M/V =10
–	Null [47]	2.58	0.99	0.033	2.58	0.99	0.033
–	CNN-1layer [22]	2.83	0.96	0.047	2.83	0.96	0.047
–	GaussLTFB	2.49	0.94	0.037	2.60	0.95	0.034
Random	FullLTFB	2.77	1.12	0.080	2.85	1.03	0.043
Identity	FullLTFB	2.50	0.94	0.037	2.82	0.95	0.034

M/V represents the energy ratio between music and voice

Back to article page