EURASIP Journal on Audio, Speech, and Music Processing

Table 3 Average performance comparison with related works on LITIS Rouen dataset and DCASE2016 dataset

From: Learning long-term filter banks for audio source separation and audio scene classification

Method	DCASE2016 (%)		LITIS Rouen (%)
	Error	F-measure	Error	F-measure
TriFB-Null	23.12	76.08	3.76	96.19
GaussFB-Null	22.69	76.56	3.48	96.44
CNN-multilayer [50]	26.45	72.44	4.00	95.80
CNN-1layer [22]	23.29	75.82	2.97	96.91
RNN-Gam [26]	–	–	3.4	–
CNN-Gam [24]	–	–	4.2	–
MFCC-GMM [49]	27.5	–	–	–
DNN-CQT [51]	–	78.1	–	96.6
DNN-Mel [53]	23.6	–	–	–
CNN-Mel [54]	24.0	–	–	–

Back to article page