Skip to main content

Table 3 Average performance comparison with related works on LITIS Rouen dataset and DCASE2016 dataset

From: Learning long-term filter banks for audio source separation and audio scene classification

Method

DCASE2016 (%)

LITIS Rouen (%)

 

Error

F-measure

Error

F-measure

TriFB-Null

23.12

76.08

3.76

96.19

GaussFB-Null

22.69

76.56

3.48

96.44

CNN-multilayer [50]

26.45

72.44

4.00

95.80

CNN-1layer [22]

23.29

75.82

2.97

96.91

RNN-Gam [26]

–

–

3.4

–

CNN-Gam [24]

–

–

4.2

–

MFCC-GMM [49]

27.5

–

–

–

DNN-CQT [51]

–

78.1

–

96.6

DNN-Mel [53]

23.6

–

–

–

CNN-Mel [54]

24.0

–

–

–