Exploring convolutional, recurrent, and hybrid deep neural networks for speech and music detection in a large audio dataset

EURASIP Journal on Audio, Speech, and Music Processing

Table 7 Comparison of the event detection results obtained by the best single-task networks and the best double-task network