Fig. 7From: Exploring convolutional, recurrent, and hybrid deep neural networks for speech and music detection in a large audio datasetConfusion matrix of the best network setting for music event detection (CNN 7×7 with L=6 and N=128). The percentages indicate the total amount of test segments in each possible real class-predicted class combinationBack to article page