Fig. 7From: Exploiting spectro-temporal locality in deep learning based acoustic event detectionAveraged frame-score (percentage of correctly classified frames) by parameter as described in subsection 5.2: spectrogram patch length (a), number of convolutional filters to be trained upon (b), filter shape (c), and max-pooling scheme (d)Back to article page