Fig. 2From: Exploiting spectro-temporal locality in deep learning based acoustic event detectionMagnified log power spectrogram regions for “steps” (a) and “phone ring” (b) sounds for high-time resolution (10 ms frame length, (a.1) and (b.1)) and high-frequency resolution (90 ms frame length, (a.2) and (b.2))Back to article page