Fig. 1From: Learning long-term filter banks for audio source separation and audio scene classificationSpectrogram examples of “cafe” scene. a, b Two audio fragments randomly selected from “cafe” scene. c The average energy distribution of the two examples in frequency direction. d The temporal coherence of the two examples in different frequency binsBack to article page