Fig. 4From: Learning long-term filter banks for audio source separation and audio scene classificationModel architecture of long-term filter banks. Each row of the spectrogram is convolved by a filter bank with individual width. In this sketch map, time durations of the filter banks in the highest four frequency bins are 3, 2, 1 and 2 framesBack to article page