EURASIP Journal on Audio, Speech, and Music Processing

Table 4 PSDS values 1\(\times\)CRNN models with different modulation cutoffs, analyzed for short and long sound events. Sound events are grouped based on mode of the duration, where long events are Blender, Shaver, Frying, Water, and Vacuum and short events are Alarm, Cat, Dishes, Dog, and Speech

From: Multi-rate modulation encoding via unsupervised learning for audio event detection

Cutoff freqs	PSDS1		PSDS2
Cutoff freqs	Short	Long	Short	Long
\(f_c\) = 0.8 Hz	0.250	0.477	0.548	0.656
\(f_c\) = 2.4 Hz	0.262	0.485	0.544	0.664
\(f_c\) = 4 Hz	0.262	0.474	0.544	0.644

Back to article page