Skip to main content

Table 4 PSDS values 1\(\times\)CRNN models with different modulation cutoffs, analyzed for short and long sound events. Sound events are grouped based on mode of the duration, where long events are Blender, Shaver, Frying, Water, and Vacuum and short events are Alarm, Cat, Dishes, Dog, and Speech

From: Multi-rate modulation encoding via unsupervised learning for audio event detection

Cutoff freqs

PSDS1

PSDS2

Short

Long

Short

Long

\(f_c\) = 0.8 Hz

0.250

0.477

0.548

0.656

\(f_c\) = 2.4 Hz

0.262

0.485

0.544

0.664

\(f_c\) = 4 Hz

0.262

0.474

0.544

0.644