Skip to main content

Table 1 PSDS values for different systems. Average (avg) and standard deviations (std) are computed from 5 iterations of each model

From: Multi-rate modulation encoding via unsupervised learning for audio event detection

Model

PSDS1

PSDS2

DCASE2023 baseline

0.365 ± 0.010

0.581 ± 0.003

3\(\times\)CRNN random-init

0.363 ± 0.004

0.594 ± 0.007

3\(\times\)CRNN VAE-init

0.374 ± 0.003

0.607 ± 0.009

3\(\times\)CRNN ModVAE-init

0.375 \(\varvec{\pm }\) 0.006

0.627 \(\varvec{\pm }\) 0.005