Deep multiple instance learning for foreground speech localization in ambient audio from wearable devices

EURASIP Journal on Audio, Speech, and Music Processing

Table 4 Foreground localization results on frame (instance) level approaches on DS