Fig. 3From: Deep multiple instance learning for foreground speech localization in ambient audio from wearable devicesVisualization of different pooling methods for foreground localization using gentle-aligned regions as labels; top: ground-truth regions of foreground and background speech; bottom: foreground posteriors of different modelsBack to article page