Room-localized speech activity detection in multi-microphone smart homes

EURASIP Journal on Audio, Speech, and Music Processing

Table 7 Room-independent SAD results on the DIRHA-sim test set, employing all available microphones (\(| \mathcal {M}_{\text {all}} |\!=\,\)40) or the reduced setups of Fig. 12

In all cases, HMM-based Viterbi decoding and “w-sum” decision fusion are used, where the combined log-likelihoods result from microphone-specific GMMs (left) or a GMM trained on a single microphone (right)