Skip to main content
Fig. 3 | EURASIP Journal on Audio, Speech, and Music Processing

Fig. 3

From: AUC optimization for deep learning-based voice activity detection

Fig. 3

Relative AUC improvement of the proposed methods over the competitive methods, when MRCG is used as the acoustic feature. The terms “EN” and “CH” are short for English and Chinese respectively. The term “NN” is short for neural networks. a Feedforward neural network is used as the basic deep model; the evaluation is conducted on the English Noisy-CHiME-4 dataset. b Feedforward neural network is used; the evaluation is conducted on the Chinese Noisy-THCHS-30 dataset. c BLSTM is used; the evaluation is conducted on the English Noisy-CHiME-4 dataset. d BLSTM is used; the evaluation is conducted on the Chinese Noisy-THCHS-30 dataset

Back to article page