TY - JOUR AU - Rabiner, L. R. AU - Sambur, M. R. PY - 1975 DA - 1975// TI - An algorithm for determining the endpoints of isolated utterances JO - The Bell System Technical Journal VL - 54 UR - https://doi.org/10.1002/j.1538-7305.1975.tb02840.x DO - 10.1002/j.1538-7305.1975.tb02840.x ID - Rabiner1975 ER - TY - JOUR AU - Ghosh, P. K. AU - Tsiartas, A. AU - Narayanan, S. PY - 2011 DA - 2011// TI - Robust voice activity detection using long-term signal variability JO - IEEE Transactions on Audio, Speech and Language Processing VL - 19 UR - https://doi.org/10.1109/TASL.2010.2052803 DO - 10.1109/TASL.2010.2052803 ID - Ghosh2011 ER - TY - BOOK AU - Datao, Y. AU - Jiqing, H. AU - Guibin, Z. AU - Tieran, Z. PY - 2012 DA - 2012// TI - Sparse power spectrum based robust voice activity detector PB - IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) CY - Kyoto ID - Datao2012 ER - TY - STD TI - W Hongzhi, X Yuchao, L Meijing, Study on the MFCC similarity-based voice activity detection algorithm (International Conference on Artificial Intelligence, Management Science and Electronic Commerce (AIMSEC), 2011) ID - ref4 ER - TY - BOOK AU - Martin, G. AU - Abeer, A. AU - Dan, E. PY - 2013 DA - 2013// TI - All for one: feature combination for highly channel-degraded speech activity detection PB - INTERSPEECH CY - Lyon ID - Martin2013 ER - TY - STD TI - T Kristjansson, S Deligne, P Olsen, Voicing features for robust speech detection (INTERSPEECH, 2005), pp. 369–372 ID - ref6 ER - TY - JOUR AU - Ahmadi, S. AU - Spanias, A. S. PY - 1999 DA - 1999// TI - Cepstrum-based pitch detection using a new statistical V/UV classification algorithm JO - IEEE Transactions on Speech Audio Processing VL - 7 UR - https://doi.org/10.1109/89.759042 DO - 10.1109/89.759042 ID - Ahmadi1999 ER - TY - JOUR AU - Wu, B. F. AU - Wang, K. C. PY - 2005 DA - 2005// TI - Robust endpoint detection algorithm based on the adaptive band partitioning spectral entropy in adverse environments JO - IEEE Transactions Speech Audio Processing VL - 13 UR - https://doi.org/10.1109/TSA.2005.851909 DO - 10.1109/TSA.2005.851909 ID - Wu2005 ER - TY - STD TI - Z Tuske, P Mihajlik, Z Tobler, T Fegyo, Robust voice activity detection based on the entropy of noise-suppressed spectrum (INTERSPEECH, 2005) ID - ref9 ER - TY - STD TI - L. N. Tan, B. J. Borgstrom, and A. Alwan, Voice activity detection using harmonic frequency components in likelihood ratio test (IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2010) ID - ref10 ER - TY - JOUR AU - Ramirez, J. AU - Segura, J. C. AU - Benitez, C. AU - Torre, A. AU - Rubio, A. PY - 2004 DA - 2004// TI - Efficient voice activity detection algorithms using long-term speech information JO - Speech Communication VL - 42 UR - https://doi.org/10.1016/j.specom.2003.10.002 DO - 10.1016/j.specom.2003.10.002 ID - Ramirez2004 ER - TY - JOUR AU - Manohar, K. AU - Rao, P. PY - 2006 DA - 2006// TI - Speech enhancement in nonstationary noise environments using noise properties JO - Speech Communication VL - 48 UR - https://doi.org/10.1016/j.specom.2005.08.002 DO - 10.1016/j.specom.2005.08.002 ID - Manohar2006 ER - TY - STD TI - M Muller, Information retrieval for music and motion (Springer Verlag, 2007) ID - ref13 ER - TY - STD TI - M Meinard, E Sebastian, Chroma Toolbox: MATLAB implementations for extracting variants of chroma-based audio features, in Proceedings of the 12th International Conference on Music Information Retrieval (ISMIR) (2011) ID - ref14 ER - TY - JOUR AU - Bartsch, M. A. AU - Wakefield, G. H. PY - 2005 DA - 2005// TI - Audio thumbnailing of popular music using chroma-based representations JO - IEEE Transactions on Multimedia VL - 7 UR - https://doi.org/10.1109/TMM.2004.840597 DO - 10.1109/TMM.2004.840597 ID - Bartsch2005 ER - TY - STD TI - EH Berger, LH Royster, DP Driscoll, JD Royster, M Layne, The Noise Manual, 5th edn. (American Industrial Hygiene Association, 2003) ID - ref16 ER - TY - BOOK AU - Rodman, J. PY - 2003 DA - 2003// TI - “The effect of bandwidth on speech intelligibility”, White paper PB - POLYCOM Inc. CY - USA ID - Rodman2003 ER - TY - JOUR AU - Gerkmann, T. AU - Hendriks, R. C. PY - 2012 DA - 2012// TI - Unbiased MMSE-based noise power estimation with low complexity and low tracking delay JO - IEEE Transactions on Audio, Speech and Language Processing VL - 20 UR - https://doi.org/10.1109/TASL.2011.2180896 DO - 10.1109/TASL.2011.2180896 ID - Gerkmann2012 ER - TY - STD TI - JS Garofolo, LF Lamel, WM Fisher et al., DARPA TIMIT acoustic phonetic continuous speech corpus CDROM (NIST, 1993) ID - ref19 ER - TY - JOUR AU - Varga, A. AU - Steeneken, H. J. M. PY - 1993 DA - 1993// TI - Assessment for automatic speech recognition: Ii. NOISEX-92: a database and an experiment to study the effect of additive noise on speech recognition systems JO - Speech Communication VL - 12 UR - https://doi.org/10.1016/0167-6393(93)90095-3 DO - 10.1016/0167-6393(93)90095-3 ID - Varga1993 ER - TY - JOUR AU - Sohn, J. AU - Kim, N. S. AU - Sung, W. PY - 1999 DA - 1999// TI - A statistical model-based voice activity detection JO - IEEE Signal Processing Letter VL - 6 UR - https://doi.org/10.1109/97.736233 DO - 10.1109/97.736233 ID - Sohn1999 ER - TY - STD TI - M Yanna, A Nishihara, Efficient voice activity detection algorithm using long-term spectral flatness measure. EURASIP Journal on Audio, Speech and Music Processing, 21 (2013) ID - ref22 ER -