TY - JOUR AU - Boll, S. F. PY - 1979 DA - 1979// TI - Suppression of acoustic noise in speech using spectral subtraction JO - IEEE Trans Acoust Speech Signal Process VL - 27 UR - https://doi.org/10.1109/TASSP.1979.1163209 DO - 10.1109/TASSP.1979.1163209 ID - Boll1979 ER - TY - STD TI - H. M. Goodarzi, S. Seyedtabaii, “Speech enhancement using spectral subtraction based on a modified noise minimum statistics estimation,” International Joint Conference on INC, IMS and IDC, 2009, Seoul, South Korea. DOI: https://doi.org/10.1109/NCM.2009.272 UR - https://ieeexplore.ieee.org/xpl/mostRecentIssue.jsp?punumber=5331298 ID - ref2 ER - TY - BOOK AU - Loizou, P. h. i. l. i. p. o. s. C. PY - 2007 DA - 2007// TI - Speech Enhancement UR - https://doi.org/10.1201/9781420015836 DO - 10.1201/9781420015836 ID - Loizou2007 ER - TY - JOUR AU - Ephraim, Y. AU - Malah, D. PY - 1985 DA - 1985// TI - Speech enhancement using a minimum mean-square error log-spectral amplitude estimator JO - IEEE Trans. Acoust. Speech Signal Process VL - 33 UR - https://doi.org/10.1109/TASSP.1985.1164550 DO - 10.1109/TASSP.1985.1164550 ID - Ephraim1985 ER - TY - JOUR AU - Ephraim, Y. AU - Malah, D. PY - 1984 DA - 1984// TI - Speech enhancement using a minimum mean square error short-time spectral amplitude estimator JO - IEEE Trans Acoust Speech Signal Process VL - ASSP-32 UR - https://doi.org/10.1109/TASSP.1984.1164453 DO - 10.1109/TASSP.1984.1164453 ID - Ephraim1984 ER - TY - JOUR AU - Srinivasan, S. AU - Samuelsson, J. AU - Kleijn, W. B. PY - 2007 DA - 2007// TI - Codebook-based Bayesian speech enhancement for nonstationary environments JO - IEEE Trans Audio Speech Lang Process VL - 15 UR - https://doi.org/10.1109/tasl.2006.881696 DO - 10.1109/tasl.2006.881696 ID - Srinivasan2007 ER - TY - JOUR AU - Erkelens, J. S. AU - Heusdens, R. PY - 2008 DA - 2008// TI - Tracking of nonstationary noise based on data-driven recursive noise power estimation JO - IEEE Trans Audio Speech Lang Process VL - 16 UR - https://doi.org/10.1109/tasl.2008.2001108 DO - 10.1109/tasl.2008.2001108 ID - Erkelens2008 ER - TY - JOUR AU - He, Q. AU - Bao, C. C. AU - Bao, F. PY - 2017 DA - 2017// TI - Multiplicative update of auto-regressive gains for codebook-based speech enhancement JO - IEEE Trans. Audio Speech Lang Process VL - 25 UR - https://doi.org/10.1109/TASLP.2016.2636445 DO - 10.1109/TASLP.2016.2636445 ID - He2017 ER - TY - JOUR AU - Martin, R. PY - 2001 DA - 2001// TI - Noise power spectral density estimation based on optimal smoothing and minimum statistics JO - IEEE Trans Speech Audio Process VL - 9 UR - https://doi.org/10.1109/89.928915 DO - 10.1109/89.928915 ID - Martin2001 ER - TY - JOUR AU - Zhao, D. Y. AU - Kleijn, W. B. PY - 2007 DA - 2007// TI - HMM-based gain modeling for enhancement of speech in noise JO - IEEE Trans Audio Speech Lang Process VL - 15 UR - https://doi.org/10.1109/TASL.2006.885256 DO - 10.1109/TASL.2006.885256 ID - Zhao2007 ER - TY - JOUR AU - Srinivasan, S. AU - Samuelsson, J. AU - Kleijn, W. B. PY - 2006 DA - 2006// TI - Codebook driven short term predictor parameter estimation for speech enhancement JO - IEEE Trans Audio Speech Lang Process VL - 14 UR - https://doi.org/10.1109/TSA.2005.854113 DO - 10.1109/TSA.2005.854113 ID - Srinivasan2006 ER - TY - STD TI - X. Y. Wang and C. C. Bao, “Speech enhancement using a joint MAP estimation of LP parameters.” Int. Conf. signal process., comm., comput., 2015. DOI: https://doi.org/10.1109/ICSPCC.2015.7338863 ID - ref12 ER - TY - JOUR AU - Linde, Y. AU - Buzo, A. AU - Gray, R. M. PY - 1980 DA - 1980// TI - An algorithm for vector quantization design JO - IEEE Trans Commun VL - C-28 UR - https://doi.org/10.1109/tcom.1980.1094577 DO - 10.1109/tcom.1980.1094577 ID - Linde1980 ER - TY - JOUR AU - Reddy, A. AU - Raj, B. PY - 2007 DA - 2007// TI - Soft mask methods for single-channel speaker separation JO - IEEE Trans Audio Speech Lang Process VL - 15 UR - https://doi.org/10.1109/TASL.2007.901310 DO - 10.1109/TASL.2007.901310 ID - Reddy2007 ER - TY - JOUR AU - Radfar, M. o. h. a. m. m. a. d. H. AU - Dansereau, R. i. c. h. a. r. d. M. PY - 2007 DA - 2007// TI - Single-Channel Speech Separation Using Soft Mask Filtering JO - IEEE Transactions on Audio, Speech and Language Processing VL - 15 UR - https://doi.org/10.1109/TASL.2007.904233 DO - 10.1109/TASL.2007.904233 ID - Radfar2007 ER - TY - JOUR AU - Hu, K. AU - Wang, D. L. PY - 2013 DA - 2013// TI - An iterative model-based approach to cochannel speech separation JO - EURASIP J Audio Speech Music Process VL - 14 UR - https://doi.org/10.1186/1687-4722-2013-14 DO - 10.1186/1687-4722-2013-14 ID - Hu2013 ER - TY - STD TI - Z. Wang, X. Wang, X. Li, Q. Fu, and Y. Yan, “Oracle performance investigation of the ideal masks,” in IWAENC, pp. 1-5, 2016. DOI: https://doi.org/10.1109/IWAENC.2016.7602888 ID - ref17 ER - TY - STD TI - B. Yan, C. Bao, Z. Bai, “DNN-based speech enhancement via integrating NMF and CASA,” International Conference on Audio, Language and Image Processing (ICALIP), 2018. DOI: https://doi.org/10.1109/ICALIP.2018.8455780 ID - ref18 ER - TY - JOUR AU - Xu, Y. AU - Du, J. AU - Dai, L. AU - Lee, C. PY - 2015 DA - 2015// TI - A regression approach to speech enhancement based on deep neural networks JO - IEEE Trans Audio Speech Lang Process VL - 23 UR - https://doi.org/10.1109/TASLP.2014.2364452 DO - 10.1109/TASLP.2014.2364452 ID - Xu2015 ER - TY - STD TI - D. S. Williamson, Y. X. Wang, and D. L. Wang, “Complex ratio masking for joint enhancement of magnitude and phase,” in Proc. ICASSP, pp. 5220-5224, 2016. DOI: https://doi.org/10.1109/ICASSP.2016.7472673 ID - ref20 ER - TY - JOUR AU - Williamson, D. o. n. a. l. d. S. AU - Wang, Y. u. x. u. a. n. AU - Wang, D. e. L. i. a. n. g. PY - 2016 DA - 2016// TI - Complex Ratio Masking for Monaural Speech Separation JO - IEEE/ACM Transactions on Audio, Speech, and Language Processing VL - 24 UR - https://doi.org/10.1109/TASLP.2015.2512042 DO - 10.1109/TASLP.2015.2512042 ID - Williamson2016 ER - TY - CHAP AU - Geravanchizadeh, M. a. s. o. u. d. AU - Ahmadnia, R. e. z. a. PY - 2014 DA - 2014// TI - Monaural Speech Enhancement Based on Multi-threshold Masking BT - Blind Source Separation PB - Springer Berlin Heidelberg CY - Berlin, Heidelberg UR - https://doi.org/10.1007/978-3-642-55016-4_13 DO - 10.1007/978-3-642-55016-4_13 ID - Geravanchizadeh2014 ER - TY - JOUR AU - Wang, Y. X. AU - Narayanan, A. AU - Wang, D. L. PY - 2014 DA - 2014// TI - On training targets for supervised speech separation JO - IEEE Trans Audio Speech Lang Process VL - 22 UR - https://doi.org/10.1109/taslp.2014.2352935 DO - 10.1109/taslp.2014.2352935 ID - Wang2014 ER - TY - JOUR AU - Chen, J. AU - Wang, Y. AU - Yoho, S. E. AU - Wang, D. L. AU - Healy, E. W. PY - 2016 DA - 2016// TI - Large-scale training to increase speech intelligibility for hearing-impaired listeners in novel noises JO - J Acoustl Soc Am VL - 139 UR - https://doi.org/10.1121/1.4948445 DO - 10.1121/1.4948445 ID - Chen2016 ER - TY - JOUR AU - Wang, D. e. L. i. a. n. g. AU - Chen, J. i. t. o. n. g. PY - 2018 DA - 2018// TI - Supervised Speech Separation Based on Deep Learning: An Overview JO - IEEE/ACM Transactions on Audio, Speech, and Language Processing VL - 26 UR - https://doi.org/10.1109/TASLP.2018.2842159 DO - 10.1109/TASLP.2018.2842159 ID - Wang2018 ER - TY - STD TI - N. Chen, C. C. Bao and F. Deng, “Speech enhancement with binaural cues derived from a priori codebook.” in Proc.ISCSLP, 2016. DOI: https://doi.org/10.1109/ISCSLP.2016.7918377 ID - ref26 ER - TY - STD TI - N. Chen, C. C. Bao and X. Y. Wang, “Speech enhancement based on binaural cues.” in Proc.APSIPA, 2017. DOI: https://doi.org/10.1109/APSIPA.2017.8282017 ID - ref27 ER - TY - JOUR AU - May, T. AU - Par, S. AU - Kohlrausch, A. PY - 2012 DA - 2012// TI - A binaural scene analyzer for joint localization and recognition of speakers in the presence of interfering noise sources and reverberation JO - IEEE Trans Audio Speech Lang Process VL - 20 UR - https://doi.org/10.1109/tasl.2012.2193391 DO - 10.1109/tasl.2012.2193391 ID - May2012 ER - TY - STD TI - Y. Jiang and R. S. Liu. “Binaural deep neural network for robust speech enhancement.” in Proc. IEEE Int. Conf. Signal, Process., Communications, Computing, pp:692-695, 2014. DOI: https://doi.org/10.1109/ICSPCC.2014.6986284 ID - ref29 ER - TY - JOUR AU - Jiang, Y. AU - Wang, D. L. AU - Liu, R. S. AU - Feng, Z. M. PY - 2014 DA - 2014// TI - Binaural classification for reverberant speech segregation using deep neural networks JO - IEEE Trans Audio Speech Lang Process VL - 22 UR - https://doi.org/10.1109/TASLP.2014.2361023 DO - 10.1109/TASLP.2014.2361023 ID - Jiang2014 ER - TY - JOUR AU - Chandna, S. AU - Wang, W. PY - 2018 DA - 2018// TI - Bootstrap averaging for model-based source separation in reverberant conditions JO - IEEE/ACM Trans Audio Speech Lang Process VL - 26 UR - https://doi.org/10.1109/TASLP.2018.2797425 DO - 10.1109/TASLP.2018.2797425 ID - Chandna2018 ER - TY - STD TI - A. Zermini, Q. Liu, Y. Xu, M. D. Plumbley, D. Betts, and W. Wang, "Binaural and log-power spectra features with deep neural networks for speech-noise separation", in Proc. IEEE 19th International Workshop on Multimedia Signal Processing (MMSP 2017), Luton, UK, October 16-18, 2017. DOI: https://doi.org/10.1109/MMSP.2017.8122280 ID - ref32 ER - TY - STD TI - Y. Yu, W. Wang, and P. Han, "Localization based stereo speech source separation using probabilistic time-frequency masking and deep neural networks", EURASIP Journal on Audio Speech and Music Processing, 2016:7, 18 pages, DOI https://doi.org/10.1186/s13636-016-0085-x, 2016. ID - ref33 ER - TY - JOUR AU - Alinaghi, A. AU - Jackson, P. AU - Liu, Q. AU - Wang, W. PY - 2014 DA - 2014// TI - Joint mixing vector and binaural model based stereo source separation JO - IEEE/ACM Trans Audio Speech Lang Process VL - 22 UR - https://doi.org/10.1109/TASLP.2014.2320637 DO - 10.1109/TASLP.2014.2320637 ID - Alinaghi2014 ER - TY - STD TI - A. Alinaghi, W. Wang, and P. Jackson, "Integrating binaural cues and blind source separation method for separating reverberant speech mixtures," in Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2011), pp. 209-212, Prague, Czech Republic, May 22-27, 2011. DOI: https://doi.org/10.1109/ICASSP.2011.5946377 ID - ref35 ER - TY - STD TI - A. Alinaghi, W. Wang, and P.J.B. Jackson, "Spatial and coherence cues based time-frequency masking for binaural reverberant speech separation", in Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2013), pp. 684-688, Vancouver, Canada, May 26-31, 2013. DOI: https://doi.org/10.1109/ICASSP.2013.6637735 ID - ref36 ER - TY - STD TI - A. Alinaghi, P Jackson, and W. Wang, "Comparison between the statistical cues in BSS techniques and binaural cues in CASA approaches for reverberant speech separation", in Proc. IET International Conference on Intelligent Signal Processing (ISP 2013), London, UK, December 3-4, 2013. DOI: https://doi.org/10.1049/cp.2013.2076 ID - ref37 ER - TY - STD TI - Q. Liu, W. Wang, P. Jackson, and Y. Tang, "A perceptually-weighted deep neural network for monaural speech enhancement in various background noise conditions", in Proc. European Signal Processing Conference (EUSIPCO 2017), Kos Island, Greece, August 28- September 2, 2017. DOI: https://doi.org/10.23919/EUSIPCO.2017.8081412 ID - ref38 ER - TY - JOUR AU - Liu, Q. i. n. g. j. u. AU - Wang, W. e. n. w. u. AU - Jackson, P. h. i. l. i. p. PY - 2012 DA - 2012// TI - Use of bimodal coherence to resolve the permutation problem in convolutive BSS JO - Signal Processing VL - 92 UR - https://doi.org/10.1016/j.sigpro.2011.11.007 DO - 10.1016/j.sigpro.2011.11.007 ID - Liu2012 ER - TY - STD TI - C. Faller, F. Baumgarte, “Binaural cue coding: a novel and efficient representation of spectral audio.” IEEE ICASSP, Orlando, Florida, USA, pp. 1841-1844, 2002. DOI: https://doi.org/10.1109/ICASSP.2002.5744983 ID - ref40 ER - TY - JOUR AU - Baumgarte, F. AU - Faller, C. PY - 2003 DA - 2003// TI - Binaural cue coding-part I: psychoacoustic fundamentals and design principles JO - IEEE Transactions on Speech and Audio Processing VL - 11 UR - https://doi.org/10.1109/TSA.2003.818109 DO - 10.1109/TSA.2003.818109 ID - Baumgarte2003 ER - TY - JOUR AU - Faller, C. AU - Baumgarte, F. PY - 2003 DA - 2003// TI - Binaural cue coding-part II: schemes and applications JO - IEEE Transactions on Speech and Audio Processing VL - 11 UR - https://doi.org/10.1109/TSA.2003.818108 DO - 10.1109/TSA.2003.818108 ID - Faller2003 ER - TY - STD TI - Y. Zhang, R. Hu. “Speech wideband extension based on Gaussian mixture model.” Acta Acustica, vol. 34, no. 5, pp. 471-480, 2009. ISSN: 03710025 ID - ref43 ER - TY - JOUR AU - Liang, S. h. a. n. AU - Liu, W. e. n. j. u. AU - Jiang, W. e. i. AU - Xue, W. e. i. PY - 2013 DA - 2013// TI - The optimal ratio time-frequency mask for speech separation in terms of the signal-to-noise ratio JO - The Journal of the Acoustical Society of America VL - 134 UR - https://doi.org/10.1121/1.4824632 DO - 10.1121/1.4824632 ID - Liang2013 ER - TY - JOUR AU - Liang, S. AU - Liu, W. J. AU - Jiang, W. AU - Xue, W. PY - 2014 DA - 2014// TI - The analysis of the simplification from the ideal ratio to binary mask in signal-to-noise ratio sense JO - Speech Comm VL - 59 UR - https://doi.org/10.1016/j.specom.2013.12.002 DO - 10.1016/j.specom.2013.12.002 ID - Liang2014 ER - TY - JOUR AU - Lu, Y. AU - Loizou, P. PY - 2008 DA - 2008// TI - A geometric approach to spectral subtraction JO - Speech Comm VL - 55 UR - https://doi.org/10.1016/j.specom.2008.01.003 DO - 10.1016/j.specom.2008.01.003 ID - Lu2008 ER - TY - JOUR AU - Bao, F. AU - Abdulla, W. H. PY - 2019 DA - 2019// TI - A new ratio mask representation for CASA-based speech enhancement JO - IEEE Trans Audio Speech Lang Process VL - 27 UR - https://doi.org/10.1109/TASLP.2018.2868407 DO - 10.1109/TASLP.2018.2868407 ID - Bao2019 ER - TY - JOUR AU - Bao, F. AU - Abdulla, W. H. PY - 2018 DA - 2018// TI - A new IBM estimation method based on convex optimization for CASA JO - Speech Comm VL - 97 UR - https://doi.org/10.1016/j.specom.2018.01.002 DO - 10.1016/j.specom.2018.01.002 ID - Bao2018 ER - TY - JOUR AU - Gao, B. AU - Woo, W. L. AU - Dlay, S. S. PY - 2013 DA - 2013// TI - Unsupervised single-channel separation of nonstationary signals using gammatone filter-bank and itakura-saito nonnegative matrix two-dimensional factorizations JO - IEEE Trans Circuits Syst I VL - 60 UR - https://doi.org/10.1109/tcsi.2012.2215735 DO - 10.1109/tcsi.2012.2215735 ID - Gao2013 ER - TY - JOUR AU - Narayanan, A. AU - Wang, D. L. PY - 2012 DA - 2012// TI - A CASA-based system for long-term SNR estimation JO - IEEE Trans Audio Speech Lang Process VL - 20 UR - https://doi.org/10.1109/TASL.2012.2205242 DO - 10.1109/TASL.2012.2205242 ID - Narayanan2012 ER - TY - JOUR AU - Chen, J. AU - Wang, Y. AU - Wang, D. L. PY - 2014 DA - 2014// TI - A feature study for classification-based speech separation at low signal-to-noise ratios JO - IEEE/ACM Trans Audio Speech Lang Process VL - 22 UR - https://doi.org/10.1109/TASLP.2014.2359159 DO - 10.1109/TASLP.2014.2359159 ID - Chen2014 ER - TY - JOUR AU - Deng, F. AU - Bao, F. AU - Bao, C. C. PY - 2014 DA - 2014// TI - Speech enhancement using generalized weighted β-order spectral amplitude estimator JO - Speech Commun VL - 59 UR - https://doi.org/10.1016/j.specom.2014.01.002 DO - 10.1016/j.specom.2014.01.002 ID - Deng2014 ER - TY - JOUR AU - Cohen, I. AU - Berdugo, B. PY - 2002 DA - 2002// TI - Noise estimation by minima controlled recursive averaging for robust speech enhancement JO - IEEE Signal Process Lett VL - 9 UR - https://doi.org/10.1109/97.988717 DO - 10.1109/97.988717 ID - Cohen2002 ER - TY - STD TI - J. Taghia, N. Mohammadiha, J. Sang, et al. "An evaluation of noise power spectral density estimation algorithms in adverse acoustic environments." 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2011. DOI: https://doi.org/10.1109/ICASSP.2011.5947389 ID - ref54 ER - TY - JOUR AU - Zue, V. i. c. t. o. r. AU - Seneff, S. t. e. p. h. a. n. i. e. AU - Glass, J. a. m. e. s. PY - 1990 DA - 1990// TI - Speech database development at MIT: Timit and beyond JO - Speech Communication VL - 9 UR - https://doi.org/10.1016/0167-6393(90)90010-7 DO - 10.1016/0167-6393(90)90010-7 ID - Zue1990 ER - TY - JOUR AU - Varga, A. AU - Steeneken, H. J. PY - 1993 DA - 1993// TI - Assessment for automatic speech recognition: II. NOISEX-92: a database and an experiment to study the effect of additive noise on speech recognition systems JO - Speech Commun VL - 12 UR - https://doi.org/10.1016/0167-6393(93)90095-3 DO - 10.1016/0167-6393(93)90095-3 ID - Varga1993 ER - TY - STD TI - N. Fan, J. Rosca, R. Balan. "Speech noise estimation using enhanced minima controlled recursive averaging," 2007 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2007. DOI: https://doi.org/10.1109/ICASSP.2007.366979 ID - ref57 ER - TY - STD TI - Antony WR, John GB, Michael PH, Andries PH. “Perceptual evaluation of speech quality (PESQ), an objective method for end-to-end speech quality assessment of narrowband telephone networks and speech codecs,” IEEE International Conference on Acoustics,Speech and Signal Processing (ICASSP). 2001. https://doi.org/10.1109/ICASSP.2001.941023. ID - ref58 ER - TY - STD TI - S. Rangachari, P. C. Loizou, Y. Hu. "A noise estimation algorithm with rapid adaptation for highly nonstationary environments," 2004 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2004. DOI: https://doi.org/10.1109/ICASSP.2004.1325983 ID - ref59 ER - TY - JOUR AU - Taal, C. H. AU - Hendriks, R. C. AU - Heusdens, R. PY - 2011 DA - 2011// TI - An algorithm for intelligibility prediction of time–frequency weighted noisy speech JO - IEEE Trans Audio Speech Lang Process VL - 19 UR - https://doi.org/10.1109/tasl.2011.2114881 DO - 10.1109/tasl.2011.2114881 ID - Taal2011 ER - TY - JOUR AU - Bao, F. e. n. g. AU - Abdulla, W. a. l. e. e. d. H. PY - 2018 DA - 2018// TI - A new time-frequency binary mask estimation method based on convex optimization of speech power JO - Speech Communication VL - 97 UR - https://doi.org/10.1016/j.specom.2018.01.002 DO - 10.1016/j.specom.2018.01.002 ID - Bao2018 ER -