TY - STD TI - DD Lee, HS Seung, in Proc. Neural. Inf. Process. Syst, 13. Algorithms for non-negative matrix factorization, (2001), pp. 556–562. ID - ref1 ER - TY - JOUR AU - Virtanen, T. PY - 2007 DA - 2007// TI - Monaural sound source separation by non-negative matrix factorization with temporal continuity and sparseness criteria JO - IEEE Trans. Audio, Speech and Lang. Process VL - 15 UR - https://doi.org/10.1109/TASL.2006.885253 DO - 10.1109/TASL.2006.885253 ID - Virtanen2007 ER - TY - STD TI - MN Schmidt, RK Olsson, in Proc. INTERSPEECH. Single-channel speech separation using sparse non-negative matrix factorization, (2006), pp. 2614–2617. ID - ref3 ER - TY - JOUR AU - Gemmeke, J. F. AU - Viratnen, T. AU - Hurmalainen, A. PY - 2011 DA - 2011// TI - Exemplar-based sparse representations for noise robust automatic speech recognition JO - IEEE Trans. Audio, Speech and Lang. Process VL - 19 UR - https://doi.org/10.1109/TASL.2011.2112350 DO - 10.1109/TASL.2011.2112350 ID - Gemmeke2011 ER - TY - STD TI - R Takashima, T Takiguchi, Y Ariki, in Proc. SLT. Exemplar-based voice conversion in noisy environment, (2012), pp. 313–317. ID - ref5 ER - TY - JOUR AU - Helander, E. AU - Virtanen, T. AU - Nurminen, J. AU - Gabbouj, M. PY - 2010 DA - 2010// TI - Voice conversion using partial least squares regression JO - IEEE Trans. On Audio, Speech, Lang. Process VL - 18 UR - https://doi.org/10.1109/TASL.2010.2041699 DO - 10.1109/TASL.2010.2041699 ID - Helander2010 ER - TY - STD TI - CH Lee, CH Wu, in Proc. INTERSPEECH. MAP-based adaptation for speech conversion using adaptation data selection and non-parallel training, (2006), pp. 2254–2257. ID - ref7 ER - TY - JOUR AU - Mouchtaris, A. AU - der Spiegel, J. V. AU - Mueller, P. PY - 2006 DA - 2006// TI - Nonparallel training for voice conversion based on a parameter adaptation approach JO - IEEE Trans. Audio, Speech, and Lang. Processing VL - 14 UR - https://doi.org/10.1109/TSA.2005.857790 DO - 10.1109/TSA.2005.857790 ID - Mouchtaris2006 ER - TY - STD TI - T Toda, Y Ohtani, K Shikano, in Proc. INTERSPEECH. Eigenvoice conversion based on Gaussian mixture model, (2006), pp. 2446–2449. ID - ref9 ER - TY - STD TI - D Saito, K Yamamoto, N Minematsu, K Hirose, in Proc. INTERSPEECH. One-to-many voice conversion based on tensor representation of speaker space, (2011), pp. 653–656. ID - ref10 ER - TY - STD TI - EM Grais, H Erdogan, in Proc. INTERSPEECH. Adaptation of speaker-specic bases in non-negative matrix factorization for single channel speech-music separation, (2011), pp. 569–572. ID - ref11 ER - TY - JOUR AU - Stylianou, Y. AU - Cappe, O. AU - Moilines, E. PY - 1998 DA - 1998// TI - Continuous probabilistic transform for voice conversion JO - IEEE. Trans. Speech and Audio Processing VL - 6 UR - https://doi.org/10.1109/89.661472 DO - 10.1109/89.661472 ID - Stylianou1998 ER - TY - JOUR AU - Toda, T. AU - Black, A. AU - Tokuda, K. PY - 2007 DA - 2007// TI - Voice conversion based on maximum likelihood estimation of spectral parameter trajectory JO - IEEE Trans. Audio, Speech and Lang. Process VL - 15 UR - https://doi.org/10.1109/TASL.2007.907344 DO - 10.1109/TASL.2007.907344 ID - Toda2007 ER - TY - JOUR AU - Takashima, R. AU - Takiguchi, T. AU - Ariki, Y. PY - 2013 DA - 2013// TI - Exemplar-based voice conversion using sparse representation in noisy environments JO - IEICE Trans. Fundam. Electron. Commun. Comp. Sci VL - E96-A UR - https://doi.org/10.1587/transfun.E96.A.1946 DO - 10.1587/transfun.E96.A.1946 ID - Takashima2013 ER - TY - STD TI - K Masaka, R Aihara, T Takiguchi, Y Ariki, in Proc. INTERSPEECH. Multimodal exemplar-based voice conversion using lip features in noisy environments, (2014), pp. 1159–1163. ID - ref15 ER - TY - STD TI - R Aihara, T Nakashika, T Takiguchi, Y Ariki, in Proc. ICASSP. Voice conversion based on non-negative matrix factorization using phoneme-categorized dictionary, (2014), pp. 7894–7898. ID - ref16 ER - TY - JOUR AU - Wu, Z. AU - Virtanen, T. AU - Chng, E. S. AU - Li, H. PY - 2014 DA - 2014// TI - Exemplar-based sparse representation with residual compensation for voice conversion JO - IEEE Trans. Audio, Speech and Lang. Process VL - 22 UR - https://doi.org/10.1109/TASLP.2014.2333242 DO - 10.1109/TASLP.2014.2333242 ID - Wu2014 ER - TY - STD TI - R Aihara, R Takashima, T Takiguchi, Y Ariki, A preliminary demonstration of exemplar-based voice conversion for articulation disorders using an individuality-preserving dictionary. EURASIP J. Audio, Speech, and Music Process. 2014(5) (2014). doi:http://dx.doi.org/10.1186/1687-4722-2014-5. UR - http://dx.doi.org/10.1186/1687-4722-2014-5 ID - ref18 ER - TY - STD TI - R Aihara, T Takiguchi, Y Ariki, in Proc. ICASSP. Activity-mapping non-negative matrix factorization for exemplar-based voice conversion, (2015), pp. 4899–4903. ID - ref19 ER - TY - JOUR AU - Kurematsu, A. AU - Takeda, K. AU - Sagisaka, Y. AU - Katagiri, S. AU - Kuwabara, H. AU - Shikano, K. PY - 1990 DA - 1990// TI - ATR Japanese speech database as a tool of speech recognition and synthesis JO - Speech Communication VL - 9 UR - https://doi.org/10.1016/0167-6393(90)90011-W DO - 10.1016/0167-6393(90)90011-W ID - Kurematsu1990 ER - TY - JOUR AU - Kitaoka, N. AU - Yamada, T. AU - Tsuge, S. AU - Miyajima, C. AU - Yamamoto, K. AU - Nishiura, T. AU - Nakayama, M. AU - Denda, Y. AU - Fujimoto, M. AU - Takiguchi, T. AU - Tamura, S. AU - Matsuda, S. AU - Ogawa, T. AU - Kuroiwa, S. AU - Takeda, K. AU - Nakamura, S. PY - 2009 DA - 2009// TI - CENSREC-1-C: An evaluation framework for voice activity detection under noisy environments JO - Acoustical Science and Technology VL - 30 UR - https://doi.org/10.1250/ast.30.363 DO - 10.1250/ast.30.363 ID - Kitaoka2009 ER - TY - STD TI - H Kawahara, H Matsui, in Proc. ICASSP, I. Auditory morphing based on an elastic perceptual distance metric in an interference-free time-frequency representation, (2003), pp. 256–259. ID - ref22 ER - TY - STD TI - T En-Najjary, O Roec, T Chonavel, in Proc. ICSLP. A voice conversion method based on joint pitch and spectral envelope transformation, (2004), pp. 199–203. ID - ref23 ER - TY - STD TI - INTERNATIONAL TELECOMMUNICATION UNION, Methods for objective and subjective assessment of quality. ITU-T Recommendation, 800 (2003). ID - ref24 ER - TY - JOUR AU - Aihara, R. AU - Takashima, R. AU - Takiguchi, T. AU - Ariki, Y. PY - 2014 DA - 2014// TI - Noise-robust voice conversion based on sparse spectral mapping using non-negative matrix factorization JO - IEICE Trans. Inf. Syst VL - E97-D UR - https://doi.org/10.1587/transinf.E97.D.1411 DO - 10.1587/transinf.E97.D.1411 ID - Aihara2014 ER - TY - STD TI - C Veaux, X Robet, in Proc. INTERSPEECH. Intonation conversion from neutral to expressive speech, (2011), pp. 2765–2768. ID - ref26 ER -