TY - STD TI - NIST, TREC NIST Evaluations. . Accessed 6 Aug 2014., [http://www.itl.nist.gov/iad/mig//tests/sdr/] UR - http://www.itl.nist.gov/iad/mig//tests/sdr/ ID - ref1 ER - TY - STD TI - S Galliano, E Geoffrois, D Mostefa, in Interspeech, Lisbon, 4–8 Sept 2005. The ESTER phase II evaluation campaign for the rich transcription of French broadcast news, pp. 3–6. ID - ref2 ER - TY - STD TI - J Zibert, F Mihelic, J Martens, H Meinedo, J Neto, L Docio, C Garcia-Mateo, P David, E Al, in Interspeech, Lisbon, 4–8 Sept 2005. The COST278 broadcast news segmentation and speaker clustering evaluation-overview, methodology, systems, results. ID - ref3 ER - TY - JOUR AU - Lavner, Y. AU - Ruinskiy, D. PY - 2009 DA - 2009// TI - A decision-tree-based algorithm for speech/music classification and segmentation JO - EURASIP J. Audio Speech Music Process VL - 2009 UR - https://doi.org/10.1155/2009/239892 DO - 10.1155/2009/239892 ID - Lavner2009 ER - TY - STD TI - S Imai, in IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings, Boston, 14–16 Apr 1983. Cepstral analysis synthesis on the mel frequency scale, pp. 93–96. ID - ref5 ER - TY - STD TI - R Vergin, D O’Shaughnessy, V Gupta, in IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings, vol. 1, Atlanta, 7–10 May 1996. Compensated mel frequency cepstrum coefficients, pp. 323–326. ID - ref6 ER - TY - JOUR AU - Vergin, R. PY - 1999 DA - 1999// TI - Generalized mel frequency cepstral coefficients for large-vocabulary speaker-independent continuous-speech recognition JO - IEEE Trans. Speech Audio Process VL - 7 UR - https://doi.org/10.1109/89.784104 DO - 10.1109/89.784104 ID - Vergin1999 ER - TY - STD TI - E Wong, S Sridharan, in International Symposium on Intelligent Multimedia, Video and Speech Processing, Kowloon Shangri-La, Hong Kong, 2–4 May 2001. Comparison of linear prediction cepstrum coefficients and mel-frequency cepstrum coefficients for language identification, pp. 95–98. ID - ref8 ER - TY - BOOK AU - Hasan, M. AU - Jamil, M. AU - Rahman, M. PY - 2004 DA - 2004// TI - International Conference on Computer and Electrical Engineering ID - Hasan2004 ER - TY - JOUR AU - Dhanalakshmi, P. AU - Palanivel, S. AU - Ramalingam, V. PY - 2011 DA - 2011// TI - Classification of audio signals using AANN and GMM JO - Appl. Soft Comput VL - 11 UR - https://doi.org/10.1016/j.asoc.2009.12.033 DO - 10.1016/j.asoc.2009.12.033 ID - Dhanalakshmi2011 ER - TY - JOUR AU - Xie, L. AU - Fu, Z. -. H. AU - Feng, W. AU - Luo, Y. PY - 2011 DA - 2011// TI - Pitch-density-based features and an SVM binary tree approach for multi-class audio classification in broadcast news JO - Multimed. Syst VL - 17 UR - https://doi.org/10.1007/s00530-010-0205-x DO - 10.1007/s00530-010-0205-x ID - Xie2011 ER - TY - STD TI - J Saunders, in IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings, Atlanta, 7–10 May 1996. Real-time discrimination of broadcast speech/music, pp. 993–996. ID - ref12 ER - TY - JOUR AU - Li, D. AU - Sethi, I. AU - Dimitrova, N. AU - McGee, T. PY - 2001 DA - 2001// TI - Classification of general audio data for content-based retrieval JO - Pattern Recogn. Lett VL - 22 UR - https://doi.org/10.1016/S0167-8655(00)00119-7 DO - 10.1016/S0167-8655(00)00119-7 ID - Li2001 ER - TY - JOUR AU - Lu, L. AU - Zhang, H. AU - Jiang, H. PY - 2002 DA - 2002// TI - Content analysis for audio classification and segmentation JO - IEEE Trans. Speech Audio Process VL - 10 UR - https://doi.org/10.1109/TSA.2002.804546 DO - 10.1109/TSA.2002.804546 ID - Lu2002 ER - TY - STD TI - TL Nwe, H Li, in IEEE International Conference on Acoustics, Speech and Signal Processing, vol. 2, Philadelphia, 18–23 Mar 2005. Broadcast news segmentation by audio type analysis, p. 1065. ID - ref15 ER - TY - BOOK AU - Hauptmann, A. AU - Baron, R. AU - Chen, M. PY - 2003 DA - 2003// TI - Proc. TRECVID ID - Hauptmann2003 ER - TY - STD TI - S Dharanipragada, M Franz, in DARPA Broadcast News Workshop, Herndon, 28 Feb–3 Mar 1999. Story segmentation and topic detection in the broadcast news domain, pp. 1–4. ID - ref17 ER - TY - JOUR AU - Gallardo-Antolín, A. AU - Montero, J. PY - 2010 DA - 2010// TI - Histogram equalization-based features for speech, music, and song discrimination JO - IEEE Signal Process. Lett VL - 17 UR - https://doi.org/10.1109/LSP.2010.2049877 DO - 10.1109/LSP.2010.2049877 ID - Gallardo-Antolín2010 ER - TY - JOUR AU - Butko, T. AU - Nadeu, C. PY - 2011 DA - 2011// TI - Audio segmentation of broadcast news in the Albayzin-2010 evaluation: overview, results, and discussion JO - EURASIP J. Audio Speech Music Process VL - 2011 UR - https://doi.org/10.1186/1687-4722-2011-1 DO - 10.1186/1687-4722-2011-1 ID - Butko2011 ER - TY - JOUR AU - Markaki, M. AU - Stylianou, Y. PY - 2011 DA - 2011// TI - Discrimination of speech from nonspeeech in broadcast news based on modulation frequency features JO - Speech Commun VL - 53 UR - https://doi.org/10.1016/j.specom.2010.08.007 DO - 10.1016/j.specom.2010.08.007 ID - Markaki2011 ER - TY - JOUR AU - Huang, R. AU - Hansen, J. PY - 2006 DA - 2006// TI - Advances in unsupervised audio classification and segmentation for the broadcast news and NGSW corpora JO - IEEE Trans. Audio Speech Lang. Process VL - 14 UR - https://doi.org/10.1109/TSA.2005.858057 DO - 10.1109/TSA.2005.858057 ID - Huang2006 ER - TY - JOUR AU - Nguyen, N. AU - Haque, M. AU - Kim, C. -. h. AU - Kim, J. PY - 2011 DA - 2011// TI - Audio segmentation and classification using a temporally weighted fuzzy C-means algorithm JO - Adv. Neural Netw VL - 6676 ID - Nguyen2011 ER - TY - BOOK AU - Chen, S. S. AU - Gopalakrishnan, P. S. PY - 1998 DA - 1998// TI - Proc. DARPA Broadcast News Workshop ID - Chen1998 ER - TY - JOUR AU - Wu, C. -. h. AU - Chiu, Y. PY - 2006 DA - 2006// TI - Automatic segmentation and identification of mixed-language speech using delta-BIC and LSA-based GMMs JO - IEEE Trans. Audio Speech Lang. Process VL - 14 UR - https://doi.org/10.1109/TSA.2005.852992 DO - 10.1109/TSA.2005.852992 ID - Wu2006 ER - TY - JOUR AU - Kotti, M. AU - Benetos, E. AU - Kotropoulos, C. PY - 2008 DA - 2008// TI - Computationally efficient and robust BIC-based speaker segmentation JO - IEEE Trans. Audio Speech Lang. Process VL - 16 UR - https://doi.org/10.1109/TASL.2008.925152 DO - 10.1109/TASL.2008.925152 ID - Kotti2008 ER - TY - JOUR AU - Wu, C. -. h. AU - Hsieh, C. -. h. PY - 2006 DA - 2006// TI - Multiple change-point audio segmentation and classification using an MDL-based Gaussian model JO - IEEE Trans. Audio Speech Lang. Process VL - 14 UR - https://doi.org/10.1109/TSA.2005.852988 DO - 10.1109/TSA.2005.852988 ID - Wu2006 ER - TY - BOOK AU - Misra, A. PY - 2012 DA - 2012// TI - Proc. Interspeech, Speech/nonspeech segmentation in web videos ID - Misra2012 ER - TY - JOUR AU - Lu, L. AU - Zhang, H. -. J. AU - Li, S. Z. PY - 2003 DA - 2003// TI - Content-based audio classification and segmentation by using support vector machines JO - Multimed. Syst VL - 8 UR - https://doi.org/10.1007/s00530-002-0065-0 DO - 10.1007/s00530-002-0065-0 ID - Lu2003 ER - TY - STD TI - H Aronowitz, in IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings, Honolulu, 15–20 Apr 2007. Segmental modeling for audio segmentation, pp. 393–396. ID - ref29 ER - TY - STD TI - J Foote, in American Association for Artificial Intelligence: Intelligence Integration and Use of Text, Image, Video, and Audio Corpora A similarity measure for automatic audio classification. (Stanford, March 1997). ID - ref30 ER - TY - STD TI - A Gallardo, R San Segundo, in II Iberian SLTech, Vigo, 10–12 Nov 2010. UPM-UC3M system for music and speech segmentation, pp. 421–424. ID - ref31 ER - TY - BOOK AU - Castan, D. AU - Vaquero, C. AU - Ortega, A. AU - Martínez, D. AU - Lleida, E. PY - 2011 DA - 2011// TI - Proc. Interspeech ID - Castan2011 ER - TY - STD TI - T Butko, CN Camprubí, H Schulz, in II Iberian SLTech, Vigo, 10–12 Nov 2010. Albayzin-2010 audio segmentation evaluation: evaluation setup and results, pp. 305–308. ID - ref33 ER - TY - JOUR AU - Kenny, P. AU - Boulianne, G. AU - Dumouchel, P. PY - 2005 DA - 2005// TI - Eigenvoice modeling with sparse training data JO - IEEE Trans. Speech Audio Process VL - 13 UR - https://doi.org/10.1109/TSA.2004.840940 DO - 10.1109/TSA.2004.840940 ID - Kenny2005 ER - TY - STD TI - P Kenny, Joint factor analysis of speaker and session variability: theory and algorithms, 1–17 (2006). . Accessed 6 Aug 2014., [http://www.crim.ca/perso/patrick.kenny] UR - http://www.crim.ca/perso/patrick.kenny ID - ref35 ER - TY - JOUR AU - Kenny, P. AU - Boulianne, G. AU - Ouellet, P. AU - Dumouchel, P. PY - 2007 DA - 2007// TI - Joint factor analysis versus eigenchannels in speaker recognition JO - IEEE Trans. Audio Speech Lang VL - 15 UR - https://doi.org/10.1109/TASL.2006.881693 DO - 10.1109/TASL.2006.881693 ID - Kenny2007 ER - TY - STD TI - C Vaquero, A Ortega, J Villalba, A Miguel, E Lleida, in Proc Interspeech 2010, vol. 2010, Makuhari, 26–30 Sept 2010. Confidence measures for speaker segmentation and their relation to speaker verification, pp. 2310–2313. ID - ref37 ER - TY - STD TI - C Vaquero, A Ortega, E Lleida, in IEEE International Conference on Acoustics, Speech and Signal Processing, Prague, 22–27 May 2011. Intra-session variability compensation and a hypothesis generation and selection strategy for speaker segmentation, pp. 3–6. ID - ref38 ER - TY - STD TI - N Brummer, A Strasheim, V Hubeika, P Matějka, L Burget, O Glembek, in Proc Interspeech, Brighton, 6–10 Sept 2009. Discriminative acoustic language recognition via channel-compensated GMM statistics, pp. 2187–2190. ID - ref39 ER - TY - BOOK AU - Castan, D. AU - Ortega, A. AU - Miguel, A. AU - Lleida, E. PY - 2012 DA - 2012// TI - Proc. SLAM Workshop ID - Castan2012 ER - TY - STD TI - D Castan, A Ortega, J Villalba, E Lleida, in IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings, Segmentation-by-classification system based on factor analysis. Vancouver, 26–31 May 2013. ID - ref41 ER - TY - STD TI - NIST, The 2009 (RT-09) Rich Transcription Meeting Recognition Evaluation Plan, (Melbourne, 28–29 May 2009. ID - ref42 ER - TY - JOUR AU - Reynolds, D. AU - Quatieri, T. F. AU - Dunn, R. B. PY - 2000 DA - 2000// TI - Speaker verification using adapted gaussian mixture models JO - Digit. Signal Process VL - 10 UR - https://doi.org/10.1006/dspr.1999.0361 DO - 10.1006/dspr.1999.0361 ID - Reynolds2000 ER - TY - STD TI - CM Bishop, Pattern Recognition and Machine Learning, vol. 4 Computers - Springer, Aug 17, 2006. ID - ref44 ER - TY - JOUR AU - Kenny, P. AU - Reynolds, D. AU - Castaldo, F. PY - 2010 DA - 2010// TI - Diarization of telephone conversations using factor analysis JO - IEEE J. Selected Topics Signal Process VL - 4 UR - https://doi.org/10.1109/JSTSP.2010.2081790 DO - 10.1109/JSTSP.2010.2081790 ID - Kenny2010 ER - TY - JOUR AU - Li, H. AU - Ma, B. AU - Lee, K. PY - 2013 DA - 2013// TI - Spoken language recognition: from fundamentals to practice JO - Proceedings of IEEE VL - 101 UR - https://doi.org/10.1109/JPROC.2012.2237151 DO - 10.1109/JPROC.2012.2237151 ID - Li2013 ER - TY - JOUR AU - Castaldo, F. AU - Colibro, D. AU - Dalmasso, E. AU - Laface, P. AU - Vair, C. PY - 2007 DA - 2007// TI - Compensation of nuisance factors for speaker and language recognition JO - IEEE Trans. Audio Speech Lang. Process VL - 15 UR - https://doi.org/10.1109/TASL.2007.901823 DO - 10.1109/TASL.2007.901823 ID - Castaldo2007 ER - TY - JOUR AU - Vogt, R. AU - Sridharan, S. PY - 2008 DA - 2008// TI - Explicit modelling of session variability for speaker verification JO - Comput. Speech Lang VL - 22 UR - https://doi.org/10.1016/j.csl.2007.05.003 DO - 10.1016/j.csl.2007.05.003 ID - Vogt2008 ER - TY - BOOK AU - Castan, D. AU - Ortega, A. AU - Lleida, E. PY - 2012 DA - 2012// TI - Proc. III Iberian SLTech ID - Castan2012 ER - TY - STD TI - O Glembek, L Burget, N Dehak, N Brummer, P Kenny, in IEEE International Conference on Acoustics, Speech and Signal Processing, Taipei, 19–24 Apr 2009. Comparison of scoring methods used in speaker recognition with joint factor analysis, pp. 4057–4060. ID - ref50 ER - TY - STD TI - P Kenny, G Boulianne, P Ouellet, P Dumouchel, in IEEE International Conference on Acoustics, Speech and Signal Processing, vol. 1, Philadelphia, 18–23 Mar 2005. Factor analysis simplified, pp. 637–640. ID - ref51 ER - TY - JOUR AU - Kittler, J. PY - 1998 DA - 1998// TI - Combining classifiers: a theoretical framework JO - Pattern Anal. Appl VL - 1 UR - https://doi.org/10.1007/BF01238023 DO - 10.1007/BF01238023 ID - Kittler1998 ER - TY - BOOK AU - Brummer, N. PY - 2010 DA - 2010// TI - Measuring, refining and calibrating speaker and language information extracted from speech PhD thesis ID - Brummer2010 ER - TY - STD TI - V Hubeika, A Strasheim, in Odyssey, Brno, 28 June–1 July 2010. Data selection and calibration issues in automatic language recognition - investigation with BUT-AGNITIO NIST LRE 2009 system, pp. 215–221. ID - ref54 ER - TY - BOOK AU - Martínez, D. AU - Miguel, A. AU - Ortega, A. AU - Lleida, E. PY - 2011 DA - 2011// TI - Proc. Interspeech ID - Martínez2011 ER -