- Research Article
- Open Access
The Effect of Listener Accent Background on Accent Perception and Comprehension
EURASIP Journal on Audio, Speech, and Music Processing volume 2007, Article number: 076030 (2007)
Variability of speaker accent is a challenge for effective human communication as well as speech technology including automatic speech recognition and accent identification. The motivation of this study is to contribute to a deeper understanding of accent variation across speakers from a cognitive perspective. The goal is to provide perceptual assessment of accent variation in native and English. The main focus is to investigate how listener's accent background affects accent perception and comprehensibility. The results from perceptual experiments show that the listeners' accent background impacts their ability to categorize accents. Speaker accent type affects perceptual accent classification. The interaction between listener accent background and speaker accent type is significant for both accent perception and speech comprehension. In addition, the results indicate that the comprehensibility of the speech contributes to accent perception. The outcomes point to the complex nature of accent perception, and provide a foundation for further investigation on the involvement of cognitive processing for accent perception. These findings contribute to a richer understanding of the cognitive aspects of accent variation, and its application for speech technology.
Crystal D: A Dictionary of Linguistics and Phonetics. Blackwell, Malden, Mass, USA; 1997.
Markman AB, Ross BH: Category use and category learning. Psychological Bulletin 2003,129(4):592-613.
Ross BH: The use of categories affects classification. Journal of Memory and Language 1997,37(2):240-267. 10.1006/jmla.1997.2515
Lucy JA, Gaskins S: Grammatical categories and the development of classification preferences: a comparative approach. In Language Acquisition and Conceptual Development. Edited by: Levinson SC, Bowerman M. Cambridge University Press, New York, NY, USA; 2001:257-283.
Lucy JA, Gaskins S: Interaction of language type and referent type in the development of nonverbal classification preferences. In Language in Mind. Edited by: Gentner D, Goldin-Meadow S. MIT Press, Cambridge, Mass, USA; 2003:465-492.
Yoshida H, Smith LB: Shifting ontological boundaries: how Japanese- and English-speaking children generalize names for animals and artifacts. Developmental Science 2003,6(1):1-17. 10.1111/1467-7687.00247_1
Sandhofer CM, Smith LB: Perceptual complexity and form class cues in novel word extension tasks: how 4-year-old children interpret adjectives and count nouns. Developmental Science 2004,7(3):378-388. 10.1111/j.1467-7687.2004.00354.x
Clopper CG: Linguistic experience and the perceptual classification of dialect variation, Ph.D. dissertation, Indiana University, Bloomington, Ind, USA, 2004.
Jilka M: The contribution of intonation to the perception of foreign accent, Doctoral dissertation, Arbeiten des Instituts f¨urMaschinelle Sprachverarbeitung (AIMS),University of Stuttgart, Stuttgart, Germany, 2000.
Munro M: Non-segmental factors in foreign accent: ratings of filtered speech. Studies in Second Language Acquisition 1995,17(1):17-34. 10.1017/S0272263100013735
Carmichael L: Measurable degrees of foreign accent: a correlational study of production, perception, and acquisition, Unpublished M.A. thesis, University of Washington, Seattle, Wash, USA, 2000.
Flege JE: The production of "new" and "similar" phones in a foreign language: evidence for the effect of equivalence classification. Journal of Phonetics 1978, 15: 47-65.
Flege JE, Munro MJ, MacKay IRA: Factors affecting strength of perceived foreign accent in a second language. Journal of the Acoustical Society of America 1995,97(5):3125-3134. 10.1121/1.413041
Magen HS: The perception of foreign-accented speech. Journal of Phonetics 1998,26(4):381-400. 10.1006/jpho.1998.0081
Davies C, Tyler A: Demystifying cross-cultural (mis) communication: improving performance through balanced feedback of international teaching assistants. 1994.
Hahn L: Native speakers' reaction to non-native stress in English discourse, Unpublished doctoral dissertation, University of Illinois, Urbana-Champaign, Ill, USA, 1999.
Pickering L: The analysis of prosodic systems in the classroom discourse of NS and NNS teaching assistants, Doctoral dissertation. University of Florida, Gainesville, Fla, USA, 1999.
Pickering L, Wiltshire C: Pitch accent in Indian-English teaching discourse. World Englishes 2000,19(2):173-183. 10.1111/1467-971X.00167
Pickering L: The role of tone choice in improving ITA communication in the classroom. TESOL Quarterly 2001,35(2):233-255. 10.2307/3587647
Tyler A, Jeffries A, Davies C: The effect of discourse structuring devices on listener perceptions of coherence in non-native university teachers' spoken discourse. World Englishes 1988,7(2):101-110. 10.1111/j.1467-971X.1988.tb00223.x
Wennerstrom A: Intonational meaning in English discourse: a study of non-native speakers. Applied Linguistics 1994,15(4):399-420. 10.1093/applin/15.4.399
Wennerstrom A: Intonation as cohesion in academic discourse: a study of Chinese speakers of English. Studies in Second Language Acquisition 1998,20(1):1-25.
Wennerstrom A: The role of intonation in second language fluency. In Perspectives on Fluency. The University of Michigan Press, Ann Arbor, Mich, USA; 2000:102-127.
Wennerstrom A: The Music of Everyday Speech: Prosody and Discourse Analysis. Oxford University Press, New York, NY, USA; 2001.
Angkititrakul P, Hansen JHL: Advances in phone-based modeling for automatic accent classification. IEEE Transactions on Audio, Speech and Language Processing 2006,14(2):634-646.
Byrne W, Knodt E, Khudanpur S, Bernstein J: Is automatic speech recognition ready for non-native speech? A data collection effort and initial experiments in modeling conversational hispanic english. Proceedings of Conference on Speech Technology in Language Learning (ESCA-ITR &), 1998, Marholmen, Sweden
Faria A: Accent classification for speech recognition. Proceedings of the 2nd Joint Workshop on Multimodal Interaction and Related Machine Learning Algorithms (MLMI '05), July 2005, Edinburgh, UK
Ikeno A, Pellom B, Cer D, et al.: Issues in recognition of Spanish-accented spontaneous English. Proceedings of Workshop on Spontaneous Speech Processing and Recognition (ISCA '03), 2003, Tokyo, Japan 63-66.
Tomokiyo LM, Waibel A: Adaptation methods for non-native speech. Proceedings of Multilinguality in Spoken Language Processing, September 2001, Aalborg, Denmark
Ward W, Krech H, Yu X, et al.: Lexicon adaptation for LCVSR: speaker idiosyncracies, non-native speakers, and pronunciation choice. Proceedings of Workshop on Pronunciation Modeling and Lexicon Adaptation for Spoken Language Technology (PLMA '02), 2002, Estes Park, Colo, USA 83-88.
Hansen JHL, Huang R, Mangalath P, Zhou B, Seadle M, Deller JR Jr.: SPEECHFIND: spoken document retrieval for a national gallery of the spoken word. Proceedings of the 6th Nordic Signal Processing Simposium (NORSIG '04), April 2004, Espoo, Finland 1-4.
Hansen JHL, Yapanel U, Huang R, Ikeno A: Dialect analysis and modeling for automatic classification. Proceedings of International Conference on Spoken Language Processing (ICSLP '04), October 2004, Jeju Island, Korea 1-4.
Tjalve M, Huckvale M: Pronunciation variation modelling using accent features. Proceedings of the 9th European Conference on Speech Communication and Technology (InterSpeech '05), September 2005, Lisbon, Portugal 1341-1344.
Clopper CG, Pisoni DB: Effects of talker variability on perceptual learning of dialects. Language and Speech 2004,47(3):207-239. 10.1177/00238309040470030101
Clopper CG, Pisoni DB: Homebodies and army brats: some effects of early linguistic experience and residential history on dialect categorization. Language Variation and Change 2004, 16: 31-48.
Clopper CG, Pisoni DB: Some acoustic cues for the perceptual categorization of American English regional dialects. Journal of Phonetics 2004,32(1):111-140. 10.1016/S0095-4470(03)00009-3
Evans BG, Iverson P: Vowel normalization for accent: an investigation of best exemplar locations in northern and southern British English sentences. Journal of the Acoustical Society of America 2004,115(1):352-361. 10.1121/1.1635413
Labov W, Sharon A: Understanding birmingham. In Language Variety in the South Revisited. Edited by: Bernstein C, Nunnally T, Sabino R. University of Alabama Press, Tuscaloosa, Ala, USA; 1997:508-573.
van Heuven VJ, van Leyden K: A contrastive acoustical investigation of Orkney and Shetland intonation. In Proceedings of the 15th International Congress of Phonetic Sciences, 2003, Barcelona, Spain Edited by: Solé MJ, Recasens D, Romero J. 805-808.
Grabe E, Post B, Nolan F: The IViE corpus. Department of Linguistics, University of Cambridge, 2001 http://www.phon.ox.ac.uk/~esther/ivyweb
Bent T, Bradlow AR: The interlanguage speech intelligibility benefit. Journal of the Acoustical Society of America 2003,114(3):1600-1610. 10.1121/1.1603234
Ikeno A, Hansen JHL: Perceptual recognition cues in native English accent variation: "listener accent, perceived accent, and comprehension". Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP '06), May 2006, Toulouse, France 1: 401-404.
Angkititrakul P, Hansen JHL, Baghaii S: Cluster-dependent modeling and confidence measure processing for in-set/out-of-set speaker identification. Proceedings of International Conference on Spoken Language Processing (ICSLP '04), October 2004, Jeju Island, South Korea 1-4.
Huggins AWF, Patel Y: The use of shibboleth words for automatically classifying speakers by dialect. Proceedings of the International Conference on Spoken Language Processing (ICSLP '96), October 1996, Philadelphia, Pa, USA 4: 2017-2020.
Nolan F: Intonation in speaker identification: an experiment on pitch alignment features. Forensic Linguistics 2002,9(1):1-21. 10.1558/sll.2002.9.1.1
About this article
Cite this article
Ikeno, A., Hansen, J.H. The Effect of Listener Accent Background on Accent Perception and Comprehension. J AUDIO SPEECH MUSIC PROC. 2007, 076030 (2007) doi:10.1155/2007/76030
- Speech Recognition
- Complex Nature
- Cognitive Processing
- Human Communication