Skip to main content

The Effect of Listener Accent Background on Accent Perception and Comprehension


Variability of speaker accent is a challenge for effective human communication as well as speech technology including automatic speech recognition and accent identification. The motivation of this study is to contribute to a deeper understanding of accent variation across speakers from a cognitive perspective. The goal is to provide perceptual assessment of accent variation in native and English. The main focus is to investigate how listener's accent background affects accent perception and comprehensibility. The results from perceptual experiments show that the listeners' accent background impacts their ability to categorize accents. Speaker accent type affects perceptual accent classification. The interaction between listener accent background and speaker accent type is significant for both accent perception and speech comprehension. In addition, the results indicate that the comprehensibility of the speech contributes to accent perception. The outcomes point to the complex nature of accent perception, and provide a foundation for further investigation on the involvement of cognitive processing for accent perception. These findings contribute to a richer understanding of the cognitive aspects of accent variation, and its application for speech technology.



  1. 1.

    Crystal D: A Dictionary of Linguistics and Phonetics. Blackwell, Malden, Mass, USA; 1997.

    Google Scholar 

  2. 2.

    Markman AB, Ross BH: Category use and category learning. Psychological Bulletin 2003,129(4):592-613.

    Article  Google Scholar 

  3. 3.

    Ross BH: The use of categories affects classification. Journal of Memory and Language 1997,37(2):240-267. 10.1006/jmla.1997.2515

    Article  Google Scholar 

  4. 4.

    Lucy JA, Gaskins S: Grammatical categories and the development of classification preferences: a comparative approach. In Language Acquisition and Conceptual Development. Edited by: Levinson SC, Bowerman M. Cambridge University Press, New York, NY, USA; 2001:257-283.

    Google Scholar 

  5. 5.

    Lucy JA, Gaskins S: Interaction of language type and referent type in the development of nonverbal classification preferences. In Language in Mind. Edited by: Gentner D, Goldin-Meadow S. MIT Press, Cambridge, Mass, USA; 2003:465-492.

    Google Scholar 

  6. 6.

    Yoshida H, Smith LB: Shifting ontological boundaries: how Japanese- and English-speaking children generalize names for animals and artifacts. Developmental Science 2003,6(1):1-17. 10.1111/1467-7687.00247_1

    Article  Google Scholar 

  7. 7.

    Sandhofer CM, Smith LB: Perceptual complexity and form class cues in novel word extension tasks: how 4-year-old children interpret adjectives and count nouns. Developmental Science 2004,7(3):378-388. 10.1111/j.1467-7687.2004.00354.x

    Article  Google Scholar 

  8. 8.

    Clopper CG: Linguistic experience and the perceptual classification of dialect variation, Ph.D. dissertation, Indiana University, Bloomington, Ind, USA, 2004.

    Google Scholar 

  9. 9.

    Jilka M: The contribution of intonation to the perception of foreign accent, Doctoral dissertation, Arbeiten des Instituts f¨urMaschinelle Sprachverarbeitung (AIMS),University of Stuttgart, Stuttgart, Germany, 2000.

    Google Scholar 

  10. 10.

    Munro M: Non-segmental factors in foreign accent: ratings of filtered speech. Studies in Second Language Acquisition 1995,17(1):17-34. 10.1017/S0272263100013735

    Article  Google Scholar 

  11. 11.

    Carmichael L: Measurable degrees of foreign accent: a correlational study of production, perception, and acquisition, Unpublished M.A. thesis, University of Washington, Seattle, Wash, USA, 2000.

    Google Scholar 

  12. 12.

    Flege JE: The production of "new" and "similar" phones in a foreign language: evidence for the effect of equivalence classification. Journal of Phonetics 1978, 15: 47-65.

    Google Scholar 

  13. 13.

    Flege JE, Munro MJ, MacKay IRA: Factors affecting strength of perceived foreign accent in a second language. Journal of the Acoustical Society of America 1995,97(5):3125-3134. 10.1121/1.413041

    Article  Google Scholar 

  14. 14.

    Magen HS: The perception of foreign-accented speech. Journal of Phonetics 1998,26(4):381-400. 10.1006/jpho.1998.0081

    Article  Google Scholar 

  15. 15.

    Davies C, Tyler A: Demystifying cross-cultural (mis) communication: improving performance through balanced feedback of international teaching assistants. 1994.

    Google Scholar 

  16. 16.

    Hahn L: Native speakers' reaction to non-native stress in English discourse, Unpublished doctoral dissertation, University of Illinois, Urbana-Champaign, Ill, USA, 1999.

    Google Scholar 

  17. 17.

    Pickering L: The analysis of prosodic systems in the classroom discourse of NS and NNS teaching assistants, Doctoral dissertation. University of Florida, Gainesville, Fla, USA, 1999.

    Google Scholar 

  18. 18.

    Pickering L, Wiltshire C: Pitch accent in Indian-English teaching discourse. World Englishes 2000,19(2):173-183. 10.1111/1467-971X.00167

    Article  Google Scholar 

  19. 19.

    Pickering L: The role of tone choice in improving ITA communication in the classroom. TESOL Quarterly 2001,35(2):233-255. 10.2307/3587647

    MathSciNet  Article  Google Scholar 

  20. 20.

    Tyler A, Jeffries A, Davies C: The effect of discourse structuring devices on listener perceptions of coherence in non-native university teachers' spoken discourse. World Englishes 1988,7(2):101-110. 10.1111/j.1467-971X.1988.tb00223.x

    Article  Google Scholar 

  21. 21.

    Wennerstrom A: Intonational meaning in English discourse: a study of non-native speakers. Applied Linguistics 1994,15(4):399-420. 10.1093/applin/15.4.399

    Article  Google Scholar 

  22. 22.

    Wennerstrom A: Intonation as cohesion in academic discourse: a study of Chinese speakers of English. Studies in Second Language Acquisition 1998,20(1):1-25.

    Article  Google Scholar 

  23. 23.

    Wennerstrom A: The role of intonation in second language fluency. In Perspectives on Fluency. The University of Michigan Press, Ann Arbor, Mich, USA; 2000:102-127.

    Google Scholar 

  24. 24.

    Wennerstrom A: The Music of Everyday Speech: Prosody and Discourse Analysis. Oxford University Press, New York, NY, USA; 2001.

    Google Scholar 

  25. 25.

    Angkititrakul P, Hansen JHL: Advances in phone-based modeling for automatic accent classification. IEEE Transactions on Audio, Speech and Language Processing 2006,14(2):634-646.

    Article  Google Scholar 

  26. 26.

    Byrne W, Knodt E, Khudanpur S, Bernstein J: Is automatic speech recognition ready for non-native speech? A data collection effort and initial experiments in modeling conversational hispanic english. Proceedings of Conference on Speech Technology in Language Learning (ESCA-ITR &), 1998, Marholmen, Sweden

    Google Scholar 

  27. 27.

    Faria A: Accent classification for speech recognition. Proceedings of the 2nd Joint Workshop on Multimodal Interaction and Related Machine Learning Algorithms (MLMI '05), July 2005, Edinburgh, UK

    Google Scholar 

  28. 28.

    Ikeno A, Pellom B, Cer D, et al.: Issues in recognition of Spanish-accented spontaneous English. Proceedings of Workshop on Spontaneous Speech Processing and Recognition (ISCA '03), 2003, Tokyo, Japan 63-66.

    Google Scholar 

  29. 29.

    Tomokiyo LM, Waibel A: Adaptation methods for non-native speech. Proceedings of Multilinguality in Spoken Language Processing, September 2001, Aalborg, Denmark

    Google Scholar 

  30. 30.

    Ward W, Krech H, Yu X, et al.: Lexicon adaptation for LCVSR: speaker idiosyncracies, non-native speakers, and pronunciation choice. Proceedings of Workshop on Pronunciation Modeling and Lexicon Adaptation for Spoken Language Technology (PLMA '02), 2002, Estes Park, Colo, USA 83-88.

    Google Scholar 

  31. 31.

    Hansen JHL, Huang R, Mangalath P, Zhou B, Seadle M, Deller JR Jr.: SPEECHFIND: spoken document retrieval for a national gallery of the spoken word. Proceedings of the 6th Nordic Signal Processing Simposium (NORSIG '04), April 2004, Espoo, Finland 1-4.

    Google Scholar 

  32. 32.

    Hansen JHL, Yapanel U, Huang R, Ikeno A: Dialect analysis and modeling for automatic classification. Proceedings of International Conference on Spoken Language Processing (ICSLP '04), October 2004, Jeju Island, Korea 1-4.

    Google Scholar 

  33. 33.

    Tjalve M, Huckvale M: Pronunciation variation modelling using accent features. Proceedings of the 9th European Conference on Speech Communication and Technology (InterSpeech '05), September 2005, Lisbon, Portugal 1341-1344.

    Google Scholar 

  34. 34.

    Clopper CG, Pisoni DB: Effects of talker variability on perceptual learning of dialects. Language and Speech 2004,47(3):207-239. 10.1177/00238309040470030101

    Article  Google Scholar 

  35. 35.

    Clopper CG, Pisoni DB: Homebodies and army brats: some effects of early linguistic experience and residential history on dialect categorization. Language Variation and Change 2004, 16: 31-48.

    Article  Google Scholar 

  36. 36.

    Clopper CG, Pisoni DB: Some acoustic cues for the perceptual categorization of American English regional dialects. Journal of Phonetics 2004,32(1):111-140. 10.1016/S0095-4470(03)00009-3

    Article  Google Scholar 

  37. 37.

    Evans BG, Iverson P: Vowel normalization for accent: an investigation of best exemplar locations in northern and southern British English sentences. Journal of the Acoustical Society of America 2004,115(1):352-361. 10.1121/1.1635413

    Article  Google Scholar 

  38. 38.

    Labov W, Sharon A: Understanding birmingham. In Language Variety in the South Revisited. Edited by: Bernstein C, Nunnally T, Sabino R. University of Alabama Press, Tuscaloosa, Ala, USA; 1997:508-573.

    Google Scholar 

  39. 39.

    van Heuven VJ, van Leyden K: A contrastive acoustical investigation of Orkney and Shetland intonation. In Proceedings of the 15th International Congress of Phonetic Sciences, 2003, Barcelona, Spain Edited by: Solé MJ, Recasens D, Romero J. 805-808.

    Google Scholar 

  40. 40.

    Grabe E, Post B, Nolan F: The IViE corpus. Department of Linguistics, University of Cambridge, 2001

    Google Scholar 

  41. 41.

    Bent T, Bradlow AR: The interlanguage speech intelligibility benefit. Journal of the Acoustical Society of America 2003,114(3):1600-1610. 10.1121/1.1603234

    Article  Google Scholar 

  42. 42.

    Ikeno A, Hansen JHL: Perceptual recognition cues in native English accent variation: "listener accent, perceived accent, and comprehension". Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP '06), May 2006, Toulouse, France 1: 401-404.

    Google Scholar 

  43. 43.

    Angkititrakul P, Hansen JHL, Baghaii S: Cluster-dependent modeling and confidence measure processing for in-set/out-of-set speaker identification. Proceedings of International Conference on Spoken Language Processing (ICSLP '04), October 2004, Jeju Island, South Korea 1-4.

    Google Scholar 

  44. 44.

    Huggins AWF, Patel Y: The use of shibboleth words for automatically classifying speakers by dialect. Proceedings of the International Conference on Spoken Language Processing (ICSLP '96), October 1996, Philadelphia, Pa, USA 4: 2017-2020.

    Article  Google Scholar 

  45. 45.

    Nolan F: Intonation in speaker identification: an experiment on pitch alignment features. Forensic Linguistics 2002,9(1):1-21. 10.1558/sll.2002.9.1.1

    Google Scholar 

Download references

Author information



Corresponding author

Correspondence to Ayako Ikeno.

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 2.0 International License (, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and Permissions

About this article

Cite this article

Ikeno, A., Hansen, J.H. The Effect of Listener Accent Background on Accent Perception and Comprehension. J AUDIO SPEECH MUSIC PROC. 2007, 076030 (2007).

Download citation


  • Acoustics
  • Speech Recognition
  • Complex Nature
  • Cognitive Processing
  • Human Communication