Automatic Query Generation and Query Relevance Measurement for Unsupervised Language Model Adaptation of Speech Recognition

Ito, Akinori; Kajiura, Yasutomo; Suzuki, Motoyuki; Makino, Shozo

doi:10.1155/2009/140575

Research Article
Open access
Published: 14 December 2009

Automatic Query Generation and Query Relevance Measurement for Unsupervised Language Model Adaptation of Speech Recognition

Akinori Ito¹,
Yasutomo Kajiura¹,
Motoyuki Suzuki² &
…
Shozo Makino¹

EURASIP Journal on Audio, Speech, and Music Processing volume 2009, Article number: 140575 (2009) Cite this article

1428 Accesses
1 Citations
3 Altmetric
Metrics details

Abstract

We are developing a method of Web-based unsupervised language model adaptation for recognition of spoken documents. The proposed method chooses keywords from the preliminary recognition result and retrieves Web documents using the chosen keywords. A problem is that the selected keywords tend to contain misrecognized words. The proposed method introduces two new ideas for avoiding the effects of keywords derived from misrecognized words. The first idea is to compose multiple queries from selected keyword candidates so that the misrecognized words and correct words do not fall into one query. The second idea is that the number of Web documents downloaded for each query is determined according to the "query relevance." Combining these two ideas, we can alleviate bad effect of misrecognized keywords by decreasing the number of downloaded Web documents from queries that contain misrecognized keywords. Finally, we examine a method of determining the number of iterative adaptations based on the recognition likelihood. Experiments have shown that the proposed stopping criterion can determine almost the optimum number of iterations. In the final experiment, the word accuracy without adaptation (55.29%) was improved to 60.38%, which was 1.13 point better than the result of the conventional unsupervised adaptation method (59.25%).

Publisher note

To access the full article, please see PDF.

Author information

Authors and Affiliations

Graduate School of Engineering, Tohoku University, 6-6-05 Aramaki aza Aoba, Sendai, 980-8579, Japan
Akinori Ito, Yasutomo Kajiura & Shozo Makino
Institute of Technology and Science, University of Tokushima 2-1, Minamijosanjima-cho, Tokushima, Tokushima, 770-8506, Japan
Motoyuki Suzuki

Authors

Akinori Ito
View author publications
You can also search for this author in PubMed Google Scholar
Yasutomo Kajiura
View author publications
You can also search for this author in PubMed Google Scholar
Motoyuki Suzuki
View author publications
You can also search for this author in PubMed Google Scholar
Shozo Makino
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Akinori Ito.

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 2.0 International License (https://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

Ito, A., Kajiura, Y., Suzuki, M. et al. Automatic Query Generation and Query Relevance Measurement for Unsupervised Language Model Adaptation of Speech Recognition. J AUDIO SPEECH MUSIC PROC. 2009, 140575 (2009). https://doi.org/10.1155/2009/140575

Download citation

Received: 03 December 2008
Revised: 20 May 2009
Accepted: 25 October 2009
Published: 14 December 2009
DOI: https://doi.org/10.1155/2009/140575

Automatic Query Generation and Query Relevance Measurement for Unsupervised Language Model Adaptation of Speech Recognition

Abstract

Publisher note

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords