TY - JOUR AU - Egorova, E. AU - Serrano, J. L. PY - 2016 DA - 2016// TI - Semi-supervised training of language model on spanish conversational telephone speech data JO - Procedia Comput. Sci VL - 81 UR - https://doi.org/10.1016/j.procs.2016.04.038 DO - 10.1016/j.procs.2016.04.038 ID - Egorova2016 ER - TY - STD TI - S Novotney, R Schwartz, J Ma, in Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference On. Unsupervised acoustic and language model training with small amounts of labelled data (IEEE, 2009), pp. 4297–4300. https://scholar.google.co.th/scholar?hl=th&as_sdt=0%2C5&q=Unsupervised+acoustic+and+language+model+training+with+small+amounts+of+labelled+data&btnG=. UR - https://scholar.google.co.th/scholar?hl=th&as_sdt=0%2C5&q=Unsupervised+acoustic+and+language+model+training+with+small+amounts+of+labelled+data&btnG= ID - ref2 ER - TY - JOUR AU - Yu, K. AU - Gales, M. AU - Wang, L. AU - Woodland, P. C. PY - 2010 DA - 2010// TI - Unsupervised training and directed manual transcription for lvcsr JO - Speech Commun VL - 52 UR - https://doi.org/10.1016/j.specom.2010.02.014 DO - 10.1016/j.specom.2010.02.014 ID - Yu2010 ER - TY - STD TI - J Gao, J Goodman, M Li, K-F Lee, Toward a unified approach to statistical language modeling for chinese. 1(1), 3–33 (2002). https://scholar.google.com/scholar?hl=en&as_sdt=0%2C5&q=Toward+a+unified+approach+to+statistical+language+modeling+for+chinese&btnG=. UR - https://scholar.google.com/scholar?hl=en&as_sdt=0%2C5&q=Toward+a+unified+approach+to+statistical+language+modeling+for+chinese&btnG= ID - ref4 ER - TY - STD TI - T Misu, in Interspeech. Kawahara: A bootstrapping approach for developing language model of new spoken dialogue systems by selecting web text, (2006), pp. 9–13. https://scholar.google.com/scholar?hl=en&as_sdt=0%2C5&q=A+bootstrapping+approach+for+developing+language+model+of+new+spoken+dialogue+systems+by+selecting+web+text&btnG=. UR - https://scholar.google.com/scholar?hl=en&as_sdt=0%2C5&q=A+bootstrapping+approach+for+developing+language+model+of+new+spoken+dialogue+systems+by+selecting+web+text&btnG= ID - ref5 ER - TY - STD TI - RC Moore, W Lewis, in Proceedings of the ACL 2010 Conference Short Papers. Intelligent selection of language model training data, (2010), pp. 220–224. https://scholar.google.com/scholar?hl=en&as_sdt=0%2C5&q=Intelligent+selection+of+language+model+training+data&btnG=. UR - https://scholar.google.com/scholar?hl=en&as_sdt=0%2C5&q=Intelligent+selection+of+language+model+training+data&btnG= ID - ref6 ER - TY - STD TI - A Axelrod, X He, J Gao, in Proceedings of the Conference on Empirical Methods in Natural Language Processing, EMNLP ’11. Domain adaptation via pseudo in-domain data selection, (2011), pp. 355–362. https://scholar.google.com/scholar?hl=en&as_sdt=0%2C5&q=Domain+adaptation+via+pseudo+in-domain+data+selection&btnG=. UR - https://scholar.google.com/scholar?hl=en&as_sdt=0%2C5&q=Domain+adaptation+via+pseudo+in-domain+data+selection&btnG= ID - ref7 ER - TY - STD TI - A Sethy, P Georgiou, SS Narayanan, in Proceedings of the Human Language Technologies (HLT) Conference. Selecting relevant text subsets from web-data for building topic specific language models (New York City, 2006), pp. 145–148. https://scholar.google.com/scholar?hl=en&as_sdt=0%2C5&q=Selecting+relevant+text+subsets+from+webdata+for+building+topic+specific+language+models&btnG=. UR - https://scholar.google.com/scholar?hl=en&as_sdt=0%2C5&q=Selecting+relevant+text+subsets+from+webdata+for+building+topic+specific+language+models&btnG= ID - ref8 ER - TY - STD TI - A Jaech, M Ostendorf, Leveraging twitter for low-resource conversational speech language modeling. arXiv preprint arXiv:1504.02490 (2015). https://scholar.google.com/scholar?hl=en&as_sdt=0%2C5&q=Leveraging+twitter+for+lowresource+conversational+speech+language+modeling&btnG=. UR - https://scholar.google.com/scholar?hl=en&as_sdt=0%2C5&q=Leveraging+twitter+for+lowresource+conversational+speech+language+modeling&btnG= ID - ref9 ER - TY - STD TI - A Prasithrathsint, Sociolinguistic Research on Thailand Languages. Language Sciences, (1998). (https://www.sciencedirect.com/science/article/pii/0388000188900174). UR - https://www.sciencedirect.com/science/article/pii/0388000188900174 ID - ref10 ER - TY - STD TI - A Chotimongkol, K Thangthai, C Wutiwiwatchai, in Co-ordination and Standardization of Speech Databases and Assessment Techniques (COCOSDA), 2014 17th Oriental Chapter of the International Committee for The. Utilizing social media data through similarity-based text normalization for lvcsr language modeling, (2014), pp. 1–6. https://scholar.google.com/scholar?hl=en&as_sdt=0%2C5&q=Utilizing+social+media+data+through+similaritybased+text+normalization+for+lvcsr+language+modeling&btnG=. UR - https://scholar.google.com/scholar?hl=en&as_sdt=0%2C5&q=Utilizing+social+media+data+through+similaritybased+text+normalization+for+lvcsr+language+modeling&btnG= ID - ref11 ER - TY - STD TI - S Kasuriya, V Sornlertlamvanich, P Cotsomrong, S Kanokphara, N Thatphithakkul, in Oriental COCOSDA. Thai speech corpus for Thai speech recognition, (2003), pp. 54–61. https://scholar.google.com/scholar?hl=en&as_sdt=0%2C5&q=Thai+speech+corpus+for+Thai+speech+recognition&btnG=. UR - https://scholar.google.com/scholar?hl=en&as_sdt=0%2C5&q=Thai+speech+corpus+for+Thai+speech+recognition&btnG ID - ref12 ER - TY - STD TI - A Chotimongkol, N Thatphithakkul, S Purodakananda, C Wutiwiwatchai, P Chootrakool, C Hansakunbuntheung, A Suchato, P Boonpramuk, in Oriental COCOSDA Held Jointly with 2010 Conference on Asian Spoken Language Research and Evaluation (O-COCOSDA/CASLRE), 2010 International Conference. The development of a large thai telephone speech corpus: Lotus-cell 2.0, (2010). https://scholar.google.com/scholar?hl=en&as_sdt=0%2C5&q=The+development+of+a+large+thai+telephone+speech+corpus%3A+Lotus-cell+2.0&btnG=. UR - https://scholar.google.com/scholar?hl=en&as_sdt=0%2C5&q=The+development+of+a+large+thai+telephone+speech+corpus%3A+Lotus-cell+2.0&btnG= ID - ref13 ER - TY - STD TI - A Chotimongkol, V Chunwijitra, S Thatphithakkul, N Kurpukdee, C Wutiwiwatchai, in Oriental COCOSDA Held Jointly with 2015 Conference on Asian Spoken Language Research and Evaluation (O-COCOSDA/CASLRE), 2015 International Conference. Elicit spoken-style data from social media through a style classifier, (2015), pp. 7–12. https://scholar.google.com/scholar?hl=en&as_sdt=0%2C5&q=Elicit+spokenstyle+data+from+social+media+through+a+style+classifier.&btnG=. UR - https://scholar.google.com/scholar?hl=en&as_sdt=0%2C5&q=Elicit+spokenstyle+data+from+social+media+through+a+style+classifier.&btnG= ID - ref14 ER - TY - STD TI - P Chootrakool, V Chunwijitra, P Sertsi, S Kasuriya, C Wutiwiwatchai, in Oriental COCOSDA Held Jointly with 2016 Conference on Asian Spoken Language Research and Evaluation (O-COCOSDA/CASLRE), 2016 International Conference. Lotus-soc: A social media speech corpus for Thai lvcsr in noisy environments, (2016). https://scholar.google.com/scholar?hl=en&as_sdt=0%2C5&q=Lotus-soc%3A+A+social+media+speech+corpus+for+Thai+lvcsr+in+noisy+environments.&btnG=. UR - https://scholar.google.com/scholar?hl=en&as_sdt=0%2C5&q=Lotus-soc%3A+A+social+media+speech+corpus+for+Thai+lvcsr+in+noisy+environments.&btnG= ID - ref15 ER - TY - STD TI - V Sornlertlamvanich, N Takahashi, H Isahara, in Proc. Oriental COCOSDA 1998. Thai part-of-speech tagged corpus: ORCHID, (1998), pp. 131–138. http://www.academia.edu/1215347/ORCHID_Thai_part-of-speech_tagged_corpus. UR - http://www.academia.edu/1215347/ORCHID_Thai_part-of-speech_tagged_corpus ID - ref16 ER - TY - STD TI - C Haruechaiyasak, S Kongyoung, in in Proc. of SNLP. Tlex: Thai lexeme analyser based on the conditional random fields, (2009). https://scholar.google.com/scholar?hl=en&as_sdt=0%2C5&q=Thai+lexeme+analyser+based+on+the+conditional+random+fields&btnG=. UR - https://scholar.google.com/scholar?hl=en&as_sdt=0%2C5&q=Thai+lexeme+analyser+based+on+the+conditional+random+fields&btnG= ID - ref17 ER - TY - STD TI - T Joachims, Advances in kernel methods, (1999). Chap. Making Large-scale Support Vector Machine Learning Practical. https://scholar.google.com/scholar?hl=en&as_sdt=0%2C5&q=Advances+in+kernel+methods+Joachims&btnG=. UR - https://scholar.google.com/scholar?hl=en&as_sdt=0%2C5&q=Advances+in+kernel+methods+Joachims&btnG= ID - ref18 ER - TY - STD TI - JD Lafferty, A McCallum, FCN Pereira, in Proceedings of the Eighteenth International Conference on Machine Learning, ICML ’01. Conditional random fields: Probabilistic models for segmenting and labeling sequence data, (2001), pp. 282–289. https://scholar.google.com/scholar?hl=en&as_sdt=0%2C5&q=Conditional+random+fields%3A+Probabilistic+models+for+segmenting+and+labeling+sequence+data.&btnG=. UR - https://scholar.google.com/scholar?hl=en&as_sdt=0%2C5&q=Conditional+random+fields%3A+Probabilistic+models+for+segmenting+and+labeling+sequence+data.&btnG= ID - ref19 ER - TY - JOUR AU - Hochreiter, S. AU - Schmidhuber, J. PY - 1997 DA - 1997// TI - Long short-term memory JO - Neural Comput. VL - 9 UR - https://doi.org/10.1162/neco.1997.9.8.1735 DO - 10.1162/neco.1997.9.8.1735 ID - Hochreiter1997 ER - TY - STD TI - K Thangthai, A Chotimongkol, C Wutiwiwatchai, in INTERSPEECH. A hybrid language model for open-vocabulary Thai LVCSR, (2013), pp. 2207–2211. https://scholar.google.com/scholar?hl=en&as_sdt=0%2C5&q=A+hybrid+language+model+for+open-vocabulary+Thai+LVCSR&btnG=. UR - https://scholar.google.com/scholar?hl=en&as_sdt=0%2C5&q=A+hybrid+language+model+for+open-vocabulary+Thai+LVCSR&btnG= ID - ref21 ER - TY - STD TI - M Yang, H Jiang, T Zhao, S Li, in Chinese Spoken Language Processing: 5th International Symposium, ISCSLP 2006, Singapore, December 13-16, 2006. Proceedings. Construct trilingual parallel corpus on demand, (2006), pp. 760–767. https://scholar.google.com/scholar?hl=en&as_sdt=0%2C5&q=Construct+trilingual+parallel+corpus+on+demand&btnG=. UR - https://scholar.google.com/scholar?hl=en&as_sdt=0%2C5&q=Construct+trilingual+parallel+corpus+on+demand&btnG= ID - ref22 ER - TY - STD TI - G Kikui, E Sumita, T Takezawa, S Yamamoto, in INTERSPEECH. Creating corpora for speech-to-speech translation, (2003). https://scholar.google.com/scholar?hl=en&as_sdt=0%2C5&q=Creating+corpora+for+speech-to-speech+translation&btnG=. UR - https://scholar.google.com/scholar?hl=en&as_sdt=0%2C5&q=Creating+corpora+for+speech-to-speech+translation&btnG= ID - ref23 ER - TY - STD TI - K Kosawat, M Boriboon, P Chootrakool, A Chotimongkol, S Klaithin, S Kongyoung, K Kriengket, S Phaholphinyo, S Purodakananda, T Thanakulwarapas, C Wutiwiwatchai, in SNLP. BEST 2009: Thai word segmentation software contest, (2009), pp. 83–88. https://scholar.google.com/scholar?hl=en&as_sdt=0%2C5&q=BEST+2009%3A+Thai+word+segmentation+software+contest&btnG=. UR - https://scholar.google.com/scholar?hl=en&as_sdt=0%2C5&q=BEST+2009%3A+Thai+word+segmentation+software+contest&btnG= ID - ref24 ER - TY - STD TI - A Chotimongkol, K Saykhum, P Chootrakool, N Thatphithakkul, C Wutiwiwatchai, in Oriental COCOSDA. LOTUS-BN: A Thai broadcast news corpus and its research applications, (2009), pp. 44–50. https://scholar.google.com/scholar?hl=en&as_sdt=0%2C5&q=LOTUS-BN%3A+A+Thai+broadcast+news+corpus+and+its+research+applications&btnG=. UR - https://scholar.google.com/scholar?hl=en&as_sdt=0%2C5&q=LOTUS-BN%3A+A+Thai+broadcast+news+corpus+and+its+research+applications&btnG= ID - ref25 ER - TY - STD TI - P Boonkwan, Part-of-speech tagging guidelines for Thai. National Electronics and Computer Technology, 1–34 (2012). ID - ref26 ER - TY - STD TI - DP Kingma, J Ba, Adam: A method for stochastic optimization. CoRR. abs/1412.6980: (2014). https://scholar.google.com/scholar?hl=en&as_sdt=0%2C5&q=A+method+for+stochastic+optimization&btnG=. UR - https://scholar.google.com/scholar?hl=en&as_sdt=0%2C5&q=A+method+for+stochastic+optimization&btnG= ID - ref27 ER - TY - STD TI - D Povey, A Ghoshal, G Boulianne, L Burget, O Glembek, N Goel, M Hannemann, P Motlicek, Y Qian, P Schwarz, J Silovsky, G Stemmer, K Vesely, in IEEE 2011 Workshop on Automatic Speech Recognition and Understanding. The Kaldi speech recognition toolkit, (2011). https://infoscience.epfl.ch/record/192584. UR - https://infoscience.epfl.ch/record/192584 ID - ref28 ER - TY - STD TI - L Bahl, P Brown, P de Souza, R Mercer, in Acoustics, Speech, and Signal Processing, IEEE International Conference on ICASSP ’86, 11. Maximum mutual information estimation of hidden markov model parameters for speech recognition, (1986), pp. 49–52. https://scholar.google.com/scholar?hl=en&as_sdt=0%2C5&q=Maximum+mutual+information+estimation+of+hidden+markov+model+parameters+for+speech+recognition&btnG=. UR - https://scholar.google.com/scholar?hl=en&as_sdt=0%2C5&q=Maximum+mutual+information+estimation+of+hidden+markov+model+parameters+for+speech+recognition&btnG= ID - ref29 ER - TY - STD TI - A Stolcke, in Proc. of the International Conference on Spoken Language Processing (ICSLP). SRILM - an extensible language modeling toolkit, (2002), pp. 901–904. https://scholar.google.com/scholar?hl=en&as_sdt=0%2C5&q=SRILM+-+an+extensible+language+modeling+toolkit&btnG=. UR - https://scholar.google.com/scholar?hl=en&as_sdt=0%2C5&q=SRILM+-+an+extensible+language+modeling+toolkit&btnG= ID - ref30 ER - TY - STD TI - R Schwartz, L Nguyen, F Kubala, G Chou, G Zavaliagkos, J Makhoul, in Proceedings of the Workshop on Human Language Technology. On using written language training data for spoken language modeling, (1994), pp. 94–98. https://scholar.google.com/scholar?hl=en&as_sdt=0%2C5&q=On+using+written+language+training+data+for+spoken+language+modeling&btnG=. UR - https://scholar.google.com/scholar?hl=en&as_sdt=0%2C5&q=On+using+written+language+training+data+for+spoken+language+modeling&btnG= ID - ref31 ER - TY - STD TI - Y Akita, T Kawahara, in 2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP ’07, vol. 4. Topic-independent speaking-style transformation of language model for spontaneous speech recognition, (2007), pp. 33–36. https://scholar.google.com/scholar?hl=en&as_sdt=0%2C5&q=Topicindependent+speakingstyle+transformation+of+language+model+for+spontaneous+speech+recognition.&btnG=. UR - https://scholar.google.com/scholar?hl=en&as_sdt=0%2C5&q=Topicindependent+speakingstyle+transformation+of+language+model+for+spontaneous+speech+recognition.&btnG= ID - ref32 ER - TY - STD TI - R Masumura, S Hahm, A Ito, in Interspeech. Training a language model using web data for large vocabulary japanese spontaneous speech recognition, (2011), pp. 1465–1468. https://scholar.google.com/scholar?hl=en&as_sdt=0%2C5&q=Training+a+language+model+using+web+data+for+large+vocabulary+japanese+spontaneous+speech+recognition.&btnG=. UR - https://scholar.google.com/scholar?hl=en&as_sdt=0%2C5&q=Training+a+language+model+using+web+data+for+large+vocabulary+japanese+spontaneous+speech+recognition.&btnG= ID - ref33 ER - TY - STD TI - S Burusphat, Speech analysis: Nakhonpathom discourse analysis. Research Institute for Languages and Culture for rural development Mahidol University (1994). http://e-book.ram.edu/ebook/t/TH103/chapter11.pdf. UR - http://e-book.ram.edu/ebook/t/TH103/chapter11.pdf ID - ref34 ER - TY - STD TI - S Chodchoey, in Proc. of the Second International Symposium on Language and Linguistics. Spoken and written discourse in thai: The difference, (1998). https://scholar.google.com/scholar?hl=en&as_sdt=0%2C5&q=Spoken+and+written+discourse+in+thai%3A+The+difference&btnG=. UR - https://scholar.google.com/scholar?hl=en&as_sdt=0%2C5&q=Spoken+and+written+discourse+in+thai%3A+The+difference&btnG= ID - ref35 ER -