Speech recognition system for Tamil language using Cmusphinx

FANOON, A. R. F. S.; UWANTHIKA, G. A. I.

dc.contributor.author	FANOON, A. R. F. S.
dc.contributor.author	UWANTHIKA, G. A. I.
dc.date.accessioned	2019-12-14T09:05:34Z
dc.date.available	2019-12-14T09:05:34Z
dc.date.issued	2019-11-27
dc.identifier.citation	9th International Symposium 2019 on “Promoting Multidisciplinary Academic Research and Innovation”. 27th - 28th November 2019. South Eastern University of Sri Lanka, University Park, Oluvil, Sri Lanka. pp. 666-674.	en_US
dc.identifier.isbn	978-955-627-189-8
dc.identifier.uri	http://ir.lib.seu.ac.lk/handle/123456789/4094
dc.description.abstract	Speech recognition is the process of converting an audio signal into sequences of words paving the way for enabling a rich human-computer interaction in many emerging applications. It is a very difficult and a complex task to recognize speech using a computer though several approaches have been made to develop an accurate speech recognition system. Most of the researches have been done for English, Chinese, mandarin and Arabic languages while a very few have been done for Tamil language. Therefore, this research intends to develop a continuous speech recognition system for Tamil language. Tamil language has 12 vowels and 18 consonants. This paper is focused on building a Tamil speech recognition system using Cmusphinx toolkit. The system was developed for several desktop applications with the aim that in the future, this system could be integrated with those applications and executes those applications through Tamil speech input rather than typing and Clickin. Audio recordings from several people were recorded in different environment which were later converted to wav file format. A text corpus, transcription file and fileids file for both training and testing database were prepared accordingly. Three models namely acoustic model, language model and lexicon model have been developed. Hidden Markov model was employed for building acoustic model. The performance of the system over various speaker subsets of different sex, age and dialect was examined. The word error rate was calculated in order to measure the performance of the system while the accuracy of the system was calculated using another formula. The results from both the measurements showed a satisfactory performance.	en_US
dc.language.iso	en_US	en_US
dc.publisher	South Eastern University of Sri Lanka, University Park, Oluvil, Sri Lanka.	en_US
dc.subject	Human-computer interaction	en_US
dc.subject	Hidden Markov Model	en_US
dc.subject	Acoustic model	en_US
dc.subject	Word error rate	en_US
dc.title	Speech recognition system for Tamil language using Cmusphinx	en_US
dc.type	Article	en_US