Please use this identifier to cite or link to this item: http://ir.lib.seu.ac.lk/handle/123456789/4094
Title: Speech recognition system for Tamil language using Cmusphinx
Authors: FANOON, A. R. F. S.
UWANTHIKA, G. A. I.
Keywords: Human-computer interaction
Hidden Markov Model
Acoustic model
Word error rate
Issue Date: 27-Nov-2019
Publisher: South Eastern University of Sri Lanka, University Park, Oluvil, Sri Lanka.
Citation: 9th International Symposium 2019 on “Promoting Multidisciplinary Academic Research and Innovation”. 27th - 28th November 2019. South Eastern University of Sri Lanka, University Park, Oluvil, Sri Lanka. pp. 666-674.
Abstract: Speech recognition is the process of converting an audio signal into sequences of words paving the way for enabling a rich human-computer interaction in many emerging applications. It is a very difficult and a complex task to recognize speech using a computer though several approaches have been made to develop an accurate speech recognition system. Most of the researches have been done for English, Chinese, mandarin and Arabic languages while a very few have been done for Tamil language. Therefore, this research intends to develop a continuous speech recognition system for Tamil language. Tamil language has 12 vowels and 18 consonants. This paper is focused on building a Tamil speech recognition system using Cmusphinx toolkit. The system was developed for several desktop applications with the aim that in the future, this system could be integrated with those applications and executes those applications through Tamil speech input rather than typing and Clickin. Audio recordings from several people were recorded in different environment which were later converted to wav file format. A text corpus, transcription file and fileids file for both training and testing database were prepared accordingly. Three models namely acoustic model, language model and lexicon model have been developed. Hidden Markov model was employed for building acoustic model. The performance of the system over various speaker subsets of different sex, age and dialect was examined. The word error rate was calculated in order to measure the performance of the system while the accuracy of the system was calculated using another formula. The results from both the measurements showed a satisfactory performance.
URI: http://ir.lib.seu.ac.lk/handle/123456789/4094
ISBN: 978-955-627-189-8
Appears in Collections:9th International Symposium - 2019

Files in This Item:
File Description SizeFormat 
Final Proceedings - Page 685-693.pdf691.9 kBAdobe PDFThumbnail
View/Open


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.