Please use this identifier to cite or link to this item: http://ir.lib.seu.ac.lk/handle/123456789/2633
Title: Design and development of automatic speech recognition system for Tamil language using CMU Sphinx 4
Authors: Kalith, IM.
Keywords: Speech recognition
CMU Sphinx 4
Tamil language
Issue Date: 28-Mar-2012
Publisher: Faculty of Applied Sciences,South Eastern University of Sri Lanka
Citation: Empowering regional development through science and technology First Annual Science Research Session -2012
Abstract: This paper presents a design and development of Speech Recognition System for Tamil language. This system is based on CMU Sphinx 4 open source speech recognition (ASR) engine developed by Carnegie Mellon University. This system should be adapted to speaker specific automatic, continuous speech. One of the main components of this system is a core Tamil speech recognition system that can be trained with field specific data. The target domain is the accent spoken by illiterate Tamil-speaker from Eastern area of Sri Lanka. The phonetically rich and balanced sentence text corpus were developed and recorded in conditional environment to set up speaker specific speech corpus. Using this speech corpus the system was trained and tested with speaker specific (testing with same word uttered by same person) and speaker independent data (testing with different word uttered by different person). The system currently gives a satisfactory peak performance of 39.5% Word Error Rate (WER) for speaker specific and unsatisfactory rate for speaker independent data, which is comparable with the best word error rates of most of the recognition systems for continuous speech available for any language.
URI: http://ir.lib.seu.ac.lk/handle/123456789/2633
ISBN: 9789556270273
Appears in Collections:ASRS - FAS 2012

Files in This Item:
File Description SizeFormat 
33.pdf114.15 kBAdobe PDFThumbnail
View/Open


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.