Please use this identifier to cite or link to this item: http://ir.lib.seu.ac.lk/handle/123456789/5812
Full metadata record
DC FieldValueLanguage
dc.contributor.authorShafana, M.S.-
dc.contributor.authorRagel, R.G.-
dc.contributor.authorKumara, T.N.-
dc.date.accessioned2021-10-08T04:12:14Z-
dc.date.available2021-10-08T04:12:14Z-
dc.date.issued2021-09-14-
dc.identifier.citationJournal of the National Science Foundation of Sri Lanka, 49(2), pp.195–208en_US
dc.identifier.issn2362-0161-
dc.identifier.urihttp://doi.org/10.4038/jnsfsr.v49i2.9466-
dc.identifier.urihttp://ir.lib.seu.ac.lk/handle/123456789/5812-
dc.description.abstractSelection of features for extraction and classification are the essential factors in achieving high performance in character recognition. Feature extraction process produces feature vectors that define the shape and characteristics of the pattern to identify them uniquely. Many feature extraction and classification approaches are available for Tamil and other languages, but there is still room to identify a better set of features for extraction to obtain higher recognition rate of Optical Character Recognition (OCR) for Tamil printed text. This research aims at producing an efficient set of features for extraction, which is capable of increasing the accuracy and reducing the runtime to improve the performance of the best OCR system to classify isolated Tamil printed characters. The proposed set of features is experimented on a large dataset using One-versus-All (OVA) Support Vector Machine (SVM). Two types of the pool of different feature vectors are created with features used in this study such as basic, density, histogram oriented gradients (HOG), and transition. In comparison with the current best approach, the testing results of Pool 1 gives better recognition accuracy of 94.87 % for OVA SVM and 97.07 % for the Unbalanced Decision Tree (UDT) SVM algorithms, but could not reach an improved recognition speed. Likewise, the results of Pool 2 improves the performance of the system by giving not only better recognition accuracy of 94.30 % for OVA SVM and 96.35% for the UDT SVM algorithms but also reached an improved recognition speed than the selected best OCR approach. The proposed set of features improves the recognition rate by 2.57–3.14% on OVA SVM and 3.22–3.94% on UDT SVM.en_US
dc.language.isoenen_US
dc.publisherNational Science Foundation of Sri Lankaen_US
dc.subjectBasic featuresen_US
dc.subjectfeature extractionen_US
dc.subjectOCRen_US
dc.subjectOVA SVMen_US
dc.subjectTamil character recognitionen_US
dc.subjectUDT SVMen_US
dc.titleAn effective feature set for enhancing printed Tamil character recognitionen_US
dc.typeArticleen_US
Appears in Collections:Research Articles

Files in This Item:
File Description SizeFormat 
9466-41414-1-PB(1).pdf2 MBAdobe PDFThumbnail
View/Open


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.