Please use this identifier to cite or link to this item:
http://ir.lib.seu.ac.lk/handle/123456789/6168
Full metadata record
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Shafana, A. R. F. | - |
dc.contributor.author | Nihla, M. I. F. | - |
dc.contributor.author | Musfira, A. F. | - |
dc.contributor.author | Naja, M. M. F. | - |
dc.date.accessioned | 2022-07-06T10:25:56Z | - |
dc.date.available | 2022-07-06T10:25:56Z | - |
dc.date.issued | 2022-05-25 | - |
dc.identifier.citation | Book of Abstracts - Proceedings of the 10th International Symposium 2022 on "Multidisciplinary Research for Encountering Contemporary Challenges”. 25th May 2022. South Eastern University of Sri Lanka, Oluvil, Sri Lanka. pp. 42. | en_US |
dc.identifier.isbn | 978-624-5736-37-9 | - |
dc.identifier.uri | http://ir.lib.seu.ac.lk/handle/123456789/6168 | - |
dc.description.abstract | The proliferation of social media enables the public to express their views and perceptions readily online. Twitter is one such platform that helps in obtaining a huge amount of textual data and performing useful analysis. Sentiment Classification is one such analysis undertaken to gain insights into public opinion on a certain topic. Although this has been prevalently done using many approaches, the limitations still exist in non-English languages. This study aims to compare the use of the lexical-based approach and machine learning-based approach for classifying the Tamil tweets based on their sentiment. Twitter API was used to perform twitter scraping that resulted in 45852 tweets in total. 300 random tweets were then classified to their respective sentiments by subject experts in the field, this annotated data was used as ground truth and 06 underlying studies were performed on the processed and cleaned data. Four machine learning algorithms (Support Vector Machine, eXtreme Gradient Boosting, Random Forest, and Gaussian Naïve Bayes) and two lexical-based analyzers (VADER and TextBlob) were used for this comparative analysis. The results suggested that the machine learning algorithms performed extremely well where the Support Vector Machine secured the best performance score of all. This study serves as empirical evidence for those interested in performing sentiment analysis on Tamil language tweets. | en_US |
dc.language.iso | en_US | en_US |
dc.publisher | South Eastern University of Sri Lanka, Oluvil, Sri Lanka. | en_US |
dc.subject | Machine Learning | en_US |
dc.subject | Sentiment Analysis | en_US |
dc.subject | Lexicon | en_US |
dc.subject | en_US | |
dc.subject | Supervised Learning | en_US |
dc.title | Comparative analysis of machine learning algorithms along with lexical analyzers for sentiment analysis in Tamil Language | en_US |
dc.type | Article | en_US |
Appears in Collections: | 10th International Symposium - 2022 |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
IntSym2022BookofAbstracts-42.pdf | 348.02 kB | Adobe PDF | View/Open |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.