SEUIR Repository

Comparative analysis of machine learning classification approaches for heart disease prediction: a study of preprocessing techniques and algorithms

Show simple item record

dc.contributor.author Karlavi, M. M.
dc.contributor.author Chatrabgoun, O.
dc.date.accessioned 2025-06-01T10:12:54Z
dc.date.available 2025-06-01T10:12:54Z
dc.date.issued 2024-11-06
dc.identifier.citation Conference Proceedings of 13th Annual Science Research Session – 2024 on “"Empowering Innovations for Sustainable Development Through Scientific Research" on November 6th 2024. Faculty of Applied Sciences, South Eastern University of Sri Lanka, Sammanthurai.. pp. 38. en_US
dc.identifier.isbn 978-955-627-029-7
dc.identifier.uri http://ir.lib.seu.ac.lk/handle/123456789/7580
dc.description.abstract The growing number of cardiovascular diseases is a big concern for health worldwide, showing the need for effective tools to help detect these diseases early and provide timely treatment. This research contributes to the growing body of knowledge in healthcare analytics by investigating the effectiveness of machine learning algorithms to enhance patient diagnosis and treatment strategies in cardiovascular care. We compare four different Machin Learning Algorithms as K-Nearest Neighbors (K-NN), Logistic Regression (LR), Support Vector Machine (SVM), and Naive Bayes (NB) using a comprehensive dataset of patient attributes. Our methodology includes preprocessing techniques such as one-hot encoding and label encoding to convert categorical variables for optimal model performance. Additionally in our study, we explored the influence of these preprocessing techniques, identifying that one-hot encoding generally enhanced accuracy for most algorithms. Hyperparameter tuning was conducted for SVM, optimizing parameters as the kernel type and regularization strength, which further improved the model’s accuracy. The dataset was systematically split into 80% training and 20% testing subsets, allowing us to assess each algorithm's accuracy on the testing set. The results revealed that SVM outperformed the other algorithms, achieving an accuracy of 89.69%, highlighting the critical role of methodological choices in developing effective predictive models. What sets this research apart from recent studies is its comprehensive comparison of multiple algorithms alongside detailed data preprocessing techniques, providing insights into the impact of these choices on predictive performance. en_US
dc.language.iso en_US en_US
dc.publisher Faculty of Applied Sciences, South Eastern University of Sri Lanka, Sammanthurai. en_US
dc.subject Cardiovascular Disease en_US
dc.subject Data Preprocessing en_US
dc.subject Hyperparameter Tuning en_US
dc.subject ML Algorithm en_US
dc.subject One-hot Encoding. en_US
dc.title Comparative analysis of machine learning classification approaches for heart disease prediction: a study of preprocessing techniques and algorithms en_US
dc.type Article en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search SEUIR


Advanced Search

Browse

My Account