SEUIR Repository

Classification of resources in an e-library using machine learning algorithms

Show simple item record

dc.contributor.author Akmal Jahan, MAC
dc.contributor.author Ragel, Roshan G
dc.date.accessioned 2017-06-12T07:40:45Z
dc.date.available 2017-06-12T07:40:45Z
dc.date.issued 2012-03-28
dc.identifier.citation Empowering regional development through science and technology First Annual Science Research Session -2012 en_US
dc.identifier.isbn 9789556270273
dc.identifier.uri http://ir.lib.seu.ac.lk/handle/123456789/2631
dc.description.abstract Library is the heart of a university and students spend a large amount of time in library in search of knowledge. The trend of reading resources in printed materials such as books, journals and other research publications is gradually changing. Since it is an uneasy and time-consuming process, students are interested in soft materials such as e-journals, e books and other web based resources. Nowadays, in a library most of the resources in digital form are stored without any classification. They are not categorized or utilized by the users since it does not have any proper way to access or find appropriate material when the users' queries applied. Even though there are a lot of manual ways to access text based materials or resources in a library, they cannot be applied to the digital resources since it needs some kind of text mining and machine learning. This project addresses this issue through a closed domain question answering system for a resource pool in an e-library. As the initial step, the project uses a narrowed down search space by processing the abstracts of the resources. More than 300 abstracts are extracted along with their title and pre-processed. 75% of the data are used as training sets and the remaining are used for testing. Different machine learning techniques such as classification and clustering are applied with this large collection of textual data using Weikato Environment of Knowledge Analysis (WEKA) and their performance metrics and error rates were compared. The most suitable machine learning technique and the mode of testing for the textual data were selected and applied for training models as the solution for the classification problem of the electronic resources. en_US
dc.language.iso en en_US
dc.publisher Faculty of Applied Sciences,South Eastern University of Sri Lanka en_US
dc.subject WEKA en_US
dc.subject Machine learning en_US
dc.subject Cross-validation en_US
dc.subject Clustering en_US
dc.title Classification of resources in an e-library using machine learning algorithms en_US
dc.type Article en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search SEUIR


Advanced Search

Browse

My Account