Classification of resources in an e-library using machine learning algorithms

Akmal Jahan, MAC; Ragel, Roshan G

dc.contributor.author	Akmal Jahan, MAC
dc.contributor.author	Ragel, Roshan G
dc.date.accessioned	2017-06-12T07:40:45Z
dc.date.available	2017-06-12T07:40:45Z
dc.date.issued	2012-03-28
dc.identifier.citation	Empowering regional development through science and technology First Annual Science Research Session -2012	en_US
dc.identifier.isbn	9789556270273
dc.identifier.uri	http://ir.lib.seu.ac.lk/handle/123456789/2631
dc.description.abstract	Library is the heart of a university and students spend a large amount of time in library in search of knowledge. The trend of reading resources in printed materials such as books, journals and other research publications is gradually changing. Since it is an uneasy and time-consuming process, students are interested in soft materials such as e-journals, e books and other web based resources. Nowadays, in a library most of the resources in digital form are stored without any classification. They are not categorized or utilized by the users since it does not have any proper way to access or find appropriate material when the users' queries applied. Even though there are a lot of manual ways to access text based materials or resources in a library, they cannot be applied to the digital resources since it needs some kind of text mining and machine learning. This project addresses this issue through a closed domain question answering system for a resource pool in an e-library. As the initial step, the project uses a narrowed down search space by processing the abstracts of the resources. More than 300 abstracts are extracted along with their title and pre-processed. 75% of the data are used as training sets and the remaining are used for testing. Different machine learning techniques such as classification and clustering are applied with this large collection of textual data using Weikato Environment of Knowledge Analysis (WEKA) and their performance metrics and error rates were compared. The most suitable machine learning technique and the mode of testing for the textual data were selected and applied for training models as the solution for the classification problem of the electronic resources.	en_US
dc.language.iso	en	en_US
dc.publisher	Faculty of Applied Sciences,South Eastern University of Sri Lanka	en_US
dc.subject	WEKA	en_US
dc.subject	Machine learning	en_US
dc.subject	Cross-validation	en_US
dc.subject	Clustering	en_US
dc.title	Classification of resources in an e-library using machine learning algorithms	en_US
dc.type	Article	en_US