Please use this identifier to cite or link to this item: http://ir.lib.seu.ac.lk/handle/123456789/6779
Title: Extractive text summarization of online scientific articles for digital library repository
Authors: Gunathilaka, R. D. R. M
Akmal Jahan, M. A. C
Keywords: K-Means
ROUGE
Extractive summarization
Text-Rank
TF-IDF
Issue Date: 3-May-2023
Publisher: South Eastern University of Sri Lanka Oluvil, Sri Lanka
Citation: 11th International Symposium (IntSym 2023) Managing Contemporary Issues for Sustainable Future through Multidisciplinary Research Proceedings 03rd May 2023 South Eastern University of Sri Lanka p. 708-719.
Abstract: In a scientific community including researchers and students who are dedicated in reading, experimenting and writing ideas to the research world, they must refer to a significant number of articles on a daily basis. Digital libraries play a key role in supplementing scientific articles within online platforms. Due to the much abundance quantities of such articles presented in various platforms, the searching process tends toward time consuming, and identifying much related resources among them becomes again difficult. On the other hand, the majority of the scientific articles are available on a subscription-based and the online archives show only a document abstract but users necessitate extra material to determine if the article is extremely relevant or not, even if it merely provides a quick description. Therefore, this work aims to introduce an alternative approach to simplify the searching and sorting of the scientific articles for a digital library where a short subunit of sentences of the subscribed articles will be provided before purchasing it. To find a best algorithm for the process of summarisation of scientific articles within a short time and provide a good comprehension of the scientific document are to be investigated. Summaries from the publicly available SumPubMed dataset of scientific articles are evaluated using supervised and unsupervised approaches and manual summaries from them are compared. Text Rank algorithm performed better than the TF-IDF and K-Means algorithms, and the system achieved a better result when increasing the content size of the article.
URI: http://ir.lib.seu.ac.lk/handle/123456789/6779
ISBN: 978-955-627-013-6
Appears in Collections:11th International Symposium - 2023

Files in This Item:
File Description SizeFormat 
IntSym 2023 Proceedings-708-719.pdf758.63 kBAdobe PDFView/Open


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.