SEUIR Repository

Extractive text summarization of online scientific articles for digital library repository

Show simple item record

dc.contributor.author Gunathilaka, R. D. R. M
dc.contributor.author Akmal Jahan, M. A. C
dc.date.accessioned 2023-08-17T08:57:06Z
dc.date.available 2023-08-17T08:57:06Z
dc.date.issued 2023-05-03
dc.identifier.citation 11th International Symposium (IntSym 2023) Managing Contemporary Issues for Sustainable Future through Multidisciplinary Research Proceedings 03rd May 2023 South Eastern University of Sri Lanka p. 708-719. en_US
dc.identifier.isbn 978-955-627-013-6
dc.identifier.uri http://ir.lib.seu.ac.lk/handle/123456789/6779
dc.description.abstract In a scientific community including researchers and students who are dedicated in reading, experimenting and writing ideas to the research world, they must refer to a significant number of articles on a daily basis. Digital libraries play a key role in supplementing scientific articles within online platforms. Due to the much abundance quantities of such articles presented in various platforms, the searching process tends toward time consuming, and identifying much related resources among them becomes again difficult. On the other hand, the majority of the scientific articles are available on a subscription-based and the online archives show only a document abstract but users necessitate extra material to determine if the article is extremely relevant or not, even if it merely provides a quick description. Therefore, this work aims to introduce an alternative approach to simplify the searching and sorting of the scientific articles for a digital library where a short subunit of sentences of the subscribed articles will be provided before purchasing it. To find a best algorithm for the process of summarisation of scientific articles within a short time and provide a good comprehension of the scientific document are to be investigated. Summaries from the publicly available SumPubMed dataset of scientific articles are evaluated using supervised and unsupervised approaches and manual summaries from them are compared. Text Rank algorithm performed better than the TF-IDF and K-Means algorithms, and the system achieved a better result when increasing the content size of the article. en_US
dc.language.iso en_US en_US
dc.publisher South Eastern University of Sri Lanka Oluvil, Sri Lanka en_US
dc.subject K-Means en_US
dc.subject ROUGE en_US
dc.subject Extractive summarization en_US
dc.subject Text-Rank en_US
dc.subject TF-IDF en_US
dc.title Extractive text summarization of online scientific articles for digital library repository en_US
dc.type Article en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search SEUIR


Advanced Search

Browse

My Account