SEUIR Repository

Optimization of plagiarism detection using vector space model on CUDA architecture

Show simple item record

dc.contributor.author Jiffriya Mohamed, Abdul-Cader
dc.contributor.author Akmal Jahan, Mohamed Abdul Cader
dc.contributor.author Hasindu, Gamaarachchi
dc.contributor.author Roshan, G. Ragel
dc.date.accessioned 2022-10-14T04:27:31Z
dc.date.available 2022-10-14T04:27:31Z
dc.date.issued 2022-09-26
dc.identifier.citation International Journal of Innovative Computing and Applications, Vol.13, No.4, 2022. pp.232 - 244 (pp. 1-20). en_US
dc.identifier.uri https://doi.org/10.1504/IJICA.2022.125675
dc.identifier.uri http://ir.lib.seu.ac.lk/handle/123456789/6267
dc.description.abstract Plagiarism is a rapidly rising issue among students that occurs during the submission of assignments, reports, and publications in universities and educational institutions because of the easy accessibility of abundant e-resources on the Internet. To mitigate plagiarism among students, many tools are available for natural language plagiarism detection. However, they become inefficient when dealing with a prolific number of documents with large content due to the time they consume. Therefore, we have proposed a way for software-based acceleration on text-based plagiarism detection using a suitable model on CPU/GPU. For the evaluation on the CPU, initially, a software-based serial vector space model was implemented on the CPU and tested with 1000 text-based documents particularly, students’ assignments, where it consumed 1641s for plagiarism detection. As the computation time of plagiarism detection is a bottleneck of performance while treating a prolific number of text-based sources with different sizes, we focus on accelerating and optimizing the model with the number of documents. Therefore, this research intends to implement and optimize the vector space model on the Graphics Processing Units (GPU) using Compute Unified Device Architecture (CUDA). In order to speed up, a parallel version of the model was developed on GPU using CUDA and tested with the same dataset which consumed only 36s and gained a 45x speedup compared to CPU, and when optimized further it took only 4s for the same dataset which was 389x faster than serial implementation. en_US
dc.language.iso en_US en_US
dc.publisher Inder Science Publishers en_US
dc.subject Graphics Processing Units (GPU) en_US
dc.subject Computer Unified Device Architecture (CUDA) en_US
dc.subject Plagiarism Detection en_US
dc.subject Vector Space Model en_US
dc.title Optimization of plagiarism detection using vector space model on CUDA architecture en_US
dc.type Article en_US


Files in this item

This item appears in the following Collection(s)

  • Research Articles [915]
    THESE ARE RESEARCH ARTICLES OF ACADEMIC STAFF, PUBLISHED IN JOURNALS AND PROCEEDINGS ELSWHERE

Show simple item record

Search SEUIR


Advanced Search

Browse

My Account