SEUIR Repository

Accelerating text-based plagiarism detection using GPUs

Show simple item record

dc.contributor.author Jiffriya, M. A. C.
dc.contributor.author Akmal Jahan, M. A. C.
dc.contributor.author Hasindu, Gamaarachchi
dc.contributor.author Roshan, G. Ragel
dc.date.accessioned 2022-09-28T10:28:21Z
dc.date.available 2022-09-28T10:28:21Z
dc.date.issued 2016-02-04
dc.identifier.citation 10th International Conference on Industrial and Information Systems (ICIIS) on 18th December 2015, University of Peradeniya, Peradeniya. pp. 1-6. en_US
dc.identifier.isbn 978-1-5090-1741-6
dc.identifier.uri https://www.researchgate.net/publication/304298204
dc.identifier.uri http://ir.lib.seu.ac.lk/handle/123456789/6253
dc.description.abstract Plagiarism is known as unauthorized use of other's contents in writing and ideas in thinking without proper acknowledgment. There are several tools implemented for text-based plagiarism detection using various methods and techniques. However, these tools become inefficient while handling a large number of datasets due to the process of plagiarism detection which comprises a lot of computational tasks and large memory requirements. Therefore, when we deal with a large number of datasets, there should be a way to accelerate the process by applying acceleration techniques to optimize plagiarism detection. In response to this, we have developed a parallel algorithm using Compute Unified Device Architecture (CUDA) and tested it on a Graphics Processing Unit (GPU) platform. An equivalent algorithm is run on the CPU platform as well. From the comparison of the results, the CPU shows better performance when the number and the size of the documents are small. Meantime, GPU is an effective and efficient platform when handling a large number of documents and is high in data size due to the increase in the amount of parallelism. It was found that for our dataset, the performance of the algorithm on the GPU platform is approximately 6x faster than CPU. Thus, introducing GPU based optimization algorithm to plagiarism detection gives a real solution while handling a large number of data for inter-document plagiarism detection. en_US
dc.language.iso en_US en_US
dc.publisher Institute of Electrical and Electronics Engineers (IEEE) en_US
dc.subject CPU en_US
dc.subject GPU en_US
dc.subject NVIDIA en_US
dc.subject CUDA en_US
dc.subject Jaccard Similarity en_US
dc.subject Vector Space Model en_US
dc.subject Hashing Strategy en_US
dc.subject Thread en_US
dc.subject Block en_US
dc.title Accelerating text-based plagiarism detection using GPUs en_US
dc.type Article en_US


Files in this item

This item appears in the following Collection(s)

  • Research Articles [915]
    THESE ARE RESEARCH ARTICLES OF ACADEMIC STAFF, PUBLISHED IN JOURNALS AND PROCEEDINGS ELSWHERE

Show simple item record

Search SEUIR


Advanced Search

Browse

My Account