SEUIR Repository

Accelerating text-based plagiarism detection using GPUs

Show simple item record

dc.contributor.author Jiffriya, M.A.C.
dc.contributor.author Jahan, M.A.C. Akmal
dc.contributor.author Gamaarachchi, Hasindu
dc.contributor.author Ragel, Roshan G.
dc.date.accessioned 2018-09-11T04:31:29Z
dc.date.available 2018-09-11T04:31:29Z
dc.date.issued 2016-07-20
dc.identifier.citation 10th International Conference on Industrial and Information Systems. 18th-20th Dec, 2015. Peradeniya, Sri Lanka. en_US
dc.identifier.other 15757192
dc.identifier.uri http://ir.lib.seu.ac.lk/handle/123456789/3125
dc.identifier.uri https://doi.org/10.1109/ICIINFS.2015.7399044
dc.description.abstract Plagiarism is known as an unauthorized use of other’s contents in writing and ideas in thinking without proper acknowledgment. There are several tools implemented for textbased plagiarism detection using various methods and techniques. However, these tools become inefficient while handling a large number of datasets due to the process of plagiarism detection which comprises of a lot of computational tasks and large memory requirement. Therefore, when we deal with a large number of datasets, there should be a way to accelerate the process by applying acceleration techniques to optimize the plagiarism detection. In response to this, we have developed a parallel algorithm using Computer Unified Device Architecture (CUDA) and tested it on a Graphical Processing Unit (GPU) platform. An equivalent algorithm is run on CPU platform as well. From the comparison of the results, CPU shows better performance when the number and the size of the documents are small. Meantime, GPU is an effective and efficient platform when handling a large number of documents and high in data size due to the increase in the amount of parallelism. It was found out that for our dataset, the performance of the algorithm on the GPU platform is approximately 6x faster than CPU. Thus, introducing GPU based optimization algorithm to the plagiarism detection gives a real solution while handling a large number of data for inter-document plagiarism detection. en_US
dc.language.iso en_US en_US
dc.publisher IEEE en_US
dc.subject CPU en_US
dc.subject GPU en_US
dc.subject NVIDIA en_US
dc.subject CUDA en_US
dc.subject Jaccard similarity en_US
dc.subject Vector space model en_US
dc.subject Hashing strategy en_US
dc.subject Thread en_US
dc.subject Block en_US
dc.title Accelerating text-based plagiarism detection using GPUs en_US
dc.type Article en_US


Files in this item

Files Size Format View

There are no files associated with this item.

This item appears in the following Collection(s)

  • Research Articles [904]
    THESE ARE RESEARCH ARTICLES OF ACADEMIC STAFF, PUBLISHED IN JOURNALS AND PROCEEDINGS ELSWHERE

Show simple item record

Search SEUIR


Advanced Search

Browse

My Account