Please use this identifier to cite or link to this item: http://ir.lib.seu.ac.lk/handle/123456789/6253
Title: Accelerating text-based plagiarism detection using GPUs
Authors: Jiffriya, M. A. C.
Akmal Jahan, M. A. C.
Hasindu, Gamaarachchi
Roshan, G. Ragel
Keywords: CPU
GPU
NVIDIA
CUDA
Jaccard Similarity
Vector Space Model
Hashing Strategy
Thread
Block
Issue Date: 4-Feb-2016
Publisher: Institute of Electrical and Electronics Engineers (IEEE)
Citation: 10th International Conference on Industrial and Information Systems (ICIIS) on 18th December 2015, University of Peradeniya, Peradeniya. pp. 1-6.
Abstract: Plagiarism is known as unauthorized use of other's contents in writing and ideas in thinking without proper acknowledgment. There are several tools implemented for text-based plagiarism detection using various methods and techniques. However, these tools become inefficient while handling a large number of datasets due to the process of plagiarism detection which comprises a lot of computational tasks and large memory requirements. Therefore, when we deal with a large number of datasets, there should be a way to accelerate the process by applying acceleration techniques to optimize plagiarism detection. In response to this, we have developed a parallel algorithm using Compute Unified Device Architecture (CUDA) and tested it on a Graphics Processing Unit (GPU) platform. An equivalent algorithm is run on the CPU platform as well. From the comparison of the results, the CPU shows better performance when the number and the size of the documents are small. Meantime, GPU is an effective and efficient platform when handling a large number of documents and is high in data size due to the increase in the amount of parallelism. It was found that for our dataset, the performance of the algorithm on the GPU platform is approximately 6x faster than CPU. Thus, introducing GPU based optimization algorithm to plagiarism detection gives a real solution while handling a large number of data for inter-document plagiarism detection.
URI: https://www.researchgate.net/publication/304298204
http://ir.lib.seu.ac.lk/handle/123456789/6253
ISBN: 978-1-5090-1741-6
Appears in Collections:Research Articles

Files in This Item:
File Description SizeFormat 
Accelerating text-based plagiarism.pdf578.28 kBAdobe PDFThumbnail
View/Open


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.