A Survey on Plagiarism Detection Techniques
DOI:
https://doi.org/10.61841/egmxp566Keywords:
Plagiarism, Plagiarism Detection, Cross-Language Plagiarism Detection, Deep Learning FrameworkAbstract
Plagiarism is a major act of academic dishonesty; hence the detection of plagiarism is very essential. Therefore, Plagiarism Detection is a thriving area of research in Natural Language Processing that involves the identification of misappropriated segments of text and the retrieval of the source of the original text. This paper surveys the types of plagiarism and tasks involved in the detection of plagiarism, and analyses the existing algorithms and methods used in the Plagiarism Detection Framework. The techniques explored in this paper are: Word2vec, Monte Carlo ANN, Candidate Retrieval and Text Alignment, PV-DM and PV-DBOW, Rabin-Karp Algorithm, IR-based plagiarism detection, LSI, and Joint Word Embedding. This survey concludes that Deep Learning Based Plagiarism Detection methods show a higher accuracy than others. The survey also concludes that the existing methods (excluding LSI), lack the ability to effectively perform Cross-Language Plagiarism Detection.
Downloads
References
[1] https://en.wikipedia.org/wiki/Plagiarism
[2] S. M. Alzahrani, N. Salim and A. Abraham, "Understanding Plagiarism Linguistic Patterns, Textual Features, and Detection Methods," in IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews), vol. 42, no. 2, pp. 133-149, March 2012.
[3] D. Suleiman, A. Awajan and N. Al-Madi, "Deep Learning Based Technique for Plagiarism Detection in Arabic Texts," 2017 International Conference on New Trends in Computing Sciences (ICTCS), Amman, 2017, pp. 216-222.
[4] R. K. Bachchan and A. K. Timalsina, "Plagiarism Detection Framework Using Monte Carlo Based Artificial Neural Network for Nepali Language," 2018 IEEE 3rd International Conference on Computing, Communication and Security (ICCCS), Kathmandu, 2018, pp. 122-127
[5] S. Lazemi, H. Ebrahimpour-Komleh and N. Noroozi, "Persian Plagirisim Detection Using CNN s," 2018 8th International Conference on Computer and Knowledge Engineering (ICCKE), Mashhad, 2018, pp. 171-175.
[6] M. Sarı and A. M. Özbayoğlu, "Classification of Turkish Documents Using Paragraph Vector," 2018 International Conference on Artificial Intelligence and Data Processing (IDAP), Malatya, Turkey, 2018, pp.
1-5.
[7] R. Sutoyo et al., "Detecting documents plagiarism using winnowing algorithm and k-gram method," 2017 IEEE International Conference on Cybernetics and Computational Intelligence (CyberneticsCom), Phuket, 2017, pp. 67-72.
[8] R. M. A. Nawab, M. Stevenson and P. Clough, "An IR-Based Approach Utilizing Query Expansion for Plagiarism Detection in MEDLINE," in IEEE/ACM Transactions on Computational Biology and
Bioinformatics, vol. 14, no. 4, pp. 796-804, 1 July-Aug. 2017.
[9] E. Hattab, "Cross-Language Plagiarism Detection Method: Arabic vs. English," 2015 International Conference on Developments of E-Systems Engineering (DeSE), Duai, 2015, pp. 141-144.
[10] M. Liu, B. Lang, Z. Gu and A. Zeeshan, "Measuring similarity of academic articles with semantic profile and joint word embedding," in Tsinghua Science and Technology, vol. 22, no. 6, pp. 619-632, December 2017.
[11] Z. Ceska, M. Toman, and K. Jezek, “Multilingual plagiarism detection, “Lecture Notes in Computer Science (including subseries Lecture Notes inArtificial Intelligence Lecture Notes in Bioinformatics), vol. 5253 LNAI,pp. 83–92, 2008.
Downloads
Published
Issue
Section
License
You are free to:
- Share — copy and redistribute the material in any medium or format for any purpose, even commercially.
- Adapt — remix, transform, and build upon the material for any purpose, even commercially.
- The licensor cannot revoke these freedoms as long as you follow the license terms.
Under the following terms:
- Attribution — You must give appropriate credit , provide a link to the license, and indicate if changes were made . You may do so in any reasonable manner, but not in any way that suggests the licensor endorses you or your use.
- No additional restrictions — You may not apply legal terms or technological measures that legally restrict others from doing anything the license permits.
Notices:
You do not have to comply with the license for elements of the material in the public domain or where your use is permitted by an applicable exception or limitation .
No warranties are given. The license may not give you all of the permissions necessary for your intended use. For example, other rights such as publicity, privacy, or moral rights may limit how you use the material.