A Survey on Plagiarism Detection Techniques

Authors

  • Aditya Shankar Narayanan SRM Institute of Science and Technology, Kattankulathur, Kanchipuram, Tamil Nadu, India. Author
  • Anjana Vishwanath SRM Institute of Science and Technology, Kattankulathur, Kanchipuram, Tamil Nadu, India Author
  • Senthil Kumar K. SRM Institute of Science and Technology, Kattankulathur, Kanchipuram, Tamil Nadu, India Author
  • Malathi D. SRM Institute of Science and Technology, Kattankulathur, Kanchipuram, Tamil Nadu, India Author

DOI:

https://doi.org/10.61841/egmxp566

Keywords:

Plagiarism, Plagiarism Detection, Cross-Language Plagiarism Detection, Deep Learning Framework

Abstract

 Plagiarism is a major act of academic dishonesty; hence the detection of plagiarism is very essential. Therefore, Plagiarism Detection is a thriving area of research in Natural Language Processing that involves the identification of misappropriated segments of text and the retrieval of the source of the original text. This paper surveys the types of plagiarism and tasks involved in the detection of plagiarism, and analyses the existing algorithms and methods used in the Plagiarism Detection Framework. The techniques explored in this paper are: Word2vec, Monte Carlo ANN, Candidate Retrieval and Text Alignment, PV-DM and PV-DBOW, Rabin-Karp Algorithm, IR-based plagiarism detection, LSI, and Joint Word Embedding. This survey concludes that Deep Learning Based Plagiarism Detection methods show a higher accuracy than others. The survey also concludes that the existing methods (excluding LSI), lack the ability to effectively perform Cross-Language Plagiarism Detection.

Downloads

Download data is not yet available.

References

[1] https://en.wikipedia.org/wiki/Plagiarism

[2] S. M. Alzahrani, N. Salim and A. Abraham, "Understanding Plagiarism Linguistic Patterns, Textual Features, and Detection Methods," in IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews), vol. 42, no. 2, pp. 133-149, March 2012.

[3] D. Suleiman, A. Awajan and N. Al-Madi, "Deep Learning Based Technique for Plagiarism Detection in Arabic Texts," 2017 International Conference on New Trends in Computing Sciences (ICTCS), Amman, 2017, pp. 216-222.

[4] R. K. Bachchan and A. K. Timalsina, "Plagiarism Detection Framework Using Monte Carlo Based Artificial Neural Network for Nepali Language," 2018 IEEE 3rd International Conference on Computing, Communication and Security (ICCCS), Kathmandu, 2018, pp. 122-127

[5] S. Lazemi, H. Ebrahimpour-Komleh and N. Noroozi, "Persian Plagirisim Detection Using CNN s," 2018 8th International Conference on Computer and Knowledge Engineering (ICCKE), Mashhad, 2018, pp. 171-175.

[6] M. Sarı and A. M. Özbayoğlu, "Classification of Turkish Documents Using Paragraph Vector," 2018 International Conference on Artificial Intelligence and Data Processing (IDAP), Malatya, Turkey, 2018, pp.

1-5.

[7] R. Sutoyo et al., "Detecting documents plagiarism using winnowing algorithm and k-gram method," 2017 IEEE International Conference on Cybernetics and Computational Intelligence (CyberneticsCom), Phuket, 2017, pp. 67-72.

[8] R. M. A. Nawab, M. Stevenson and P. Clough, "An IR-Based Approach Utilizing Query Expansion for Plagiarism Detection in MEDLINE," in IEEE/ACM Transactions on Computational Biology and

Bioinformatics, vol. 14, no. 4, pp. 796-804, 1 July-Aug. 2017.

[9] E. Hattab, "Cross-Language Plagiarism Detection Method: Arabic vs. English," 2015 International Conference on Developments of E-Systems Engineering (DeSE), Duai, 2015, pp. 141-144.

[10] M. Liu, B. Lang, Z. Gu and A. Zeeshan, "Measuring similarity of academic articles with semantic profile and joint word embedding," in Tsinghua Science and Technology, vol. 22, no. 6, pp. 619-632, December 2017.

[11] Z. Ceska, M. Toman, and K. Jezek, “Multilingual plagiarism detection, “Lecture Notes in Computer Science (including subseries Lecture Notes inArtificial Intelligence Lecture Notes in Bioinformatics), vol. 5253 LNAI,pp. 83–92, 2008.

[12] http://en.wikipedia.org/Latent_semantic_analysis

Downloads

Published

29.02.2020

How to Cite

Shankar Narayanan, A., Vishwanath, A., K. , S. K., & D. , M. (2020). A Survey on Plagiarism Detection Techniques. International Journal of Psychosocial Rehabilitation, 24(1), 1564-1572. https://doi.org/10.61841/egmxp566