Skip to main content

A Framework for Checking Plagiarized Contents in Programs Submitted atĀ Online Judging Systems

  • Conference paper
  • First Online:
Image Processing and Capsule Networks (ICIPCN 2020)

Abstract

Plagiarism has always been a problem in the academic area through digitalization and the widespread use of computers, copying documents has become very easy. Nowadays, all kinds of documents are available as digital content and freely accessible either legally or illegally. This development makes it easy for culprits to copy digital content found on the web without much effort. Therefore, plagiarism can be found in any field such as literature, design, scientific papers, and source code. This project focused on the detection of a particular type of plagiarism: source code plagiarism, also known as software plagiarism. Source code plagiarism is often done by computer science students. Often, they solve their assignments by copying from other students or copying it from the internet and presenting it as their own work. This project implemented source code plagiarism detection framework by detecting finder prints of programs and using them to compare each of them instead of using the whole file. We used winnowing to select fingerprints among k-gram hash values of a source code, which was generated by Rabin-Karp Algorithm. At long last, we likewise give trial results on applying enormous data sets and contrasting the runtime with Measure of Software Similarity (MOSS), the extensively used plagiarism perception facility.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Aho, A.V., Lam, M.S., Sethi, R., Ulman, J.D.: Compilers: Principles, techniques and tools (for Anna University), 2nd edn. Pearson Education India, Chennai (2003)

    Google ScholarĀ 

  2. Bostan, A.S.: Winnowing algorithm for program code (2017)

    Google ScholarĀ 

  3. Ji, J.H., Woo, G., Cho, H.G.: An experience of detecting plagiarized source codes in competitive programming contests. ACM SIGCSE Bull. 40(3), 369ā€“369 (2008)

    ArticleĀ  Google ScholarĀ 

  4. Joy, M., Luck, M.: Plagiarism in programming assignments. IEEE Trans. Educ. 42(2), 129ā€“133 (1999)

    ArticleĀ  Google ScholarĀ 

  5. Karnalim, O., Aldiansyah, A.: Python source code plagiarism attacks in object-oriented environment. Comput. Eng. Appl. J. 6(3), 87ā€“93 (2017)

    Google ScholarĀ 

  6. Karnalim, O., Budi, S.: The effectiveness of low-level structure-based approach toward source code plagiarism level taxonomy. In: 6th International Conference on Information and Communication Technology (ICoICT), pp. 130ā€“134. IEEE (2018)

    Google ScholarĀ 

  7. Karp, R.M., Rabin, M.O.: Efficient randomized pattern-matching algorithms. IBM J. Res. Dev. 31(2), 249ā€“260 (1987)

    ArticleĀ  MathSciNetĀ  Google ScholarĀ 

  8. Kermek, D., Novak, M.: Process model improvement for source code plagiarism detection in student programming assignments. Inf. Educ. 15(1), 103ā€“126 (2016)

    Google ScholarĀ 

  9. Lim, J.S., Ji, J.H., Cho, H.G., Woo, G.: Plagiarism detection among source codes using adaptive local alignment of keywords. In: Proceedings of the 5th International Conference on Ubiquitous Information Management and Communication, p.Ā 24. ACM (2011)

    Google ScholarĀ 

  10. Liu, C., Chen, C., Han, J., Yu, P.S.: GPLAG: detection of software plagiarism by program dependence graph analysis. In: Proceedings of the 12th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 872ā€“881. ACM (2006)

    Google ScholarĀ 

  11. Prechelt, L., Malpohl, G., Philippsen, M.: Finding plagiarisms among a set of programs with JPlag. J. UCS 8(11), 1016 (2002)

    Google ScholarĀ 

  12. Rahal, I., Wielga, C.: Source code plagiarism detection using biological string similarity algorithms. J. Inf. Knowl. Manage. 13(03), 1450028 (2014)

    ArticleĀ  Google ScholarĀ 

  13. Schleimer, S., Wilkerson, D.S., Aiken, A.: Winnowing: local algorithms for document fingerprinting. In: Proceedings of the 2003 ACM SIGMOD International Conference on Management of Data, pp. 76ā€“85. ACM (2003)

    Google ScholarĀ 

  14. Sue, Y.: How plagiarism detection works at HackerRank (2018). https://blog.hackerrank.com/how-plagiarism-detection-works-at-hackerrank/

  15. TooNewbie: How plagiarism checkers are implemented? Letā€™s help CodeForces to find cheaters! (2019). https://codeforces.com/blog/entry/65874

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Mohammad Shamsul Arefin .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

Ā© 2021 The Editor(s) (if applicable) and The Author(s), under exclusive license to Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Ferdows, J., Khatun, S., Mokarrama, M.J., Arefin, M.S. (2021). A Framework for Checking Plagiarized Contents in Programs Submitted atĀ Online Judging Systems. In: Chen, J.IZ., Tavares, J.M.R.S., Shakya, S., Iliyasu, A.M. (eds) Image Processing and Capsule Networks. ICIPCN 2020. Advances in Intelligent Systems and Computing, vol 1200. Springer, Cham. https://doi.org/10.1007/978-3-030-51859-2_50

Download citation

Publish with us

Policies and ethics