Plagiarism Detection in SQL Student Assignments
An original method for plagiarism detection in SQL student assignments has been proposed. The method is based on identifying so-called “SQL lexemes” - persistent elements of an SQL statement, and “SQL variables” - easily modifiable elements of SQL statements. Thus, any SQL statements can be replaced with a so-called token - sequence of SQL lexemes and SQL variables. Distance between SQL tokens can be calculated using such a well-known algorithm as Levenshtein Metric. Small values of Levenshtein distance between tokens detect such SQL statements that were built by modifications of others.
We also present first practical results of actual application of the algorithm, and discuss further developments of the method.
Keywordse-Learning Automatic evaluation Automatic grading
- 1.Black, P.E. (ed.): Levenshtein distance. In: Dictionary of Algorithms and Data Structures [https://xlinux.nist.gov/dads/]. U.S. National Institute of Standards and Technology (2008). Accessed 4 May 2017
- 4.Dietinger, T., Maurer, H.: GENTLE – General Network Training and Learning Environment. In: Proceedings of ED-MEDIA98/ED-TELECOM 1998, Freiburg, pp. 274–280 (1998)Google Scholar
- 5.Ebner, M., Scerbakov, N., Maurer, H.: New features for e-learning in higher education for civil engineering. J. Univ. Sci. Technol. Learn. 1(1), 93–106 (2016)Google Scholar
- 7.Scerbakov, N.: TU Graz Teach-Center (2001). http://coronet-iicm.tugraz.at/wbtmaster/welcome.html. Accessed 13 Apr 2017