Java code for evaluation of fingerprint selection algorithms for two-stage plagiarism detection
Gints Jēkabsons

This software was used for the experiments in the paper "Evaluation of Fingerprint Selection Algorithms for Two-Stage Plagiarism Detection" (https://sciendo.com/article/10.2478/acss-2021-0022). This software is developed for evaluating the effectiveness of fingerprint selection algorithms for a two-stage (source retrieval + aligning) local text reuse detection. It implements Full fingerprinting, Every p-th, 0 mod p, Winnowing, Hailstorm, Frequency-Biased Winnowing (FBW), and Modified Frequency-Biased Winnowing (MFBW). Indexing of the fingerprints is implemented using the Apache Lucene library.


Date
30.12.2021.
Keywords
Text reuse detection, plagiarism detection, fingerprint selection, indexing
Hyperlink
http://www.cs.rtu.lv/jekabsons/nlp.html
Department for Research Coordination and Information.
E-mail: elza.vecpuise@rtu.lv; Phone: +371 26013889