Comparing Text-Matching Software Systems Using the Document Set in the Latvian Language

Laima Vītoliņa; Alla Anohina-Naumeca

Comparing Text-Matching Software Systems Using the Document Set in the Latvian Language

Journal of Academic Ethics 2020
Laima Vītoliņa, Alla Anohina-Naumeca

There are many internationally developed text-matching software systems that help successfully identify potentially plagiarized content in English texts using both their internal databases and web resources. However, many other languages are not so widely spread but they are used daily to communicate, conduct research and acquire education. Each language has its peculiarities, so, in the context of finding content similarities, it is necessary to determine what systems are more suitable for a document set written in a specific language. The research focuses on testing the existent text-matching software systems on a set of documents prepared in the Latvian language. The corpus includes documents containing verbatim plagiarism, paraphrasing, translation plagiarism and original text to test both false positive and false negative cases. In total, 16 different text-matching software systems are compared on the plagiarism coverage using the prepared document corpus. The research presented is a part of an international initiative “Testing of Support Tools for Plagiarism Detection (TeSToP)” established under the European Network for Academic Integrity.

Keywords
plagiarism detection, academic integrity, text-matching software, plagiarism coverage
DOI
10.1007/s10805-019-09355-z
Hyperlink
https://link.springer.com/article/10.1007%2Fs10805-019-09355-z

Kamzola, L., Anohina-Naumeca, A. Comparing Text-Matching Software Systems Using the Document Set in the Latvian Language. Journal of Academic Ethics, 2020, Vol. 18, No. 2, pp.129-141. ISSN 1570-1727. e-ISSN 1572-8544. Available from: doi:10.1007/s10805-019-09355-z

Publication language
English (en)

Publication Type
Scientific article indexed in SCOPUS or WOS database
Funding for basic activity
State funding for education
Field of research
2. Engineering and technology
Sub-field of research
2.2 Electrical engineering, Electronic engineering, Information and communication engineering
Research platform
Information and Communication
ID: 30408
Citation count

0