Evaluation of Clustering Results in the Aspect of Information Theory
2020 61st International Scientific Conference on Information Technology and Management Science of Riga Technical University (ITMS 2020): Proceedings 2020
Pēteris Grabusts

Clustering is the process of finding possible groups in a given set, taking into account the signs of similarity or difference between the elements of this set. In cluster analysis, it is necessary to classify the data in some way or to find regularities in given data, therefore the concept of “regularity” is gaining more and more importance in the context of intelligent data processing systems. It is necessary to find out how the data are related to each other, what is the similarity or difference of different data, what is the measure of comparison of these data. For such purposes, various clustering algorithms can be used, which divide the data into groups according to certain criteria - metrics. Metric in this context means the distance between the points in the cluster. The existing methods of entropy clustering is an information-theoretical approach to the problem of clustering. In the work the evaluation of clustering quality was performed with the help of clustering quality criteria and in the context of information theory concepts.


Atslēgas vārdi
clustering, entropy, information theory, purity, Rand index
DOI
10.1109/ITMS51158.2020.9259227
Hipersaite
https://ieeexplore.ieee.org/document/9259227

Grabusts, P. Evaluation of Clustering Results in the Aspect of Information Theory. No: 2020 61st International Scientific Conference on Information Technology and Management Science of Riga Technical University (ITMS 2020): Proceedings, Latvija, Riga, 15.-16. oktobris, 2020. Piscataway: IEEE, 2020, 1.-5.lpp. ISBN 978-1-7281-9106-5. e-ISBN 978-1-7281-9105-8. Pieejams: doi:10.1109/ITMS51158.2020.9259227

Publikācijas valoda
English (en)
RTU Zinātniskā bibliotēka.
E-pasts: uzzinas@rtu.lv; Tālr: +371 28399196