Evaluation framework of hierarchical clustering methods for binary data

Varování

Publikace nespadá pod Fakultu sportovních studií, ale pod Fakultu informatiky. Oficiální stránka publikace je na webu muni.cz.
Autoři

KRIKSCIUNIENE Dalia SAKALAUSKAS Virgilijus TAMASAUSKAS Darius

Rok publikování 2012
Druh Článek ve sborníku
Konference Proceedings of 12th International Conference on Hybrid Intelligent Systems (HIS)
Fakulta / Pracoviště MU

Fakulta informatiky

Citace
www http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6421371&isnumber=6421298
Doi http://dx.doi.org/10.1109/HIS.2012.6421371
Obor Informatika
Klíčová slova Cluster analysis; binary data; Monte Carlo simulation; distance matrix; hierarchical clustering
Popis The article aims to evaluate hierarchical clustering methods according to their performance for binary data type. We explore the accuracy of ten hierarchical clustering methods by experimenting with ten different distance measures. The three types of well, poorly and very poorly separated clusters of binary data sets are generated by selecting the appropriate parameters for binomial distribution and Monte Carlo method. In order to evaluate the precision of clustering methods the binary data sets are transformed to distance matrices. The error level each method is explored in relationship to distance measures, cluster types and data distributions. The Complete linkage, Flexible-beta and Ward‘s methods have best clustering performance for the case of two well separated clusters of binary data.
Související projekty:

Používáte starou verzi internetového prohlížeče. Doporučujeme aktualizovat Váš prohlížeč na nejnovější verzi.

Další info