AN IMPROVED FUZZY K-MEANS CLUSTERING ALGORITHM BASED ON WEIGHT ENTROPY MEASUREMENT AND CALINSKI-HARABASZ INDEX

Authors

  • Nguyễn Như Đồng Training Department, Hochiminh Vocational College of Technology, Viet Nam
  • Phan Thành Huấn The Division of Information Technology, University of Social Sciences and Humanities, VNU Hochiminh City, Viet Nam http://orcid.org/0000-0002-2886-9352

DOI:

https://doi.org/10.37569/DalatUniversity.8.2.408(2018)

Keywords:

Calinski-Harabasz Index, Fuzzy K-means, Weight entropy.

Abstract

Clustering plays an important role in data mining and is applied widely in fields of pattern recognition, computer vision, and fuzzy control. In this paper, we proposed an improved clustering algorithm combined of both fuzzy k-means using weight Entropy and Calinski-Harabasz index. The advantage of this method is that it does not only create efficient clustering but also has the ability to measure clusters and rate clusters to find the optimal number of clusters for practical needs. Finally, we presented experimental results on real-life datasets, which showed that the improved algorithm has the accuracy and efficiency of the existing algorithms.

Downloads

Download data is not yet available.

References

Chan, Y., Ching, W., Ng, M. K., & Huang, J. Z. (2004). An optimization algorithm for clustering using weighted dissimilarity measures. Pattern Recognition, 37(5), 943-952.

Friguiand, H., & Nasraoui, O. (2004). Unsupervised learning of prototypes and attribute weights. Pattern Recognition, 37(3), 567-581.

Hoàng, X. H., & Nguyễn, T. X. H. (2006). Mở rộng thuật toán gom cụm K-means cho dữ liệu hỗn hợp. Tạp chí Tin học và Điều khiển học, 22(3), 267-274.

Huang, J. Z., Ng, M. K., Rong, H., & Li, Z. (2005). Automated variable weighting in K-Means type clustering. IEEE Transactions on Pattern Analysis, 27(5), 657-668.

Jing, L., Ng, M. K., & Huang, J. Z. (2007). An entropy weighting K-Means algorithm for subspace clustering of high dimensional sparse data. IEEE Transactions on Knowledge and Data Engineering, 19(8), 1026-1041.

Li, T., & Chen, Y. (2008). An improved K-means algorithm for clustering using Entropy weighting measures. Paper presented at The 7th World Congress on Intelligent Control and Automation, China.

Li, T., & Chen, Y. (2010). Fuzzy K-Means incremental clustering based on K-Center and vector quantization. Journal of Computer, 5(11), 1670-1677.

Lichman, M. (2013). UCI machine learning repository. California, USA: University of California. Retrieved from http://archive.ics.uci.edu/ml.

Published

01-07-2018

Volume and Issues

Section

Natural Sciences and Technology

How to Cite

Đồng, N. N., & Huấn, P. T. (2018). AN IMPROVED FUZZY K-MEANS CLUSTERING ALGORITHM BASED ON WEIGHT ENTROPY MEASUREMENT AND CALINSKI-HARABASZ INDEX. Dalat University Journal of Science, 8(2), 13-23. https://doi.org/10.37569/DalatUniversity.8.2.408(2018)