Acceso abierto

Statistical method for clustering high-dimensional data based on fuzzy mathematical modeling

   | 11 dic 2023

Cite

In this paper, firstly, the fuzzy mathematical model and its types and algorithms of fuzzy numbers are studied. Then, the concept of fuzzy mathematics is added to cluster analysis, and fuzzy cluster analysis is carried out on samples or objects after data standardization and the construction of a similarity matrix. Then, the fuzzy C-mean FCM algorithm is proposed, and the FCM algorithm is improved by introducing the point density of data objects, the maximum minimum distance method of point density sampling, and the reduction of computation to improve the statistical function of clustering high-dimensional data. Finally, the algorithm of this paper is compared with other algorithms by AC, PR, RE, convergence speed, running time, DBI index and other indexes in order to analyze the advantages of the FCM algorithm in high dimensional data clustering statistics. The results show that the F value of the FCM algorithm in the Zoo dataset reaches 0.976, which improves 61% compared to FKM and nearly 12.6% compared to the IIFKM0 algorithm, with the best clustering effect. The DBI index of the FCM algorithm is the lowest in the four datasets, which is basically stabilized in the range of 0.5-0.6. This is better than other algorithms.

eISSN:
2444-8656
Idioma:
Inglés
Calendario de la edición:
Volume Open
Temas de la revista:
Life Sciences, other, Mathematics, Applied Mathematics, General Mathematics, Physics