Acceso abierto

A Multi-View Fuzzy Clustering Framework for Semantic-Rich Text Data Using SBERT and Ensemble Learning

, ,  y   
05 jun 2025

Cite
Descargar portada

Q. Chen, Y. Peng and Z. Lu, “BioSentVec: creating sentence embeddings for biomedical texts,” in 2019 IEEE International Conference on Healthcare Informatics (ICHI), Xi’an, China, Jun. 2019, pp. 1–5. https://doi.org/10.1109/ichi.2019.8904728 Search in Google Scholar

Y. Liu, M. Liu, and X. Wang, “Towards semantically sensitive text clustering: A feature space modeling technology based on dimension extension,” PLoS ONE, vol. 10, no. 3, Mar. 2015, Art. no. e0117390. https://doi.org/10.1371/journal.pone.0117390 Search in Google Scholar

V. Mehta, S. Bawa, and J. Singh, “WEClustering: word embeddings based text clustering technique for large datasets,” Complex & Intelligent Systems, vol. 7, no. 6, pp. 3211–3224, Sep. 2021. https://doi.org/10.1007/s40747-021-00512-9 Search in Google Scholar

D. Zhukov, E. Andrianova, K. Otradnov, and L. Istratov, “Soft clustering method for text mining, with an opportunity to attribute them to different semantic groups,” ITM Web of Conferences, vol. 18, Apr. 2018, Art. no. 03004). https://doi.org/10.1051/itmconf/20181803004 Search in Google Scholar

L. Fu, P. Lin, A. V. Vasilakos, and S. Wang, “An overview of recent multi-view clustering,” Neurocomputing, vol. 402, pp. 148–161, Aug. 2020. https://doi.org/10.1016/j.neucom.2020.02.104 Search in Google Scholar

C. Yuan, Y. Zhu, Z. Zhong, W. Zheng, and X. Zhu, “Robust self-tuning multi-view clustering,” World Wide Web, vol. 25, pp. 489–512, Feb. 2022. https://doi.org/10.1007/s11280-021-00945-9 Search in Google Scholar

A. Kumar, P. Rai, and H. Daume, “Co-regularized multi-view spectral clustering,” in NIPS’11 Proceedings of the 25th International Conference on Neural Information Processing Systems, Dec. 2011, pp. 1413–1421. Search in Google Scholar

K. Chaudhuri, S. M. Kakade, K. Livescu, and K. Sridharan, “Multi-view clustering via canonical correlation analysis,” in Proceedings of the 26th Annual International Conference on Machine Learning, Jun. 2009, pp. 129–136. https://doi.org/10.1145/1553374.1553391 Search in Google Scholar

D. Greene and P. Cunningham, “A matrix factorization approach for integrating multiple data views,” in Joint European Conference on Machine Learning and Knowledge Discovery in Databases, 2009, pp. 423–438. https://doi.org/10.1007/978-3-642-04180-8_45 Search in Google Scholar

X. Xie and S. Sun, “Multi-view clustering ensembles,” in 2013 International Conference on Machine Learning and Cybernetics, vol. 1, Tianjin, China, Jul. 2013, pp. 51–56. https://doi.org/10.1109/ICMLC.2013.6890443 Search in Google Scholar

Z. Xu and S. Sun, “An algorithm on multi-view Adaboost,” in International Conference on Neural Information Processing, 2010, pp. 355–362. https://doi.org/10.1007/978-3-642-17537-4_44 Search in Google Scholar

V. Kumar and S. Minz, “Multi-view ensemble learning: an optimal feature set partitioning for high-dimensional data classification,” Knowledge and Information Systems, vol. 49, pp. 1–59, Sep. 2016. https://doi.org/10.1007/s10115-015-0875-y Search in Google Scholar

Z. Tao, H. Liu, S. Li, Z. Ding, Y. Fu, “From ensemble clustering to multi-view clustering,” in Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence (IJCAI), 2017, pp. 2843–2849. https://doi.org/10.24963/ijcai.2017/396 Search in Google Scholar

E. Hammami and R. Faiz, “Text clustering based on multi-view representations,” in CIRCLE’22: Conference of the Information Retrieval Communities in Europe, Jul. 2022. [Online]. Available: https://ceurws.org/Vol-3178/CIRCLE_2022_paper_32.pdf Search in Google Scholar

G. Cui, “Analysis on the country differences of CSR of multinational corporations based on fuzzy C-Means clustering,” Journal of Physics: Conference Series, vol. 1533, no. 2, Apr. 2020, Art. no. 022079. https://doi.org/10.1088/1742-6596/1533/2/022079 Search in Google Scholar

J. C. Bezdek, “A convergence theorem for the fuzzy ISODATA clustering algorithms,” IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. PAMI-2, no. 1, pp. 1–8, Jan. 1980. https://doi.org/10.1109/TPAMI.1980.4766964 Search in Google Scholar

S. Subudhi and S. Panigrahi, “A hybrid mobile call fraud detection model using optimized fuzzy C-means clustering and group method of data handling-based network,” Vietnam Journal of Computer Science, vol. 5, pp. 205–217, May 2018. https://doi.org/10.1007/s40595-018-0116-x Search in Google Scholar

I. D. Mienye and Y. Sun, “A survey of ensemble learning: Concepts, algorithms, applications, and prospects,” IEEE Access, vol. 10, pp. 99129–99149, Sep. 2022. https://doi.org/10.1109/ACCESS.2022.3207287 Search in Google Scholar

K. A. Nguyen, W. Chen, B. S. Lin, and U. Seeboonruang, “Comparison of ensemble machine learning methods for soil erosion pin measurements,” ISPRS International Journal of Geo -Information, vol. 10, no. 1, Jan. 2021, Art. no. 42. https://doi.org/10.3390/ijgi10010042 Search in Google Scholar

S. Alelyani, “Stable bagging feature selection on medical data,” Journal of Big Data, vol. 8, 2021, Art. no. 11. https://doi.org/10.1186/s40537-020-00385-8 Search in Google Scholar

L. Hickman, S. Thapa, L. Tay, M. Cao, and P. Srinivasan, “Text preprocessing for text mining in organizational research: Review and recommendations,” Organizational Research Methods, vol. 25, no. 1, pp. 114–146, 2022. https://doi.org/10.1177/1094428120971683 Search in Google Scholar

C. Galli, C. Cusano, S. Guizzardi, N. Donos, and E. Calciolari, “Embeddings for efficient literature screening: A primer for life science investigators,” Metrics, vol. 1, no. 1, Sep. 2024, Art. no. 1. https://doi.org/10.3390/metrics1010001 Search in Google Scholar

B. Asadi and R. Hajj, “Prediction of asphalt binder elastic recovery using tree-based ensemble bagging and boosting models,” Construction and Building Materials, vol. 410, Jan. 2024, Art. no. 134154. https://doi.org/10.1016/j.conbuildmat.2023.134154 Search in Google Scholar

A. A. Wani, “Comprehensive analysis of clustering algorithms: exploring limitations and innovative solutions,” PeerJ Computer Science, vol. 10, Aug. 2024, Art. no. e2286. https://doi.org/10.7717/peerj-cs.2286 Search in Google Scholar