Acceso abierto

Robust Hybrid Data-Level Approach for Handling Skewed Fat-Tailed Distributed Datasets and Diverse Features in Financial Credit Risk

, , ,  y   
10 jun 2025

Cite
Descargar portada

Ahmad M., Aftab S., Muhammad S. S., Ahmad S., Machine learning techniques for sentiment analysis: A review, International Journal of Multidisciplinary Sciences and Engineering, 2017, 8(3), 27. Search in Google Scholar

Akosa J., Predictive accuracy: A misleading performance measure for highly imbalanced data, in Proceedings of the SAS Global Forum, 2017, 12, 1–4. SAS Institute Inc. Cary, NC, USA. Search in Google Scholar

Arafa A., El-Fishawy N., Badawy M., Radad M., RN-SMOTE: Reduced Noise SMOTE based on DBSCAN for enhancing imbalanced data classification, Journal of King Saud University-Computer and Information Sciences, 2022, 34(8), 5059–5074. Search in Google Scholar

Araf I., Idri A., Chairi I., Cost-sensitive learning for imbalanced medical data: a review, Artificial Intelligence Review, 2024, 57(4), 1–72. Search in Google Scholar

Asselman A., Khaldi M., Aammou S., Enhancing the prediction of student performance based on the machine learning XGBoost algorithm, Interactive Learning Environments, 2023, 31(6), 3360–3379. Search in Google Scholar

Awaji B., Senan E.M., Olayah F., Alshari E.A., Alsulami M., Abosaq H.A., Alqahtani J., Janrao P., Hybrid techniques of facial feature image analysis for early detection of autism spectrum disorder based on combined CNN features, Diagnostics, 2023, 13(18), 2948. Search in Google Scholar

Azhar N.A., Pozi M.S.M., Din A.M., Jatowt A., An investigation of SMOTE based methods for imbalanced datasets with data complexity analysis, IEEE Transactions on Knowledge and Data Engineering, 2022. Search in Google Scholar

Bader-El-Den M., Teitei E., Perry T., Biased random forest for dealing with the class imbalance problem, IEEE Transactions on Neural Networks and Learning Systems, 2018, vol. 30, no. 7, pp. 2163–2172, IEEE. Search in Google Scholar

Bansal A., Jain A., Analysis of focussed under-sampling techniques with machine learning classifiers, 2021 IEEE/ACIS 19th International Conference on Software Engineering Research, Management and Applications (SERA), 2021, 91–96. Search in Google Scholar

Batista G.E.A.P.A., Prati R.C., Monard M.C., A study of the behavior of several methods for balancing machine learning training data, ACM SIGKDD Explorations Newsletter, 2004, 6(1), 20–29. Search in Google Scholar

Belgiu M., Drăguţ L., Random forest in remote sensing: A review of applications and future directions, ISPRS Journal of Photogrammetry and Remote Sensing, 2016, 114, 24–31. Search in Google Scholar

Bi J., Zhang C., An empirical comparison on state-of-the-art multi-class imbalance learning algorithms and a new diversified ensemble learning scheme, Knowledge-Based Systems, 2018, vol. 158, pp. 81–93, Elsevier. Search in Google Scholar

Brown I., Mues C., An experimental comparison of classification algorithms for imbalanced credit scoring data sets, Expert Systems with Applications, 2012, 39(3), 3446–3453. Search in Google Scholar

Breiman L., Random forests, Machine Learning, 2001, 45, 5–32. Search in Google Scholar

Budjač R., Nikmon M., Schreiber P., Zahradníková B., Janáčová D., Automated machine learning overview, Research Papers Faculty of Materials Science and Technology Slovak University of Technology, 2019, 27(45), 107–112. Search in Google Scholar

Carmona P., Climent F., and Momparler A., Predicting failure in the US banking sector: An extreme gradient boosting approach, International Review of Economics & Finance, 2019, vol. 61, pp. 304–323, Elsevier. Search in Google Scholar

Caouette J.B., Altman E.I., Narayanan P., Nimmo R., Managing credit risk: The great challenge for global financial markets, John Wiley & Sons, 2011. Search in Google Scholar

Carrington A.M., Fieguth P.W., Qazi H., Holzinger A., Chen H.H., Mayr F., Manuel D.G., A new concordant partial AUC and partial c statistic for imbalanced data in the evaluation of machine learning algorithms, BMC Medical Informatics and Decision Making, 2020, 20, 1–12. Search in Google Scholar

Chandra W., Suprihatin B., Resti Y., Median-KNN Regressor-SMOTE-Tomek links for handling missing and imbalanced data in air quality prediction, Symmetry, 2023, 15(4), 887. Search in Google Scholar

Chawla N.V., Bowyer K.W., Hall L.O., Kegelmeyer W.P., SMOTE: Synthetic minority over-sampling technique, Journal of Artificial Intelligence Research, 2002, 16, 321–357. Search in Google Scholar

Chen T., Guestrin C., XGBoost: A scalable tree boosting system, Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2016, 785–794. Search in Google Scholar

Chen N., Ribeiro B., Chen A., Financial credit risk assessment: a recent review, Artificial Intelligence Review, 2016, 45, 1–23, Springer. Search in Google Scholar

Chen W., Liu C., Yu W., Wen J., Zhou L., Asymmetric Loss-Oriented Cost Sensitive Learning Model for Credit Risk Assessment, (Journal Name Pending), 2024. Search in Google Scholar

Cheng K., Zhang C., Yu H., Yang X., Zou H., Gao S., Grouped SMOTE with noise filtering mechanism for classifying imbalanced data, IEEE Access, 2019, 7, 170668–170681. Search in Google Scholar

Cheng L.-C., Wu Y.-T., Chao C.-T., Wang J.-H., Detecting fake reviewers from the social context with a graph neural network method, Decision Support Systems, 2024, vol. 179, p. 114150, Elsevier. Search in Google Scholar

Couronné R., Probst P., Boulesteix A.-L., Random forest versus logistic regression: A large-scale benchmark experiment, BMC Bioinformatics, 2018, 19, 1–14. Search in Google Scholar

Davis J., Goadrich M., The relationship between Precision-Recall and ROC curves, in Proceedings of the 23rd International Conference on Machine Learning, 2006, 233–240. Search in Google Scholar

Demšar J., Statistical comparisons of classifiers over multiple data sets, The Journal of Machine Learning Research, 2006, 7, 1–30. Search in Google Scholar

De Corso E., Pasquini E., Trimarchi M., La Mantia I., Pagella F., Ottaviano G., Garzaro M., Pipolo C., Torretta S., Seccia V., et al., Dupilumab in the treatment of severe uncontrolled chronic rhinosinusitis with nasal polyps (CRSwNP): a multicentric observational phase IV real-life study (DUPIREAL), Allergy, 2023, 78(10), 2669–2683. Search in Google Scholar

Ding H., Sun Y., Wang Z., Huang N., Shen Z., Cui X., RGAN-EL: A GAN and ensemble learning-based hybrid approach for imbalanced data classification, Information Processing & Management, 2023, 60(2), 103235. Search in Google Scholar

Dong J., Chen Y., Yao B., Zhang X., Zeng N., A neural network boosting regression model based on XGBoost, Applied Soft Computing, 2022, 125, 109067. Search in Google Scholar

Douzas G., Bacao F., Geometric SMOTE: a geometrically enhanced drop-in replacement for SMOTE, Information Sciences, 2019, 501, 118–135. Search in Google Scholar

Elreedy D., Atiya A.F., A comprehensive analysis of synthetic minority oversampling technique (SMOTE) for handling class imbalance, Information Sciences, 2019, 505, 32–64. Search in Google Scholar

Ferreira A.J., Figueiredo M.A.T., Boosting algorithms: A review of methods, theory, and applications, Ensemble Machine Learning: Methods and Applications, 2012, 35–85. Search in Google Scholar

Fernández A., García S., Herrera F., Chawla N.V., SMOTE for learning from imbalanced data: progress and challenges, marking the 15-year anniversary, Journal of Artificial Intelligence Research, 2018, 61, 863–905. Search in Google Scholar

Fränti P., Mariescu-Istodor R., Soft precision and recall, Pattern Recognition Letters, 2023, 167, 115–121. Search in Google Scholar

Friedman M., A comparison of alternative tests of significance for the problem of m rankings, The Annals of Mathematical Statistics, 1940, 11(1), 86–92. Search in Google Scholar

Fonseca J., Bacao F., Geometric SMOTE for imbalanced datasets with nominal and continuous features, Expert Systems with Applications, 2023, 234, 121053. Search in Google Scholar

Fu G.-H., Yi L.-Z., Pan J., Tuning model parameters in class-imbalanced learning with precision-recall curve, Biometrical Journal, 2019, 61(3), 652–664. Search in Google Scholar

Galar M., Fernández A., Barrenechea E., Bustince H., Herrera F., A review on ensembles for the class imbalance problem: bagging-, boosting-, and hybrid-based approaches, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews), 2011, 42(4), 463–484. Search in Google Scholar

García S., Fernández A., Luengo J., Herrera F., Advanced nonparametric tests for multiple comparisons in the design of experiments in computational intelligence and data mining: Experimental analysis of power, Information Sciences, 2010, 180(10), 2044–2064. Search in Google Scholar

Ganganwar V., An overview of classification algorithms for imbalanced datasets, International Journal of Emerging Technology and Advanced Engineering, 2012, 2(4), 42–47. Search in Google Scholar

Gnip P., Vokorokos L., Drotár P., Selective oversampling approach for strongly imbalanced data, PeerJ Computer Science, 2021, 7, e604. Search in Google Scholar

Gu J., Random Forest Based Imbalanced Data Cleaning and Classification, 2007, Citeseer. Search in Google Scholar

Gu Q., Cai Z., Zhu L., Huang B., Data mining on imbalanced data sets, in 2008 International Conference on Advanced Computer Theory and Engineering, 2008, 1020–1024, IEEE. Search in Google Scholar

Gür E., Duyan Y.A., Balcı F., Mice make temporal inferences about novel locations based on previously learned spatiotemporal contingencies, Animal Cognition, 26(3), 771–779, 2023. Search in Google Scholar

Guo K., Wan X., Liu L., Gao Z., Yang M., Fault diagnosis of intelligent production line based on digital twin and improved random forest, Applied Sciences, 2021, 11(16), 7733. Search in Google Scholar

Han S., Williamson B.D., Fong Y., Improving random forest predictions in small datasets from two-phase sampling designs, BMC Medical Informatics and Decision Making, 2021, 21, 1–9, Springer. Search in Google Scholar

Han H., Wang W.-Y., Mao B.-H., Borderline-SMOTE: A new over-sampling method in imbalanced data sets learning, International Conference on Intelligent Computing, 2005, 878–887, Springer. Search in Google Scholar

Hand D. J., Till R. J., A simple generalisation of the area under the ROC curve for multiple class classification problems, Machine Learning, 2001, 45, 171–186. Search in Google Scholar

Hancock J.T., Khoshgoftaar T.M., Johnson J.M., Evaluating classifier performance with highly imbalanced big data, Journal of Big Data, 2023, 10(1), 42. Search in Google Scholar

Hanley J.A., McNeil B.J., The meaning and use of the area under a receiver operating characteristic (ROC) curve, Radiology, 1982, 143(1), 29–36. Search in Google Scholar

He H., Garcia E.A., Learning from imbalanced data, IEEE Transactions on Knowledge and Data Engineering, 2009, 21(9), 1263–1284, IEEE. Search in Google Scholar

He H., Ma Y., Imbalanced learning: foundations, algorithms, and applications, 2013, John Wiley & Sons. Search in Google Scholar

Hossin M., Sulaiman M.N., A review on evaluation metrics for data classification evaluations, International Journal of Data Mining & Knowledge Management Process, 2015, 5(2), 1. Search in Google Scholar

Islam A., Belhaouari S.B., Rehman A.U., Bensmail H., KNNOR: An oversampling technique for imbalanced datasets, Applied Soft Computing, 2022, 115, 108288. Search in Google Scholar

Ishwaran H., Lu M., Standard errors and confidence intervals for variable importance in random forest regression, classification, and survival, Statistics in Medicine, 2019, 38(4), 558–582. Search in Google Scholar

Jiang C., Lu W., Wang Z., Ding Y., Benchmarking state-of-the-art imbalanced data learning approaches for credit scoring, Expert Systems with Applications, 2023, 213, 118878, Elsevier. Search in Google Scholar

Johnson J.M., Khoshgoftaar T.M., Survey on deep learning with class imbalance, Journal of Big Data, 2019, 6(1), 1–54. Search in Google Scholar

Kaur H., Pannu H.S., Malhi A.K., A systematic review on imbalanced data challenges in machine learning: Applications and solutions, ACM Computing Surveys (CSUR), 2019, 52(4), 1–36. Search in Google Scholar

Ke G., Meng Q., Finley T., Wang T., Chen W., Ma W., Ye Q., Liu T.-Y., LightGBM: A highly efficient gradient boosting decision tree, Advances in Neural Information Processing Systems, 2017, 30. Search in Google Scholar

Khalilia M., Chakraborty S., Popescu M., Predicting disease risks from highly imbalanced data using random forest, BMC Medical Informatics and Decision Making, 2011, 11, 1–13. Search in Google Scholar

Khoshgoftaar T.M., Golawala M., and Van H.Ja, An empirical study of learning from imbalanced data using random forest, 19th IEEE International Conference on Tools with Artificial Intelligence (ICTAI 2007), 2007, vol. 2, pp. 310–317, IEEE. Search in Google Scholar

Kim T., Lee J.-S., Maximizing AUC to learn weighted naive Bayes for imbalanced data classification, Expert Systems with Applications, 2023, 217, 119564. Search in Google Scholar

Kotsiantis S., Kanellopoulos D., Pintelas P., Handling imbalanced datasets: A review, GESTS International Transactions on Computer Science and Engineering, 2006, 30(1), 25–36. Search in Google Scholar

Kraiem M.S., Sánchez-Hernández F., Moreno-García M.N., Selecting the suitable resampling strategy for imbalanced data classification regarding dataset properties. An approach based on association models, Applied Sciences, 2021, 11(18), 8546. Search in Google Scholar

Kumar V., Lalotra G.S., Sasikala P., Rajput D.S., Kaluri R., Lakshmanna K., Shorfuzzaman M., Alsufyani A., Uddin M., Addressing binary classification over class imbalanced clinical datasets using computationally intelligent techniques, Healthcare, 2022, 10(7), 1293. Search in Google Scholar

Kyiu A., Tawiah V., IFRS 9 implementation and bank risk, in Accounting Forum, 2023, 1–25, Taylor & Francis. Search in Google Scholar

Lai S.B.S., Shahri N.H.N.B.M., Mohamad M.B., Rahman H.A.B., Rambli A.B., Comparing the performance of AdaBoost, XGBoost, and logistic regression for imbalanced data, Mathematics and Statistics, 2021, 9(3), 379–385. Search in Google Scholar

Laker J., The Evolution of Risk and Risk Management–A Prudential Regulator’s Perspective, Conference–2007, 2007, Reserve Bank of Australia. Search in Google Scholar

Le T., Vo M.T., Vo B., Lee M.Y., Baik S.W., A hybrid approach using oversampling technique and cost-sensitive learning for bankruptcy prediction, Complexity, 2019, 2019, 8460934. Search in Google Scholar

Li X., Liu Q., A hybrid sampling algorithm for imbalanced and class-overlap data based on natural neighbors and density estimation, Knowledge and Information Systems, 2024, 1–32. Search in Google Scholar

Liang X.W., Jiang A.P., Li T., Xue Y.Y., Wang G.T., LR-SMOTE—An improved unbalanced data set oversampling based on K-means and SVM, Knowledge-Based Systems, 2020, 196, 105845. Search in Google Scholar

Limanto S., Buliali J.L., Saikhu A., GLoW SMOTE-D: Oversampling Technique to Improve Prediction Model Performance of Students Failure in Courses, IEEE Access, 2024. Search in Google Scholar

Ling C. X., Huang J., Zhang H., AUC: a statistically consistent and more discriminating measure than accuracy, Ijcai, 2003, 3, 519–524. Search in Google Scholar

Liu F., Qian Q., Cost-sensitive variational autoencoding classifier for imbalanced data classification, Algorithms, 2022, 15(5), 139. Search in Google Scholar

Liu Y., Yu X., Huang J.X., An Aijun, Combining integrated sampling with SVM ensembles for learning from imbalanced datasets, Information Processing & Management, 2011, 47(4), 617–631. Search in Google Scholar

Liu Y., Wang H., Fei Y., Liu Y., Shen L., Zhuang Z., Zhang X., Research on the prediction of green plum acidity based on improved XGBoost, Sensors, 2021, 21(3), 930. Search in Google Scholar

Lommers K., Harzli O. E., Kim J., Confronting machine learning with financial research, arXiv preprint arXiv:2103.00366, 2021. Search in Google Scholar

Malhotra R., Jain J., Predicting defects in imbalanced data using resampling methods: an empirical investigation, PeerJ Computer Science, 2022, 8, e573. Search in Google Scholar

Ma G., Wang Y., Can the Chinese domestic bond and stock markets facilitate a globalising renminbi, Economic and Political Studies, 2020, vol. 8, no. 3, pp. 291–311, Taylor & Francis. Search in Google Scholar

Mellor A., Boukir S., Haywood A., and Jones S., Exploring issues of training data imbalance and mislabelling on random forest performance for large area land cover classification using the ensemble margin, ISPRS Journal of Photogrammetry and Remote Sensing, vol. 105, 2015, pp. 155–168, Elsevier. Search in Google Scholar

More A.S., Rana D.P., Performance enrichment through parameter tuning of random forest classification for imbalanced data applications, Materials Today: Proceedings, 2022, vol. 56, pp. 3585–3593, Elsevier. Search in Google Scholar

Mukherjee M., Khushi M., SMOTE-ENC: A novel SMOTE-based method to generate synthetic data for nominal and continuous features, Applied System Innovation, 2021, 4(1), 18. Search in Google Scholar

Niu A., Cai B., Cai S., Big Data Analytics for Complex Credit Risk Assessment of Network Lending Based on SMOTE Algorithm, Complexity, 2020, vol. 2020, no. 1, p. 8563030, Wiley Online Library. Search in Google Scholar

Ozdemir B., Evolution of risk management from risk compliance to strategic risk management: From Basel I to Basel II, III and IFRS 9, Journal of Risk Management in Financial Institutions, vol. 11, no. 1, 2018, pp. 76–85, Henry Stewart Publications. Search in Google Scholar

Pes B., Learning from high-dimensional and class-imbalanced datasets using random forests, Information, 2021, 12(8), 286. Search in Google Scholar

Paing M.P., Pintavirooj C., Tungjitkusolmun S., Choomchuay S., Hamamoto K., Comparison of sampling methods for imbalanced data classification in random forest, 2018 11th Biomedical Engineering International Conference (BMEiCON), 2018, 1–5. Search in Google Scholar

Prakash A., Thangaraj J., Roy S., Srivastav S., Mishra J. K., Model-aware XG-Boost method towards optimum performance of flexible distributed Raman amplifier, IEEE Photonics Journal, 2023, 15(4), 1–10. Search in Google Scholar

Qiu W., Credit risk prediction in an imbalanced social lending environment based on XGBoost, 2019 5th International Conference on Big Data and Information Analytics (BigDIA), 2019, pp. 150–156, IEEE. Search in Google Scholar

Qolomany B., Al-Fuqaha A., Gupta A., Benhaddou D., Alwajidi S., Qadir J., Fong A. C., Leveraging machine learning and big data for smart buildings: A comprehensive survey, IEEE Access, 2019, 7, 90316–90356. Search in Google Scholar

Brennan, P. “A comprehensive survey of methods for overcoming the class imbalance problem in fraud detection.” Institute of Technology Blanchardstown, Dublin, Ireland, 2012. Search in Google Scholar

Rao C.R., Karl Pearson chi-square test the dawn of statistical inference, Goodness-of-fit tests and model validity, 2002, 9–24, Springer. Search in Google Scholar

Ratnaningsih T., SMOTE-MRS: A Novel SMOTE-Multiresolution Sampling Technique for Imbalanced Distribution to Improve Prediction of Anemia, (Journal Name Pending), 2024. Search in Google Scholar

Rawat S.S., Mishra A.K., Review of Methods for Handling Class-Imbalanced in Classification Problems, arXiv preprint arXiv:2211.05456, 2022. Search in Google Scholar

Rijsbergen V., Information retrieval; Butterworth, 1978, J. Librariansh., 1979, 11, 237. Search in Google Scholar

Rodriguez-Galiano, V. F., Ghimire, B., Rogan, J., Chica-Olmo, M., & Rigol-Sanchez, J. P., An assessment of the effectiveness of a random forest classifier for land-cover classification, ISPRS Journal of Photogrammetry and Remote Sensing, 2012, 67, 93–104. Search in Google Scholar

Rodríguez-Galiano V.F., Abarca-Hernández F., Ghimire B., Chica-Olmo M., Atkinson P. M., Jeganathan C., Incorporating spatial variability measures in land-cover classification using Random Forest, Procedia Environmental Sciences, vol. 3, 2011, pp. 44–49, Elsevier. Search in Google Scholar

Rout N., Mishra D., Mallick M.K., Handling imbalanced data: a survey, in International Proceedings on Advances in Soft Computing, Intelligent Systems and Applications: ASISA 2016, 2018, 431–443, Springer. Search in Google Scholar

Schapire R.E., Freund Y., Boosting: Foundations and algorithms, Kybernetes, 2013, 42(1), 164–166. Search in Google Scholar

Sarker I. H., Machine learning: Algorithms, real-world applications and research directions, SN Computer Science, 2021, 2(3), 160. Search in Google Scholar

Shi S., Li J., Zhu D., Yang F., Xu Y., A hybrid imbalanced classification model based on data density, Information Sciences, 2023, 624, 50–67. Search in Google Scholar

Singh A., Ranjan R. K., Tiwari A., Credit card fraud detection under extreme imbalanced data: a comparative study of data-level algorithms, Journal of Experimental & Theoretical Artificial Intelligence, 2022, 34(4), 571–598. Search in Google Scholar

Singla P., Domingos P., Discriminative training of Markov logic networks, in AAAI, 2005, 868–873. Search in Google Scholar

Sibindi R., Mwangi R. W., Waititu A. G., A boosting ensemble learning based hybrid light gradient boosting machine and extreme gradient boosting model for predicting house prices, Engineering Reports, 2023, 5(4), e12599. Search in Google Scholar

Soltanzadeh P., Hashemzadeh M., RCSMOTE: Range-Controlled synthetic minority over-sampling technique for handling the class imbalance problem, Information Sciences, 542, 92–111, 2021. Search in Google Scholar

Srivastava J., Sharan A., SMOTEEN Hybrid Sampling Based Improved Phishing Website Detection, Authorea Preprints, 2023. Search in Google Scholar

Sun Z., Song Q., Zhu X., Sun H., Xu B., Zhou Y., A novel ensemble method for classifying imbalanced data, Pattern Recognition, 2015, 48(5), 1623–1637. Search in Google Scholar

Sun Z., Ying W., Zhang W., Gong S., Undersampling method based on minority class density for imbalanced data, Expert Systems with Applications, 2024, 249, 123328. Search in Google Scholar

Szeghalmy S., Fazekas A., A comparative study on noise filtering of imbalanced data sets, Knowledge-Based Systems, 2024, 301, 112236. Search in Google Scholar

Tao X., Guo X., Zheng Y., Zhang X., Chen Z., Self-adaptive oversampling method based on the complexity of minority data in imbalanced datasets classification, Knowledge-Based Systems, 2023, 277, 110795. Search in Google Scholar

Taye M.M., Understanding of machine learning with deep learning: architectures, workflow, applications and future directions, Computers, 2023, 12(5), 91. Search in Google Scholar

Ting K.M., An instance-weighting method to induce cost-sensitive trees, IEEE Transactions on Knowledge and Data Engineering, 2002, vol. 14, no. 3, pp. 659–665, IEEE. Search in Google Scholar

Tomašević N., Mladenić D., Class imbalance and the curse of minority hubs, Knowledge-Based Systems, 2013, 53, 157–172. Search in Google Scholar

Tomek I., Two modifications of CNN, IEEE Transactions on Systems, Man, and Cybernetics, 1976, 6, 769–776. Search in Google Scholar

Uddin Md G., Nash S., Diganta M. T. M., Rahman A., Olbert A. I., Robust machine learning algorithms for predicting coastal water quality index, Journal of Environmental Management, 2022, 321, 115923. Search in Google Scholar

Verikas A., Gelzinis A., Bacauskiene M., Mining data with random forests: A survey and results of new tests, Pattern Recognition, 2011, 44(2), 330–349. Search in Google Scholar

Wang A.X., Chukova S.S., Nguyen B.P., Synthetic minority oversampling using edited displacement-based k-nearest neighbors, Applied Soft Computing, 2023, 148, 110895. Search in Google Scholar

Wang K., Wan J., Li G., Sun H., A hybrid algorithm-level ensemble model for imbalanced credit default prediction in the energy industry, Energies, 2022, 15(14), 5206. Search in Google Scholar

Wang C., Deng C., Wang S., Imbalance-XGBoost: leveraging weighted and focal losses for binary label-imbalanced classification with XGBoost, Pattern Recognition Letters, 2020, 136, 190–197. Search in Google Scholar

Wilson D.L., Asymptotic properties of nearest neighbor rules using edited data, IEEE Transactions on Systems, Man, and Cybernetics, 1972, 3, 408–421. Search in Google Scholar

Yan J., Tang B., He H., Detection of false data attacks in smart grid with supervised learning, 2016 International Joint Conference on Neural Networks (IJCNN), 2016, 1395–1402. Search in Google Scholar

Yanenkova I., Nehoda Y., Drobyazko S., Zavhorodnii A., Berezovska L., Modeling of bank credit risk management using the cost risk model, Journal of Risk and Financial Management, 2021, 14(5), 211. Search in Google Scholar

Yin S., Dey D.K., Valdez E.A., Gan G., Vadiveloo J., Skewed link regression models for imbalanced binary response with applications to life insurance, arXiv preprint arXiv:2007.15172, 2020. Search in Google Scholar

Xu T., Comparative Analysis of Machine Learning Algorithms for Consumer Credit Risk Assessment, Transactions on Computer Science and Intelligent Systems Research, 2024, vol. 4, pp. 60–67. Search in Google Scholar

Zhang C., Tan K.C., Li H., Hong G.S., A cost-sensitive deep belief network for imbalanced classification, IEEE Transactions on Neural Networks and Learning Systems, 2018, 30(1), 109–122. Search in Google Scholar

Zhang C., Wang G., Zhou Y., Yao L., Jiang Z.L., Liao Q., Wang X., Feature selection for high dimensional imbalanced class data based on F-measure optimization, in 2017 International Conference on Security, Pattern Analysis, and Cybernetics (SPAC), 2017, 278–283, IEEE. Search in Google Scholar

Zhang W., He Y., Wang L., Liu S., Meng X., Landslide susceptibility mapping using random forest and extreme gradient boosting: A case study of Fengjie, Chongqing, Geological Journal, 2023, 58(6), 2372–2387. Search in Google Scholar

Zhang W., Wu C., Zhong H., Li Y., Wang L., Prediction of undrained shear strength using extreme gradient boosting and random forest based on Bayesian optimization, Geoscience Frontiers, 2021, 12(1), 469–477. Search in Google Scholar

Zhao Z., Cui T., Ding S., Li J., Bellotti A.G., Resampling Techniques Study on Class Imbalance Problem in Credit Risk Prediction, Mathematics, 2024, vol. 12, no. 5, p. 701, MDPI. Search in Google Scholar

Zheng M., Wang F., Hu X., Miao Y., Cao H., Tang M., A method for analyzing the performance impact of imbalanced binary data on machine learning models, Axioms, 2022, 11(11), 607. Search in Google Scholar

Zhu L., Qiu D., Ergu D., Ying C., Liu K., A study on predicting loan default based on the random forest algorithm, Procedia Computer Science, 2019, 162, 503–513. Search in Google Scholar

Zhu T., Lin Y., Liu Y., Synthetic minority oversampling technique for multiclass imbalance problems, Pattern Recognition, 2017, 72, 327–340. Search in Google Scholar

Zou Y., Gao C., Gao H., Business failure prediction based on a cost-sensitive extreme gradient boosting machine, IEEE Access, 2022, vol. 10, pp. 42623–42639, IEEE. Search in Google Scholar

Idioma:
Inglés
Calendario de la edición:
4 veces al año
Temas de la revista:
Informática, Inteligencia artificial, Desarrollo de software