Open Access

A comparative analysis of classification algorithms for consumer credits


Cite

Machine Learning is a constantly growing area which has the capacity to analyze massive amounts of data and find relevant patterns, a very important feature in the era of big data. It has a wide range of application areas, including the financial field, and proved to be efficient in solving various problems, including the prediction of the default probability of a customer to meet their obligations to the bank, using classification algorithms. Their output is further used when deciding whether to approve a loan or no, based on the previous behavior of the customers, hence reduces the loss of the bank. Even though Machine Learning algorithms proved to be efficient in solutioning this type of problems, none was identified for remarkable results. This paper studies 10 different methods applied on the same dataset (Logistic Regression, K-Nearest Neighbor, Support Vector Machine, Kernel Support Vector Machine, Naïve Bayes, Decision Tree, Random Forest, Bagging Classifier, Linear Discriminant Analysis, Neural Network - Multi Layer Perceptron) and performs a comparative analysis aiming to identify the one which outperforms the others. Their performance is evaluated based on some well-known statistical measures such as Accuracy, Misclassification Rate, Precision and Specificity. In addition, this paper also presents and evaluates the impact of feature selection on the overall performance of an algorithm.

eISSN:
2558-9652
Language:
English