MACHINE LEARNING-BASED APPROACHES FOR CREDIT CARD DEBT PREDICTION

Nurain Ibrahim1*,Umi Munirah Ishak2, Nur Nabilah Arina Ali3, Norshahida Shaadan4

1*,4School of Mathematical Sciences, College of Computing, Informatics and Mathematics, Universiti Teknologi MARA, 40450 Shah Alam, Selangor, Malaysia

1*Institute for Big Data Analytics and Artificial Intelligence (IBDAAI), Kompleks Al-Khawarizmi, Universiti Teknologi MARA, 40450 Shah Alam, Selangor, Malaysia

2D’Monte Laguna Merbok Sungai Petani 31, Persiaran BLM 1A, Bandar Laguna Merbok, 08000 Sungai Petani, Kedah, Malaysia

3Fresenius Medical Care, Axis Technology Centre, 2nd Floor, Lot Petaling Jaya, Jalan 51A/225, Seksyen 13, 46100 Petaling Jaya, Selangor, Malaysia

1*This email address is being protected from spambots. You need JavaScript enabled to view it., 2This email address is being protected from spambots. You need JavaScript enabled to view it., 3This email address is being protected from spambots. You need JavaScript enabled to view it., 4This email address is being protected from spambots. You need JavaScript enabled to view it.



ABSTRACT

 

The primary concern in the stock market and banks that offer credit cards has been a problem over time. Regardless of their capacity to pay, most card users abuse their credit cards and accrue debt from cash cards. The most significant issue facing cardholders and banks alike is this calamity. Predicting credit card customers' default payments became vital to lowering this risk. Data mining approaches, including decision tree, logistic regression, and Naïve Bayes with feature selection methods, were applied to secondary credit card debt data to identify the significant factors that impact credit card default and to enhance the prediction of credit card default. As a result, the decision tree with Gini index splitting criteria forward selection wrapper method was identified as the best model with the highest percentages of accuracy, precision, sensitivity, and area under ROC of 76.39%, 72.02%, 85.08%, and 0.891 respectively. Additionally, the significant factors that impact credit card default are gender, education level, repayment status in July 2005, repayment status in August 2005, status of repayment in September 2005, and the amount paid in June 2005 and May 2005. This study may help financial institutions assess creditworthiness and give consumers insights into their financial behaviors.

 


Keywords: Credit Card Debt, Decision Tree, Logistic Regression, Naïve Bayes

 

Published On: 1 April 2024

Full Download