ISSN: 2226-6348
Open access
In the era of burgeoning big data and the expansive reach of the Internet, commercial banks are confronted with the challenge of managing an extensive customer base while striving to meet their evolving needs. A nuanced and reliable understanding of consumer preferences is imperative for banks to ensure customer retention and to preemptively address potential churn. This research introduces a sophisticated approach to predict customer churn through the lens of multiclass categorization, leveraging the prowess of ensemble machine learning algorithms. By integrating the strengths of XGBoost, LightBoost, and CatBoost with a bagging ensemble method, our model offers a refined prediction of customer churn, distinguishing between various levels of churn risk. This multiclass ensemble learning framework not only enhances the predictive accuracy but also provides a more granular insight into customer behavior patterns. The efficacy of our model is assessed using the kappa statistic, a robust measure for evaluating the consistency of predictions across multiple categories. Our experimental results reveal that the kappa value of our multiclass ensemble model significantly surpasses that of single-algorithm approaches, indicating a superior predictive performance and reliability. The insights gleaned from our model can inform targeted marketing strategies and customer retention efforts, thereby mitigating the risk of customer churn. Through the application of this multiclass ensemble learning model, banks can achieve a more strategic and informed approach to maintaining customer loyalty and optimizing their service offerings.
Bashir, M. A., Ali, M. H., Wai, L. M., Hossain, M. I., & Rahaman, M. S. (2020). Mediating Effect of Customer Perceived Value on the Relationship between Service Quality and Customer Satisfaction of E-Banking in Bangladesh. International Journal of Advanced Science and Technology. 29(2), 3590 – 3606
Bhattacharjee, A., Jahanshahi, A. A., Polas, M. R. H., Hossain, M. I., & Asheq, A. S. (2019). Customer Care Service Management is Moving Forward to Achieve Sustainable Customer Retention in Every Industry. Does it play a Role to Increase Brand Retention. International Journal of Management and Sustainability, 8(2), 88-97.
Ben-lan, H. (2014). A Study of the Application of SVM in Prediction about Decrease in Bank's Customers. Financial Forum, 19(9), 70-74.
Cai-xian, Y. U., & Zhi-rong, Z. (2013). Mathematical modeling analysis of customer churn prediction in banks [Mathematical modeling and analysis on bank customer churn prediction]. Journal of Changchun University of Technology (Natural Science Edition), 34(1), 5-8. http://doi.org/10.3969/j.issn.1674-1374.2013.01.002
Dalvi, P. K., Khandge, S. K., Deomore, A., Bankar, A., & Kanade, V. A. (2016, 2016/1/1). Analysis of customer churn prediction in telecom industry using decision trees and logistic regression. Paper presented at the.
De Caigny, A., Coussement, K., & De Bock. (2018). A new hybrid classification algorithm for customer churn prediction based on logistic regression and decision trees. European Journal of Operational Research(269 (2)), 760-772.
Ganesh, J., Arnold, M. J., & Reynolds, K. E. (2000). Understanding the customer base of service providers: An examination of the differences between switchers and stayers. Journal of Marketing, 64(3), 65-87.
Gervasi, O., Murgante, B., Misra, S., Garau, C., Ble?i?, I., Taniar, D., Apduhan, B. O., Rocha, A. M. A. C., Tarantino, E., & Torre, C. M. (2020). Machine Learning for Customer Churn Prediction in Retail Banking (12251, pp. 576-589). Springer International Publishing AG. http://doi.org/10.1007/978-3-030-58808-3_42.
Hossain, M. I., Limon, N., Amin, M. T., & Asheq, A. S. (2018). Work Life Balance Trends: A Study on Malaysian GenerationY Bankers. IOSR Journal of Business and Management, 20 (9), 01-09.
Jain, H., Khunteta, A., & Srivastava, S. (2020). Churn prediction in telecommunication using logistic regression and logit boost. Procedia Computer Science, 167, 101-112.
Jiyang, D. (2020). Research on Predicting Fund Customer churn Based on Decision Tree Automated Feature Selection: Reflections in the Post Epidemic Era. Shandong Social Science(09), 74-80.
John Britto, M., & Gobinath, R. (2020). WITHDRAWN: REHC: Reduction through exclusive homogeneous clusters for imbalance dataset. Materials Today: Proceedings http://doi.org/10.1016/j.matpr.2020.11.093
Jones, K. I., & Sah, S. (2023). The Implementation of Machine Learning In The Insurance Industry With Big Data Analytics. International Journal of Data Informatics and Intelligent Computing, 2(2), 21-38.
Khaled, A. S., Ahmed, S., Tabash, M. I., Al-Homaidi, E. A., & Hossain, M. I. (2019). The Impact of Technological and Marketing Innovations on Retailing Industry: Evidence of India. Journal of Reviews on Global Economics, 8, 948-957
Li, B., & Xie, J. (2020). Study on the Prediction of Imbalanced Bank Customer Churn Based on Generative Adversarial Network. Journal of Physics. Conference Series, 1624(3), 32054. http://doi.org/10.1088/1742-6596/1624/3/032054
Lin Rui, C. X. (2012). A Customer churn analysis model for banks based on artificial. Computer Knowledge and Technology(3)
Maldonado, S. (2015). Churn prediction via support vector classification: An empirical comparison. Intelligent Data Analysis, 19(s1), S135-S147.
Mavri, M., & Ioannou, G. (2008). Customer switching behaviour in Greek banking services using survival analysis. Managerial Finance, 34(3), 186-197.
Parvez, M. O. (2020). Use of machine learning technology for tourist and organizational services: high-tech innovation in the hospitality industry. Journal of Tourism Futures, 7(2), 240-244.
Polas, M. R. H., Juman ,M. K., Karim, A. M., Tabash, M. I., Hossain, M. I., (2020). Do Service Quality Dimensions Increase the Customer Brand Relationship among Gen Z? The Mediation Role of Customer Perception between the Service Quality Dimensions (SERVQUAL) and Brand Satisfaction. International Journal of Advanced Science and Technology. 29( 4), 1050-1070
Reinartz, W. J., & Kumar, V. (2003). The impact of customer relationship characteristics on profitable lifetime duration. Journal of Marketing, 67(1), 77-99.
Shehab, M., Abualigah, L., Shambour, Q., Abu-Hashem, M. A., Shambour, M. K. Y., Alsalibi, A. I., & Gandomi, A. H. (2022). Machine learning in medical applications: A review of state-of-the-art methods. Computers in Biology and Medicine, 145, 105458.
Shirazi, F., & Mohammadi, M. (2019). A big data analytics model for customer churn prediction in the retiree segment. International Journal of Information Management, 48, 238-253.
Tékouabou, S. C. K., Gherghina, ?. C., Toulni, H., Mata, P. N., & Martins, J. M. (2022). Towards Explainable Machine Learning for Bank Churn Prediction Using Data Balancing and Ensemble-Based Methods. Mathematics, 10(14), 2379.
http://doi.org/10.3390/math10142379
Tran, H., Le, N., & Nguyen, V. (2023). Customer Churn Prediction In The Banking Sector Using Machine Learning-Based Classification Models. Interdisciplinary Journal of Information, Knowledge & Management, 18
Ullah, I., Raza, B., Malik, A. K., Imran, M., Islam, S. U., & Kim, S. W. (2019). A churn prediction model using random forest: analysis of machine learning techniques for churn prediction and factor identification in telecom sector. Ieee Access, 7, 60134-60149.
Verma, P. (2020). Churn prediction for savings bank customers: A machine learning approach. Journal of Statistics Applications & Probability, 9(3), 535-547.
Wang, Y., & Yu, W. (2021, December). Research on the Influencing Factors of Consumer Experience Under the New Retail Mode of Fresh Food. In Proceedings of the 2021 5th International Conference on Computer Science and Artificial Intelligence (pp. 337-346).
Weiqing, W., Rao, Y., & Cheng, L. (2014). The influencing factors of customer churn in commercial banks: a study based on survival analysis method. Financial Forum, 19(01), 73-79.
Zhu, B., Baesens, B., Backiel, A., & Vanden Broucke, S. K. (2018). Benchmarking sampling techniques for imbalance learning in churn prediction. Journal of the Operational Research Society, 69(1), 49-65.
Zobair, K. M., Sanzogni, L., Houghton, L., & Islam, M. Z. (2021). Forecasting care seekers satisfaction with telemedicine using machine learning and structural equation modeling. Plos One, 16(9), e257300.
Shuofeng, C., Karim, A. M., & LinLi. (2024). A Multiclass Ensemble Learning Approach for Predicting Customer Churn in Commercial Banks. International Journal of Academic Research in Progressive Education and Development, 13(4), 787–804.
Copyright: © 2024 The Author(s)
Published by HRMARS (www.hrmars.com)
This article is published under the Creative Commons Attribution (CC BY 4.0) license. Anyone may reproduce, distribute, translate and create derivative works of this article (for both commercial and non-commercial purposes), subject to full attribution to the original publication and authors. The full terms of this license may be seen at: http://creativecommons.org/licences/by/4.0/legalcode