Building Reliable Loan Approval Systems: Leveraging Feature Engineering and Machine Learning

Maryam Shoaeinaeini(1), Milad Shoaeinaeini(2), Brent Harrison(3), Milad Jasemi(4),


(1) University of Kentucky
(2) Islamic Azad University, Islamshahr Branch
(3) University of Kentucky
(4) University of Montevallo

Abstract


Automating loan approval system is essential in today's banking system.  Even with the shift to online platforms, the traditional method is still cumbersome and needs a lot of customer-related data. This study proposes a robust solution to overcome these challenges. Despite previous studies, new financial indicators in feature engineering stage are introduces to extract more important client information, thereby improving prediction robustness and accuracy. To implement our integrated approach, an online dataset from a finance company, is utilized. The dataset is preprocessed by various data preparation techniques, including cleaning, transformation, and feature engineering. Subsequently, the preprocessed data undergoes a range of powerful machine learning techniques such as K-Nearest Neighbor, Decision Tree, Gaussian Naive Bayes, and Logistic Regression. Additionally, three robust ensemble methods including Random Forest, AdaBoost Classifier, and Gradient Boosting Classifier are employed for further improveness in performance.  The presented approach succeeded to acheive the highest accuracy by AdaBoost Classifier at 88%. A comparison with the original preprocessed model using ROC curve and feature importance analysis demonstrates the superior performance of our approach, with a larger area under the ROC curve and reduced false positive rate. Furthermore, the comparison findings show a stronger reliance of our model on financial features rather than personal customer features, highlighting its robust classification performance. These results indicate the potential strength of our model to replace the current loan approval system in real-world applications.

Keywords


Loan Approval Prediction; Feature Engineering; Machine Learning; Ensemble Learning

Full Text:

PDF

References


M. Anand, A. Velu, and P. Whig, “Prediction of Loan Behaviour with Machine Learning Models for Secure Banking,” Journal of Computer Science and Engineering (JCSE), vol. 3, no. 1, pp. 1–13, Feb. 2022, doi: 10.36596/jcse.v3i1.237.

M. Udhbav, R. Kumar, N. Kumar, R. Kumar, Dr. M. Vijarania, and S. Gupta, “Prediction of Home Loan Status Eligibility using Machine Learning,” SSRN Electronic Journal. Elsevier BV, 2022. doi: 10.2139/ssrn.4121038.

M. Kumar, V. Goel, T. Jain, S. Singhal and L. Goel, "Neural Network Approach To Loan Default Prediction", International Research Journal of Engineering and Technology (IRJET), vol. 05, 2018, [online] Available: https://www.irjet.net/archives/V5/i4/IRJET-V5I4942.pdf.

R. C. Chiang, Y.-F. Chow, and M. Liu, The Journal of Real Estate Finance and Economics, vol. 25, no. 1. Springer Science and Business Media LLC, pp. 5–32, 2002. doi: 10.1023/a:1015347516812.

M.-C. Chen and S.-H. Huang, “Credit scoring and rejected instances reassigning through evolutionary computation techniques,” Expert Systems with Applications, vol. 24, no. 4. Elsevier BV, pp. 433–441, May 2003. doi: 10.1016/s0957-4174(02)00191-4.

T. Sueyoshi, “DEA-discriminant analysis in the view of goal programming,” European Journal of Operational Research, vol. 115, no. 3. Elsevier BV, pp. 564–582, Jun. 1999. doi: 10.1016/s0377-2217(98)00014-9..

U. Aslam, H. I. Tariq Aziz, A. Sohail, and N. K. Batcha, “An Empirical Study on Loan Default Prediction Models,” Journal of Computational and Theoretical Nanoscience, vol. 16, no. 8. American Scientific Publishers, pp. 3483–3488, Aug. 01, 2019. doi: 10.1166/jctn.2019.8312.

M. Shoaeinaeini, O. Ozturk, and D. Gupta, “Twitter-informed Prediction for Urban Traffic Flow Using Machine Learning,” 2022 6th International Conference on Universal Village (UV). IEEE, Oct. 22, 2022. doi: 10.1109/uv56588.2022.10185516.

O. Ozturk, B. Hangun, and M. Shoaeinaeini, “Utilizing Machine Learning to Predict Offshore Wind Farm Power Output for European Countries,” 2022 11th International Conference on Renewable Energy Research and Application (ICRERA). IEEE, Sep. 18, 2022. doi: 10.1109/icrera55966.2022.9922823.

J. Wiens and E. S. Shenoy, “Machine Learning for Healthcare: On the Verge of a Major Shift in Healthcare Epidemiology,” Clinical Infectious Diseases, vol. 66, no. 1. Oxford University Press (OUP), pp. 149–153, Aug. 21, 2017. doi: 10.1093/cid/cix731.

H. A. P. L. Perera and S. C. Premaratne, "An Artificial Neural Network Approach for the Predictive Accuracy of Payments of Leasing Customers in Sri Lanka," presented at the International Conference on Business, Economics, and Social Science & Humanities (BESSH-2016), Novotel Hotel Sydney Central, Sydney, Australia, Sep. 2016, vol. 2.

Z. Ereiz, “Predicting Default Loans Using Machine Learning (OptiML),” 2019 27th Telecommunications Forum (TELFOR). IEEE, Nov. 2019. doi: 10.1109/telfor48224.2019.8971110.

Y. Li, “Credit Risk Prediction Based on Machine Learning Methods,” 2019 14th International Conference on Computer Science & Education (ICCSE). IEEE, Aug. 2019. doi: 10.1109/iccse.2019.8845444.

A. Bagherpour, "Predicting Mortgage Loan Default with Machine Learning Methods," University of California/Riverside, 2017.

A. Steenackers and M. J. Goovaerts, “A credit scoring model for personal loans,” Insurance: Mathematics and Economics, vol. 8, no. 1. Elsevier BV, pp. 31–34, Mar. 1989. doi: 10.1016/0167-6687(89)90044-9.

M. Madaan, A. Kumar, C. Keshri, R. Jain, and P. Nagrath, “Loan default prediction using decision trees and random forest: A comparative study,” IOP Conference Series: Materials Science and Engineering, vol. 1022, no. 1. IOP Publishing, p. 012042, Jan. 01, 2021. doi: 10.1088/1757-899x/1022/1/012042.

L. Zhu, D. Qiu, D. Ergu, C. Ying, and K. Liu, “A study on predicting loan default based on the random forest algorithm,” Procedia Computer Science, vol. 162. Elsevier BV, pp. 503–513, 2019. doi: 10.1016/j.procs.2019.12.017.

M. S. Irfan Ahmed and P. Ramila Rajaleximi, "An Empirical Study on Credit Scoring and Credit Scorecard for Financial Institutions," International Journal of Advanced Research in Computer Engineering & Technology (IJARCET), vol. 8, no. 7, pp. 275-279, Jul. 2019, ISSN: 2278-1323..

A. Jafar Hamid and T. M. Ahmed, “Developing Prediction Model of Loan Risk in Banks Using Data Mining,” Machine Learning and Applications: An International Journal, vol. 3, no. 1. Academy and Industry Research Collaboration Center (AIRCC), pp. 1–9, Mar. 30, 2016. doi: 10.5121/mlaij.2016.3101.

P. Supriya, M. Pavani, N. Saisushma, N. Vimala Kumari, and K. Vikas, "Loan Prediction by using Machine Learning Models," International Journal of Engineering and Techniques, vol. 5, no. 2, pp. 144-148, Mar.-Apr. 2019.

J. Tejaswini, T. M. Kavya, R. D. N. Ramya, P. S. Triveni, and V. R. Maddumala, "Accurate loan approval prediction based on machine learning approach," Journal of Engineering Science, vol. 11, no. 4, pp. 523-532, 2020.

P. Tumuluru, L. R. Burra, M. Loukya, S. Bhavana, H. M. H. CSaiBaba, and N. Sunanda, “Comparative Analysis of Customer Loan Approval Prediction using Machine Learning Algorithms,” 2022 Second International Conference on Artificial Intelligence and Smart Energy (ICAIS). IEEE, Feb. 23, 2022. doi: 10.1109/icais53314.2022.9742800.

Shoaei, Maryam. “Dream Housing Finance Dataset.” Kaggle, May 18, 2024. https://www.kaggle.com/datasets/maryamshoaei/dream-housing-finance-dataset/data.

L. HOTA, “A Comparative Performance Assessment for Prediction of Loan Approval in Financial Sector.” Research Square Platform LLC, Apr. 04, 2023. doi: 10.21203/rs.3.rs-2763466/v1.

Viswanatha V, Ramachandra A.C, Vishwas K N, and Adithya G, “Prediction of Loan Approval in Banks using Machine Learning Approach”, int. j. eng. mgmt. res., vol. 13, no. 4, pp. 7–19, Aug. 2023.

“Sklearn.Ensemble.Adaboostclassifier.” scikit. Accessed October 20, 2023. https://scikitlearn.org/stable/modules/generated/sklearn.ensemble.AdaBoostClassifier.html#sklearn.ensemble.AdaBoostClassifier.


Refbacks

  • There are currently no refbacks.


Journal of Computer Science and Engineering (JCSE)
ISSN 2721-0251 (online)
Published by : ICSE (Institute of Computer Sciences and Engineering)
Website : http://icsejournal.com/index.php/JCSE/
Email: jcse@icsejournal.com

Creative Commons License is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.