TY - GEN
T1 - Evaluating Machine Learning and Deep Learning Analytics for Predicting Bankruptcy of Companies
AU - Sen, Prasenjit
AU - Assi, Sulaf
AU - Assi, Jolnar
AU - Liatsis, Panos
AU - Jayabalan, Manoj
AU - Al-Jumeily, Dhiya
N1 - Publisher Copyright:
© The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd. 2024.
PY - 2024
Y1 - 2024
N2 - Corporate bankruptcy is a global issue that has increased over the last few years. Due to lack of adequate historical data, current models have not been able to correctly predict cases of bankruptcy. This research proposed a composite procedure at four stages which includes pre-processing and data rebalancing methods to curate the data, perform feature selection, use various machine learning and deep learning models to construct a robust predictive bankruptcy model and the use of explainable AI to understand the various features that contribute to the model prediction. Models were based on the labelled historic data of various bankrupted and non-bankrupted Polish companies between the period of 2007–2013. Prior to models’ application, data were split to training and test sets that consisted of 25,122 and 12,375 datapoints, respectively. Models were evaluated on various metrics: ROC-AUC, recall, and F-Beta score to determine the best predictive model. Our comparative study showcases that missing data imputation performed using KNN Imputer, skewness reduction using Yeo-Johnson transformation, feature elimination using Recursive Feature Elimination technique along with cost sensitive learning used in tandem with XGBoost algorithm produces the best model with test AUC score of 96.1%, recall score as 96%, and F-Beta score of 92.42%. Implementation of Explainable AI has also helped in realizing top four significant features that impact negatively to bankruptcy prediction globally and locally across all the models created are the ratios “total_cost_overtotal_sales”, “gross_profit_in_3_years_over_total_assests”, “profit_on_sales_over_sales”, and “profit_on_sales_over_total_assests”. Such insights on the classification outcome which instils confidence amongst the decision makers about the validity of the model and its prediction capabilities.
AB - Corporate bankruptcy is a global issue that has increased over the last few years. Due to lack of adequate historical data, current models have not been able to correctly predict cases of bankruptcy. This research proposed a composite procedure at four stages which includes pre-processing and data rebalancing methods to curate the data, perform feature selection, use various machine learning and deep learning models to construct a robust predictive bankruptcy model and the use of explainable AI to understand the various features that contribute to the model prediction. Models were based on the labelled historic data of various bankrupted and non-bankrupted Polish companies between the period of 2007–2013. Prior to models’ application, data were split to training and test sets that consisted of 25,122 and 12,375 datapoints, respectively. Models were evaluated on various metrics: ROC-AUC, recall, and F-Beta score to determine the best predictive model. Our comparative study showcases that missing data imputation performed using KNN Imputer, skewness reduction using Yeo-Johnson transformation, feature elimination using Recursive Feature Elimination technique along with cost sensitive learning used in tandem with XGBoost algorithm produces the best model with test AUC score of 96.1%, recall score as 96%, and F-Beta score of 92.42%. Implementation of Explainable AI has also helped in realizing top four significant features that impact negatively to bankruptcy prediction globally and locally across all the models created are the ratios “total_cost_overtotal_sales”, “gross_profit_in_3_years_over_total_assests”, “profit_on_sales_over_sales”, and “profit_on_sales_over_total_assests”. Such insights on the classification outcome which instils confidence amongst the decision makers about the validity of the model and its prediction capabilities.
KW - Classification
KW - Corporate bankruptcy
KW - Explainable AI
KW - Machine learning
UR - http://www.scopus.com/inward/record.url?scp=85187803629&partnerID=8YFLogxK
U2 - 10.1007/978-981-99-8498-5_32
DO - 10.1007/978-981-99-8498-5_32
M3 - Conference contribution
AN - SCOPUS:85187803629
SN - 9789819984978
T3 - Lecture Notes in Networks and Systems
SP - 407
EP - 419
BT - Advances in Intelligent Manufacturing and Robotics - Selected Articles from ICIMR 2023
A2 - Tan, Andrew
A2 - Zhu, Fan
A2 - Jiang, Haochuan
A2 - Mostafa, Kazi
A2 - Yap, Eng Hwa
A2 - Chen, Leo
A2 - Olule, Lillian J. A.
A2 - Myung, Hyun
PB - Springer Science and Business Media Deutschland GmbH
T2 - International Conference on Intelligent Manufacturing and Robotics, ICIMR 2023
Y2 - 22 August 2023 through 23 August 2023
ER -