Predicting Super-Efficiency of Commercial Bank Branches Using Regression Models
Abstract
This study focuses on predicting the super-efficiency scores of commercial bank branches by employing various regression models. The analysis is conducted on a dataset comprising 375 bank branches from the fiscal year 2017, utilizing a range of financial, operational, and cost-related indicators as input features. A suite of regression techniques, including linear regression, ensemble methods such as Random Forest and XGBoost, as well as neural network models, is implemented to estimate the super-efficiency values. Model performance is assessed through metrics including Mean Absolute Error (MAE) and the coefficient of determination (R²). The findings reveal that non-linear models, especially ensemble-based algorithms, outperform linear models in terms of accuracy and generalizability. This regression framework offers a robust decision-support tool for evaluating and benchmarking the operational efficiency of bank branches.
Keywords:
Data envelopment analysis, Machine learning, Commercial banks, Bank branch performance evaluationReferences
- [1] Mozaffari, M. R., Kamyab, P., Jablonsky, J., & Gerami, J. (2014). Cost and revenue efficiency in DEA-R models. Computers & industrial engineering, 78, 188–194. https://doi.org/10.1016/j.cie.2014.10.001
- [2] Gerami, J., Mozaffari, M. R., Wanke, P. F., & Correa, H. (2022). A novel slacks-based model for efficiency and super-efficiency in DEA-R. Operational research, 22(4), 3373–3410. https://doi.org/10.1007/s12351-021-00679-6
- [3] Mozaffari, M. R., Dadkhah, F., Jablonsky, J., & Wanke, P. F. (2020). Finding efficient surfaces in DEA-R models. Applied mathematics and computation, 386, 125497. https://doi.org/10.1016/j.amc.2020.125497
- [4] Mozaffari, M. R., Mohammadi, S., Wanke, P. F., & Correa, H. L. (2021). Towards greener petrochemical production: Two-stage network data envelopment analysis in a fully fuzzy environment in the presence of undesirable outputs. Expert systems with applications, 164, 113903. https://doi.org/10.1016/j.eswa.2020.113903
- [5] Noura, A. A., Hosseinzadeh Lotfi, F., Jahanshahloo, G. R., Rashidi, S. F., & Parker, B. R. (2010). A new method for measuring congestion in data envelopment analysis. Socio-economic planning sciences, 44(4), 240–246. https://doi.org/10.1016/j.seps.2010.06.003
- [6] Noura, A. A., Hosseinzadeh Lotfi, F., Jahanshahloo, G. R., & Fanati Rashidi, S. (2011). Super-efficiency in DEA by effectiveness of each unit in society. Applied mathematics letters, 24(5), 623–626. https://doi.org/10.1016/j.aml.2010.11.025
- [7] Rashidi, S. F., & Barati, R. (2014). On the comparison of supply chain with sub-Dmus in Dea. Advances in environmental biology, 2387–2391. https://b2n.ir/mx3313
- [8] Rashidi, S. F. (2015). Evaluation of productivity indicators in the oil industry by using multi-attribute decision making approach (MADM). International journal of advanced and applied sciences, 2(6), 25–31. https://b2n.ir/zn7859
- [9] Barati, R., & Fanati Rashidi, S. (2024). Fuzzy AHP and fuzzy TOPSIS synergy for ranking the factor influencing employee turnover intention in the Iran hotel industry. Journal of applied research on industrial engineering, 11(1), 57–75. https://doi.org/10.22105/jarie.2022.336603.1464
- [10] Aigner, D., Lovell, C. A. K., & Schmidt, P. (1977). Formulation and estimation of stochastic frontier production function models. Journal of econometrics, 6(1), 21–37. https://doi.org/10.1016/0304-4076(77)90052-5
- [11] Aparicio, J., Barbero, J., Kapelko, M., Pastor, J. T., & Zofío, J. L. (2017). Testing the consistency and feasibility of the standard Malmquist-Luenberger index: Environmental productivity in world air emissions. Journal of environmental management, 196, 148–160. https://doi.org/10.1016/j.jenvman.2017.03.007
- [12] Aparicio, J., Esteve, M., & Kapelko, M. (2023). Measuring dynamic inefficiency through machine learning techniques. Expert systems with applications, 228, 120417. https://doi.org/10.1016/j.eswa.2023.120417
- [13] Guerrero, N. M., Aparicio, J., & Valero-Carreras, D. (2022). Combining data envelopment analysis and machine learning. Mathematics, 10(6), 909. https://doi.org/10.3390/math10060909
- [14] Guillen, M. D., Aparicio, J., & Esteve, M. (2023). Gradient tree boosting and the estimation of production frontiers. Expert systems with applications, 214, 119134. https://doi.org/10.1016/j.eswa.2022.119134