A Framework for Business Failure Prediction

Islek, Irem; Atakli, Idris Murat; Oguducu, Sule Gunduz

doi:10.1007/978-3-319-59060-8_8

Irem Islek¹⁹,
Idris Murat Atakli¹⁹ &
Sule Gunduz Oguducu²⁰

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 10246))

Included in the following conference series:

International Conference on Artificial Intelligence and Soft Computing

2101 Accesses
1 Citations

Abstract

Business failure prediction systems help predict financial failures before they actually happen and provide an early warning for enterprises. Using machine learning techniques, instead of traditional statistical models, has brought a considerable increase in performance into the area of business failure prediction. This paper presents a framework for predicting business failures by using different machine learning techniques. We, also, implemented a novel model for business failure prediction based on NARX (nonlinear autoregressive network with exogenous inputs) feedback neural network to be included into this framework which is a recurrent dynamic network with feedback connections. Detailed experiments are conducted to compare the performance of these approaches. Especially, for the long-term business failure predictions, there are no other papers investigating the performance of NARX. To the best of our knowledge, this is the first time NARX algorithm is applied for long-term business failure prediction.

Access provided by CONRICYT-eBooks. Download conference paper PDF

Prediction of Medical Equipment Failure Rate: A Case Study

Recurrent ANNs for Failure Predictions on Large Datasets of Italian SMEs

Predicting Business Failure Using Neural Networks: An Empirical Comparison with Statistical Methods and Data Mining Method

Keywords

1 Introduction

From 1960’s to present, researchers have paid a great deal of attention to finding a successful way for predicting business failures. It can be described as developing a methodology to predict financial distress using several existing financial features of an enterprise. Business failure prediction, which is also known as financial distress prediction or firm failure prediction has a considerable importance to shareholders, investors, credit managers, etc. Business failure prediction models as such alert a stakeholder or a manager to take timely precautions to prevent failures before they occur. For investors, this model provides vital information which helps them deciding whether to invest in a firm or not. In other words, this model reduces the risk of false investment decisions and prevents financial loss. Also, this model can be used by credit managers to evaluate the level of risk and credit limit for an enterprise.

There exist many studies in the literature about business failure prediction. The first study about this topic was done by Beaver in 1966 [1]. Beaver used univariate analysis to forecast bankruptcy. After that, Altman proposed multivariate discriminant analysis to solve this problem [2]. Most of the subsequent studies were based on Altman’s study. After 1980, different types of regression models, such as logit and probit, were proposed to develop a model which can predict business failures accurately. Afterwards, machine learning algorithms were introduced as alternatives to the statistical models. Most of the recent studies compare traditional statistical models with machine learning models or combine several models in one methodology [3,4,5,6,7,8]. In general, obtained results show that machine learning algorithms overcome statistical models in predicting business failure.

In this study, we proposed a framework for successfully predicting business failures. This framework contains nine different prediction models, namely, Logistic Regression, Multilayer Perceptron (MLP), Sequential Minimal Optimization (SMO), Bayesian Network, Naive Bayes, J48, Random Forest, Random Tree and NARX (nonlinear autoregressive network with exogenous inputs) feedback neural network. To the best of our knowledge, NARX has never been used for business failure prediction before this study. In addition to that, this framework gives chance of making multistep ahead prediction with NARX model. For the evaluation purposes, nine different models were applied to same datasets on the same framework and obtained results are given in detail.

The paper organized as follows: In Sect. 2, we reviewed the related work. Details of constructed datasets are given in Sect. 3. In addition to that, proposed methodology is explained in Sect. 3. The performances of applied methodologies are evaluated in Sect. 4. Comparisons of these performances are also given in this section. The paper is concluded by summarizing achievements and giving future directions in Sect. 5.

2 Related Work

Financial distress prediction has remained highly popular since 1960’s. After Altman’s multivariate discriminant analysis, Ohlson proposed logit analysis for bankruptcy prediction for the first time [9].

After that, machine learning algorithms came into use as an alternative to statistical models. For instance, neural networks were used in numerous studies in order to predict business failure [3,4,5, 10, 11]. In these studies, neural networks were compared with traditional statistical models such as multivariate discriminant analysis. Most of these studies claim that neural networks gave better performance than discriminant analysis. In several studies, SVM has been also used for predicting business failures. It has been found that SVM outperformed the classical methods [12, 13]. Another popular machine learning approach which is used for firm failure prediction is tree algorithms such as ID3 and decision trees [14, 15]. In these studies, tree algorithms were compared with discriminant analysis and provided better results than statistical models. According to the literature review, we can say that machine learning models generally outperform traditional statistical models such as multivariate discriminant analysis.

Combining a model with other models to strengthen the weak points of the model is a common approach in machine learning studies. In this direction, researchers compared neural networks to decision trees, SVM, majority voting and concluded that neural networks was the best method for forecasting financial distress in comparison to other methods [7]. Azayite and Achchab composed a hybrid model based on discriminant analysis, back propagation neural network and self-organizing maps [8]. They applied the hybrid model to Moroccan firms and claimed that the hybrid model outperformed discriminant analysis. Wu et al. proposed a genetic based SVM to predict bankruptcy [16]. This methodology tested on Taiwan dataset to compare with discriminant analysis, logit, probit, neural networks and traditional SVM. Proposed hybrid methodology gave the best predictive accuracy according to the experimental results. Another hybrid study brought together SVM and logistic regression [17]. The methodology modified the outputs of the SVM classifiers according to the result of logistic regression analysis. In [18], single classifiers were trained by SVM algorithms with different kernel functions on different feature subsets of one initial dataset. This ensemble SVM provided better performance than individual SVM classifier. Lin et al. proposed another hybrid method which combines locally linear embedding (LLE) and SVM to predict firm failures [19].

Even though big data approach is extremely popular, it is not used for predicting business failures. In literature, there is only one study which uses big data approach for business failure prediction [20]. The reason for that may be that it is quite difficult to obtain huge amounts of data for business failure prediction.

In this study, we propose a framework for business failure prediction by making following contributions:

Our framework contains NARX network algorithm which has never been used for business failure prediction before.
Thanks to NARX network, multistep ahead prediction can be done in addition to one-step ahead prediction.
Proposed framework can be used for not only business failure prediction, but also other suitable prediction problems in some areas such as finance, biomedical etc., due to its flexible structure.

3 The Dataset and the Proposed Framework

3.1 Details of Dataset

Financial statements of enterprises, which are registered to IMKB BIST [21], are published on Public Disclosure Platform, periodically. In addition to that, deteriorated firms are published on Public Disclosure Platform, as well. Datasets for our study are derived from these resources. 10 different financial ratios are defined as input variables from these datasets. These variables are selected according to Aktan’s study which detects 10 best financial ratios for bankruptcy prediction within 53 financial ratios [22]. Selected financial ratios can be seen in Table 1.

Table 1. Selected financial ratios

Full size table

Class values, which correspond to financial status of firms are defined as good, bad and very bad in constructed datasets.

In the first dataset, input variables and class values are calculated for quarterly periods. Apart from that, a second dataset is constructed using yearly values of selected variables.

3.2 Proposed Framework

The proposed framework contains three main steps, Data Preparation, Prediction and Evaluation as seen in Fig. 1.

Data Preparation Step. In data preparation step, data rows, which include null values for some financial ratios, are removed from the dataset. After cleaning, 10 financial ratios are calculated using several financial variables. Lastly, a matrix data structure is composed from calculated financial ratios and class values.

Prediction Step. This step is responsible for producing business failure prediction results. For this purpose, we constructed Logistic Regression [23], Multilayer Perceptron [24], Sequential Minimal Optimization [25], Bayesian Network [26], Naive Bayes [27], J48 [28], Random Forest [29], Random Tree [30] and NARX models in this step. A prediction model should be selected within these nine models to continue this step of the framework. Afterwards, the selected model is trained using given data and the prediction results are produced according to the trained model.

Due to page limitations, we, very briefly, explain NARX model, which has not been employed for business failure prediction purposes before.

NARX, which is a dynamic network, is useful for time series modeling. As can be seen in Eq. 1, the previous output value of the network and previous values of input parameters are used for producing next step value of the output.

$$\begin{aligned} y(t) = f(y(t - 1), y(t - 2),..., y(t - n_y), u(t - 1), u(t - 2),...,u(t - n_u)) \end{aligned}$$

(1)

In this equation, u represents the training inputs while y represents the target variables to be predicted. t means the discrete time step in this equation. For predicting next values of y(t), previous values of the exogenous input and previous values of the output regress together using f function. A general NARX network architecture can be seen in Fig. 2.

There are two types of NARX network: series-parallel architecture and parallel architecture. Series-parallel architecture which is also called open-loop, uses existing output as one of network inputs. Parallel network (close-loop) uses the output produced by previous iteration as one of network inputs.

Firstly, the series-parallel architecture is constructed in order to train network. In this network, inputs of the network are selected financial ratios (u1(t), u2(t), .., u10(t)) and existing outputs (y(t)). Series-parallel NARX completes training phase in a shorter time than parallel NARX because series-parallel one uses existing output values.

Afterwards, the architecture of the realized network is transformed to parallel architecture in prediction step. Reason of using parallel NARX network is that parallel architecture provides opportunity to make multi-step ahead prediction.

Evaluation Step. Accuracy, Type I error and Type II error are measured for the performance review of applied algorithms. Brief descriptions and formulas of them are given below:

Accuracy, calculates the ratio of total number of correct predictions to total number of predictions.

Type I error (false positive), means predicting a firm’s financial status as good when it is actually bad or very bad. Also, predicting a firm’s financial status as bad when it is actually very bad is Type I error, as well.

Type II error (false negative), means predicting a firm’s financial status as bad when it is actually good. In addition to that, predicting a firm’s status as very bad when it is actually bad or good is also Type II error.

If we compare Type I and Type II, we can easily say that Type I error is more significant than Type II error for our problem. If a firm’s financial status is bad or very bad but our methodology says that it is good, firm’s managers will not take necessary precautions and possibly, end up with bankruptcy.

4 Performance Evaluation Results

As we mentioned before, two separate datasets are constructed from raw data. First one contains data in quarters and second one contains annual data. In both datasets, 2015 data is used for testing. Test data sample counts of quarter-period dataset and annual dataset are 222 and 66, respectively. Class values for datasets are defined as good, bad and very bad. The optimal parameters are defined using validation set which includes 2014 data.

Table 2. Comparison results for quarter-period dataset

Full size table

Constructed NARX network contains 10 neurons in the hidden layer and Levenberg - Marquardt [31] algorithm is used as the training step for the network. The applied NARX network contains one hidden layer. In our NARX model, 10 financial ratio values which are given in Table 1, are used as input. There is one value as output of the network which corresponds financial status of firm. Transfer function of the NARX model is sigmoid function. In Eq. 1, $n_y$ and $n_u$ are the lags of the input and output of our NARX model. $n = 1$ means one-step ahead, while any larger value of n means multi-step ahead prediction (If $n = 2$, model predicts 2 step ahead value).

Besides NARX, other prediction algorithms are applied using Weka. NARX algorithm is implemented using MATLAB. Evaluated results of applied methods for the first dataset (quarter-period dataset) are given in Table 2.

As can be seen in Table 2, Random Forest gives the best accuracy for quarter-period dataset. In addition to that, lowest Type I and Type II error rates are obtained with Random Forest for quarter-period dataset. One step ahead NARX provides second best results for accuracy, Type I and Type II error rates.

As shown in Table 3, one step ahead NARX gives the best accuracy for the annual dataset. For Type I error, Random Forest outperforms one step ahead NARX. For Type II error, NARX gives lowest error rate.

Table 3. Comparison results for annual dataset

Full size table

The reason Random Forest gives satisfying results is that it is actually an ensemble learning methodology. It contains multitude of decision trees and prediction results are chosen according to the voting mechanism. Ensemble learning approach is based on obtaining highly accurate classifiers by combining less accurate ones.

In addition to Random Forest, one step ahead NARX, also, gives better results than other prediction models of the framework for our datasets as NARX is commonly used for modelling time series based prediction and our datasets also have a temporal ordering for several different financial ratios.

Since, this framework also provides a multi-step ahead business failure prediction, we did some extra experiments for multi-step ahead prediction using parallel NARX network. Detailed results of these experiments are given in Tables 4 and 5.

In Table 4, 5 step ahead prediction gives result for one year later in quarter-period dataset. As you can see from Table 4, accuracy value of 3 step ahead NARX is lower than expected. We guess that this decrease causes from imbalanced dataset of 3 step ahead test.

Table 4. Comparison results for one step and multistep ahead NARX for quarter-period dataset

Full size table

Table 5. Comparison results for one step and multistep ahead NARX for annual dataset

Full size table

In Table 5, each step indicates one year, thus 5 step ahead prediction gives results for five years later. Not surprisingly, prediction accuracy, Type I and Type II error rates drop year after year. It is obvious that the long-term business failure prediction is challenging since political and societal changes also play a role in business failure. However, it is difficult to predict political and societal changes.

5 Conclusions

In this study, we presented a framework for business failure prediction. To achieve that, Logistic Regression, Multilayer Perceptron, Sequential Minimal Optimization, Bayesian Network, Naive Bayes, J48 Tree, Random Forest, Random Tree and NARX models are constructed in this framework. We also want to emphasize that this is the first study, which uses NARX for business failure prediction. All prediction models of framework are tested separately using two different datasets which contain firms from Turkey. The first dataset uses quarterly period data but second one uses annual data for financial ratios and class values estimations.

In conclusion, we can confidently say that proposed framework is very useful for business failure prediction. Using this framework, suitable business failure prediction model for a dataset can be chosen easily. Moreover, NARX model gives a chance of predicting multi-step ahead business failure.

References

Beaver, W.H.: Financial ratios as predictors of failure, empricial research in accounting: selected studies. J. Acc. Res. 5, 179–199 (1966)
Google Scholar
Altman, E.I.: Financial ratios, discriminant analysis and the prediction of corporate bankruptcy. J. Financ. 23, 589–609 (1968)
Article Google Scholar
Tam, K.Y., Kiang, M.Y.: Managerial applications of neural networks: the case of bank failure predictions. Manage. Sci. 38, 926–947 (1992)
Article MATH Google Scholar
Coats, P.K., Fant, L.F.: Recognizing financial distress patterns using neural network tool. Financ. Manage. 22, 142–155 (1993)
Article Google Scholar
Wilson, R.L., Sharda, R.: Bankruptcy prediction using neural networks. Decis. Sci. 11, 545–557 (1994)
Google Scholar
Ahn, B.S., Cho, S.S., Kim, C.Y.: The integrated methodology of rough set theory and artificial neural network for business failure prediction. Expert Syst. Appl. 18(2), 65–74 (2000)
Article Google Scholar
Geng, R., Bose, I., Chen, X.: Prediction of financial distress: an empirical study of listed Chinese companies using data mining. Eur. J. Oper. Res. 241(1), 236–247 (2015)
Article Google Scholar
Azayite, F.Z., Achchab, S.: Hybrid discriminant neural networks for bankruptcy prediction and risk scoring. Procedia Comput. Sci. 83, 670–674 (2016)
Article Google Scholar
Ohlson, J.: Financial ratios and the probabilistic prediction of bankruptcy. J. Account. Res. 18, 109–131 (1980)
Article Google Scholar
Odom, M.D., Sharda, R.A.: A neural networks model for bankruptcy prediction. In: The 2nd IEEE International Joint Conference on Neural Network, pp. 163–168 (1990)
Google Scholar
Altman, E.I., Marco, G., Varetto, F.: Corporate distress diagnosis: comparisons using linear discriminate analysis and neural networks. J. Bank. Financ. 18, 505–529 (1994)
Article Google Scholar
Gestel, T.V., Baesens, B., Suykens, J., Espinoza, M., Baestaens, D.E., Vanthienen, J., De Moor, B.: Bankruptcy prediction with least squares support vector machine classifiers. In: Computational Intelligence for Financial Engineering, pp. 1–8. IEEE Press (2003)
Google Scholar
Shin, K.S., Lee, T.S., Kim, H.: An application of support vector machines in bankruptcy prediction model. Expert Syst. Appl. 28(1), 127–135 (2005)
Article Google Scholar
Messier, W.F., Hansen, J.V.: Inducing rules for expert system development: an example using default and bankruptcy data. Manage. Sci. 34(12), 1403–1415 (1988)
Article Google Scholar
Gepp, A., Kumar, K., Bhattacharya, S.: Business failure prediction using decision trees. J. Forecast. 29(6), 536–555 (2010)
Article MathSciNet MATH Google Scholar
Wu, C.H., Tzeng, G.H., Goo, Y.J., Fang, W.C.: A real-valued genetic algorithm to optimize the parameters of support vector machine for predicting bankruptcy. Expert Syst. Appl. 32(2), 397–408 (2007)
Article Google Scholar
Hua, Z., Wang, Y., Xu, X., Zhang, B., Liang, L.: Predicting corporate financial distress based on integration of support vector machine and logistic regression. Expert Syst. Appl. 33(2), 434–440 (2007)
Article Google Scholar
Sun, J., Li, H.: Financial distress prediction using support vector machines: ensemble vs. individual. Appl. Soft Comput. 12(8), 2254–2265 (2012)
Article Google Scholar
Lin, F., Yeh, C.C., Lee, M.Y.: A hybrid business failure prediction model using locally linear embedding and support vector machines. Romanian J. Econ. Forecast. 16(1), 82–97 (2013)
Google Scholar
Hafiz, A., Lukumon, O., Muhammad, B., Olugbenga, A., Hakeem, O., Saheed, A.: Bankruptcy prediction of construction businesses: towards a big data analytics approach. In: Big Data Computing Service and Applications (BigDataService), pp. 347–352. IEEE Press (2015)
Google Scholar
IMKB BIST. http://www.borsaistanbul.com
Aktan, S.: Application of machine learning algorithms for business failure prediction. Investment Manage. Financ. Innov. 8(2), 52–65 (2011)
Google Scholar
Hosmer Jr., D.W., Lemeshow, S.: Applied Logistic Regression. Wiley, New York (2004)
Google Scholar
Alpaydin, E.: Introduction to Machine Learning. The MIT Press, Cambridge (2004)
MATH Google Scholar
Platt, J.: Sequential Minimal Optimization: A Fast Algorithm for Training Support Vector Machines (1998)
Google Scholar
Friedman, N., Geiger, D., Goldszmidt, M.: Bayesian Network Classifers. Mach. Learn. 29(2-3), 131–163 (1997)
Article MATH Google Scholar
Mitchell, T.M.: Machine Learning. McGraw-Hill, Maidenhead (1997)
MATH Google Scholar
Quinlan, J.R.: C4.5: Programs for Machine Learning. Elsevier, New York (2014)
Google Scholar
Breiman, L.: Random forests. Mach. Learn. 45(1), 5–32 (2001)
Article MATH Google Scholar
Hall, M., Frank, E., Holmes, G., Pfahringer, B., Reutemann, P., Witten, I.H.: The WEKA data mining software: an update. SIGKDD Explor. 11(1), 10–18 (2009)
Article Google Scholar
Weisstein, E.W.: Levenberg-Marquardt Method. From MathWorld-A Wolfram Web Resource. http://mathworld.wolfram.com/Levenberg-MarquardtMethod.html

Download references

Acknowledgments

This research was partially supported by The Scientific and Technological Research Council of Turkey (TUBITAK) under TEYDEB grant 3150156.

Author information

Authors and Affiliations

Idea Teknoloji Cozumleri, Istanbul, Turkey
Irem Islek & Idris Murat Atakli
Department of Computer Engineering, Istanbul Technical University, Istanbul, Turkey
Sule Gunduz Oguducu

Authors

Irem Islek
View author publications
You can also search for this author in PubMed Google Scholar
Idris Murat Atakli
View author publications
You can also search for this author in PubMed Google Scholar
Sule Gunduz Oguducu
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Irem Islek .

Editor information

Editors and Affiliations

Częstochowa University of Technology, Częstochowa, Poland
Leszek Rutkowski
Częstochowa University of Technology, Częstochowa, Poland
Marcin Korytkowski
Częstochowa University of Technology, Częstochowa, Poland
Rafał Scherer
AGH University of Science and Technology, Kraków, Poland
Ryszard Tadeusiewicz
University of California, Berkeley, California, USA
Lotfi A. Zadeh
University of Louisville, Louisville, Kentucky, USA
Jacek M. Zurada

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Islek, I., Atakli, I.M., Oguducu, S.G. (2017). A Framework for Business Failure Prediction. In: Rutkowski, L., Korytkowski, M., Scherer, R., Tadeusiewicz, R., Zadeh, L., Zurada, J. (eds) Artificial Intelligence and Soft Computing. ICAISC 2017. Lecture Notes in Computer Science(), vol 10246. Springer, Cham. https://doi.org/10.1007/978-3-319-59060-8_8

Download citation

DOI: https://doi.org/10.1007/978-3-319-59060-8_8
Published: 24 May 2017
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-59059-2
Online ISBN: 978-3-319-59060-8
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

A Framework for Business Failure Prediction

Abstract

Similar content being viewed by others

Prediction of Medical Equipment Failure Rate: A Case Study

Recurrent ANNs for Failure Predictions on Large Datasets of Italian SMEs

Predicting Business Failure Using Neural Networks: An Empirical Comparison with Statistical Methods and Data Mining Method

Keywords

1 Introduction

2 Related Work