Keywords

1 Introduction

Prediction of financial distress of a company on the basis of financial analysis is nowadays an important and interesting issue. This prediction is based on estimating of whether the analyzed company is in a state of financial threats or not or whether its next development leads to bankruptcy or not. For the purpose of early detection of imminent problems of an analyzed company, prediction models are often used. Their task is to evaluate the financial health of the company on the basis of selected financial indicators or characteristics of the company and then to identify imminent bankruptcy in advance (Misankova and Kral 2015). The identification of the impending problems of the company can be important not only for company owners but also for business partners, potential or existing creditors, or employees (Bartosova et al. 2014).

In recent years many authors in different countries devote to the creation of prediction models for predicting imminent bankruptcy. Prediction of corporate financial distress and bankruptcy is an interesting topic for financial researchers. The first prediction models were constructed in the late 1960s by well-known authors, such as Beaver (1966), Altman (1968), Ohlson (1980), Springate et al. (1983), Zmijewski (1984), Fulmer et al. (1984), Shirata (1995), Dimitras et al. (1996), Shumway (2001), and others. Currently, there exist hundreds of prediction models developed at a particular time and in conditions of particular economies. Many of them are used in economic practice.

Many models were developed in different countries all over the world in recent years by the application of real data about financial ratios of companies. For example, Chung et al. (2008) have created MDA classification model for companies in the field of finance industry in New Zealand. Anandarajan et al. (2004) have used the artificial neural network for modeling bankruptcy in Germany. Altman and Sabato (2005) have developed the logit model for small companies in the USA. Lieu et al. (2008) have used logit model for companies in Taiwan. Logit model has also been used in the work of Lin and Piesse (2004) for industrial companies in the UK. Bandyopadhyay (2006) has used MDA for companies in India for bankruptcy prediction 1 year or 2 years in advance. Sun and Li (2012) have used logit and MDA too for China companies. Shams et al. in Iran (2011) have developed a logit model for securities and exchange organization. Bellovary et al. (2007) have processed the situation about existing bankruptcy prediction studies from 1930 to present.

These bankruptcy prediction models use for analysis different financial indicators of the company, the most frequently used are financial ratios, or some other characteristics of the company or environment in which it operates (Misankova et al. 2015). Our work in the future will lead to a comprehensive bankruptcy prediction model for Slovak companies, based on real data from the accounting statements of real Slovak companies. This model will be created using multivariate statistical methods, e.g., multidiscriminant analysis (MDA) or logistic regression (logit). By MDA we would like to create a prediction model that would be able to discriminate companies into a group of prosperous or into a group of non-prosperous ones. Using logit method we would like to create a model of logistic regression, which would be able to predict a probability of company bankruptcy. Our effort will lead to create a prediction model with strong predictive ability. That means the ratio of correctly classified companies into a group of prosperous or non-prosperous ones will be as high as possible. For these reasons, it is important to do an initial statistical analysis of the variables used in the model first. In this paper we analyze the financial ratios used for prediction and their basic statistical characteristics. This analysis is made separately in eight different regions of Slovak Republic.

2 Methodology

As was mentioned above, in this paper we focus on the initial statistical analysis of predictors (financial ratios of companies) used for prediction model creation. In order to describe the basic relationships and characteristics of financial ratios of the companies, we use basic statistical methods.

To detect the basic characteristics of predictors that later will be used for failure prediction model creation, we use descriptive statistical characteristics of the financial ratios of a given set of Slovak companies. For each variable used to create a prediction model, we list the average value together with standard deviation. The average value gives us an idea of concentration of values of this financial indicator. The standard deviation gives extra information about the distraction of all the values of the indicator around the mean value. Moreover, we mention the maximum and minimum value of the financial indicator to get an idea about the range of all values. Further information gives the median value, which is the center of a set of values of the ratio indicator, that is, medianhalve the ratio values—one half of values are smaller than the median value and the second half are larger than the median.

All the statistical descriptions are made separately for every eight Slovak regions. From the basic characteristics of predictors in each region, we can get an idea of how much these financial ratios of companies are different in parts of Slovakia. We also can determine which regions of Slovakia are sufficiently similar. These results may be useful in the future in the process of model development. Based on an analysis of the basic characteristics, we can deduce which Slovak regions can be analyzed together.

By using the cluster analysis on the characteristics of financial ratios, we could obtain the information which regions of Slovakia can be merged into groups. Then we could analyze them together in the process of model development, because they are sufficiently similar. Cluster analysis is an explorative statistical method that identifies structures within the data. More specifically, it tries to identify homogenous groups of cases or observations. Cluster analysis is used to identify groups of cases if the grouping is not previously known. Its task is to identify groups of individuals or objects that are similar to each other but different from individuals in other groups. Because it is an explorative method, it makes any distinction between dependent and independent variables. We use in our analysis the hierarchical cluster analysis, which is the most common method for clustering. It generates a series of models with cluster solutions from 1 (i.e., all cases are in one cluster) to n (i.e., all cases are an individual cluster). From the set of solutions, we choose the one that has such number of groups, which is for us highly usable and easily interpretable. The graphical representation of solutions with all possible models is called dendrogram.

The whole analysis is made on a dataset of 109,550 Slovak companies. We have the values of 11 financial ratios for these companies, obtained from accounting statements of companies of year 2014. The set of financial ratios uses the following notation: OM-current assets; KZ-current liabilities; CK-foreign assets; CM-total assets; VI-the equity; EBIT-earnings before interest and taxes; DT-sales∕360; KP-accounts receivable; Z-inventory; CPK-networking capital; and NZ-retained earnings (Kocisova and Misankova 2014; Valaskova and Zvarikova 2014).

The notation of Slovak regions is the following: BA-Bratislava region; BB-Banska Bystrica region; KE-Kosice region; NT-Nitra region; PO-Presov region; TN-Trencin region; TT-Trnava region; and ZA-Zilina region.

3 Results

First we analyzed the means and standard deviations of all financial ratios that we have available. The results are in the following Table 1. The mean and standard deviation are listed for every financial ratio (in columns) and for every region of Slovakia (in rows). In the first line for every region, there is mean value, and in the second line, there is standard deviation. Using this characteristic we could compare the regions of Slovakia, how much they are different, or, on the other hand, which of them are similar in the characteristics of companies operating in the region.

Table 1 Mean and standard deviation of financial ratios in all regions of Slovakia

In Table 2 there are the minimum and maximum values of all financial ratios that we have available for companies in all regions of Slovakia. Financial ratios are in the columns of the table; in the rows there are the regions of Slovakia. Each region has two lines of the table, in the first line there is a minimum value for the ratio, and in the second line, there is a maximum value. By comparing the values between each other, we could see the differences between them. Moreover, we could rate the similarity between some regions in order to analyze them together in our next work.

Table 2 Minimum and maximum values of financial ratios in all regions of Slovakia

Finally, in Table 3 there are the median values of financial ratios in all regions of Slovakia. All ratios are in the columns of the table, and regions are in rows. By checking the median value, we have added information about concentration and dispersion of the values. Median value divides the set of all values into two parts. One half of all values of the given financial ratio are lower than the median value in the table, and the second half are higher than the median value. By comparing medians with the mean value (Table 1), we could recognize the dispersion of values of a particular ratio. Mean value is very sensitive to extreme values. If there is some extreme value, too high or too low in comparison with other values, it has strong influence on the mean value. In contrast, median value is insensitive to existence of some outliers.

Table 3 Median value of financial ratios in all regions of Slovakia and frequencies of companies in regions

By comparing median values between the regions of Slovakia, we have the information which of them are too different and, on the contrary, which are sufficiently similar. This information could be very useful in the following analyses.

In the last column of Table 3, there are the numbers of companies, which operate in the regions. These frequencies will be used during the process of cluster analysis, where we want to create segmentation of regions, which could be analyzed together in the next process of bankruptcy prediction.

3.1 Cluster Analysis

Basic statistical characteristics, which are in Tables 1, 2, and 3, can be used for creating segments of Slovak regions, groups of regions that are so similar that can be analyzed together. We are checking for regions that are as similar in their characteristics as possible and different with regions in other groups. We have performed the hierarchical clustering that gives us all possible solutions. It starts from eight groups, where every region is in its own cluster and stops with one cluster with all regions in it. In the next dendrogram, we can see the models (Fig. 1).

Fig. 1
figure 1

Dendrogram of hierarchical cluster analysis for regions of Slovakia

The model with three or four groups of regions is most suitable for us. So we specify these two possibilities. In the model with three clusters, the following regions of Slovakia are in the clusters:

  1. 1.

    Group: Bratislava region

  2. 2.

    Group: Banska Bystrica region

  3. 3.

    Group: other regions—Trnava region, Nitra region, Trencin region, Presov region, Zilina region, and Kosice region

As in this case, both first and second groups are only one region, and all the other regions are in the last group; the possibility with four groups is better usable and more helpful. The model with four groups consists of the following:

  1. 1.

    Group: Bratislava region

  2. 2.

    Group: Banska Bystrica region

  3. 3.

    Group: Trnava region and Nitra region

  4. 4.

    Group: Trencin region, Presov region, Zilina region, and Kosice region

Again, first and second clusters are one region alone, but it means that there regions which are really different from the other regions in other groups and between each other too. In fact, especially Bratislava region is really different from other parts of Slovakia, and the companies there really have different values of financial characteristics. In the third group, there are two regions, Trnava and Nitra. These two regions are both close to Bratislava region of the capital city, and companies in both could be affected by the capital city. Other four regions, namely, Trencin, Zilina, Presov, and Kosice, are together in one group of regions. That means that companies in these regions are similar in characteristics of their financial ratios. Consequently we could analyze these regions together.

4 Conclusion

The essence of our work lies in the analysis of financial ratios of companies in different regions of Slovakia. These ratios are often used in the process of creating the bankruptcy prediction model for companies. Prediction model is able either to predict the probability level, at which the next life of the company will lead into a financial distress, or to classify the company into a group of prosperous or a group of non-prosperous ones. Our next work in the future will be dedicated to derivation of such prediction model for companies in Slovakia. The first step of the process for us is the basic statistical analysis of the predictors, used in this prediction model. Thus we have introduced the descriptive characteristics and then used them in hierarchical cluster analysis. We get the model with four groups of regions. Similar analysis could be done with the sectors of national economy, in which the companies in our dataset cooperate. The number of these sectors is quite high, so that it will be helpful and usable to find the model with smaller number of groups of these sectors. Then in the next process, the cluster membership of sectors of national economy could be included in the prediction model. Similarly it could be the membership in the group of regions that has been derived in this analysis.