The Selection of Variables in the Models for Financial Condition Evaluation

Tomczak, Sebastian Klaudiusz; Górski, Arkadiusz; Wilimowska, Zofia

doi:10.1007/978-3-319-28567-2_4

Sebastian Klaudiusz Tomczak⁶,
Arkadiusz Górski⁶ &
Zofia Wilimowska⁷

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 432))

491 Accesses
2 Citations

Abstract

A quality of classification of studied phenomena, or objects depends on the selection of variables (features) and criteria of the assessment. The choice of financial ratios in the study of financial standing of companies is crucial. The article presents the proposal to apply measure of quality of selection to choose sub-optimal subsets of financial ratios that best describe the subject of the research, which is the company. The aim of this study is to present a solution that allows the selection of financial ratios with a very high cognitive value, enabling the building of integrated measures assess the financial condition of the company. The presented results show the process of selection of the five-elements subset from the set of 13 financial ratios.

Access provided by Autonomous University of Puebla. Download conference paper PDF

Branch and Bound Method in Feature Selection Process for Models of Financial Condition Evaluation

Financial Modeling Under Multiple Criteria

A Novel Variable Selection Approach Based on Multi-criteria Decision Analysis

Keywords

1 Introduction

In the rapidly changing market economies continuous assessment of financial phenomena occurring in businesses, in particular continuous evaluation of their financial condition is expected. Proper evaluation of the processes occurring in the enterprise enables prediction of the financial situation of the company and taking pre-emptive action which could protect the company from bankruptcy.

Enterprises can be described by certain characteristics, features that can be financial and non-financial indicators, ratios. The use of synthetic indicators in the assessment process allows the assessment of a company financial standing, this is integrated assessment. Of course, it is clear that not every financial indicator (feature) is equally important in the evaluation of companies, therefore is crucial in this respect to choose (select) financial indicators most valuable, useful and crucial from the point of view of the assessing enterprise.

Why some indicators are more often used than others? Various aspects effect the frequency of their use. One of them is the availability of data, for example not all companies are listed on the stock exchange, what means that mostly the market ratios of companies are not known, and therefore should be removed from the set of financial ratios [1].

Analysis of research by Hamrol [2], Hołda and Micherda [3] and Kowalak [4] shows that when choosing financial ratios authors have used different techniques for their selection. One technique is to use, for example correlation matrix. The second technique is to set yourself up as an expert in the selection of appropriate indicators. This technique was used by Altman, who was one of the first researchers to construct a discriminant model for company’s financial condition evaluation. Another technique is guided by the literature. Currently, the authors are inspired by these indicators, which are often used to assess the insolvency of companies, something discussed in a number of publications. More information on the selection of features to build a synthetic index can be found in [5, 6].

The selection of features or choosing the indicators falls into an integrated assessment model can be based on different methods. In this paper it is proposed to use in this respect, quality measures of selection. These measures allow to evaluate the quality of selection, that is, in effect, to optimize the selection of a set of characteristics, which indirectly allows the selection of individual characteristics.

2 Quality Measures of Selection

We can evaluate feature quality selection by using selection measures which include evaluation, correctness and evaluating the level of adjustment carried out during the selection. This means that the quality measure selection directly do not select features. Using them is estimated already selected a set of features, which indirectly measure the quality of selection can be used to selections set of features. If the assessment of selected features will not be satisfactory, it is time once again select the features to build a synthetic indicator and to carry out their evaluation. However, given the very large number of possible combinations of features, evaluation of individual subsets is time-consuming [7]. In this paper we propose a method for selecting features for the construction of the synthetic index—integrated model of company’s financial condition evaluation.

For example a company has specific characteristics (in the assessment of the financial condition it can be financial ratios) that describe the object. These characteristics are expressed by a sequence s of N variables $y_{1,} y_{2, \ldots ,} y_{N}$. The larger the N, e.g. the number of features, more difficult to choose of financial indicators that can be used to build the synthetic indicator, which is more difficult to make a selection. You must use a suitably selected method which can measure quality characteristics. Based on measurements of the selected set of features of the object it can be classified to a specific class, for example in relation to evaluation the company’s financial condition to two elements set of classes, which can be defined as: anticipating bankruptcy or continuation of activity. Classes can be described by $x_{1,} x_{2, \ldots ,} x_{L}$, and their number can be determined by L [8, 9]. When you have a full probabilistic information $P(x_{i} )$—a priori probability of the classes and $f(y|x_{i} )$—conditional density probability distribution of the class, i = 1, 2, …, L), the classification to one of the designated classes refers to comparing the a posteriori conditional probabilities, $P(x_{i} |y ),$ i = 1, 2, …, L.

In the literature you can find suggested various measures of quality of selection, a selection of these is presented in Table 1.

Table 1 Quality measures of information selection

Full size table

Most of the measures are specific to 2 class problems only while the measure $C_{k}$ can be used when L ≥ 2 is present. Presenting a way of measures of the quality of the selection in the selection of indicators to build a synthetic index a measure $C_{k}$ is used which distinguishes itself from other measures of specific properties.

2.1 Measure $C_{k}$

The measure $C_{k}$ can be used when L ≥ 2, so this measure allows the assessment of the quality of the selected subset of features from the established accuracy and for any number of classes [10–12]. A measure is given by:

$$C_{k} (\underline{X} |\underline{Y} ) = \sum\limits_{Y} {P(y)} \left[ {\frac{1}{L}\sum\limits_{i = 1}^{L} {P^{k} (x_{i} |y)} } \right]^{1/k} = \mathop E\limits_{Y} \left[ {\frac{1}{L}\sum\limits_{i = 1}^{L} {P^{k} (x_{i} |y)} } \right]^{1/k}$$

(11)

where

k:: any number of natural, k ≥ 2,
L:: number of classes, L ≥ 2,
$\mathop E\limits_{Y}$ :: averaging operator on the set of all possible Y,
$P^{k} (x_{i} |y)$ :: a posteriori conditional probability of the object belonging to one of the specified classes,
$\underline{X}$ :: random variable representing the class $x_{i}$,
$\underline{Y}$ :: random variable representing the object y.

3 The Synthetic Index—Discriminant Analysis Method

Discriminant models are most often used for construction of the synthetic index assessing the financial condition of the company which classify companies to two classes: bankrupt and not bankrupt or good and bad financial condition.

The literature suggests several methods of selection features (indicators) to build discriminant models. The authors are of the opinion that the use of quality measures of selection may allow for the creation of a new method of supporting the construction of successful discriminant models.

From the 60s of XX Century the researchers have built such models. Models of financial ratios presented in the literature are based on different and differing quantities of elements within these combinations [3, 4, 13, 14]. Table 2 shows the number of financial indicators used in the most popular models of discrimination (on the basis of the 47 examined models).

Table 2 Number of indicators in selected discriminant models

Full size table

Analyzing the number of financial indicators used in the discriminating model (Table 2) can be seen that the number is from 3 to 12 indices. However, typically the number of indicators used in the construction of the model is from 4 to 6. The main question that should be asked at this point is how the authors of each model choose this number and the financial ratios. It is worth noting that some of the financial indicators are more often used in the models than others.

4 The Use of Measure for Selection of Indicators—Study

The main purpose of the study is to select the best combination of 5 indicators from the 13 marked by Y1, Y2,…, Y13 financial indicators which are the best combination for building discriminant models. This selection is aimed at the choice of indicators that best describe the company’s financial condition, classified as: poor financial condition (the expected bankruptcy), good financial condition.

The study used financial ratios of the largest companies listed on the Warsaw Stock Exchange, with the exception of companies in the financial sector, because they have a specific balance—so the number of examined companies is limited to 13 companies. For each company the value of individual indicators was calculated, and the research period covers three years (see Tables 4, 5 and 6).

In the study of the use measure $C_{k}$, the following assumptions are made:

number of classes L = 2,
a priori probability:$P(x_{1} ) = 0.75, P(x_{2} ) = 0.25$ calculated on the basis of a sample as the ratio of the number of companies with good financial condition to the total number of enterprises and accordingly, the ratio of the number of companies with poor financial condition to the total number of enterprises,
parameter k = 2, a priori probability density functions are normal.

Conditional probability $P(x_{i} |y)$ can be calculated by Bayes formula [11, 16] for two classes:

$$P(x_{i} |y) = \frac{{P(x_{1} )*f(y|x_{1} )}}{{P(x_{1} )*f(y|x_{1} ) + P(x_{2} )*f(y|x_{2} )}}$$

(12)

where

$P(x_{1} ),P(x_{2} )$ :: a priori probability for class 1 and 2,
$f(y|x_{1} )$ :: the conditional probability distribution density of the class 1,
$f(y|x_{2} )$ :: the conditional probability distribution density of the class 2.

Assuming statistical independence of the characteristics of a normal distribution

$$f(y|x_{1} ) = \prod\limits_{i = 1}^{5} {\frac{1}{{\sqrt {2\pi \sigma_{i1}^{2} } }}\exp \left[ {\frac{{ - (y_{i} - \overline{{y_{i1} }} )^{2} }}{{2\sigma_{i1}^{2} }}} \right]}$$

(13)

$\sigma_{i1}^{2}$ :: standard deviation of the ith feature in the first class,
$\overline{{x_{i1} }}$ :: average of the ith feature in the first class.

$$f(y|x_{2} ) = \prod\limits_{i = 1}^{5} {\frac{1}{{\sqrt {2\pi \sigma_{i2}^{2} } }}\exp \left[ {\frac{{ - (y_{i} - \overline{{y_{i2} }} )^{2} }}{{2\sigma_{i2}^{2} }}} \right]}$$

(14)

$\sigma_{i2}^{2}$ :: standard deviation of the ith feature in the second class,
$\overline{{x_{i2} }}$ :: average of the ith feature in the second class.

Based on a sample descriptive statistics were calculated that will allow the calculation of the probability distribution density. Table 3 shows the designated interval (evaluation), the mean and the variance range for each features.

Table 3 Compilation of descriptive statistics for features and for first and second class

Full size table

The assessment ratio was determined based on the average (13 indicators of 13 companies), whereas the mean and variance is based on the assessment interval.

Then the value of each indicator for the selected companies was calculated. The indicators calculated for the individual companies are shown in Tables 4, 5 and 6.

Table 4 Summary of indicators for the investigated companies in 2009

Full size table

Table 5 Summary of indicators for the investigated companies in 2008

Full size table

Table 6 Summary of indicators for the investigated companies in 2007

Full size table

In Table 4, there are negative values of some indicators of companies: TPSA, CEZ, and GTC. Values below zero few indicators of CEZ, TPSA is due to a negative working capital. By contrast, negative index values GTC affect operating loss and net loss.

In Table 5, there are also the negative values of some indicators of companies: TPSA, CEZ, LOTOS, PGE, PKNORLEN and POLIMEXMS. At the value below zero few indicators TPSA, CEZ, PGE and POLIMEXMS influenced negative working capital. In contrast ratios non-positive LOTOS and PKNORLEN were caused by the negative value for both working capital and loss.

As mentioned earlier, for construction of the integrated model 5 characteristics of the company have been used most often. Therefore, it was decided to test the combination of 5-five features that will provide the best outcome C_k measure. The number of possible combinations of features $\left( {\frac{13}{5}} \right)$ amounting to 1287 is quite significant. In order to test such a large combination a computer program has been used to find all the combinations and a choice of five characteristics for which measure C_k adopted greatest value.

Studies have shown that a very large number of combinations of indicators gives the highest value $P(x_{1} |y)$ = 1, $C_{k}$ = 0.375. Therefore, the results have been rounded to ten decimal places. Statement contained in Table 7 shows the number of possible combinations for the companies in the coming years, for which the value of measure C_k was the highest.

Table 7 Summarizes the best combination of five indicators

Full size table

Table 7 shows that in most cases you can not select a single best combination of indicators to assess the company, except for TPSA, PBG, PGNiG and POLIMEXMS. Therefore, the next selection of search results is a compilation of the most common set combinations. The nominated sets aim to reduce the number of available combinations of features (see Table 8).

Table 8 Sets of the optimal combination in a given year

Full size table

Analysis Table 8 shows that is possible to find two the best sets of combinations in 2009. However, in 2008 there are 9, and in 2007 there are 3. These sets significantly reduced the number of the best combinations for a given year. However, when we analyze all three years can be clear that only two combinations most frequently occur in this period (see Table 9).

Table 9 Summary of a set of the most common sub-optimal combination of financial ratios selected through the application of measures of quality of selection $C_{k}$ for the period of three years (2007–2009)

Full size table

In Table 9 it can be seen that there were selected two optimal combinations of indicators: Y1; Y3, Y7, Y11, Y13 and Y1, Y4, Y9, Y11, Y13. These combinations make up the majority of debt ratios, profitability and liquidity. The target is to select the best combination, which makes it necessary to carry out further studies.

The next step to obtain the optimal combination is to use reduction. Thanks to its use the number of possible combinations will be reduced. For the reduction will be applied mathematical operations: each result will be raised to the tenth power. Tables 10 and 11 summarizes the best combination after the reductions.

Table 10 Set of the best combinations of indicators—after the reduction

Full size table

Table 11 The set of the most frequently occurring combinations of the year—after reduction

Full size table

Analysis of all three years showed that can distinguish only one best combination of indices (see Table 12).

Table 12 The set of the mostly occurring combinations of the all 3 years

Full size table

Optimal combination from the point of view of quality selection measure create both liquidity ratios, turnover, debt and profitability.

5 Summary

The analyze of 47 discriminant models allowed to demonstrate that the most common to the construction of the synthetic index is used an average of 5 characteristics (features, ratios) to evaluate of company’s financial condition. In the article the 13 features that were chosen are the most commonly used in discriminant models tested (a minimum of 5 times). These features are: debt ratios (4 indicators), liquidity ratios, profitability and turnover ratio (3 ratios). Of these 13 features one should choose the combination of the five characteristics by which the highest level of measurement is obtained. There were selected five features guided by the frequency of the number of indicators used to build discriminant models. There was checked every possible combination of the features for choice 5 from 13 features, it means 1287 combinations.

Efforts were made to find as small as possible combination of the best features, which caused that had to be done further research. In further studies we used reduction. The aim of the reduction was to reduce a number of features through mathematical operation: each outcome measure was elevated to the tenth power. Also in this case the number of the best combination was too large for the presentation of results, although in some cases the number of best combinations was reduced. Like the previously there were used sets of best combinations. Only by analysis of three years there was selected one best combination of: y1, y2, y4, y11, y13 (total occurrences in the set is 21).

The use of the quality selection measure did not immediately clear results, which was why different kinds of reductions were use, in order to determine the best combination. Determining 5 from 13 features can be debatable. Analysis of literature showed that for the construction the most common models four, five and six indicators were used. This situation proves that the combination of four or six indicators could prove to be a better combination.

References

Nowak, M.: Praktyczna ocena kondycji finansowej przedsiębiorstwa, fRr, Warsaw (1998)
Google Scholar
Hamrol, M. eds.: Analiza finansowa przedsiębiorstwa—ujęcie sytuacyjne, UE, Poznan (2010)
Google Scholar
Hołda, A., Micherda, B.: Kontynuacja działalności jednostki i modele ostrzegające przed upadłością. Krajowej Izby Biegłych Rewidentów, Warsaw (2007)
Google Scholar
Kowalak, R.: Ocena kondycji finansowej przedsiębiorstwa. ODDK, Gdansk (2008)
Google Scholar
Grabiński, T., Wydymus, S., Zeliaś, A.: Metody doboru zmiennych w modelach ekonometrycznych. PWN, Warsaw (1982)
Google Scholar
Graves, S.B., Ringuest, J.L.: Models and Methods for Project Selection. Kluwer Academic Publishers, London (2003)
Google Scholar
Wilimowska, Z.: Kombinatoryczna metoda selekcji cech w rozpoznawaniu obrazów na podstawie wzrostu ryzyka, Archiwum Automatyki i Telemechaniki. 1980, t. 25, z. 3 (1980a)
Google Scholar
Wilimowska, Z.: Względna dyskretna ocena ryzyka w szacowaniu wartości firmy, in: Information Systems Applications and Technology ISAT 2002 Seminar. Modele zarządzania, koncepcje, narzędzia i zastosowania. Materiały międzynarodowego seminarium, Karpacz, 11–13 grudnia 2002, Wroclaw (2003)
Google Scholar
Wilimowska, Z.: Models of the firm’s financial diagnose, in: Information Systems Applications and Technology ISAT 2003 Seminar. Proceedings of the 24th international scientific school, Szklarska Poręba, 25–26 September 2003, Wroclaw (2003)
Google Scholar
Sobczak, W., Malina, W.: Metody selekcji i redukcji informacji. NT, Warsaw (1985)
Google Scholar
Wilimowska, Z.: Selekcja cech w rozpoznawaniu obrazów. Wroclaw University of Technology, Wroclaw, Rozprawa doktorska (1976)
Google Scholar
Wilimowska, Z.: Oszacowanie ryzyka Bayesa w problemie rozpoznawania obrazów, Archiwum Automatyki i Telemechaniki. 1980, t. 25, z. 4 (1980b)
Google Scholar
Karol, T., Prusak, B.: Upadłość przedsiębiorstw a wykorzystanie sztucznej inteligencji. CeDeWu, Warsaw (2005)
Google Scholar
Zaleska, M.: Identyfikacja ryzyka upadłości przedsiębiorstwa i banku. Difin, Warsaw (2002)
Google Scholar
Rooth, A.B.: Pattern Recognition, Data Reduction, Catchwords and Semantic Problems. Etnologiska Institutionens Smaskriftsserie, Uppsala (1979)
Google Scholar
Jajuga, K.: Statystyczna teoria rozpoznawania obrazów. PWN, Warsaw (1990)
Google Scholar

Download references

Author information

Authors and Affiliations

The Faculty of Computer Science and Management, Wroclaw University of Technology, Wroclaw, Poland
Sebastian Klaudiusz Tomczak & Arkadiusz Górski
University of Applied Sciences in Nysa, Nysa, Poland
Zofia Wilimowska

Authors

Sebastian Klaudiusz Tomczak
View author publications
You can also search for this author in PubMed Google Scholar
Arkadiusz Górski
View author publications
You can also search for this author in PubMed Google Scholar
Zofia Wilimowska
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Sebastian Klaudiusz Tomczak .

Editor information

Editors and Affiliations

Faculty of Computer Science and Manageme, Wrocław University of Technology, Wrocła, Wrocław, Poland
Zofia Wilimowska
Faculty of Computer Science and Manageme, Wrocław University of Technology, Wrocła, Wroclaw, Poland
Leszek Borzemski
Faculty of Computer Science and Manageme, Wrocław University of Technology, Wrocła, Wrocław, Poland
Adam Grzech
Faculty of Computer Science and Manageme, Wrocław University of Technology, Wrocła, Wrocław, Poland
Jerzy Świątek

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Tomczak, S.K., Górski, A., Wilimowska, Z. (2016). The Selection of Variables in the Models for Financial Condition Evaluation. In: Wilimowska, Z., Borzemski, L., Grzech, A., Świątek, J. (eds) Information Systems Architecture and Technology: Proceedings of 36th International Conference on Information Systems Architecture and Technology – ISAT 2015 – Part IV. Advances in Intelligent Systems and Computing, vol 432. Springer, Cham. https://doi.org/10.1007/978-3-319-28567-2_4

Download citation

DOI: https://doi.org/10.1007/978-3-319-28567-2_4
Published: 24 February 2016
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-28565-8
Online ISBN: 978-3-319-28567-2
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics

The Selection of Variables in the Models for Financial Condition Evaluation

Abstract

Similar content being viewed by others

Branch and Bound Method in Feature Selection Process for Models of Financial Condition Evaluation

Financial Modeling Under Multiple Criteria

A Novel Variable Selection Approach Based on Multi-criteria Decision Analysis

Keywords

1 Introduction

2 Quality Measures of Selection

2.1 Measure \(C_{k}\)

3 The Synthetic Index—Discriminant Analysis Method

4 The Use of Measure for Selection of Indicators—Study

5 Summary

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

The Selection of Variables in the Models for Financial Condition Evaluation

Abstract

Similar content being viewed by others

Branch and Bound Method in Feature Selection Process for Models of Financial Condition Evaluation

Financial Modeling Under Multiple Criteria

A Novel Variable Selection Approach Based on Multi-criteria Decision Analysis

Keywords

1 Introduction

2 Quality Measures of Selection

2.1 Measure \(C_{k}\)

3 The Synthetic Index—Discriminant Analysis Method

4 The Use of Measure for Selection of Indicators—Study

5 Summary

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation