1 Introduction

High-growth firms, which are also known as “gazelles” (Lopez-Garcia and Puente 2012, p. 1029), are an intensively researched topic which is closely observed by policymakers (Daunfeldt and Halvarsson 2015).Footnote 1 There are several explanations for this considerable research interest. From the perspective of various firms, growth is a crucial strategic priority and is often seen as an indicator of corporate success and market acceptance (Shin et al. 2005, p.6; Barringer et al. 2005, p. 665). Especially for new firms, rapid growth is essential for survival (Coad 2007b, p. 51). Concerning their impact on job creation, it is frequently mentioned that HGFs create a substantial number of long-term jobs (Coad 2007a, p. 81; Acs and Mueller 2008, p. 86; Coad et al. 2014b, p. 92; Lopez-Garcia and Puente 2012, p. 1030).Footnote 2 The hired people are oftentimes long-term unemployed or immigrants so that HGFs help these people to get a foothold in the job market (Coad et al. 2014b, p. 293). It is impressive that approximately three percent of all firms are responsible for the majority of private sector’s revenue growth (Acs et al. 2008, p. 2). Oftentimes, this growth is sustainable since a considerable share of HGFs manages to stay a HGF in the following period (Acs et al. 2008, p. 8).

For private investors, HGFs are also an attractive investment opportunity: especially big firms are more likely to continue to grow after such a period than the remaining firms and therefore, continue to be a promising investment (Acs et al. 2008, p.8, p. 18, p. 27, p. 46).

Another motivation to study HGFs is put forward by Schreyer (2000) and also Acs et al. (2008) who regard these firms’ potential to advance national economies. Especially in some European countries, unemployment rates are relatively high which is due to a lacking ability of the economies to adapt to ongoing changes. The authors argue that although governments can create a favorable environment, it is up to private companies to carry out the required implementations. HGFs are believed to be especially capable of adapting to change, are willing to take hazards, and show higher-than-average research and development (R&D) expenses to finance important innovations. Birch and Medoff (1994, p. 163) have also stressed the HGFs’ innovative capacity. Companies like Apple and Cisco are frequently mentioned examples for such HGFs (Barringer et al. 2005, p. 664). Especially technology based HGFs are known to have positive spill-over effects and to have a higher propensity to invest in R&D (Schreyer 2000, p. 1; Fotopoulos and Louri 2004, p. 163). Consequently, HGFs foster an increase in productivity and competition. There is evidence that this competition creates more jobs than it destroys (Acs et al. 2008, p. 11 and p. 19).

To reduce unemployment and because of other previously mentioned reasons both the European Commission and the OECD have taken measures to foster HGFs (European Commission 2010, p. 14; Organisation for Economic Co-operation and Development 2010; Coad et al. 2014b, p. 93, Schreyer 2000, p. 6). A mechanism to anticipate future HGFs would allow to well-directedly support HGFs without wasting taxpayers’ money and increasing inflation (Birch 1981, p. 4). Current HGF related initiatives invest money in start-ups without knowing if they will ever become HGFs which is criticized by Acs et al. (2008, p. 7). However, Birch (1981, p. 8), Schreyer (2000, p. 29), Coad et al. (2014a, p. 91), and Acs et al. (2008, p.45) point out that a priori, future HGFs are impossible to anticipate because of their high heterogeneity and their nonuniform growth development which hampers the identification of common characteristics. Coad (2007b, p. 1 and p. 56) even believes that “Firm growth is characterized by a predominant stochastic element, making it difficult to predict” and that it is an “idiosyncratic and fundamentally random process”. This is why Coad and also Acs et al. (2008, p. 40) believe that it is challenging to implement HGF related policies.

In this context, this study aims to evaluate the capability of the random forest (RF) algorithm to predict future HGFs using publically available financial data. Therefore, we follow the appeal of Coad (2007b, p. 1) to apply novel statistical techniques in the area of HGFs. To provide a realistic impression, the out-of-sample prediction quality is evaluated on a more recent data set. Moreover, since the RF’s byproducts allow to analyze the data’s underlying patterns, the second contribution is to present and discuss these findings. For instance, an analysis of variable importance rankings can foster the identification of the previously mentioned common characteristics or patterns. According to Coad (2007b, p. 58), there was only a little progress in identifying the determinants of firm growth, making this analysis a valuable contribution. The analysis is based on nine countries, covering eleven years (2004–2014) and 179970 unique firms. Therefore, the results are robust and enable cross-national comparisons. In contrast to studies like (Schreyer 2000), the data originates from the same source so that differences cannot be attributed to different data collection and processing methods.Footnote 3 As it will become obvious in the literature review, previous empirical analysis either regard the influence of single variables on (high) firm growth or perform regression and panel data estimations. Although these studies were able to deliver various interesting insights into the determinants of HGFs, none of them evaluates whether these insights enable the prediction of future firm growth based on new data.

Our last contribution will be an analysis of the so-called “survivorship bias” (cf. Coad et al. 2014a, p. 96). Based on our data, such bias cannot be confirmed.

Hence, this study is supposed to introduce the predictive paradigm to the research area of HGFs. This paradigm is already common in other financial data applications. For instance, in the tradition of Altman (1968) and Ohlson (1980), scientists try to predict future corporate defaults using statistical techniques like the multiple discriminant analysis and the logistic regression. In the recent past, data mining approaches like support vector machines (Härdle W et al. 2005), artificial neural networks (ANNs) (Cross and Rarnchandani 1995) and RFs (Kartasheva and Traskin 2011; Behr and Weinblat 2017) are applied as well, usually outperforming the previously mentioned techniques. Another application of forecasting is the corporate financial performance prediction for instance in Lam (2004) using ANNs. In this study, ANNs enable to derive investment decisions which outperform the average market return.

The remaining article is structured as follows. In Section 2 we present important HGF related findings based on descriptive and regression studies. In Section 3 we introduce our data source, the structure of our train and test data and our used growth indicator. Section 4 describes the RF algorithm, how its prediction performance can be quantified, and the proceeding of our main analysis. The selected prediction variables following this proceeding are presented and described in Section 5 while our results can be found in Section 6. A summary and directions for further research are given in Section 7.

2 Literature Review

This section provides an overview of the body of literature on firm growth. Studies, which are mentioned here, are revisited in Section 5.1 to identify promising predictors for HGFs. Since there is currently no literature following the predictive paradigm in the area of high-growth firms, we will focus on studies which conducted panel model estimations or contributed otherwise to the research in this area.

Using data for western Germany, Boeri and Cramer (1992) stated that firm growth could usually be observed for young firms. About two years after foundation, these firms were found to have doubled their initial size. However, the growth rate decreased over time and was highly unstable. A higher-than-average growth rate was often followed by a lower-than-average rate in the consecutive year. The longer a company’s number of employees remained in a certain size range, the more likely it became that it is going to stay in this range. Additionally, the authors identified a tough selection process for such young firms: the risk to fail was highest during the first years and lessened afterwards. Eight to nine years after birth, only approximately 40% of all firms founded in the same year still existed. The jobs created by the surviving firms still overcompensated the job losses of the closed firms. Boeri and Cramer (1992) could not determine a sector-effect.

The empirical results obtained by Schreyer (2000) are based on cross-national firm-level data from Germany, Italy, Netherlands, Spain, Sweden, and Canada. The data originate from different sources like surveys and commercial data bases and cover a maximal space of time from 1985 to 1996. The study’s objective is the identification of distinctive features of HGFs. Similar to Boeri and Cramer (1992), Schreyer (2000) pointed out that small HGFs are responsible for a significant percentage of job creation (but not of total employment). The bigger the firm, the more its number of employees stabilized. However, relatively big HGFs were found to create considerable numbers of jobs, which questions the political focus on rather small firms. Concerning the sector effect, Schreyer (2000) observe that HGFs in all sectors but certain sectors contained a higher share of HGFs.Footnote 4 In this context, it is necessary to discriminate between growing firms and HGFs since both kinds of firms are concentrated in certain sectors but often not in the same ones. Regarding the firm’s location, the number of HGFs was usually proportional to the overall number of firms in the geographical area. However, a few exceptions like Paris in the case of France with an above average share of HGFs were observed.

Moreover, HGFs were found to be younger than the other firms, more often owned by other companies and investing substantially in R&D. Country-specific differences became obvious regarding the relationship between age and growth. In Spain, older firms did not turn out to be less likely to grow intensively. In contrast to that, young German and Dutch firms rather tend to grow extensively than older ones. The finding that HGFs are oftentimes not economically independent is explained by an easier access to e.g. financing, staff and market knowledge.

A rather different approach was conducted by Barringer et al. (2005) who performed a textual analysis using 100 descriptions of the winners of the Ernst & Young LLP Entrepreneur of the Year award complemented with some financial data. Based on this data set, they intended to extract characteristics from the four areas “founder characteristics”, “firm attributes”, “business practices” and “human resource management practices” which were frequently mentioned for rapidly growing firms and less so for slowly growing firms. Several characteristics from all four areas have been found to distinguish rapidly growing firms. For instance, such firm’s founder had a high impact especially if she had a college education, was experienced in her industry and showed an exceptional motivation to succeed. Moreover, the firm’s participation in interorganizational relationships and products, which creates a unique value, were beneficial. Concerning the staff, training and financial incentives were found to foster fast growth.

Analyzing North American firms from two data bases, Acs et al. (2008) divided their sample into three subsets: 1994–1998, 1998–2002 and 2002–2006. Firms, which expanded both revenues and the number of employees, have been identified in the second period; the remaining periods have been used to study the characteristics of these firms before and after this expansion.Footnote 5 Unlike Boeri and Cramer (1992), the authors differentiated their analysis by firm size. It was discovered that the majority of big expanding firms (500 employees and more) were expanding firms in the previous period and continued to grow afterwards. In contrast to this, expanding firms with less than 20 employees were found to be highly volatile since only less than ten percent of them expanded in the previous period and most of them declined after expansion. The authors could not find any considerable sectoral or regional effect. One interlocational finding was that most expanding firms were located 6 to 15 miles from the central business district. Moreover, it was confirmed that such expanding firms tend to be smaller and younger than the remaining ones. The age appears to have a higher impact than firm size. This does not mean that expanding firms are typically start-ups. In fact, only about six percent of all expanding firms were start-ups. This contradicts the findings of Birch (1981). The average age of an expanding firm increased with the firm’s size and turned out to be 17 for small firms and 34 years for big firms. Expanding firms were also found to be 40% more efficient (measured in revenue per employee) than non-expanding firms.

Lopez-Garcia and Puente (2012, p. 1029) pointed out that the previously mentioned univariate studies may be misleading due to multivariate dependencies. The following studies considered interdependencies of several variables simultaneously and therefore have a higher explanatory power.

Apart from frequently used variables like firm size, age and sector, Harhoff et al. (1998) have also included the firm’s legal form in their OLS and two-stage Heckman regression. The data set stems from the CreditreformFootnote 6 database covering 8,068 firms from West Germany starting with the year 1989. The data set was supplemented by telephone interviews. On average, private limited liability firms showed both a 4.5% higher growth rate than a single proprietorship but also a higher rate of bankruptcy. These results are explained by different tax liabilities, financial accountabilities, and ownership structures, which come with various legal forms. For instance, owners of proprietorships are liable without limitation whereas limited liability firm owners only risk the value of their equity. Public limited companies did not show any clear differences to proprietorships as far as growth is concerned. The previously mentioned firm’s independence status only had a significantly positive effect in construction and trade industries and turned out to be insignificant for manufacturing and service.

Becchetti and Trovato (2002) regard 4,000 Italian small and medium-sized enterprises (SMEs) between 1989 and 1997. Referring to the study by Harhoff et al. (1998), the authors also studied the effect of the ownership structure on firm growth using a multivariate approach. In contrast to the previous study, a significant effect of the ownership structure could not be discovered. Instead, an effect of the accessibility to credit capital was shown for the first time. Not granted loans impede future growth and government subsidies foster it. Whether a firm exports their product also had a significant positive influence.

To analyze the determinants of the probability of high-growth, Lopez-Garcia and Puente (2012) applied a dynamic probit analysis on panel data taken from the local National Institute of Statistics containing 1,411 Spanish trading companies between 1996 and 2003. Growth was observed over a period of one year. Hence, controlling for other potential determinants is inevitable. Besides that, the used random effect model controlled for past growth to avoid interpretation problems caused by autocorrelation. This autocorrelation has been, indeed, found in their study: high-growth firms were 14% more likely to also be a high-growth firm in the following year than a non-high-growth firm. Additionally, it was discovered that a paid wage premium also increased a firm’s probability to be a high-growth firm stressing the importance of human capital. Similar to Schreyer (2000), the authors found it necessary to distinguish between average growth and high growth since the amount of financial debt seemed to only restrain average growth but not high-growth firms.

Autocorrelation has also been studied by Coad (2007a) by analyzing 10000 French manufacturing firms from 1996 until 2002 using quantile regression. In contrast to Lopez-Garcia and Puente (2012), it was analyzed whether the autocorrelation depended on the firm’s size and growth rate. Coad (2007a) showed for his sample that above-average growth was usually followed by a poor growth in the following period and vice versa. When disaggregating for ten different firm sizes, it was observed that big firms had a slightly positive autocorrelation while small firms showed the previously mentioned negative autocorrelation. Hence, bigger firms were often able to grow over several periods. The results were justified by the long-term planning horizons of big firms, which might lead to long-lasting growth and are in line with the results of Jovanovic (1982).

In a survey, Coad (2007a) compared different autocorrelation studies concluding that “there does not appear to be an emerging consensus” as far as the strength and lag are concerned (Coad 2007b, p. 16). A more robust finding is the decrease of both the growth rate and its variance with increasing firm’s age. Concerning innovations, Coad (2007b) identified contradictions in the regarded literature but recognized that product innovations usually lead to greater employment while process innovations have an unclear effect. No strong influence is found for a firm’s financial performance. The same was found for relative productivity, which is either explained by firms which downsize to increase productivity or by a lack of competition. Coad (2007b) found evidence for the existence of the previously mentioned industry effect. For instance, overall growth in a certain industry was also beneficial for this sector’s firms. However, the sector’s explanatory power is rather small. Besides these factors, the author mentioned other determinants like country-specific legal regulations and different taxation requirements for certain sizes. Moreover, a highly uncertain future demand also seemed to affect the firm’s efforts to grow.

The ability of HGFs to remain a HGF in the next periods continues to be a highly researched area. Recently, Daunfeldt and Halvarsson (2015) regarded an almost complete sample of Swedish firms between 1997 and 2008 split in three consecutive periods. A quantile autocorrelation model was applied. They found only a negligible probability for this recurrence. This is why they believe HGFs to be “one-hit wonders”. Hence, they concluded that policy initiatives to foster HGFs are likely to be useless.

Levratto et al. (2010) analyze high-growth firms separately from average-growth firms. The authors regarded 12,811 French manufacturing small and medium-sized enterprises (SMEs) from 1997 through 2007. The data originated from Bureau Van Dijk and the French National Institute of Statistics and Economic Studies. They identified that high firm growth is more likely to be found in Paris than in other French regions. Being an exporting firm had no significant effect on high-growth firms but a positive and significant effect on the remaining firms. One of the most important variables appeared to be the corporate structure: main firms of a group grew faster than controlled subsidiaries and independent firms. Contrary to Gibrat (1931) who assumes that firm growth is random, Levratto et al. (2010) believe that growth is influenced by structural variables such as the firm’s sector and age and strategic choices like the firm’s financial structure. The underlying analysis was carried out using a pooled multinomial logit model and a hybrid multinomial logit model which are able to capture non-linear interdependencies. For all firms, it was found that growing firms in general were rather young and small regarding the number of employees. In contrast to Boeri and Cramer (1992), a sector effect could be discovered. As far as the strategic choices are concerned, labor productivity and the share of obligations to the supplier to total liabilities were found to be positively correlated with growth.

In summary, the research of HGFs remains to be a much-publicized research topic. The determinants of HGFs and their impact on the economy as well as their persistence are often contradictory. While most authors seem to agree on the importance of age and firm size, the influences of the sector, legal form, and financial variables are controversial. Moreover, the models are not evaluated on out-of-sample data so that overfitting is likely leading to results, which are strongly influenced by the peculiarities of the used data sets and are not generalizable. Furthermore, cross-country studies, which are based on data sets from the same source, are rare. This is why we estimate country-specific RF models for nine countries and determine their out-of-sample performance on more recent data. The RF’s variable importance ranking enables us to compare the determinants of HGFs leading to results which are more robust.

3 Data Base and Identification of High-growth Firms

We use accounting records from the Amadeus data base. In this section, we provide a short description of this data source and explain how we define a high-growth firm.

3.1 The Amadeus Data Base

The Amadeus data base is generated by ‘Bureau van Dijk’(BvD). Amadeus comprises information for both large firms and SMEs and contains data about East and West European countries with a focus on nonincorporated firms facilitating international comparisons (Van Dijk Electronic Publishing GmbH B 2015).Footnote 7

For our analysis, we use the finance data and the master file data for Finland (FI), France (FR), Germany (DE), Italy (IT), Portugal (PT), Spain (ES), Great Britain (GB), Poland (PL) and Sweden (SE). We converted figures from British Pound, Swedish Krona, and Polish Zloty to Euro using the 2014 exchange rates. The selection of countries is due to the lack of data for other countries.

For every country, we created two data sets: a train data set and a test data set. The train data sets are used to identify the most promising corresponding country-specific RFs. As shown in Fig. 1, each train data set contains predictive variables from 2004 to 2006. These variables are presented in Section 5.1. Growth and a potential bankruptcy is observed between 2007 and 2010.Footnote 8 The more recent test data sets contain the same predictive variables from 2008 to 2010 and growth or bankruptcy indicator variables for the period between 2011 and 2014. The test data sets are only used to evaluate the out-of-sample prediction performance of the models. Such a set-up simulates a prediction at the end of year 2010 based on all the necessary data to both generate the RFs using the train data and apply them to the (already available) financial data of the corresponding test data set to predict which firms are likely to grow in the upcoming four years.Footnote 9 An overview of the numbers of firms per country and the shares of HGFs and insolvent firms is given in Table 1.Footnote 10

Table 1 Total, HGF and bankruptcy figures per country and data set
Fig. 1
figure 1

Overview of the analyzed years in the train and test data sets

Apart from removing incomplete cases, we applied additional selection criteria to improve the data quality. For instance, we ensured that every regarded balance sheet covers exactly 12 months. Furthermore, we defined sensible intervals for the predictor variables to prevent implausible observations biasing the results. E.g., we checked that each firm’s turnover, short-term and long-term debts and interest payments were positive. While RFs are known to be rather robust towards the unavoidable arbitrariness of the data cleaning process, erroneous data are still disadvantageous (Williams 2011, 246). Furthermore and similar to Coad (2007a), this study only regards “organic growth” which is not caused by merger or take-over. Therefore, such firms have also been removed.

3.2 Identification of High-growth Firms

To estimate prediction models and to evaluate their performance, the information on firm’s growth status is crucial. This information is generated from the data contained in the Amadeus data base. A firm is regarded to be a HGF if its Birch-Schreyer growth indicatorFootnote 11 value belongs to the top 10% of all firms from the same country and data set over a period of three years. Therefore, this study follows the proceeding of Schreyer (2000), Acs et al. (2008), and Lopez-Garcia and Puente (2012). The index is defined as follows in our study:

$$growth=(e_{t+4}-e_{t+1}) \cdot \frac{e_{t+4}}{e_{t+1}}.$$

e t is the firm’s number of employees in year t. For the train data set, growth is observed between t + 1 = 2007 and t + 4 = 2010. The corresponding interval thresholds for the test data are 2011 and 2014. Such a timeframe is common in the literature (Coad et al. 2014a, p. 95).

The target variable Y for a given data set (d s ∈{t r a i n, t e s t}) and a particular country c is, therefore, defined as follows:Footnote 12

$$Y_{ds, c} = \left\{\begin{array}{ll} 1, & \text{if} \ growth_{ds, c}\ge q_{0.9, ds, c} \\ 0, & \text{if} \ growth_{ds, c} < q_{0.9, ds, c} \end{array}\right. $$

q 0.9, d s, c is the 90% quantile of all growth values in the country c from data set ds. Firms with a growth value equal or above the 90% quantile are labeled as HGFs whereas the remaining firms are treated as low-growth firms (LGFs). It is important to point out that firms, which went bankrupt, are neither considered to be LGF nor HGF and are not considered for model estimation. Apart from that, firms of every age can become a HGF so that the study is not limited to start-ups. The shortcoming of our proceeding is that the value of q 0.9, d s, c depends on the analyzed country and period which complicates cross-country comparisons. Another possibility would be to require that every HGF grows by a certain percentage (Hölzl 2014, p. 204). However, this would lead to different shares of HGFs for each country and period. Since the extent of a HGF’s growth is affected by the sectoral and overall economic growth, this does not seem to facilitate comparisons either (Audretsch and Mahmood 1994, p. 243; Bravo Biosca 2010, p. 2).

The Birch-Schreyer growth indicator combines absolute and relative employment growth. This makes the indicator less dependent on the firm size than its components. Large firms usually have larger absolute changes in the number of employees than smaller firms (absolute growth) whereas smaller firms rather can increase their staff by a given percentage than bigger firms (Schreyer 2000, p.14; Lopez-Garcia and Puente 2012, p. 1030). Because of these favorable properties, this is the most frequently used indicator (Coad et al. 2014a, p. 94). However, the Birch-Schreyer growth indicator is known to prefer absolute over relative growth (Hölzl 2014, p. 226). This is not necessarily a disadvantage because politicians will rather be interested in high absolute numbers of new jobs than in high relative employment increases in certain firms with possibly low impacts on the country’s unemployment rates. However, studies like National Commission on Entrepreneurship (2011), Coad (2007b), and Harhoff et al. (1998) purely focus on relative growth.

Focusing purely on employment growth is common but has been criticized by Aiginger (2006) since similar policies might foster inefficient behavior. Furthermore, employment is not a parameter, which firms try to maximize (Levratto et al. 2010, p. 9). In fact, an employment growth based definition is not the only possible approach. Another frequently used measure is the firm’s turnover growthFootnote 13 which has several deficiencies and which is the reason why we focus on employment growth (Daunfeldt et al. 2014). For instance, the definition of turnover depends on the observed country’s accounting practice, which hampers cross-country comparisons. Since consecutive years are observed, country-specific deflation becomes necessary which leads to controversial assumptions (Lopez-Garcia and Puente 2012, p. 1035); Levratto et al. 2010, p. 9). Acs et al. (2008) even analyzed both employee and revenue growth at the same time. Daunfeldt and Halvarsson (2015) followed a similar approach. Further growth indicators are occasionally created based on the firm’s value-added, total assets or total profit (Coad 2007b, p. 3).

According to Coad et al. (2014a, p. 99 – 104) and Hölzl (2014, p. 199), the choice of the HGF indicator is extremely important and has a strong influence on the obtained results. However, irrespective of the applied definition, HGFs achieve higher growth rates after their high growth period than a control group. This is why the dependence of the indicator does not question the HGF concept as a whole.

4 Methodological Considerations

In this section, the used prediction method, the measure, which quantifies the appropriateness of the estimated HGF propensities, and our proceeding are explained.

We use the RF algorithm to estimate HGF propensities for a given firm in a given country. We have chosen this particular algorithm because of several reasons. RFs were found to have a very promising prediction performance for default prediction which will become relevant in Section 6.3 (Behr and Weinblat 2017). Furthermore, the RF provides several byproducts which can assist the user in different stages of the data analysis process. As shown in the further course of this section, a RF helps to identify promising predictors, can assess their contribution to predict a HGF and can be used to derive “typical” country-specific HGFs and LGFs. Thus, the RF offers a powerful framework for HGF prediction.

A RF consists of a user-defined number B of classification trees, which are generated using a slightly modified classification and regression trees (CART) algorithm (Verikas et al. 2010, p. 331). As the generation of individual classification trees is the core of the RF and such a tree is presented in Section 6.2, we provide a brief description of the CART algorithm first.

4.1 Classification and Regression Trees Algorithm

CART is a tree-based approach to generate qualitative (classification) or quantitative (regression) predictions for given observations (Breiman et al. 1984). Due to our specific context of HGF prediction, we only regard the classification application. CART starts with the complete data set and recursively splits it into two sub data sets, which should be as different as possible in their shares of HGFs and LGFs (Kumar and Ravi 2007, p. 3). Based on the value of a single variable, the observation is assigned to one of the two subjacent sub data sets. The variable used for splitting the data set and the specific threshold value are identified by considering all potential variables and all their observed values. To be more specific, the decision of which variable and which threshold value should be used, is determined by comparing the impurity of the class distribution inside the original node and the two subjacent nodes. The split that enables the highest impurity reduction is chosen. The impurity for a node t is quantified using the Gini coefficient I. In this context, p(j|t) is the share of the members of the class j in this node (Breiman et al. 1984, p. 20–26)

$$ I(t)=1-\sum\limits_{j}^{}p^{2}(j|t). $$
(1)

Leaves are the lowermost nodes of the tree. The majority class (binary prediction) in the reached leaf is predicted for a given observation (Breiman et al. 1984, p. 33; Frydman et al. 1985, p. 272).

The main advantage of such trees is that they are easy to comprehend and do not depend on any distributional assumptions. Furthermore, both quantitative and qualitative variables are suited as predictors (Breiman et al. 1984, p. 56; Chandra et al. 2009, p. 4832; Shirata 1998, p. 3). An important downside is that even minor changes in the data can lead to different decision trees. This is why CART models are considered to be weak learners having a small bias but a high variance.

4.2 Random Forest Algorithm

The basic idea of a RF is to reduce the variance without increasing the low bias. This is achieved by averaging over a large number of decision trees, which usually leads to a better forecasting performance (Breiman 1996, p. 123–140).. The concept of creating a user-defined number B of sub-models (decision trees in case of a RF) is called bagging which stands for “bootstrap aggregation”. Each of the RF’s trees is estimated based on its own data set, which is obtained by drawing a bootstrapped sample from the original data set. Each tree’s data contain as many observations as the original data set. This means that about two-thirds of all observations will be considered by a tree once or more, and others will not be considered at all (Breiman 2001, p. 5; Zighed et al. 2000, p. 117–118; Hastie et al. 2009, p. 283, 587; Rokach 2007, p. 106–107)

Each of the B trees is grown using a slightly modified version of the CART algorithm. The modification affects the number of variables that are analyzed at each node to select the splitting variable and its threshold value. Instead of analyzing all available variables, the RF algorithm first randomly selects m variables for each node and only regards these selected variables when determining the best split. This is called “random subspace”. Once again, the condition that enables the highest Gini value is chosen (Breiman 2001, p. 11; Han et al. 2011, p. 377–378). Similar to B, m is also defined by the user.

The estimated RF provides B individual binary predictions based on the B classification trees for a new observation in a first step. These predictions are called votes and are represented by v 1v 100. The most frequently occurring class v is returned in the second step. A propensity can be obtained by returning the share of a certain class instead.

A model (forest) that embodies B sub-models (trees), each of which is estimated based on a sub data set and which combines their individual predictions towards an overall prediction is called “ensemble”. Such an ensemble can usually better adapt to the data patterns than a single sub-model can.

Another of RF’s beneficial features is that it is found to perform well in the case of highly imbalanced data as shown by Brown and Mues (2012). This is highly relevant for our study because, by definition, only a small fraction of all firms are HGFs. Yeh et al. (2014, p. 101) find the RF to be less affected by overfitting than the other considered models. Moreover, random subspacing speeds up the computation process considerably (Verikas et al. 2010, p. 331). Unfortunately, RFs are less transparent compared with a single classification tree due to simultaneously analyzing B classification trees (Olson et al. 2012, p. 464). However, this can be compensated by analyzing the RF’s byproducts (global variable importance rankings and prototypes).

Based on the classification accuracy and the unused observations of the single trees (out-of-bag observations), a RF allows the estimation of a global variable importance ranking (Breiman 2001, p. 23–25; Verikas et al. 2010, p. 331–334; Strobl et al. 2007, p. 17–18). The underlying idea is that a permutationFootnote 14 of the value of an important variable should lead to a high increase in the error rate whereas permutating unimportant variables should not cause a considerable effect (Breiman 2001, p. 11; Breiman and Cutler 2004; Hastie et al. 2009, p. 593). The higher the significance value, the higher the importance of the current variable (Breiman 2001, p. 23–24; Verikas et al. 2010, p. 331). According to Verikas et al. (2010, p. 340), the RF’s variable importance ranking has certain drawbacks like its inability to capture variable dependencies.

Prototypes are representative observations of every class. A prototype of a given class c can be derived by identifying an observation o of this class, which has the highest number of other class c observations among its w nearest neighbors. The distance between o and another observation l can be determined from the proximity matrix of the RF: p r o x o, l . The further proceeding depends on the type of the variable. For a quantitative variable, the prototype’s corresponding value is the median of the values of the w neighbors. For a qualitative variable, the prototype’s value is the most common value of its w neighbors (Breiman and Cutler 2004).

4.3 Evaluation of the Classification Performance

In this study, it is necessary to assess the appropriateness of our RFs in an out-of-sample prediction setting. Predictive performance measures quantify this appropriateness based on the values from the so-called ‘confusion matrix (Hart et al. 2005, 362). The components of this matrix are explained in Hassan et al. (2010). Based on the matrix components, performance evaluation measures like the accuracy, precision, true positive rate (TPR) and false positive rate (FPR) can be obtained. It is crucial to point out in this context that the accuracy and precision measures are highly misleading for imbalanced data (cf. Maimon and Rokach 2006, p. 666 and Han et al. 2011, p. 365–367).Footnote 15

To obtain informative indicators even in settings of highly imbalanced class distributions, it is important to only rely on measures from the same row of the confusion matrix. Two such metrics are the TPR and the FPR. Ideally, a RF should have a TPR of one and a FPR of zero which is usually unrealistic: a model which tends to classify a new firm as a HGF in case of doubt will usually achieve a high TPR (close to one) because it predicts many HGFs correctly. However, the FPR rate will also be quite high because many actual LGFs will also be treated as HGFs. A model which rather tends to classify firms as LGFs will have a low TPR rate but also a low FPR rate. This is why Pagans (2015, p. 156) mentions a trade-off between these two indicators. Another problem in this context is the arbitrary cutoff point up to which a firm is treated as a LGF. The choice influences all the previously mentioned indicators so that it is not clear whether unsatisfying indicator values represent a weak model performance or just a disadvantageous selection of threshold (Fawcett 2006, p. 861–864).

The Receiver Operating Characteristic (ROC) curve is a graphical tool to assess and select prediction models based on their prediction performance. It is adapted from the field of signal detection and addresses the two previously mentioned problems (Gorunescu 2011, p. 323). As it is the case for our RFs, the ROC requires a prediction model which can quantify how certain a given firm belongs to a particular class in a two-class-setting. The advantages of a ROC are that it does not require calibrated class probabilities and it is independent of uneven class distributions because it only relies on the TPR and FPR, which do not depend on class distributions either. Furthermore, it regards all possible thresholds instead of only one. However, it does not lead to a numerical value facilitating comparisons of several curves, which possibly lie close to each other or even intersect (Fawcett 2006, p. 861–867).

To obtain such a numerical value, the area under the ROC curve (AUC) can be calculated. In the case of an ideal model, this area will be one (area of the unit square) and in the case of the useless model, it is going to be 0.5 (half of area of the unit square). The closer the area of a model is to one, the rather does the model assign higher propensities to HGFs than to LGFs. More precisely, the AUC is the probability that the model assigns a higher propensity to a HGF than to a LGF (Fawcett 2006, p. 868).

In this study, one important challenge is to compare the performance of nine country-specific RFs on their test data sets. Unfortunately, it is not appropriate to just compare the corresponding AUC values because the differences might be caused by chance. A statistical test can be applied to ensure that the observed superiority of one country is unlikely to be random. The test must be applied in an unpaired setting because firms are taken from different data sets and are, therefore, uncorrelated (Krzanowski and Hand 2009, p. 107; Zhou et al. 2014). Such a two-sided test has been developed by Venkatraman (2000) extending the work of Venkatraman and Begg (1996). Both methods are based on permutation tests and compare the ROC curves instead of just the AUC values. The null hypothesis states that two analyzed ROC curves are equal.

4.4 Model Specification Based on Cross-validation and Variable Importance

After having explained the used prediction methods and a technique to evaluate the obtained results, the proceeding of the main analysis is presented in this section.

To conduct the analysis, several obstacles have to be overcome. The first one originates from the fact that the HGF status is only assigned to every tenth firm so that most of the firms belong to the low growing class. This is why it is necessary to cope with the class imbalance problem in this analysis. A whole branch of data mining literature deals with this topic, which goes beyond the scope of this article. Contributions have been made, for instance, by Chawla et al. (2002), Batista et al. (2004) and Chawla (2005). In this study, the Synthetic Minority Over-sampling Technique (SMOTE) by Chawla et al. (2002) is usedFootnote 16. Its main idea is to generate synthetic instances of the minority class, which lie between two existing adjacent minority class instances. This is achieved by randomly placing this instance on the connecting line of these two neighbors in the feature space. In contrast to rather primitive alternatives like random oversampling, SMOTE prevents the RF from overfitting by widening the regions of the minority class (Chawla et al. 2002, p. 328). In a comparison study of different approaches to the class imbalance problem, Batista et al. (2004) find that SMOTE delivers the best or competitive results for data sets with class distributions which are similar to our data sets’ distributions. SMOTE is also attractive from an economical point of view because its created virtual firms can at least be seen as imaginable because their structural and financial information are derived from existing firms.

The second obstacle of this analysis is related to the problem that there is no consensus in the literature on the variables, which contribute to anticipating future high growth. Apart from variables like firm size, previous growth, age and sector (c.f. Section 2), the findings are oftentimes contradicting. Levratto et al. (2010, p. 2) add that financial, environmental, productive and technical aspects all contribute to future growth. In order not to overlook valuable predictors, it is necessary to evaluate a relatively wide range of potential predictors. Thus, the analysis starts with a total number of 30 potential predictors. Although the RF is known to be able to deal with a high number of predictors some of which are almost unrelated to the class membership (Han et al. 2011, p. 383), Breiman and Cutler (2004) recommend estimating a preliminary RF using all predictors. This RF is only used to estimate a first variable importance ranking. We chose the 15 most important variables and therefore follow Yeh et al. (2014) who applied a similar approach in the area of going-concern prediction. This means that separately for each of the nine countries, the training data is balanced using SMOTE. Afterwards, nine country-specific RFs are estimated based on this data, and the variable importance rankings are obtained. To achieve that country-specific differences are not caused by different sets of variables, the same 15 variables are selected for all countries although their rankings are not identical.Footnote 17 This is done by averaging each variable’s country-specific ranks and selecting the variables with the 15 highest averages. The 30 initially chosen variables and a detailed presentation of the 15 selected variables can be found in Section 5.1.

An intuitive reason why not all 30 initial variables should be considered for the final analysis is the RF’s mode of operation. The probability that the analyzed subset contains no useful predictor for a given split of a given tree with m = 3 and 16 out of 30 variables with almost no use for HGF prediction is:

$$\frac{\binom{15}{0}\binom{15}{3}}{\binom{30}{3}}\approx 11.2\% $$

This means that without preliminary variable selection, approximately every eighth split will be counterproductive.

After having extracted 15 promising variables and applying SMOTE to the country-specific data sets to establish an even distribution of HGFs and LGFs, the final country-specific RFs can be estimated. As explained in Section 4.2, the RF’s two main tuning parameters are the number of trees (B) and the m-value. In Behr and Weinblat (2017), we have also found the maximum number of nodes per tree (maxnodes) to be a valuable tuning parameter to limit overfitting. Based on observations of the OOB error, which stabilizes for approximately 100 trees, the number of trees is set to 100 according to Breiman and Cutler (2004) to limit the computational overhead. The m and maxnodes-value for each country are determined in a grid-search by performing three three-fold stratified cross-validations (CVs).Footnote 18 Inside of each CV, the AUC measure is calculated and averaged so that the parameter combination with the highest average AUC values is chosen. The optimal parameters according to CV are presented in Table 2.

Table 2 Best m and maxnodes values of the country-specific RFs

It turned out that for six of the nine countries, it is not necessary to vary maxnodes. These parameter combinations were used to estimate the final RF based on the country-specific training data. In the main analysis, these final RFs are used to predict the HGF status of the test data, to estimate the prototypical HGF and LGF as well as to estimate the variable importance ranking of the remaining 15 variables. Please note that SMOTE is not applied to the test data to simulate a realistic setting in which all reputedly highly growing companies contain a huge number of false positives, which will also be the case in practical applications.

Becchetti and Trovato (2002) and Coad et al. (2014a) believe that a risky corporate strategy might succeed and lead to high growth or fail and end up in bankruptcy. To evaluate this belief, we estimated nine further RFs (one RF for each country) which predict bankruptcy instead of high growth. These RF’s predictions are compared with the prediction of the previous nine RFs to evaluate whether high growth and bankruptcy are “two sides of one coin”.

5 Predicting Variables and Descriptive Evidence

In this section, we discuss the potential predicting variables and provide some descriptive statistics for the train data set.

5.1 Predictive Variables

Pytlik (1995, p. 233) states the following conditions a key figure has to meet to be regarded as a promising predictor: a key figure turned out to be useful in previous studies, theoretical considerations suggest that it is useful or it has a high importance in practical applications.

Their values (except of firm age and sector) are both regarded one year before the first year of the growth or bankruptcy period X t , in first differences covering both one year Δ1 X = X t X t−1 and two years Δ2 X = X t X t−2. The only exception is the employment-based variables. We analyze the employment (emp) in year X t . Instead of regarding its first differences, we have calculated two Birch - Schreyer growth indicators Δ1 g r o w t h and Δ2 g r o w t h from Section 3.2 covering a time lag of one and two years respectively. As Coad (2007b, p. 16) pointed out, there is no agreement in the literature concerning the time lag so that we decided to consider two different ones simultaneously and to delegate the choice to the RFs. Besides that, we also consider the three non-accounting related key figures age, sector and legal form.

As explained in Section 4.4, country-specific preliminary RFs have been used to estimate a variable importance ranking out of 30 considered variables. Both the initial 30 variables and the chosen ones can be seen in Table 3.

Table 3 All analyzed predictors

Only the chosen variables are discussed in this section.

  • Age of the firm: age

  • The firm’s age is regarded in this study because several studies like Harhoff et al. (1998) showed that young firms tend to grow faster than older ones. The growth process of younger firms is often characterized to have a high variance, e.g. by Coad (2007b), Dunne et al. (1989) and Boeri and Cramer (1992). Furthermore, the HGF literature found that a small group of young firms is responsible for a disproportionately high number of new jobs (Acs et al. 2008, p. 11 and p. 19; Schreyer 2000, p. 6). Henrekson and Johansson (2010) did not confirm that idea. One possible explanation is that young firms have to overcome an intensive selection process by quickly growing to be capable of competing (Boeri and Cramer 1992, p. 555).

  • Size of the firm: size

  • age and size are considered to be important in almost all HGF related studies. Since bigger firms are usually older than smaller ones, there is possible a correlation between these two predictors (Coad 2007b, p. 18 and p. 51). Furthermore, small firms are under constant pressure to grow to reduce costs to the same level as other (bigger) competitors (Coad 2007b, p. 51). A certain minimal size is important in fields with high fixed costs whereas a relatively small size increases the firm’s flexibility (Schreyer 2000, p. 13). Consequently, large firms have longer planning horizons and more long-term investments, which pay-off over several years and lead to a higher autocorrelation of growth (Coad 2007a, p. 74). Besides that, national legislation is often firm’s size-specific. For instance, bigger firms in certain countries have higher firing costs and have to pay higher taxes. On the other hand, they also have a higher lobbying power, which might facilitate growth (Coad 2007b, p. 27 and p. 78–79; Schreyer 2000, p. 7). There seems to be no consensus in the literature whether a small or a big size facilitates high growth. For instance, Harhoff et al. (1998) find that a bigger size leads to lower growth rates whereas Henrekson and Johansson (2010, p. 1) disagree on that. We also consider relative size changes using Δ1 l s i z e and Δ2 l s i z e.

  • Number of employees: emp

  • emp is a different indicator of the firm’s size (Coad 2007b, p. 3). Lopez-Garcia and Puente (2012, p. 1038) regard emp as a proxy for human capital which is an important determinant of HGFs.

  • Birch-Schreyer growth indicator: growth

  • Following Lopez-Garcia and Puente (2012, p. 1031), we considered past growth to allow for autocorrelation. Coad (2007a, p. 78) also reported a positive autocorrelation for big firms and a negative one for small ones. Coad (2007b, p. 15–16) points out that findings on autocorrelation are highly contradictory so that neither the existence of autocorrelation nor its influence and the length of the time lags are certain.

  • Sector of the firm: sec

  • We define six different sectors based on the NACE taxonomy ensuring that every sector contains a sufficient number of firms in every country. The regarded six sectors including their NACE code intervals are presented in Table 4. Although it is known that HGFs can be found in all industries, there are still reasons to consider this variable as a possible predictor (Schreyer 2000, p. 3; Acs et al. 2008, p. 2). For instance, Lopez-Garcia and Puente (2012, p. 1037) find a disproportionately high number of HGFs in some sectors. Other studies as Boeri and Cramer (1992, p. 546) only find a small sectoral effect. Another reason to include sec is that growth within a sector also influences the growth patterns of individual firms (Levratto et al. 2010, Audretsch and Mahmood 1994).

  • Debt ratio: dr

  • This is a measure of a firm’s leverage indicating the borrowed share of a firm’s funding. The remaining funding originates from past retained profits or has been introduced by the shareholders (Albrecht et al. 2007, 476; Penner 2004, 218). One important motivation to include dr is put forward by Lopez-Garcia and Puente (2012, p. 1036–1039). They argue that firm growth requires financing and that a high dr might lead to future financing constraints and the inability to realize all reputedly profitable projects. In their study, dr was found to have a significant non-linear influence when not controlling for firm-specific time-invariant factors. The importance of dr for firm growth was also confirmed by Fagiolo and Luzzi (2006, p.33), Becchetti and Trovato (2002, p. 294) and Levratto et al. (2010, p. 10). Lopez-Garcia and Puente (2012, p. 1031) found no effect of dr. One possible reason for doubt is that especially in case of start-ups, not bank credits but risk capital and internal finance are important funding sources (Lopez-Garcia and Puente 2012, p. 1038).

  • Return on assets: roa

  • roa measures how efficient a firm uses its assets and is an indicator of profitability (Stickney et al. 2009, 245). It is a widely used financial ratio and might be a predictor of future growth because investors consider firms with high returns as secure investments (Chen et al. 1985, p. 202; Levratto et al. 2010, p. 8). However, according to Coad (2007b, p. 24–25) roa can be expected to have a significant but rather small explanatory power.

  • Sales per employee ratio: spe

  • According to Vause (2009, p. 176) and Puri (2012, p. 131) both spe and Δs p e quantify the (average) firm’s employee’s ability to generate sales and therefore their productivity. High spe values might incentivize certain firms to hire additional staff to increase sales. In opposition to this reasoning, Coad and Broekel (2012) determined a negative correlation of spe and growth. The contribution of roa and spe is questioned by Coad (2007b, p. 25) and Baily et al. (1996, p. 259) because both figures can be increased by growth but also by downsizing so that its explanatory power might be inconclusive. For instance, Levratto et al. (2010, p. 18) did not find a significant influence on growth rates for HGFs.

  • Fixed assets ratio: far

  • far indicates the degree of capital commitment. High values imply low flexibility and constant pressure to keep capacity utilization high to cover the recurring assets’ expenses (e. g. for energy and maintenance). We assume that a high far might motivate a firm to grow to spread its high fixed costs over a greater number of products (Schneider and Lindner 2010, 317). Levratto et al. (2010, p. 5) add that too high fixed costs can be a serious threat for future growth.

  • Equity fixed asset ratio: efar

  • This indicator is also known as the ‘golden rule of balance sheet’ and is designed to enable a rather medium-term or even long-term perspective (Löbbe 2001; Buenstorf et al. 2013, p 250). The intention is to ensure that a firm’s assets should be financed by equity to a sufficient degree. This indicator is known to have a high relevance to practitioners (Becker 2010, 16).

Table 4 Sector classification and the corresponding NACE regions

It is surprising that the firm’s legal form is not among the most important variables. After all, the legal form determines growth-related issues like taxation, whether a firm can issue shares, and who is liable in case of financial problems (Levratto et al. 2010; Harhoff et al. 1998). Therefore, Harhoff et al. (1998, p. 455) expected for their regarded West German firms that limited liability firms will show riskier entrepreneurial activity with more growth potential. They found 4.5% higher growth rates for limited liability firms than for unlimited ones (Harhoff et al. 1998, p. 481).

5.2 Descriptive Results

After identifying the final set of predicting variables, we present some descriptive statistics disaggregated by the HGF status. The results for the median (Table 13), mean (Table 14), standard deviation (Table 15) and inter-quantile range (Table 16) are presented in the appendix. Please note that size is presented in Millions and spe in Thousands.

It can be observed that HGFs are usually substantially bigger regarding the number of employees and balance sheet total. The differences are particarly big for Portugal and GB. In all cases for Δ1 g r o w t h and Δ2 g r o w t h except for Portugal and Poland, the previous employment growth quantified by the Birch-Schreyer growth indicator is much higher for HGFs. Almost the same holds for Δ1 l s i z e and Δ2 l s i z e. The only difference is that Δ2 l s i z e for Portugal is bigger for HGFs than for LGFs while Δ1 l s i z e is not.

Moreover, the median HGF is younger than the corresponding LGF in all countries except GB where there is no difference. The age-related observations are in line with the considerations, which are presented in Section 5.1 while the size related findings are not. Still, most HGFs are not start-ups. Only about 18% of all HGFs are less than six years old and approximately every second HGF is 15 years old or even older.

The variable efar also show a clear tendency, being smaller for HGFs than for LGFs in all cases except the Finnish one. This finding is counter-intuitive. Moreover, in most countries, HGFs have a higher roa in the year 2006 than the LGFs.

As far as dr and far are concerned, no clear patterns can be noticed.

To identify sectors with disproportionately high or low occurrences of HGFs, we analyzed the sector distribution in Table 5. The relative frequencies are calculated using the total number of HGFs or LGFs, respectively. By trend, HGFs often seem to originate from the service and social sector and infrequently from the manufacturing and trade sector.

Table 5 Country-specific sector distributions

In the HGF literature, studies like Harhoff et al. (1998) assume that being a HGF and going bankrupt are two sides of a coin. The underlying idea is that certain firms make risky decisions. If everything goes according to plan, the firms might eventually become a HGF. In the opposite case, the firm might even fail. According to Harhoff et al. (1998, p. 455), this risky behavior is especially predominant among firms with a limited liability legal form. If this is the case, this has serious implications for industrial policy because support for HGFs would also foster risky economic activity, which is presumably not intended. This difficulty is denoted as “survivorship bias” (Becchetti and Trovato 2002, p. 291; Coad et al. 2014a, p. 96).

Table 6 is only supposed to present the train and test data. The findings do not find their way into the RFs so that the performed prediction remains to be a true out-of-sample prediction.

Table 6 Transitions between HGFs, LGFs, and bankrupt firms between train- and test-data

We observed that only in France and in Spain, former HGFs are likely to fail in near future. This is probably also caused by the fact that there is a considerable number of bankrupt firms in the corresponding test data sets. Even for these two countries, it is not appropriate to assume a survivorship bias because LGFs from the same countries are more likely to fail than HGFs. Considering our high numbers of bankrupt firms, this study’s results can be seen as in line with the findings of Acs and Mueller (2008) that only three percent of former HGFs fail in the consecutive period

Another question which is, for instance, put forward by Daunfeldt and Halvarsson (2015) is the likelihood that a former HGF remains to be a HGF in the next period. Depending on the answer, it might not be reasonable in the long-run to invest in HGFs by politicians or private investors. In this study, approximately every fourth HGF manages to repeat its strong growth. However, the results are highly country-specific. Acs et al. (2008) find that approximately every fourth big HGF was a HGF in the previous period, too, and Hölzl (2014) also determines for Austrian firms that Birch-Schreyer based high growth is rather persistent.

However, we can neither confirm the results of Lopez-Garcia and Puente (2012) that the majority of all HGFs also were HGFs in the previous period. Nor can we confirm the conclusion of Coad (2007a, p. 80) for his regarded French manufacturing HGFs that they “[...] may grow a lot in one period, but it is unlikely that the spurt will last long”. Daunfeldt and Halvarsson (2015, p. 361) support (Coad 2007a) by stating that (Swedish) HGFs are usually “one hit wonders” and are unlikely to grow intensively in two consecutive periods. Only about one percent of their firms remained to be HGFs during two consecutive periods. However, former HGFs are most likely to experience a modest employment growth.

6 Model Estimation and Prediction Results

After having explained our proceeding and the used training and test data, this section presents the estimation results. It starts with the analysis of the performance of the out-of-sample prediction and an inspection whether HGF prediction of bigger firms is more reliable than of small ones as it is frequently assumed in the literature. Furthermore, it is evaluated if a higher propensity to grow is associated with a higher propensity to default. This is another frequently assumed hypothesis. The last part of this section focusses on the RF’s byproducts namely prototypes and the variable importance analysis.

6.1 Prediction Results Evaluation

This part presents the AUC and other measures to illustrate the performance of the RFs. It also discusses whether HGFs differ from firms, which grow but do not belong to the highest growing 10% as assumed by Lopez-Garcia and Puente (2012) and Schreyer (2000). Table 7

Table 7 Five country-specific AUC quantiles by employee number

Table 8 presents the obtained results of the country-specific HGF prediction. The true positive rate (TPR), false positive rate (FPR), the Accuracy and Precision are all calculated by predicting a firm as highly growing if the majority of all trees of the forest believe the firm to be highly growing.Footnote 19 Using a cutoff value of 50% is reasonable because SMOTE was applied in a way that established an even class distribution. Still, higher values can be used to only predict firms to be a HGF when the RF is “very sure” or lower values in order not to oversee as many true HGFs as possible. In both cases, the four mentioned measures will change while the AUC is independent of such cutoff points. Moreover, the later discussed Table 7 illustrates the models’ performance for different thresholds. Please be reminded that the accuracy and the precision measures are also influenced by the share of HGFs in the test data as it is mentioned in Section 4.3. A better recognition of LGFs has a higher effect on these two measures than of HGFs, which is against the intention of the study.

Table 8 Out-of-sample prediction results using 0.5 as cutoff for all measures except AUC

Based on the AUC, the following ranking can be obtained for true out-of-sample predictions: GB (highest prediction performance), FR, IT, ES, PL, DE, PT, FI, SE (lowest prediction performance). To evaluate if country-specific differences are caused by different predictabilities of HGFs in this countries and not by chance, we compared the ROCs using Venkatraman’s unpaired test.

The test results are presented in Table 9. According to these results, there is strong evidence for the superior performance of the British RF and the inferiority of the Swedish RF applying the usual 5% level of significance. The Polish model only significantly differs from the British and Swedish one. Moreover, it cannot be said with certainty whether the results of two consecutive country-specific models always deviate significantly. Examples for such countries are France and Italy but also Finland and Poland. However, the test seems to mainly confirm the mentioned ranking.

Table 9 p-values of the ROC based unpaired Venkatraman test

According to the AUC’s interpretation, the British RF has a probability of 81.1% to assign a randomly chosen HGF a higher HGF-propensity than a randomly chosen LGF (cf. Table 8).

$$P(P(\widehat{HGF}|HGF)>P(\widehat{HGF}|LGF))=0.811$$

For a Swedish HGF, this probability is only 52.32%. In France, the highest share of HGFs (65.28%) is correctly anticipated (TPR) whereas the same anticipation works worst in Portugal (15.82%). Portuguese decision maker might want to use a lower threshold at the expense of a higher number of false positives. For the current threshold of 50%, the Portuguese, Spanish and Finnish RFs act conservatively referred to Fawcett (2006, p. 863) since they are rather hesitant to predict high growth (low TPR) but make only a few incorrect high growth predictions (low FPR). The Polish and French models can be designated as liberal since they seem to predict high growth more willingly so that more true HGFs are anticipated (high TPR) but also more wrong high growth predictions occur (high FPR).

For the 50% threshold, almost every third British and Italian firm, which is assumed to be a HGF, is truly a HGF which can be seen from the precision measure. This is a considerable result because by simple random sampling, only about every tenth firm would have been a HGF. However, especially in Sweden but also in Finland and France, the RF’s results are only slightly better than random choice. The Portuguese RF has the highest share of correct predictions (accuracy). Because of the low TPR and FPR value, this is achieved by seldom predicting high growth so that most of the LGFs (90% of all firms) are correctly predicted.

Table 10 shows the RF’s prediction performance from another perspective and illustrates it for six different cutoff points. In this section, only the HGF-rows are analyzed. The remaining rows are covered in Section 6.3. The HGF-rows show for each country, what share of all HGFs in the corresponding test data set is correctly predicted when regarding the top x% of all firms with the highest propensity to grow substantially. When regarding Italian, French, and British models, the top one percent group contains around five times more HGFs than random sampling would identify. When regarding the five percent group, similar statements can be made. The density of correctly anticipated HGFs is much bigger between 0 and 25% than between 25% and 50% for all countries except for Finland and Sweden so that a big number of true HGFs receive rather high propensities. This is a desirable outcome.

Table 10 Relative frequencies in the test data of all predicted HGFs and bankrupt firms per country among x% of all firms with the highest propensity to be a HGF

The boxplots in Fig. 2 confirm this finding. They show the distribution of the estimated propensities to be a HGF subject to the growth category. Once again, the results of the bankrupt firms are discussed in Section 6.3. On average, the Italian, French, and British HGFs have a much higher propensity to be a HGF than the LGF. Their medians are always over the 75% quantile of the LGFs. In contrast to that, no difference can be seen for the mentioned two groups in Sweden. Countries like Germany and Portugal lie between these two extremes. By trend, HGFs have a higher propensity to be HGFs than the LGFs counterparts do. However, every possible cutoff point (parallel line to the x-axis) will always lead to a remarkable number of false positives which receive high propensity scores but are LGFs.

Fig. 2
figure 2

Country-specific boxplots of the estimated out-of-sample propensity to be a HGF subject to the firm’s true status

Another claim in the literature, for instance by Schreyer (2000), is that HGFs differ from firms which grow but do not belong to the highest growing 10%. To contribute to this discussion, the correlation measure by Bravais and Pearson is presented in Table 8. It is calculated by confronting the firms’ Birch-Schreyer growth indicator value with its estimated propensity to be a HGF: \(cor(growth, \hat {p})\). The basic idea of this analysis is that if there is no structural difference between HGFs and the remaining firms, the indicator value should correlate with the estimated propensity: the higher the propensity, the higher the growth. Although all except the Spanish correlation are indeed positive, they are quite small which hints to the frequently mentioned structural difference of HGFs. These findings support the results from Lopez-Garcia and Puente (2012) who have also stressed the peculiarities of HGFs.

Another assumption, which is vividly discussed in the literature, is whether high growth of big firms is more predictable than of small firms. For instance, Coad (2007a) and Acs et al. (2008) concluded that big firms have a rather steady growth path due to longer planning horizons and investment plans. Coad (2007b, p. 51) and Coad and Hölzl (2009) described the growth of small firms as erratic and, therefore, harder to predict.

To contribute to this discussion and to evaluate whether big firms can be more reliably predicted by the RF algorithm, the firms from the test data were evenly divided in five groups based on their number of employees. Table 7 shows the corresponding AUCs for each of these groups. If small firms’ growth is truly more irregular than big firms’ growth, small firm’s growth should be harder to predict leading to smaller AUC values. It can be confirmed that the country-specific AUCs of the smallest firms (q 0q 20) are most often slightly smaller or substantially smaller (Poland) than the average value. However, for countries like Spain, France, Great Britain, Italy, and Portugal, no remarkable differences can be observed. Concerning the biggest firms (q 80q 100), no cross-national statements can be made either. While prediction seems to work better-than-average in Spain, Italy, Poland, and Sweden, the same prediction delivers below average results for the remaining countries. Based on these results, above average results can be found for the second smallest (q 20q 40) size category of firms for all countries except Poland, which does not go in line with the usual findings.

To find out if these differences could have also been caused by chance, we performed the unpaired Venkatraman test, which is mentioned in Section 4.3. We tested the difference between the second smallest size category and the biggest size category which is assumed to be most predictable in literature. We found the deviation to be “significant” for Germany, Finland, and Portugal using the usual 5% level of significance and also for Sweden using 10%. This is why we cannot find any clear tendency as far as predictability of different size groups is concerned.

6.2 Variable Importance and Prototypes

As mentioned before, the RF does not only enable predictions but also several interpretation mechanisms. This section starts with a visual analysis of a “typical” tree and continues with a more illustrative variable importance ranking. It concludes with an observation of typical HGFs, LGFs, and bankrupt firms.

Figure 3 exemplifies one “typical” decision tree of the British RF.Footnote 20 The presented tree is the outcome of the CART algorithm and is similar to the ones, which are used inside the RF. This tree is more suited for demonstration purposes because CART chooses the best split out of all possible ones based on its Gini measure and not out of m randomly chosen variables, all of which might be inappropriate. The tree illustrates the 1 to 9 univariate evaluations which need to be conducted to generate a prediction for a given firm. CART also carries out an implicit dimension reduction since only nine out of the fifteen available variables are considered for a prediction. For firms with Δ1 g r o w t h < 39.04, even only one evaluation is sufficient. As a provisional result, it can be stated that the prediction relies on the absolute and delta values of the growth indicator, the company size, and age, the number of employees, and the sales per employee. This tree already confirms the findings of Becchetti and Trovato (2002, p. 291) that there are several determinants of firm growth in addition to the firm’s size and age. Financial variables like roa and dr and the sector designation are also not considered which is not in line with Becchetti and Trovato (2002) who stress the importance of creditworthiness for future growth.

Fig. 3
figure 3

British example decision tree created by CART

Another approach to understanding the functionality of the RF is the analysis of the RF’s variable importance, which also enables a cross-national comparison of HGF patterns. The x-axis of Fig. 4 contains the analyzed countries and the y-axis the regarded predictors. These predictors are ordered based on the average RF’s variable importance over all nine countries so that the variable at the top has the highest importance and the one at the bottom the lowest. The numbers in the diagram indicate the importance rank for the corresponding country and variable. In this context, 1 represents the highest rank and 15 the lowest. Moreover, the higher a variable’s importance, the bigger the radius of the circle.

Fig. 4
figure 4

Variable importance rankings of the nine regarded country-specific RFs

The findings confirm and complement the existing literature, which is mainly driven by regression analysis. The size both measured using the number of employees and the balance sheet total, is highly important to predict future high growth. This is in line with Levratto et al. (2010, p. 6) who state that size is a determinant of growth when firms of different sizes are analyzed simultaneously. According to this authors, small firms grow at a higher rate than bigger firms. The number of employees is the almost certain most important variable. Other very useful variables are the two Birch-Schreyer indicator values Δ1 g r o w t h and Δ2 g r o w t h. Moreover, the absolute change of the balance sheet total over one or two years is also valuable to anticipate future HGFs in most countries. This means that both the firm’s size and its variation over time play a substantial role. These patterns seem to be valid for all regarded countries. Acs et al. (2008, p. 46) determined that past HGF status is an important predictor of future growth. Coad (2007a, p. 80) could also show that even less recent growth changes have an influence on current growth.

However, different studies came up with contradicting results as Coad (2007b, p. 15–16) summarized in his review. This disagreement can also be observed for the number of relevant lags.

Besides that, the firm’s age also has a noteworthy prediction contribution in most but not in all countries. As already determined by the CART tree, age is quite unimportant in Great Britain but also in Germany, Poland, and Portugal. In contrast to that, age is the fourth most important predictor in Spain. In the literature, it is assumed that age is very important for HGF prediction and that its contribution ranges slightly behind the firm’s size (Harhoff et al. 1998, p. 479; Acs et al. 2008, p. 10). The explained country-specific differences have not been mentioned in the literature yet.

The ranking also confirms the regression-based finding from previous studies that size- and age-based variables are more important than financial data based predictors (Coad 2007b, p. 23). However, there are exceptions to this heuristic. For instance, far is rather important in Portugal and Great Britain. spe and its variation are worth considering in Germany, Spain, Poland, and Portugal. On average, the first difference of the sales per employee ratio is the most important financial variable. Variables like dr and roa but also the firm’s sector are only of minor (average) importance. For some countries, these variables are quite important like roa for France and Great Britain and sec for France and Portugal. The low importance of dr is in line with the findings of Lopez-Garcia and Puente (2012) for Spanish firms, with Levratto et al. (2010) findings for French firms and with Fagiolo and Luzzi (2006) results for Italian firms. Lopez-Garcia and Puente (2012, p. 1038) explain the low importance of dr for young HGFs by their reliance on risk capital instead of bank credits. However, contradictory findings like Becchetti (1995) for Great Britain exist as well.

Similar to this study’s results, Boeri and Cramer (1992) did not find an important sector effect for the regarded German firms and Coad (2007b, p. 29) reported that most other regression studies made similar findings. A possible reason might be that HGFs can be found in all sectors although they are known to occur more frequently in certain ones (Schreyer 2000, p. 5; Acs and Mueller 2008, p. 86; Acs et al. 2008, p. 2). Meanwhile, Becchetti and Trovato (2002) found a sector effect in both regarded models. Note that the mentioned variables have all been assessed to be relatively important by an initial variable importance check (c.f. Section 4.4) so that even being on the last position does not mean that the variable is absolutely unimportant.

Besides being an illustration of the different RF, the ranking might provide first hints for the bad performance of the Swedish, Finnish, Portuguese, and German models. This does not seem to be the case in this study because these countries’ variable importance rankings do not seem to substantially differ from the other countries. This is why other explanations must be taken into consideration like the presence of other variables, which are omitted in this analysis. An alternative explanation might be that HGF prediction is simply more challenging for these countries.

RF’s variable importance ranking illustrates the intensity of the single variables’ contribution to the prediction of a HGF. Which values a “typical” HGF or LGF has in a certain country remains unclear. Table 11 presents the values of the six most important variables of a country-specific prototype of a HGF and LGF.Footnote 21 The values of all 15 available variables can be found in Table 17 in the appendix.

Table 11 Prototypes - six most important variables

In all countries, such a typical HGF has a higher number of employees and a larger size than a corresponding LGF like it was already discovered in Section 5.2. Schreyer (2000, p. 13) explains this by high research costs in certain areas like pharmaceuticals which only big firms can afford to pay. Whether a HGF is younger or older than a LGF seems to depend on the regarded country. However, a typical HGF is older than 13 years. For the USA, Acs and Mueller (2008) concluded that HGFs are at least five years in business. Using a more restricting definition of HGFsFootnote 22, Acs et al. (2008, p. 1) find that their average HGFs are between 17 and 34 years old and, therefore, explicitly mention that HGFs are not start-ups. Schreyer (2000, p. 19) also determined for Spain that the propensity to be a HGF does not decline with age and that there is no unidirectional effect of age. Henrekson and Johansson (2010, p. 227) concluded that HGFs are “not necessarily small and young”. Coad (2007b, p. 51–52) added that since most start-ups do not come up with innovating ideas but rather replicate already existing business models so that there is no reason to expect remarkable growth from the majority of start-ups. Since a high rate of start-ups fails soon after market entry, start-ups often lead to a waste of economic resources. Shane (2009) argued similarly. Several past studies often came to the opposite conclusion that HGFs are usually younger than LGFs (Birch 1981, p. 8; Lopez-Garcia and Puente 2012, p. 1037; Becchetti and Trovato 2002, p. 17; Harhoff et al. 1998, p. 455; Boeri and Cramer 1992). This study’s findings contradict the notion that a support of HGFs can be achieved by subsidizing (small and young) start-ups. Many countries have support programs, which mainly focus on start-ups. Still, the high contribution of small HGFs to job creation should not be overseen (Schreyer 2000, p. 7). Our findings might be caused by choice of the growth indicator. Hölzl (2014, p. 226) argues that the Birch-Schreyer growth indicator favors large firms. Since it is easier for a large firm to achieve a high indicator value, our RFs might be biased towards bigger firms. Additional to the previously stated arguments, this might be another reason why large firms become the center of attention as far as HGFs are concerned.

Moreover, typical HGFs most often have substantially higher Birch-Schreyer growth indicator values in the past than LGFs similar to the findings from the variable description section. This finding is in line with Lopez-Garcia and Puente (2012, p. 1037) who have found that “past extreme growth episodes increase the probability of current fast growth” and are “a significant predictor of current fast growth”. Boeri and Cramer (1992) comes to a similar conclusion. In Portugal, both values are negative, but the HGFs’ values are still higher.

As far as financial variables are concerned, in all countries except Poland, Sweden, and France, typical HGFs have a lower dr than their LGFs counterparts do (cf. Table 17). This confirms the findings of Becchetti and Trovato (2002) who conclude that financial restrictions limit the growth of Italian SMEs. Constrained HGFs are not able to finance all their potentially profitable projects (Becchetti and Trovato 2002, p. 297). Meanwhile, for French manufacturing HGFs this limitation could not be confirmed (Levratto et al. 2010). Our descriptive findings also do not hint to such a tendency.

Most but not all countries’ HGF prototypes have a higher roa than the corresponding LGFs. It is surprising that only in Poland, HGFs have a higher spe than LGFs because one could expect that a firm will hire new employees if they “contribute” to a relatively large amount of sales. However, Coad and Broekel (2012) and Becchetti and Trovato (2002) came to a similar conclusion. For North American HGFs, Acs et al. (2008) found a higher revenue per employee values than for LGFs.

As far as the industry is concerned, previous findings that a large number of HGFs belongs to the service sector are confirmed. This is also the case for Spain, which was analyzed by Lopez-Garcia and Puente (2012) and also for Sweden, Germany, and the Netherlands (Schreyer 2000, p. 22). Because of similar results, Coad et al. (2014a, p. 98) argue that it is a common misconception among politicians that most HGFs are high-technology firms, which leads to misguided economic development schemes. To the contrary, Schreyer (2000, p. 22) find that a lot of HGFs came from manufacturing sector which we can confirm for Poland and Finland. Levratto et al. (2010, p. 5) argue that only a few big manufacturing firms grow substantially due to a survival bias caused by high sunk costs and high capital investment. Our prototype-based findings are in line with the purely descriptive observations from Section 5.2.

6.3 Connection between high growth and bankruptcy

Section 5.2 explains why it is of major interest if a close relationship between HGFs and bankrupt firms (survivorship bias) exists as it is frequently mentioned in the literature. This section takes up on this question.Footnote 23

A graphical out-of-sample approach can be seen in Fig. 2. Besides the two already discussed boxes, every country’s boxplot contains a third box which depicts the HGF propensity distribution for companies which are known to have failed between 2011 and 2014. This period is also regarded to determine the HGF/LGF status of the firms. Therefore, a HGF propensity score was also estimated for every failing firm. These propensities are also presented in Fig. 2. The results clarify that defaulting firms usually do not have a higher propensity to grow and will usually not be wrongly predicted to be a HGF. The HGF-box always lies (more or less distinctively) above the box of the failed firms. In GB, where most accurate predictions have been obtained according to AUC, the two boxes do not overlap at all. For the second-best and third-best prediction (FR and IT), there is a small overlap so that high growth and bankruptcy rather seem to be antipodes. Still, the median HGF propensity of all HGFs lies above the 75% quantile of the HGF propensity of the bankrupt firms. The less accurate the prediction, the higher the intersection between these two boxes seems to be. This can be seen for countries like Finland, Portugal, and Sweden. The reason for these differences might be that for the last three countries, the corresponding RF was almost not able to recognize any useful HGF patterns so that virtually arbitrary propensities are assigned to both failing firms and HGFs. This leads to similar distributions.

Besides the already discussed HGF prediction results, Table 10 also presents what share of all failing firms can be found among the x% of all firms with the highest HGF propensity. Even for the three countries Finland, Portugal, and Sweden, the presented quantiles almost always contain a higher share of all HGFs than of all bankrupt firms. The latter share is also almost always smaller than the regarded quantile so that the hypothesis that bankruptcy and high growth are “two sides of one coin” is rejected.

Table 12 presents the results of a correlation analysis. For all countries except Finland and Sweden, Footnote 24 a second RF was estimated which predicts bankruptcy instead of high growth using train data. These RFs were used to generate an out-of-sample bankruptcy propensity score for all test data firms of their countries so that both a bankruptcy propensity and a HGF propensity are estimated for every firm. Table 12 shows the correlations between these two propensities per country. A correlation of 1 would mean that high HGF propensities are always accompanied with high bankruptcy propensities whereas −1 would mean that high HGF propensities are always accompanied with low bankruptcy propensities. However, both results are highly unrealistic because neither high growth nor bankruptcy can be faultlessly predicted as shown in Kumar and Ravi (2007) for bankruptcy prediction.

Table 12 Correlation between the propensity to be a HGF and the propensity to fail

In countries like GB and PL, there is a rather strong negative relationship between the two propensities. In Germany, France, and Portugal this relationship is also negative, but the corresponding correlation is quite low. A weak positive correlation is observed for Spain and Italy.

To confront HGFs and bankrupt firms, prototypes of bankrupt firms are presented as well for all countries where enough bankrupt firms are observed in the test data. The results can be found in Tables 11 and 17. It can be seen that the likelihood of confusion does not seem to be very high. The values of most of the variables differ strongly between bankrupt and highly growing firms. Based on the prototypes, it is more difficult to distinguish between LGFs and bankrupt firms. For example, typical HGFs always have a much higher number of employees than LGFs and bankrupt firms. Bankrupt firms are often younger than HGFs, always have a lower roa, and are stronger indebted than HGFs.

As a result, it can be said that no evidence could be found for the hypothesis that firms that are likely to grow have a high risk to fail. Based on the mentioned analysis, it is unlikely that subsidies for HGFs would lead to riskier economic activities. This does not mean that no firm that is going to be supported is going to fail, but the apprehensions from the literature cannot be confirmed for the regarded countries. Additionally, in countries where HGF prediction is rather reliable, the corresponding RF can usually distinguish between bankrupt and highly growing firms although they are not explicitly trained to do so.

7 Conclusion

Highly growing firms (HGF) or “gazelles” have been regarded in many studies because they are assumed to have an above-average contribution to job creation and revenue growth (Coad 2007a, p. 81; Acs and Mueller 2008, p. 86; Coad et al. 2014b, p. 92; Acs et al. 2008, p. 8; Lopez-Garcia and Puente 2012, p. 1030). Nowadays, important political institutions like the European Commission and the OECD foster HGFs to spur growth (Coad et al. 2014b, p. 93). To achieve a precise promotion of HGFs, it is crucial to have a mechanism to predict future HGFs using readily accessible data. The literature has been rather skeptical of the existence of such a mechanism due to the high heterogeneity of HGFs (Acs et al. 2008, p. 45).

This study concludes that although not every HGF could have been predicted in the past, a prediction is often still possible in most of the regarded countries. Therefore, we agree with Coad (2007b, p. 58) that regression-based analysis might be inappropriate to analyze HGFs. In our study, we analyzed 179970 unique firms from nine European countries between 2004 and 2014 using the random forest (RF) algorithm. The RF is a modern data mining algorithm which evaluates a user-defined number of decision trees to assign a binary prediction to every regarded firm. This study describes a true out-of-sample prediction using 15 structural and financial variables, which have been determined using preliminary country-specific RFs. After having found a promising set of variables, the country-specific RFs were obtained by extensive cross-validations. It was shown for the first time that HGF prediction is most reliable in Great Britain, France, Italy and Spain and also worthwhile in Poland, Germany, Portugal, and Finland. In Sweden, a reliable out-of-sample prediction could not be realized. In the first four countries, the probability to assume a randomly chosen true HGF being more likely to be a HGF than a true LGF is above 76% for a 50% threshold. When regarding the top 15% of all firms according to their propensity to be a HGF, up to 49% of all contained HGFs can be correctly predicted. In contrast to studies like Coad (2007a) and Acs et al. (2008), it was found that especially precise predictions could be obtained for the second fifth of all firms with respect to their number of employees.

The most important variables for this prediction turned out to be the previous absolute value of the firm’s size, the number of employees, and their first differences as well as the firm’s age. However, certain financial variable based predictors like the fixed assets ratio, equity fixed asset ratio, and the sales per employee also contribute, but to a lower extent. This variables’ ranking is to some extent country-specific. This is also the case for “typical” HGFs (prototypes) which is another contribution of this study. Prototype HGFs have more employees than a prototype LGF and have usually grown stronger in the past. Most HGFs are not start-ups which questions political initiatives to foster start-ups to enable future growth.

No evidence was found for a common concern in the literature that HGFs have often made risky but eventually successful decisions in the past so that fostering them also fosters risky behavior. The correlation between the propensity to be a HGF and the propensity to be a bankrupt firm is either close to zero or even negative which is the case in most countries. High propensities to grow intensively are only rarely assigned to failing firms because they turned out to be more similar to LGFs than to HGFs.

For future research, our findings imply that single countries’ results should not be generalized due to the often substantial country-specific differences. Out-of-sample HGF prediction should no longer be considered to be impossible but analyzed thoroughly using further countries, variables, growth indicators and prediction models. Especially other prediction models and further variables might further improve the prediction quality. For instance, Lopez-Garcia and Puente (2012) and Barringer et al. (2005) point out the importance of extensive staff training and financial incentives for the employees, which could be considered as additional variables. The engagement in export of a firm and its research and development efforts also seem to be important predictors (Wagner 2007; Becchetti and Trovato 2002; Schreyer 2000). Moreover, Barringer et al. 2005, p. 664) highlighted the characteristics of the founder as an important factor of success. We did not analyze these variables due to very high amounts of missing values in our data set.

Before fostering HGFs, their effect on the economy should be analyzed carefully. The effects of existing HGFs have already been studied but the consequences of an increased share of HGFs is still unclear (Coad et al. 2014a, p. 101). It might even happen that a higher rate of HGFs comes at the expense of a lower industry growth because of high numbers of bankruptcies, which may not be desired by policy-makers (Bravo Biosca 2010, p. 2).