1 Introduction

As discussed in detail in Sect. 2, Gibrat’s Law has been rejected by recent studies. Firms’ growth does not appear to be independent of initial size; in fact, small and young firms have a higher growth rate than their larger and older counterparts. However, earlier studies, based on small subsamples of well-established, mature, large firms, tended to confirm the Law. This poses a puzzle.

One way to reconcile these two streams of literature is to test whether a given population of firms tends to converge to Gibrat-like behavior through time, the idea being that market selection—with the correlated exit of the less efficient firms and the convergence to the minimum efficient scale (MES) of the most efficient ones—is able to bring about a “steady state” where a much more homogeneous population of firms behaves according to the Law of Proportionate Effect. In this context it may well be that Gibrat’s Law can be rejected “ex ante” when considering the entire population of firms and the entire period examined and confirmed ex post within the sole population of surviving firms at the end of the period.

The innovative feature of this study is that it tests Gibrat’s Law through time (i.e., year-by-year) allowing the selection of the initial population, while previous studies either investigated a subsample of large mature firms (ex post approach) or a given population of firms over a certain time period (ex ante approach). In the former approach the investigated firms are the outcome of a previous (not investigated) market selection and constitute the industrial “core” within which Gibrat’s Law tends to be confirmed. In the latter approach—testing the Law over a period of time using a given, immutable population—the role of smaller firms, which has to grow fast in order to survive, is magnified and the Law is generally rejected.

As far as we know, this article is the first attempt to track an initial population of existing firms in a sector year-by-year, to see whether the overall rejection of Gibrat’s Law over a given period of time may actually conceal a possible convergence in favor of the Law through time. If such is the case, earlier and recent seminal papers on the subject might be reconciled.

The article is organized as follows: the next section expands the discussion of our hypothesis through an extensive survey of previous literature; Sect. 3 describes the data used and the empirical experiments; Sect. 4 presents the econometric results while Sect. 5 briefly concludes.

2 Previous literature: Gibrat’s Law as a theoretical and empirical crossroad

The debate on Gibrat’s Law of Proportionate Effect has been going on for some time (cf. Santarelli et al. 2006). A commonly accepted interpretation of the Law originally “discovered” by Robert Gibrat (1931) is that the growth rate of a given firm is independent of its size at the beginning of the period examined. In other words, “the probability of a given proportionate change in size during a specified period is the same for all firms in a given industry—regardless of their size at the beginning of the period” (Mansfield 1962, p. 1031). Until the 1970s, the Law was quite popular among both applied and theoretical industrial economists, basically for two reasons.

On the one hand, the stochastic properties of the Law were consistent with the prevailing dynamic patterns of development observable in most industrial sectors of the time. In particular, as Simon and Bonini (1958) pointed out, if one “...incorporates the Law of proportionate effect in the transition matrix of a stochastic process, [...] then the resulting steady-state distribution of the process will be a highly skewed distribution.” This is also the case nowadays, when most economic sectors are characterized by a log-normal size distribution with a bulk of small- and medium-sized firms and a few larger ones. The theoretical consistency between Gibrat’s Law and the observed size distribution of firms across different industrial sectors has also been proposed by Steindl (1965) and treated through examples and simulations by Prais (1976, Chap. 2).

On the other hand, from a purely theoretical perspective, Gibrat’s Law proved consistent (or at least not in contrast) with some by now classic economic models of the firm size distribution. For example, Robert Lucas put forward an important updating of Jacob Viner’s original theory addressed to explain the size distribution of business firms. While Viner (1932) had seen business size distribution as the outcome of cost-minimizing firms (characterized by U-shaped long-run average cost functions and facing a given market demand), Lucas (1978) postulated size distribution as a solution to the problem of allocating productive factors over managers of different “talent,” so as to maximize aggregate output. His model predicted the “full” size distribution of firms, conditioning it to a given distribution of the managerial talents; in fact, in his view, firms were collections of assets whose matching to managers can change arbitrarily from period to period. As stated by Lucas himself, this modern “organizational” theory of the firm was not in contrast with the “....independence of firm growth and size: Gibrat’s Law, or the Law of proportionate effect” (ibidem, p. 514); therefore he devoted the fifth section of his seminal article to the special case which satisfied the Law.

However, its theoretical popularity notwithstanding, empirical tests of the Law were highly controversial. While earlier studies based on subsamples of large and mature firms had tended to confirm the Law (Hart and Prais 1956; Simon and Bonini 1958; Hymer and Pashigian 1962), further research began to challenge its overall validity. In his seminal work, Mansfield (1962) investigated US steel, petroleum and tires sectors in different time periods and found that Gibrat’s Law was failing in the majority of cases, with smaller firms growing faster than their larger counterparts. This outcome was largely confirmed by many subsequent empirical studies using more comprehensive specifications (including firm’s age and other controlling regressors) and more sophisticated econometric techniques able to take into account both sample selection and heteroskedasticity problemsFootnote 1 (Hall 1987; Evans 1987a, 1987b; Dunne et al. 1989; Dunne and Hughes 1994; Hart and Oulton 1996; Audretsch et al. 1999; Calvo 2006). In the meantime, other studies managed to generate feasible features in sectoral concentration, lognormality in business size distribution and adequate representations of the entry and exit dynamics, using models where the basic Gibrat process was heavily modified in order to capture the departures from the Law found by the empirical research (Cabral 1995; McCloughan 1995).

This new evidence in contrast with the Law was quite challenging from a purely theoretical point of view. The first author who took up this challenge seriously was Boyan Jovanovic, with his well-known and important contribution published in Econometrica (1982).Footnote 2 The author proposed a Bayesian model of noisy selection, according to which efficient firms grow and survive, whereas inefficient firms decline and fail. In particular, in Jovanovic’s model of passive learning firms are initially endowed with unknown, time-invariant characteristics, i.e., ex-ante efficiency parameters, while ex-post the prior distribution is updated as evidence comes in which leads some firms to discover that they are more efficient than others. Thus, each firm has to decide on its strategy: whether to exit, to continue at the same size, to expand, or to reduce its productive capacity. This model appears to be consistent with the empirical rejection of Gibrat’s Law in the short run (ibidem, 1982; Fig. 1, p. 650), when efficient surviving smaller firms accelerate their growth in comparison with their larger and more experienced counterparts. However, there is no reason—in Jovanovic’s model—to deny the Law in the very long run, when noisy selection has been completed and a steady state is reached by the economy.

The same line of argument applies to more recent models of active learning. Richard Ericson and Ariel Pakes (1995) assume that all the decisions taken by firms are meant to maximize the expected discounted value of the future net cash flow, conditional on the current information set. In their model, a firm knows its own characteristics and those of its competitors, along with the future distribution of industry structure, conditional on the current structure. Jovanovic’s assumptions concerning small industry size and product homogeneity are relaxed, and new entries may either adjust their size to the minimum efficiency scale (MES) level of output of the “core” of the industry, or choose/find a niche within which the likelihood of survival is relatively high even though the firm does not grow fast. In a subsequent work, Pakes and Ericson (1998) examine two cohorts of firms from Wisconsin in the retail and the manufacturing industries, and find that the structure of the former industry is compatible with Jovanovic’s passive learning model, while that of the latter is compatible with their model of active exploration. In both models optimal behavior generates a set of stopping states which can imply exit from the market. These Markov Perfect Nash equilibria appear consistent with the violation of Gibrat’s Law in the short run, and with the validation of the Law in the very long run (cf. also Lotti and Santarelli 2004).

Bearing the theoretical and empirical models discussed above in mind, in the late 1990s scholars put forward the hypothesis that “Gibrat’s Legacy” (Sutton 1997; Caves 1998) was defendable not as a general Law, but only as a dynamic rule valid for large and mature firms that had already attained the MES level of output, but not for smaller (younger) ones operating at a sub-optimal scale (Geroski 1995).

The purpose of this article is indeed to reconcile earlier literature with more recent research, showing that the rejection of Gibrat’s Law ex ante can be coupled with the defense of the Law ex post, as a long-run regularity. We will try to test this hypothesis through an empirical experiment where the same population of firms is tracked over time on a yearly base. The basic hypothesis is that Gibrat’s Law should be rejected ex ante and over the entire time period, but validated ex post when noisy (Jovanovic) or active (Ericson and Pakes) selection has played its evolutionary role in reshaping the original population. In fact, repeating the test of Gibrat’s Law year-by-year enables us to consider what happens when the original heterogeneous population of existing firms is gradually reshaped in favor of the most efficient ones, i.e., to set up an empirical counterpart of the learning and selection processes put forward by the theoretical models described above.

While previous recent literature has found that Gibrat’s Law must be rejected over a given period of time, since smaller firms tend to grow faster than their larger counterparts, no previous study has attempted to determine whether this result is robust once selection and learning through time are fully taken into account.Footnote 3

3 Data and methodology

In this article we deal with the Italian radio, TV, and communication equipment (ISIC 3832) industryFootnote 4 from 1987 to 1994, including all the firms which were active in the sector at the beginning of the period. More specifically, we use a unique data set from the Italian National Institute for Social Security (INPS). This data set identifies all existing firms with at least one paid employee in the examined sector and tracks their employment performance—our measure of size—at yearly intervals from January 1987 to January 1994.Footnote 5 The original INPS file was checked in order to identify entry and failure times correctly and to detect inconsistencies in individual tracks due to administrative factors, and cancellations due to firm transfers, mergers, and takeovers. This cleaning procedure reduced the total number of firms in the database to 3,285.

The rationale behind the decision to choose this industry is twofold. First, it has long been recognized as the core of ‘‘electronics’’, a crucial sector as far as new technologies are concerned (cf. Carree 2002). Second, at the beginning of the period examined it was a rather outdated sector in Italy, still lagging behind the technological revolution brought about by the passage from analog to digital signals. Thus, between the late 1980s and early 1990s it experienced a significant “shakeout” (as defined by Klepper and Miller 1995), which—as has been the case in several countries—accompanied its shift to the group of top technology-intensive industries (cf. Martins et al. 1996 Footnote 6). Indeed, as can be seen in Table 1, the examined population exhibited a remarkable rate of failure (more than 40% over 7 years) as a consequence of the severe technological and market shakeout experienced by the sector.Footnote 7 In this respect, in the period under examination the Italian radio, TV, and communication equipment industry can be regarded as an adequate example of a sector characterized by learning and selection as described by the theoretical models discussed above.Footnote 8

Table 1 Number of active firms, average size, and standard deviation

Using the existing firms in the sector at January 1987, and according to the general hypotheses proposed above, Gibrat’s Law will be tested both over the entire period (1987–1994) and year-by-year. This set of estimates will enable us to answer the following two questions:

  • (H1) Is Gibrat’s Law valid over the entire period (ex ante hypothesis)?

  • (H2) Is there any convergence toward a Gibrat-like pattern of growth through time (ex post hypothesis)?

If both these hypotheses were confirmed, the puzzling debate about the validity of Gibrat’s Law might be recomposed: although the Law can be rejected when considering the overall evolution of an ex ante population of existing firms, it may nevertheless be an accurate description of the patterns of growth of the well-established firms belonging to that ex post sub-population which results from market selection and learning processes.

The central relationship tested in this study is the logarithmic specification of Gibrat’s Law:

$$ \ln S_{{i,t}} = \beta _{0} + \gamma _{1} \ln S_{{i,t - 1}} + \varepsilon _{{i,t}} $$
(1)

where S i,t is the size of firm i at time t, S i,t−1 is the size of the same firm in the previous period and ε i,t is a random variable distributed independently of S i,t−1. Following Chesher (1979, p. 404), if both sides of Eq. 1 are exponentiated, it becomes clear that if γ1 is equal to unity, then growth rate and initial size are independently distributedFootnote 9 and Gibrat’s Law is in operation. By contrast, if γ1 < 1, smaller firms grow at a systematically higher rate than do their larger counterparts, while the opposite is the case if γ1 > 1. For estimation purposes and to make the interpretation of the results easier, the model is parameterized as follows:

$$ \ln S_{{i,t}} - \ln S_{{i,t - 1}} = \beta _{0} + (\gamma _{1} - 1)\ln S_{{i,t - 1}} + \varepsilon _{{i,t}} $$
(2)

where the new coefficient β1 is defined as β1 = (γ1 − 1), so that one can test the validity of the Law with the significance of parameter β1 (as before, if β1 = 0 Gibrat’s Law holds; if β1 < 0 smaller firms grow at a higher rate than their larger counterparts, while the opposite is the case if β1 > 0). Let χ i,t be an indicator function which takes value 1 if firm i is still alive at time t and 0 otherwise. Accordingly, observed data on firm size can give only the conditional expectation of the difference ΔlnS i,t  = ln S i,t  − ln S i,t−1, given S i,t−1 and χ i,t  = 1; i.e., according to our specification:

$$ E{\left( {\Updelta \ln S_{{i,t}} |S_{{i,t - 1}} ,\chi _{{i,t}} = 1} \right)} = \beta _{0} + \beta _{1} \ln S_{{i,t - 1}} + E{\left( {\varepsilon _{{i,t}} |S_{{i,t - 1}} ,\chi _{{i,t}} = 1} \right)} $$
(3)

If the conditional expectation of ε t is zero, the regression function for the selected sub-sample is the same as the population regression function, the only drawback being a loss of efficiency due to the smaller number of observations available. However, if this is not true, the last term of Eq. 3 needs to be included in the regression function. It is for this reason that a rule for χ t is required, and the most natural way to deal with this kind of selection is to use a survival equation, i.e., a probit model, given that we can detect exactly when a firm exits the market. In a more general formulation, this is the same as saying that:

$$ {\text{Prob}}(\chi _{i} = 1) = \Upphi (\alpha^{\prime} z_{i} )\;{\text{Probit Selection Equation}} $$
(4a)
$$ y_{i} = {\beta }\ifmmode{'}\else$'$\fi X_{i} + \varepsilon _{i} \;{\text{observed only if }}\chi _{i} = 1 $$
(4b)

If we denote the residual of Eq. 4a with μ i,t and if we assume that the error terms are normal, respectively ε i  ∼ N(0,σ ε ) and μ i  ∼ N(0,σ μ ) with corr(ε i , μ i ) = ρ, we can reformulate Eq. 3 as:

$$ E{\left( {\Updelta \ln S_{{i,t}} |S_{{i,t - 1}} ,\chi _{{i,t}} = 1} \right)} = \beta _{0} + \beta _{1} \ln S_{{i,t - 1}} + \rho \sigma _{\varepsilon } \lambda _{i} $$
(5)

where \( \lambda _{i} = \frac{{\phi {\left( {\alpha 'z_{i} } \right)}}} {{\Upphi {\left( {\alpha 'z_{i} } \right)}}} \) is the inverse of Mills’ ratio.Footnote 10

The two-step estimation procedure requires one to estimate the probit selection model first.Footnote 11 Once λ i has been obtained for each observation, the main Eq. 5 is estimated, augmenting the observations with the Mills’ ratio inverse, to obtain an additional parameter estimate \( \beta _{M} = \hat{\rho }\hat{\sigma }_{\varepsilon } \) from which we can simply recover the two-step estimate of \( \hat{\rho } = \frac{{\beta _{M} }} {{\hat{\sigma }_{\varepsilon } }}. \) We used Maximum Likelihood estimationFootnote 12 with robust standard errors since heteroskedasticity has been detected in all the specifications (as reported in the following tables).

Consistently with most previous studies, robustness checks have been developed augmenting the main equation with the age variable, squared and interaction terms. The same specifications were tested over the entire period (1987–1994) and year-by-year (seven separate estimates).

Chesher’s logarithmic specification of Gibrat’s Law only considers surviving firms and calls for a sample selection approach. A possible alternative methodology is to test directly a growth equation and consider exit as a −100% rate of change (see Evans 1987a, b):Footnote 13

$$ G_{{i,t}} = \beta _{0} + \beta _{1} \ln S_{{i,t - 1}} + \varepsilon _{{i,t}} $$
(6)

where: G i,t  = (S i,t  − S i,t−1)/S i,t−1 with: S i,t  = 0 in case of exit.

In this alternative specification if β1 = 0, Gibrat’s Law holds; if β1 < 0, smaller firms grow at a higher rate than their larger counterparts, while the opposite is the case if β> 0. Robustness checks taking into account the possible role of age, squared and interaction variables were implemented in this specification as well.

4 Results

Tables 2 and following ones present the regression results over the entire period examined (1987–1994) and year-by-year. The model specification is given in the headline, while coefficient estimates are presented together with robust standard errors and level of statistical significance (* = 90%; ** = 95%; *** = 99%). Gibrat’s coefficients β1 are central to our analysis, and the validity of the Law is tested through their estimates.

Table 2 Sample selection model

In Tables 2, 3, and 4 coefficient estimates from the sample selection model are presented, and overall diagnostic tests are reported, including the estimate of the correlation between the residuals of the main and the selection equation (ρ) with the related significance level of the corresponding Likelihood Ratio test.Footnote 14 As can be seen from all the three tables, the need for the sample selection model has been confirmed, especially in the first years of the period examined. Next, estimates of λ, Wald tests for the overall validity of the model, and White heteroskedasticity tests are reported. Finally, the reader can follow the market selection process by looking at the number of observations, the decrease in these marking the incidence of firms’ failures.

Table 3 Sample selection model augmented with age
Table 4 Sample selection model augmented with age, squares, and interaction

Examination of Table 2 prompts two basic considerations:

  1. (1)

    Consistently with previous studies (see Sect. 2), Gibrat’s Law is rejected ex ante over the entire period 1987–1994, with a β1 = −0.153 significantly different from 0 at the 99% level of confidence; smaller firms seem to grow faster than their larger counterparts. Moreover, an initial larger size improves the likelihood of survival, although in a non-linear fashion.

  2. (2)

    β1 is not only smaller—in absolute terms—in the yearly estimates (this being simply a consequence of the expected close similarity in size in two adjacent years), but also increases over time (although not monotonically) and is not statistically different from 0 in the last period.Footnote 15 Overall, some evidence of an ex post convergence to Gibrat-like behavior emerges.

Some further considerations arise when one looks at the more comprehensive Table 3, which takes the age variable into account.

  1. (3)

    Consistently with previous studies, the inverse relationship between age and growth (β2) is confirmed over the entire period, yet age seems to lose its role in the second sub-period (1990–1994).

  2. (4)

    As in the previous estimates, Gibrat’s Law is rejected in general, but convergence toward the validity of the Law occurs through time. In this table, β1 is not statistically significant from 1990 to 1991 onwards;Footnote 16 the departure from Gibrat’s Law is confined to the first 3 years, when the population of firms is still strongly characterized by the presence of smaller and younger firms exhibiting a higher propensity to grow. However, in the long run, size and age diminish in significance and ex post the steady-state sub-population of surviving firms displays a Gibrat-like behavior. Indeed, it is plausible to conclude that the resulting 1,933 most efficient firms show—and will probably continue to show in the subsequent periods for which data are not available—growth patterns consistent with the Law of Proportionate Effect.

  3. (5)

    The findings discussed in the previous point are further confirmed by the results reported in Table 4, where additional robustness checks have been inserted (squared and interaction terms).

Turning our attention to the alternative specification (6), results are reported in the following Tables 5, 6, and 7.

  1. (6)

    As in Chesher’s specification, Gibrat’s Law is rejected over the entire 1987–1994 period (β1 = −0.202 at a 99% level of statistical significance). However, in this case too, convergence towards the Law can be detected through time: β1 turns out to be not significant from 1990 to 1991 onwards.

  2. (7)

    This basic result is confirmed in Tables 6 and 7, where the complementary role of age and further controls are taken into account. Finally, as in the previous estimates, age loses its significance in the last years of the examined period.

Table 5 Growth model
Table 6 Growth model augmented with age
Table 7 Growth model augmented with age, squares, and interaction

5 Conclusions

The main finding of this study on the Italian radio, TV, and communication equipment industry over the period 1987–1994 is that the rejection of Gibrat’s Law, common to most previous empirical research and also found here, should be reconsidered from a different perspective once market selection is properly taken into account.

Indeed, while we find that Gibrat’s Law must be rejected ex ante, convergence toward Gibrat-like behavior through time is detected ex post; in other words, the reshaped and shrunken population of surviving and therefore most efficient firms exhibits within itself patterns of growth that are consistent with Gibrat’s Law.

This evidence entails that through time, learning (either passive or active) and market selection originate a “core” of surviving firms which tends to behave according to the Law of Proportionate Effect, especially when the sector is subjected to some sort of shakeout as in our example. This evidence reconciles the diverging results emerging from the previous empirical literature analyzed in Sect. 2 and is consistent with the recent theoretical models of market selection also discussed in that section. If these results are confirmed by future research, Gibrat’s Law can be regarded as a way to describe what occurs in the long run, once learning and market selection have fully played their roles. From this point of view, Gibrat’s Law remains a useful tool for theoretical reasoning.