1 Introduction

A key indicator of the performance of new ventures is their post-entry growth (Audretsch 1995; Parker 2004). The aim of this paper is to contribute to our knowledge of how growth patterns vary with age. Building on previous research into the growth paths of new ventures (Delmar et al. 2003; Coad et al. 2013a), we provide new evidence on how growth persistence is moderated by firm age.

This paper addresses recent concerns that the characteristics of new firm growth remain poorly understood (McKelvie and Wiklund 2010), which is partly due to a lack of high quality data on young firms. The paucity of research into how firm growth varies with age can be explained by two data-related issues. First, there is limited availability of data on firm age: Headd and Kirchhoff (2009, p548) recently commented on “the dearth of information by business age” and explained that “[s]imply stated, industrial organization and small business researchers are deprived of firm-age data.” Relatedly, Decker et al. (2014, p3) observe that information on firm age has only recently been added to administrative databases. Second, it is very difficult to obtain representative data on very young firms, since they are often only included in the dataset when they exceed a certain threshold size (Coad et al. 2013b). We maintain that our exploratory search for empirical regularities requires analysis of large-sample data (cf Helfat 2007, p189).

We overcome previous data limitations by using a unique and rich dataset compiled from the Swedish Patent and Registration Office (PRV) on all limited liability firms during 1998–2008. The data cover all young firms and also include information on the registered start year. While previous research has had difficulties in obtaining data on the early years of new ventures (for a survey, see Bamford et al. 2004, their Table 1), we are thus in a unique position to look at growth paths of firms of all ages.

Previous research linking firm age to firm growth generally took the form of adding firm age as an explanatory variable in a firm growth regression model, usually finding that younger firms have faster expected growth rates (e.g., Fizaine 1968; Evans 1987; Dunne et al. 1989; Robson and Bennett 2000; Yasuda 2005). Recent work has even suggested that, controlling for age, firm size no longer has any systematic relationship with firm growth, such that it is age, rather than size, that best explains a firm’s growth rate (Haltiwanger et al. 2013; Lawless 2014).Footnote 1 Dunne et al. (1989) also observed that a firm’s growth rate variance decreases with age. A few studies have investigated the effect of age on growth across the growth rate distribution, finding that young firms are equally likely to experience decline as old firms, but that young firms are more likely to experience fast growth than old firms (Reichstein et al. 2010; Coad et al. 2013b; Barba Navaretti et al. 2014). Daunfeldt et al. (2014) also observed that high-growth firms in general were younger than other firms, irrespective of whether employment, sales, labor productivity, or value added was used as growth indicator.

Another strand of literature has focused on growth rate autocorrelation. The possible existence of growth-rate autocorrelation has prompted a large number of empirical investigations (e.g., Ijiri and Simon 1967; Singh and Whittington 1975; Dunne and Hughes 1994; Goddard et al. 2002; Coad 2007; Coad and Hölzl 2009; Capasso et al. 2014; Bianchini et al. 2016), because of interest in understanding firms longer-term growth trajectories, as well as concerns about industrial concentration. Indeed, “serial correlation in firm growth rates ... is of considerable economic interest and deserves to be examined in its own right.” (Singh and Whittington 1975, p17). Early studies (Ijiri and Simon 1967; Singh and Whittington 1975), using mostly data on large manufacturing firms, indicated that the process of firm growth was characterized by positive autocorrelation. Results from recent studies are more ambiguous, with some finding that firm growth is characterized by positive autocorrelation rates (Dunne and Hughes 1994) and others negative autocorrelation (Goddard et al. 2002; Bottazzi et al. 2011), while still others conclude that there is no growth persistence (Almus and Nerlinger 2000; Lotti et al. 2003). Coad (2007), Coad and Hölzl (2009) and Capasso et al. (2014) investigate the role played by firm size using quantile regression techniques. The results from these studies indicate that autocorrelation in general is negative for small firms, whereas large firms show positive or no persistence in growth rates. The highest negative autocorrelation was found among the 10% fastest growing firms, making sustained high growth rates a very unlikely growth process. This result is also supported by Parker et al. (2010), Hölzl (2014) and Daunfeldt and Halvarsson (2015), who have found that high-growth firms are essentially “one-hit wonders.”

The most closely related paper to our own is probably Coad et al. (2013b) who present some preliminary results on how growth rate persistence changes with firm age, reporting that sales-growth autocorrelation was positive for firms that were less than 5 years old, but soon turned negative, and remained negative, for older firms. However, these authors cautioned that survivor bias and selection bias could be driving these results, such that young firms with relatively high growth rates were over-represented in their data.

We contribute to the literature by providing new evidence on how the growth-rate distribution changes with firm age, including how the first four moments change with age, and providing findings on how growth rate autocorrelation varies with age. We present robust evidence that young firms experience a sudden burst of growth shortly after entry, and that soon afterwards their growth rates slow down and become more erratic (in the sense of experiencing negative autocorrelation).

Our results thus indicate that young firms are characterized by positive growth autocorrelation in the years immediately after entry, but that the autocorrelation coefficient quickly turns negative and remains negative as firms get older. Nascent ventures, therefore, enjoy a brief spell of positive growth persistence—a sort of “success-breeds-success” dynamic—which lasts for about 5 years, until persistence becomes negative. This finding can be tentatively linked to the struggle for new ventures to grow and overcome the vulnerabilities related to their initial small scale (Stinchcombe 1965). We can thus reject the hypothesis that older firms should have a higher degree of growth persistence due to learning effects, better foresight, or longer-range planning horizons. Instead our results support theories arguing that older firms might have difficulties in adapting their strategies to changing market conditions, whereas new firms need to grow in order to achieve a minimum efficient scale.

The paper is organized as follows. Section 2 contains our theoretical background. Section 3 describes our data, and Section 4 presents our methodology. Our results and robustness analyses are in Section 5, and Section 6 concludes.

2 Background

Autocorrelation in growth rates is an important feature of firm growth. Theoretical models of firm growth, such as Gibrat’s “Law of Proportionate Effect” (Gibrat 1931) or variations on this model (e.g., Levinthal 1991), first suggested that there is no autocorrelation in growth rates, such that growth in one period is independent of growth in the following period. The intuition behind this assumption of no autocorrelation could be explained as follows. Young firms are exposed to business challenges for which they may be initially unprepared; they are vulnerable to unexpected shocks on all sides, which may result in an erratic growth performance that is characterized by the “prevalence of interruptions to growth” (Garnsey and Heffernan 2005, p675). “New firms are hampered by their need to make search processes a prelude to every new problem they encounter” (Garnsey 1998, p541)—which suggests that new firm growth lacks continuity and suggests that superior growth performance in 1 year need not imply superior growth performance in the following year. Young firms with under-developed routines and capabilities may therefore be expected to grow in unpredictable and erratic ways—i.e., the lack of persistence in new venture growth would correspond to a lack of autocorrelation in growth rates for young firms.

Hypothesis 1a: There is no autocorrelation in the growth rates of young firms.

In essence, positive growth rate autocorrelation (i.e., when growth over the period t-2:t-1 is positively correlated with growth over the period t-1:t) corresponds to a case of increasing returns to growth, or a “virtuous cycle” of success-breeds-success (or, alternatively, failure-breeds-failure), where any growth (or decline) that a firm achieved in 1 year will enhance the expected growth (or decline) in the following year. Positive growth autocorrelation corresponds to sustained smooth growth profiles, whereas negative growth autocorrelation corresponds to erratic growth paths (where positive growth in 1 year decreases the expected growth rate in the following year).

Young firms have traditionally been associated with a “liability of newness” (Stinchcombe 1965) as they struggle to develop routines, learn new roles, forge new relationships with customers, and—more generally—establish themselves in their new environment. Part of the liability of newness comes from their liability of smallness, whereby new firms are often relatively small, strive to grow in order to quickly reach a larger, more efficient scale of operation (close to, or above, the “Minimum Efficient Scale”). Indeed, empirical work has consistently found that young firms have higher average growth rates than older firms (Fizaine 1968; Haltiwanger et al. 2013; Lawless 2014). Hence, young firms could be expected to have a sustained period of relatively high growth rates in the years after entry:

Hypothesis 1b: young firms display positive growth rate autocorrelation.

Older firms, in contrast, are prone to suffer from a “liability of obsolescence” and also a “liability of senescence” (Barron et al. 1994). This implies lower growth persistence for old firms, since they may have problems adapting their strategies to changing business conditions as well as increasing inertia and organizational rigidities. Older firms may therefore be buffeted around by business phenomena beyond their control, which thwart the continuity of any longer-term business plans they may have. In the years after entry, young firms might struggle to reach a “minimum efficient scale” (MES) and achieve economies of scale (Lotti et al. 2009). As time goes by, however, their initial burst of energy may fade, and once they have survived the first few years and have settled into their new organizational routines, their growth will lose its momentum. Previous evidence has shown that the growth of older firms is more random and harder to predict (Coad et al. 2016). While younger firms might display positive autocorrelation in their early years, older firms that have become well-established in their niches may stabilize at a certain size, and any growth that occurs later lacks persistence and may correspond to mere fluctuations around a steady-state size (where growth in 1 year would be unrelated to growth in the next year). Hence:

Hypothesis 2a: there is no autocorrelation in the growth rates of old firms.

An alternative view, however, would be that older firms have more experience and foresight when it comes to their business environment, which leads to longer planning horizons, and have built up routines and capabilities that may lead to sustained superior performance, and can therefore be expected to have smoother growth paths with fewer bumps and surprises (that is—more positive autocorrelation in their growth rates). Older firms are also more likely to purchase insurance to protect themselves from unexpected shocks (such as hurricane Sandy: see Collier et al. 2016). Learning-by-doing models (Arrow 1962; Sorensen and Stuart 2000; Chang et al. 2002) furthermore suggest that older firms may benefit from their greater business experience and therefore can be expected to have a higher degree of growth persistence than their younger counterparts. Although the relationship between growth persistence and age has so far escaped attention, nevertheless the evidence on how growth persistence varies with size suggests that larger firms have smoother growth profiles than smaller firms (Coad 2007; Coad and Hölzl 2009). To the extent that age and size are closely related (e.g., Greiner 1998), we might also expect that older firms will have positive growth-rate autocorrelation.

Hypothesis 2b:old firms display positive growth rate autocorrelation.

3 Data

3.1 Data description

The main challenges when investigating the effects of firm age are data availability and the necessity of a comprehensive representation of young firms (Headd and Kirchhoff 2009; Decker et al. 2014). In order to overcome these challenges, we chose to use the PAR-dataset, which comprises all Swedish limited liability firms during 1997–2010. Swedish administrative datasets have previously been shown to be an unusually rich information source for entrepreneurship research (e.g., Davidsson et al. 2009; Folta et al. 2010).

In Sweden, all limited liability firms are required by law to submit an annual report to the Swedish patent and registration office (PRV), and PAR, a Swedish consulting firm, gathered this information from PRV. The dataset thus covers all limited liability firms, which means that young firms are not under-represented as in many other studies (e.g., Huergo and Jaumandreu 2004; Coad et al. 2013b). The data include all variables that can be found in the annual reports (e.g., number of employees, sales, financial ratios), industry classification codes, M&A activity, and whether the firm is part of a business group.

Another attractive feature of the dataset is that it includes information on the registered start year, with the oldest firm being registered already in 1877. For firms that are registered during the study period 1998 to 2008, we even have information on the month of the first accounting year. This means that we can identify firms during their first year of business and determine firm age exactly for companies that started during 1998–2008. This level of detail allows us to observe firms within their first year as a startup. For firms that registered prior to 1998, we cannot identify the correct age down to a single month and therefore rely on the registered start year to determine a firm’s age.Footnote 2 We restrict our analysis to active firms (i = 1,  … , n), which we define as firms that have at least one employee and positive sales. We focus on sales growth rather than employment growth, because the employment growth of micro firms is heavily affected by “lumpiness” in growth due to the integer constraints that arise from data on employee headcounts (Coad et al. 2013a).Footnote 3 More specifically, we measure firm growth by taking the log-difference of firm size, which is the usual way of calculating growth rates (Tornqvist et al. 1985; Coad 2009), i.e.,

$$ growt{h}_{i, t}=\mathit{\log}\ \left( siz{e}_{i, t}\right)-\mathit{\log}\left( siz{e}_{i, t-1}\right) $$
(1)

where size i , t is inflation adjusted sales i , t . Employment and sales are the two growth indicators that are most commonly used within the firm growth literature (Delmar 1997). Although sales and employment can be thought of as output and input variables in the production function, they are still modestly correlated (Shepherd and Wiklund 2009). The correlation for all years between sales growth and employment growth in our data is 0.35.

With few exceptions, most firm growth studies rely on measures of total growth (i.e., the sum of organic and acquired growth) since they do not have access to any data on mergers and acquisitions (Coad et al. 2014). However, mergers and acquisitions (M&As) are likely to be related to the age of the firm and can thus influence the relationship between firm age and growth persistence. Fortunately, the database includes information on M&As and we therefore choose to exclude all firms from that registered any such activity during the study period. This ensures us that we capture how firm age is related to organic growth autocorrelation and not to growth episodes that occur through external growth via M&As. We also include a dummy variable to capture differences between firms that operated in a business group in the beginning of the study period and those that did not. In contrast to most previous studies, we can thus focus on organic growth and also control whether the firm was part of a business group at the beginning of the growth period studied.

Our main variable of interest is firm age, which is defined as the observation year minus the registered start year. Compared to other studies on firm age, we do not need to work with left-censored age distributions due to our complete coverage. The extensive information of firm age is unique and should enable us to accurately assess the age effect on growth persistence.

3.2 Non-parametric plots

The age distribution for the population of firms in 2008 is presented in Fig. 1, showing that most firms are young. This is to be expected, since we know that young firms have high exit rates (Lotti et al. 2003), with about 50% of firms exiting in the first 3 years (Anyadike-Danes and Hart 2014; Coad 2017). Except for the humps around ages 20 and 40, the distribution appears to show roughly exponential decay.Footnote 4 The humps can be explained by legal changes in Sweden that increased the minimum amount of capital required to start an incorporation.Footnote 5

Fig. 1
figure 1

Frequency plot over firm age for the whole population in 2008. Note: A discrete frequency plot over firm age (one bar for each age) for the population of 316,298 incorporated firms registered in 2008

Turning to our sample, the mean age in 2008 is 14.4 (median = 14 years) and the corresponding mean in 2000 is 14.4 (median = 11 years), which partly reflects the shifts of the hump around 1995.Footnote 6 With a standard deviation of age equal to 13.2 (in 2008), we find that the mean and standard deviation of age are fairly close in magnitude, but not equal as would be the case for strictly exponentially distributed variables. The oldest firms in the sample are 111 years old, and there are 30 such firms, which means that we can completely rule out right-censoring, especially since our primary focus is on firms with ages of up to 40 years.

Figure 2 shows the kernel density plots for annual sales growth rates for different age groups during 2008. Plotted on semi-log axes, the growth-rate distribution exhibits the familiar “tent-shape” (Stanley et al. 1996; Bottazzi and Secchi 2006) which indicates that the growth rate distribution is far from the Gaussian case and instead is Laplace-distributed. Moreover, the distribution of the youngest firms (age <5 years) is different from that of older firms, because it has more probability mass with positive growth rates. This indicates that younger firms are more likely than older firms to experience fast sales growth rates, confirming earlier results (Coad et al. 2013b; Barba Navaretti et al. 2014). An interesting observation is that the numbers of young firms in the middle of the growth rate distribution (i.e., with growth rates close to zero) are also far fewer than what can be found in the distributions of older firms. This implies that younger firms also are less likely to experience marginal growth rates compared to older firms. Finally, the left tail of the growth rate distribution seems roughly invariant to firm age, suggesting that younger firms have almost the same likelihood of facing fast rates of decline as older firms.

Fig. 2
figure 2

Density plot of sales growth for firms of different age groups in 2008. Note: Density plot with an epanechnikov kernel function at 50 points, delimited at ±2

Figure 3 contains further analysis of how the first four moments of the growth rate distribution (i.e., the average, standard deviation, skewness, and kurtosis) vary with age. The top-left panel shows that the average growth rate is the highest in the first observed period (i.e., year 2),Footnote 7 after which it quickly stabilizes at a level which is slightly negative. The beneficial effects of youth on growth rate therefore appear to be short-lived (see also Haltiwanger et al. 2013 and Lawless 2014). The top-right plot of Fig. 3 shows how the standard deviation of firm growth decreases comparatively steadily over the first 40 years. The skewness of growth rates starts at values of close to zero in year 2 and appears to generally become more negative in the years until year 40. In other words, the growth rate distribution may be relatively symmetric to start with, but as firms age, it becomes more negatively skewed, with firm decline overshadowing firm growth. Finally, the fourth moment of the growth rate distribution (i.e., the kurtosis, shown in the bottom right plot of Fig. 3) shows that the kurtosis generally increases in the years after entry. This complements the earlier finding that the standard deviation decreases—hinting that the growth rate distribution becomes slightly more heavy-tailed (but with lower variance) and further from the Gaussian case as firms age.

Fig. 3
figure 3

Mean, standard deviation, skewness, and kurtosis of growth rates as a function of age 2008–2007. Note: The plots correspond to the first four moments for firms of ages 2 to 40. Each plot is fitted with a nonlinear trend based on locally weighted regressions (lowess in Stata)

The higher dispersion in growth rates among the very youngest firms can also be seen in Table 1 that shows some descriptive statistics for the sales-growth variable. Firms less than 10 years old show positive average growth rates and higher standard deviation than older age categories (which have negative average growth rates).

Table 1 Description of sales growth by age categories (2008)

One of the most important dimensions for distinguishing between heterogeneous firms (and also the growth performance of these firms) is firm size. Small firms have been repeatedly observed to grow faster than large firms (Sutton 1997), and growth rate autocorrelation has also been shown to depend on firm size (Coad 2007). We therefore split our sample into firms with, on average, up to 4 employees, 5–9 employees, and 10 employees or more, over the periods t − 1 and t − 2. Descriptive statistics of growth rates for each size class are presented in Table 5 in the “Appendix” section. The positive average growth rates observed for firms with age less than 10 in Table 1 remain for each size class, whereas the negative average growth rates in Table 1 for firms aged 10+ seem to be driven by firms with fewer than five employees.

In order to identify the effect of firm age on growth persistence, we also control for industry variation (at the 2-digit NACE industry level) in average industry growth rate, market concentration (measured using a Herfindahl concentration index), entry rate and exit rate. We also control for the effect on firm growth persistence of being part of a larger business group. Descriptive statistics of all control variables are presented in Table 2.

Table 2 Description of industry variables defined at the 2-digit NACE level (2008) and sales

To get a first impression of the relationship between intertemporal growth rates, we look at the bivariate density of sales growth of consecutive annual growth rates. Figure 4 is a representation of the bivariate density of sales growth in periods t and t − 1 and is in itself an important contribution to empirical work on growth autocorrelation, because it provides a “big picture” summary representation of the frequencies of growth paths across two periods. The frequency is projected into the plane on a 0.1 × 0.1 grid by the aid of a contour plot, illustrated through eight shades of gray into equidistant logarithmic categories, starting from 1 firm. Growth in either period is restricted to the range from −2 to 2, and every non-white shade indicates that some firms are present. The darker the color, the higher the frequency of firms with the observed intertemporal pair of sales growth rates.

Fig. 4
figure 4

Contour plot for pairs of consecutive growth rates 2006–2008. Note: Coloring shades correspond to the frequency of the number of firms in categorized into logarithmic bins. White areas contain no observations without interpolation. The contour is delimited to growth rates ±2

The bivariate distribution is unimodal with a dark center, indicating that many firms have growth rates close to zero in both periods. Almost all areas of the grid are covered, which means that most configurations of consecutive growth rates are represented. In each corner, however, we see that the shading becomes lighter in color, which suggests that only a few firms experience extreme growth rates in consecutive periods (whether it be extreme positive or extreme negative growth).

Figure 4 shows the probability of growth in two consecutive periods, but it does not include information about firm age. Since we are interested in understanding the relationship between growth-rate autocorrelation and firm age, we change the shading parameter that determines the contour lines. Instead of using frequency of firms, Fig. 5 lets the shading parameter represent changes in median age.Footnote 8 The plot can be split into four quadrants. For example, firms in the top-right (TR) quadrant (with growth in the range [0,2] in both periods) experienced positive growth rates in both 2007 and 2008. If for example hypothesis 1b holds, which states that young firms experience positive autocorrelation, then we would observe lighter colored areas in the top-right (TR) and/or bottom-left (BL) quadrants in Fig. 5. On the other hand, should hypothesis 2b hold, we would instead observe darker shading in the diagonal quadrants: top-right (TR) and/or bottom-left (BL). Looking at the figure, we can discern at least one clear tendency. In the TR quadrant, there is a noticeable amount of lighter shaded areas, suggesting that young firms are more likely to experience positive growth rates in both periods t − 1 and t.

Fig. 5
figure 5

Contour plot over median grid age for pairs of consecutive growth rates in 2006–2008. Note: Coloring shades corresponds to the four age groups 0–4 years; 5–9 years; 10–19 years; 20–40+, which contains 40+ group. White areas contain no observations without interpolation. The contour is delimited to growth rates ±2

It is worthwhile to test for statistically significant differences in the densities of average age for each of the four quadrants. If indeed young firms are over-represented in the TR quadrant, we would see the weighted density corresponding to TR to be left-leaning, which is clearly the case if we look at Fig. 6.Footnote 9 From the same figure, we see a second tendency, although much subtler, with older firms (that are represented by dark shading in Fig. 5) being over-represented in the BL quadrant, meaning that they are more likely to experience negative growth rates in the periods t − 1 and t. Both of these findings offer early support for hypotheses 1b and 2b, but for different reasons: while Fig. 6 suggests that both young and old firms experience positive autocorrelation, positive growth rates in consecutive years are more likely for younger firms, whereas negative growth rates in consecutive years are more likely for older firms. Differences between the distributions in Fig. 6 can be formally tested with pairwise two-sided Kolmogorov-Smirnov tests accompanied with standard t tests. Except for the TL and BR distributions, all pairwise combinations are significantly different (results not shown here). Thus, the contour plots offer some early support for hypothesis 1b (and weaker support for hypothesis 2b). While graphical models are well-suited for exploratory empirical research (Bettis et al. 2014), the next section tests our hypotheses in more detail using multivariate regressions that include control variables. By combining visual analysis and rigorous statistical/econometric analysis, we seek to provide richer insights into the robustness and the economic and statistical significance of the hypotheses.

Fig. 6
figure 6

Empirical density of median ages for the four quadrants in the contour plot (Fig. 5). Note: Analysis of the quadrants in Fig. 5. TR top right (positive to positive), TL top left (negative to positive), BR bottom right (positive to negative), BL bottom left (negative to negative). Density plot with an epanechnikov kernel function at 50 points. Pairwise equality of distributions and means are rejected for each pair except for TL and BR by the two-sided Kolmogorov-Smirnov test and the standard t test

4 Regression methodology

This paper follows in the tradition of modeling firm growth as a stochastic process (Gibrat 1931; Levinthal 1991; Geroski 2000; Coad et al. 2013a; Denrell et al. 2015). At any point in time, even if there is a multitude of different factors (internal resources as well as external conditions) affecting the process of growth for the individual firm, the stochastic framework regards those factors as approximately random at the aggregate firm-level. In the cross-sectional analysis of firm growth, the combined effect of these forces amounts to a probability density that describes the dynamic of firm growth (Singh and Whittington 1975). Considering the probability density of growth rates, autocorrelation refers to a type of intra-distributional movement, where the realization of past growth affects the expected future growth rates.

To model the dynamics of firm growth, we opt for a simple regression specification, which reflects the fact that there are no clearly identified variables that are able to explain the majority of variation in growth rates (Geroski 2000; Coad 2009; Denrell et al. 2015). More specifically, we consider an age-augmented version of the growth-rate autocorrelation model used in previous research (Coad 2007; Coad and Hölzl 2009; Bottazzi et al. 2011), given by

$$ growt{h}_{i, t}=\mu +{\gamma}_s siz{e}_{i, t-1}+{\gamma}_g growt{h}_{i, t-1}+{\gamma}_a ag{e}_{i, t.1}+{\epsilon}_{i, t}, $$
(2)

In this model, growth is governed by four parameters. The first is the constant term, μ, that captures average yearly growth rates; and the second parameter, γ s , captures the effect from lagged size. The model is closely related to Gibrat’s Law of Proportionate Effect (LPE), which states that growth rates in time t are statistically independent of initial firm size (Gibrat 1931; for overviews, see e.g., Sutton 1997; Caves 1998; Lotti et al. 2003; Gilbert et al. 2006). This condition is usually associated with having γ s  = 0, which means that there are no effects on growth from lagged size. If the LPE is violated, lagged size either has a positive effect on growth with γ s  < 0 or a negative effect γ s  < 0. In the first case (γ s  > 0), growth becomes explosive as they grow increasingly faster when they become larger. Evidently, this scenario can only be temporary, as we do not observe infinitely large firms, and cannot result in a steady-state distribution for firm size. The other case (γ s  < 0) means that size regresses to the mean over time and, as a consequence, allows smaller firms to grow faster than already large firms.

The third parameter in the model is γ g , which refers to the effect from lagged growth rates, i.e., whether growth rates are persistent. We use persistence and autocorrelation in growth rates interchangeably, henceforth. This parameter is also intimately linked to the LPE. Should ∣γ g ∣ > 0, such that growth rates are (auto) correlated, growth can either encourage (γ g  > 0) or discourage (γ g  < 0) subsequent growth, which also results in a dependence between firm size and growth (Chesher 1979). Thus, in order for the LPE to apply, the condition γ g  = 0 also needs to be fulfilled in addition to γ s  = 0. It is mainly because of this auxiliary condition that growth autocorrelation is considered in the Gibrat literature. While the literature on the LPE is vast, the number of studies that consider growth persistence as a self-contained phenomenon is considerably smaller (see e.g., Daunfeldt and Halvarsson 2015 for a recent survey). The fourth and final parameter γ a captures the effect from firm age, here measured at the initial period (t-1) of the growth model, which means that the smallest possible age a firm can have in period t is two. The last term ϵ i , t is a possibly non-Gaussian disturbance term.

From the basic age-augmented growth-autocorrelation model presented above, we make several extensions. First, and most importantly, we control for industry specific components in growth rates, using our variables for industry entry- and exit-rates, the average industry growth rate, and the Herfindahl concentration ratio. The industry variables are included for year t-1. As a further precaution, we remove firms that went through some form of M&A activity during the period t, t-1, or t-2. We also have information at the firm-level regarding whether a firm belongs to a business group. Along with industry variables, we therefore introduce a dummy that takes the value one if a firm belongs to a business group in t-1, and zero otherwise.

As a second extension, we test whether growth autocorrelation varies with firm age by interacting growth t − 1 with age t − 1. This extended model is able to capture any linear relationships between age and growth autocorrelation. However, should the relationship be non-linear, the interaction term only presents a first approximation. We, therefore, in a third extension, repeat the analysis without the interaction term but restricting the sample to distinct age groups.

Our choice of estimator for Eq. (1) is based on the particularities of the error term (ϵ it ), which inherits the particular “tent-shape” that characterize the distribution of firm-growth rates, as seen in Fig. 2. It is well-established that firm growth tends to follow the Laplace distribution (Stanley et al. 1996; Bottazzi and Secchi 2006; Bottazzi et al. 2011), with most firms not growing at all while a few firms grow very fast. Since this violates the normality assumption of ϵ i , t , OLS becomes less attractive, whereas least absolute deviation (LAD) becomes more suitable (Fotopoulos and Louri 2004; Coad and Rao 2008; Reichstein et al. 2010; Bottazzi et al. 2011; Capasso et al. 2014).

Instead of a normal distribution, LAD (also known as median regression) assumes the error terms to be Laplace-distributed. Another advantage comes from its robust estimation in the presence of outliers on the dependent variable that tends to ±∞, which becomes relevant in the presence of very fast-growing firms. Since these firms’ growth rates can distort the mean but not the median, median regression is better suited and more robust than OLS (Bottazzi et al. 2011). Finally, Eq. (1) is estimated using robust standard errors.Footnote 10

5 Regression results

5.1 Main analysis

The results from the main analysis are presented in Table 3. The first column estimates Eq. (2) without control variables. These results capture the median autocorrelation among firms while controlling for firm age. To test our hypotheses, the second column incorporates an interaction term between age and lagged growth. This variable captures any linear relationship between firm-age and growth-autocorrelation. In the following columns (3) to (7), we introduce additional controls variables to see whether industry structure variables account for some variation in firm growth rates.

Table 3 Results from median regression on all firms in 2008

In the baseline model in column (1), the coefficient on lagged growth, 0.004, is small and positive, but highly significant, indicating that firm growth is overall marked by positive growth-rate autocorrelation.Footnote 11 This implies that positive growth rates tend to be followed by more positive growth rates, and that negative growth rates are followed by more negative growth rates. A 1 unit change in median sales growth in period t-1 translates into a 0.4 percentage point (0.004) increase in sales growth in period t. In terms of one standard deviation (0.608 as reported in Table 1) as the unit change, the corresponding increase in sales growth amounts to 0.24 percentage points (0.6080.004=0.0024). For the median firm with a growth rate of 5.2% (Table 1) at t-1, the latter change predicts a very subtle increase to 5.44%, which in terms of the median growth rate corresponds to a 4.62% (5.44/5.2 − 1 ) increase. Stated yet another way, the rate at which the median firm grows increased by 4.62% as a consequence of its lagged growth rate.

Gibrat’s (1931) prediction that size is independent of growth rates (LPE) can also be rejected. We observe a slightly positive relationship between growth rate and lagged size for the full sample (coefficient = 0.002), whereas firm age is negatively related to firm growth for the full sample (Table 2 column (1)), indicating that older firms have lower expected sales growth rates of 0.1% for each additional year.Footnote 12

The interaction term for age and lagged growth is important for the purposes of testing our hypotheses, but its interpretation needs some explanation. The inclusion of an interaction term in column 2 between lagged growth and firm age changes the estimate of growth autocorrelation to 0.02, up from 0.004. Since we enter firm age in (integer) levels, the direct effect of 0.02 applies to hypothetical firms aged zero years, but as age enters the models with one lag, the youngest firms are 1 year. As the interaction term is negative, it means that growth autocorrelation for firms aged 1 to 40 is predicted to decrease from 0.02 by 0.1% for each for each additional year as the median firm ages. Since the interaction term postulates a linear relationship between firm age and growth-rate autocorrelation, the model also predicts that autocorrelation is effectively zero for firms of 15 years of age. For older firms, the linear prediction results in negative autocorrelation rates, which for firms aged 40 becomes −0.035. This implies that positive growth rates tend to be followed by more negative growth rates, and that negative growth rates are followed by more positive growth rates. We illustrate this predicted linear relationship in Fig. 7 that plots the autocorrelation rates for firms aged 1 to 40 years together with 95% confidence intervals. Figure 7 shows that the correlation coefficient is estimated to change from significantly positive to zero at around age 15 and to become increasingly negative afterwards. Hence, Fig. 7 shows how the autocorrelation coefficient varies with age, in a way that is not clearly visible from the regression results shown in Table 3.

Fig. 7
figure 7

Average marginal effects of sales growth (t-1) as a linear function of age with 95% CIs. Note: The plot shows the estimated marginal effect on the vertical axis from initial (lagged) growth for firms with ages (lagged) going from 1 to 40, on the horizontal axis

Table 3 also indicates that the estimates of age and growth persistence are robust to the inclusion of a number of control variables in columns (3) to (7). These control variables provide additional insights into what influences median growth rates beyond the baseline specification of Eq. (2). According to the results from the full model presented in column (7), positive growth rates are more prevalent among firms that are active in growing industries or in industries with low market-concentration rates or low exit and entry rates. These findings indicate that firms who are active in industries with higher competition or less turbulence in the form of firm entries and exits are more likely to increase their growth further when accounting for initial sales growth rates. Finally, firms that belong to a business group are experiencing positive growth rates more often than single-establishment firms.

While the above analysis helps us to investigate any linear relationship between firm age and autocorrelation, it cannot account for non-linear segments in the relationship between autocorrelation and firm age. Our graphical analyses in Figs. 5 and 6 show that positive autocorrelation rates could be detected for young firms (who are more likely to have consecutive positive growth) and older firms (who are slightly more likely to have consecutive negative growth). To complement these earlier plots, and to investigate possible nonlinearities, we take an alternative regression specification that does not include an interaction term for age and autocorrelation, but that estimates the full model specification separately on five different age groups. This allows autocorrelation to vary freely between these subsamples of the age distribution.

Regression results for different age categories are presented in Table 4. Firms that are younger than 5 yearsFootnote 13 show a positive and significant autocorrelation coefficient for the median firm of 0.041, i.e., twice the rate than predicted by the linear trend in Table 3. Computed for a median growth rate of 8.6% for firms younger than 5 years in t-1, one standard deviation increase of 0.673 (reported in Table 1) corresponds to a 2.76 percentage points increase (0.6730.041 = 0.276) and thus a 32.09% increase ((8.6 + 2.76)/8.6 − 1 = 0.3209) in the speed of growth rates in t. Although the autocorrelation rate is twice as high for firms in the youngest age group than predicted in Table 3, the surge in growth rates of 32.09% found here is close to the prediction of 26.15%, which gives credence to the linear specification in Table 3 for the youngest firms.

Table 4 Results from median regression on firms divided by age group

We also observe from column 2 in Table 4 that autocorrelation turns negative for firms that are 5–9 years, rather than at the intercept of 15 years as predicted by the linear specification (implicit in Table 3 and emphasized in Fig. 7). Autocorrelation thereafter turns significant and increasingly more negative for firm in age groups 10–19 years and 20–39 years and finally becomes insignificant for firms that are aged 40+. Although the regression analysis does not support a strong U-shaped relationship in autocorrelation rates between firms across the age distribution (which was hinted at by the contour plots), the results in Table 4 at least contradict the strong negative autocorrelation rate that was predicted in Table 3 (and Fig. 7) for older firms.

The results presented in Table 4 also indicate that firm size has a different effect on growth persistence for young firms than for older firms. LPE for all age groups except for firms aged 5 to 9 years, for which autocorrelation is non-significant. For the youngest firms, the LPE is violated because smaller firms grow faster than larger firms, while for firms older than 9 years LPE is violated because larger firms within age groups grow faster than smaller firms. The results by age groups also indicate different results for the control variables. Noteworthy is that the negative results for the Herfindahl index only enter significantly for firms in the age groups 10–19 years and 20–39 years. Thus, for the youngest and oldest firms, competition (as captured by the level of concentration in the industry) does not factor into their growth rates. Also striking is the negative effect observed for turbulence in terms of the entry and exit rate in columns (1) to (4). It is only the oldest firms that grow faster in industries with a higher entry rate, and growth does not seem to respond to the industry’s exit rate.

As a final step in the analysis, we dispense with age groups altogether to investigate the autocorrelation rate for each individual age. Instead of showing a regression table with 39 columns, we present the regression coefficients graphically. Figure 8 plots the median regression estimate of the autocorrelation coefficient on the vertical axis, together with each coefficient’s 95% confidence intervals, while age is plotted on the horizontal axis. The figure is based on 39 data points (firms ages 2–40) obtained from 39 cross-sectional regressions (one per year) of the full model (excluding the interaction term). Figure 8 complements the results in Tables 3 and 4. Growth autocorrelation is positive and significant for start-ups with age 2–4 years, but turns insignificant shortly afterwards, and remains negative and significant until about age 17 years, after which it alternates between insignificant and significant, but never turns significantly positive again for older firms.

Fig. 8
figure 8

Estimate of growth autocorrelation (γ g ) from separate median regressions for ages 2–40. The figure is thus based on 39 estimated coefficients with 95%-CIs obtained from the regressions

5.2 Robustness checks

5.2.1 Repeating the analysis for different years

In order to investigate if our results are driven by year-specific effects, we re-estimate all models for each year during the study period (2000–2008) and the results are briefly summarized here.Footnote 14 We begin by estimating the series of regressions that form the basis of Fig. 7 for the preferred sample of firms in all years 2000–2007. For every year of the period, young firms “burst into life” with positive growth autocorrelation in their first observable year.Footnote 15 In the years after entry, autocorrelation turns insignificant and then significantly negative. When this happens, and for how long autocorrelation stays negative, differs slightly between years.Footnote 16 The average period, however, with sustained negative growth rates across all years was found to be roughly between 10 to 17 years. Based on the year-by-year analysis of the non-linear relationship between autocorrelation rates and firm age, we confirm Hypothesis 1b of positive growth autocorrelation for the youngest firms, and we cannot rule out that hypothesis 2a (no autocorrelation) holds for firms older than 20 years.

5.2.2 Selection bias from surviving firms

One possible explanation for our finding of positive autocorrelation in the early years relates to selection effects. More specifically, there might be a “shadow of death” effect, such that some short-lived firms experience two successive periods of decline (hence, positively-correlated growth across years) immediately before exiting. A contrasting explanation would be that our finding of positive growth autocorrelation in the early years could be due to internal developmental factors of firms, for example if young firms (that will survive a long time) struggle to grow in their early years (e.g., to reach a minimum efficient scale of production), and after this initial growth spurt their growth stabilizes as they reach a “steady-state” scale of operation or “optimal size.”Footnote 17 Ideally, we would investigate selection effects by tracking cohorts of firms for 40 years, conditional on knowing that they will survive for the 40-year period—but this is not possible because our data does not track firms for that long. Instead, the approach we take is to focus on firms that we know will survive several years into the future, thus eliminating observations from short-lived firms that might contribute to a “shadow of death” effect. More specifically, we measure the growth rate autocorrelation of firms of different ages, where the dependent variable is the annual growth rate measured into 2000 to 2004, and inclusion in the sample is conditional on survival into 2009.

The results are available upon request. Similar to our previous results in Fig. 7, we see that growth rate autocorrelation is positive in the first few years after start-up and decreases relatively slowly in the post-entry years, before again stabilizing at a level of negative growth rate autocorrelation until turning insignificant for the older firms. The initial growth spurt is qualitatively similar, and we can confirm that our main analysis is not driven by “shadow of death” selection effects, but instead that the observed patterns in growth rate autocorrelation over age relate to internal developmental factors in firms.

5.2.3 Disaggregating by firm size

One concern is that a large number of one-person firms (also known as solo self-employed individuals) might have been driving our results. We therefore excluded firms that started with one person employed in the period t − 1 or t − 2 and repeated the analysis. The initial growth spurt remains and becomes more emphasized when excluding firms of less than or equal to 1 and persists for firms as old as 8 years. After the initial growth spurt, the autocorrelation coefficient turns insignificant and remains insignificant for most older firms, in contrast to a pronounced period of negative and significant coefficients for middle-aged firms (as found in the main analysis).

It is clear that the initial growth spurt reflected in the aggregate is not exclusively a function of the smallest firms, but rather reflects firms of a larger size. This tendency is also confirmed if we estimate the full model for firms in three different size groups with average number of employees less than 5, 5 to 9, and 10 and above (Table 5). Descriptive statistics for growth rates in these size groups over the age groups is reported in Table 6 in the appendix. The regression results are in Table 7, which clearly shows that autocorrelation is negative for the smallest firms, and therefore suggests that the positive autocorrelation for young firms is not being driven by small firms.

5.2.4 Autocorrelation with a second lag

We also extend the model in Eq. (2) to allow for the case where longer lags of growth have an impact on current growth. More specifically, we repeat the analysis in Table 4 with a second growth lag (results available upon request). The results for the first lag are similar to what was observed before. The second lag is often significant and is generally of the same sign as the first lag (although the effect is smaller in magnitude). Overall, adding a second lag yields some interesting results, although a drawback is that we lose a year of observation (which is particularly problematic given our focus on the years immediately after entry).

6 Summary and conclusions

Firm age has been argued to be one of the most important determinants of firm growth (Fizaine 1968; Haltiwanger et al. 2013; Lawless 2014). However, we still know very little about how growth patterns change with age. The lack of studies can most likely be explained by the absence of data on firm age and the under-representation of young firms in many available longitudinal datasets. We overcome these shortcomings by using a dataset that includes information on the years since registration of all Swedish limited liability firms of all sizes.

While previous literature has investigated how growth varies with age, or how growth varies with previous growth, our contribution is to examine how growth autocorrelation varies with age. A coherent story emerges from a range of non-parametric and parametric techniques, that provide a rich, coherent and novel set of results regarding how growth patterns change with age. Our results indicate that young firms are characterized by positive autocorrelation in growth rates, suggesting that growth in one period is positively related to growth in the next. Positive autocorrelation in the growth of young firms is observed alongside a higher mean and standard deviation of the growth rates distribution for young firms. Graphical analyses, accompanied by statistical tests, reveal that young firms are more likely to experience positive growth in two successive periods. Older firms are less likely to experience two periods of positive growth, and very old firms are more likely to experience two successive periods of negative growth.

Sales growth autocorrelation turns negative in the years after entry. New firms strive to grow in order to achieve a minimum efficient scale and overcome their “liability of newness” (Cabral 1995), but once they have survived the first few years it seems that their growth loses its momentum. We thus found no support for notions that older firms should have a high degree of growth persistence due to learning effects, well-established capabilities, or from operating with longer strategic time horizons. Instead, the growth of mature firms would be better characterized by theoretical models that predict negative autocorrelation, for example if investment is lumpy, if firms’ resources are discrete, and if growth in one period is unlikely to be followed by growth in the following period (Coad and Planck 2012). Furthermore, our findings on the erratic growth of older firms can be related to theories of the liabilities of aging (Barron et al. 1994): i.e., the liability of senescence (i.e., how accumulated rules, routines and structures lead to organizational ossification and structural rigidity) and the liability of obsolescence (regarding the organization’s fit to the environmental context). Hence, older firms who suffer from liabilities of senescence and obsolescence might be more vulnerable to setbacks in their growth trajectories, as these rigid old organizations are buffeted around in ever-changing environments.

Our analysis does not come without limitations. One question is whether our results can potentially be generalized to other contexts and countries. We believe that this is likely since Coad et al. (2013b) reported similar results for Spain, although their study was vulnerable to selection bias due to data limitations, and the relationship between growth autocorrelation and age was not explored in much depth. Nevertheless, we would welcome further investigations in other contexts.

Future work could investigate more closely which specific mechanisms account for the positive growth autocorrelation among young firms. One explanation might relate to the legitimacy that young firms accumulate, due to the increasing returns to customer adoption that arises when the customer base grows rapidly before reaching satiation. Another explanation could be the increasing returns that arise from gains to routinization, as employee attention is freed up and productivity improves.

Further research could also investigate in more detail the patterns of growth for young firms and whether it differs among start-ups. One particularly interesting question is whether young firms should first seek profits, or first seek growth. Davidsson et al. (2009), for example, presented evidence that profitable firms are more likely to attain high profits and high growth in the future compared to those firms that seek growth before profits. Our findings might, however, indicate that young firms should first seek to grow, because there are positive feedback effects of early growth on subsequent growth, as well as other possible benefits such as survival benefits (Phillips and Kirchhoff 1989; Coad et al. 2013a) that come from growth per se and from the larger size that this begets.