1 Introduction

There is a wide consensus on the necessity of understanding productivity growth in order to reduce the efficiency gap and to ensure the convergence of productivity among industries and regions. This interest was largely motivated by recent empirical literature on economic growth showing that regional disparities in the productivity levels represent one of the key determinants of the income differences and inequality (Dettori et al. 2012; Easterly and Levine 2001; Caselli 2005; Rice et al. 2006, among others). The lion’s share of productivity research has tended to take either a regional-level (macro-level) approach, focusing on the characteristics of ecological units such as cities and countries, or a firm-level (micro-level) focusing on the characteristics of the firm. At the regional level, several explanations of the productivity gap have been put forward, but the key role appears to be related to intrinsic differences among regions such as infrastructure, human capital, and levels of research and development (Krugman 1991; Romer 1990; Lucas 1988; Moretti 2004; Ciccone and Peri 2006; Bronzini and Piselli 2009; Andersson and Lööf 2011; Glaeser et al. 2002).

At the firm level, scholars argue that firm-specific characteristics (age, size, type of economic activity, human capital and internal R&D), and industrial structure or external variables (knowledge spillovers, specialization, diversity, competition) explain firms’ performance (Audretsch and Feldman 2004; Henderson 2007). Some recent studies propose an alternative approach that allows micro-levels and macro-levels to be modeled simultaneously in order to explain the differences in the total factor productivity (henceforth TFP): It is the so-called the multilevel or the hierarchical model (e.g. Aiello et al. 2014; Fazio and Piacentino 2010; Raspe and Van Oort 2011).

By using multilevel modeling, it is possible to explain the differences in the TFP by providing a clear distinction between firm and region-specific effects. In addition, it is possible to show how contextual effects translate into individual behavior. Fazio and Piacentino (2010) argue that multilevel modeling can also reduce the ambiguity surrounding the agglomeration-firm performance relationship and address regional, sectoral and cross-level heterogeneity. Raspe and Van Oort (2011) believe that “existing single-level methodologies can be problematic and that alternative methodologies (such as multilevel analysis) provide a useful empirical framework to address potential ecological measurement fallacies”. If micro- and macro-factors affect productivity and interact with each other, their contribution can be properly measured only via a multilevel analysis that can solve the micro–macro-problem known as “ecological fallacy” (Robinson 1950) or “cross-level fallacy” (Alker 1969). If one of the relevant dimensions (individual or regional) is omitted, estimations of the determinants of TFP are bound to be biased.

In this paper, we use an unbalanced panel of more than 2843 Tunisian manufacturing firms over the period 1998–2004 to estimate how much of the observed firm-level performance due to firm-specific characteristics. In addition, we test how regional and sectoral characteristics affect the productivity of firms. In this respect, Tunisia provides a very relevant context to examine these issues. Indeed, Tunisian economic activities are characterized by large inter-regional and inter-sectoral productivity gaps. Nearly 56% of the total population and 92% of all industrial firms are concentrated in the three largest cities: Tunis, Sfax and Sousse. These three coastal towns, which form the core of economic activity, represent 85% of the national GDP. Moreover, large productivity gaps exist across sectors (Marouani and Mouelhi 2016). By considering the interaction of micro-data at the firm level and macro-data at the regional and sectoral levels, we are able to control the individual, regional and sectoral heterogeneity for the evaluation of firm-level productivity. We can also overcome the endogeneity and multicollinearity problems so critical in empirical studies that rely on aggregate data only to investigate the relevance of the socioeconomic context for economic activity (Fazio and Piacentino 2010). Moreover, the multilevel analysis allows the inclusion of macro-level (regional and/or sectoral) exploratory variables which otherwise would be absorbed by the fixed effects. In addition, the multilevel analysis, by using a single equation model, exploits the structures of data and properly addresses the issue of error correlation across firms that operate in the same region and in the same sector.

Most previous studies in Tunisia tend to analyze productivity either at the firm level or at the regional level. For example, Baccouche et al. (2008) use a firm-level data to examine the relationship between foreign direct investment (FDI) and total factor productivity (TFP) in Tunisian manufacturing sectors during the period 1998–2004. Amara and Thabet (2012) and Thabet (2015) test the impacts of the local industrial structure (specialization, diversity, competitiveness and the firm’s size) on the aggregated added value for five industrial sectors among 138 delegations of the coastal areas of Tunisia over the period 1998–2004. Amara and El Lahga (2015) have recently used a sample of manufacturing Tunisian firms to distinguish between the effects of own firm’s characteristics (direct effects) and average characteristics of their neighbors (endogenous and contextual effects) on its output level. Marouani and Mouelhi (2016) separately use sectoral and firm data to analyze the dynamics of sectoral productivity growth in Tunisia and assess the contribution of structural change to these dynamics. To the best of our knowledge, our study is among the first in Tunisia to consider micro- and macro-interaction to examine the productivity of manufacturing firms.Footnote 1

The remaining of the paper is organized as follows. We briefly conduct a comprehensive literature review in the field of predicting and understanding the determinants of TFP as well as a short description of the economic geography of Tunisia in Sect. 2. We then present the data and the methodology employed to estimate the contribution of each micro-level and macro-level factor on the firm-level TFP (Sect. 3) and an overview of the main results in Sect. 4. Section 5 concludes with some policy recommendations.

2 Firm productivity and regional disparity in Tunisia: brief review

In the literature reviews, productivity is considered at two different levels: the micro-level and the macro-level. In micro-level, and given the increasing availability of individual firm data, a growing number of studies have tried to identify what factors influence the productivity of firms. Most of these studies have been carried in advanced economies, such as the USA, Germany, France, Italy and the UK (Keller and Yeaple 2009; Wagner 2007; Martin et al. 2011; Parisi et al. 2006; Wakelin 2001). Firms are naturally influenced by their own attributes and resources (also known as internal factors) such as competencies, knowledge and human capital (Backman 2014). Human capital can impact the firm’s performance through several mechanisms (Ballot et al. 2001): (1) a firm which has substantial human capital will make better decisions than its rivals with lower human capital; (2) innovation will be stimulated by the quality and training of the personnel in the R&D department; (3) learning-by-doing is also higher if workers have high human capital. Using data from two panels of large French and Swedish firms for the same period 1987–1993, Ballot et al. (2001) show that firm-sponsored training and R&D are significant inputs in the two countries.

The empirical literature also suggests that firm size and firm age have a positive impact on productivity. Indeed, firm size largely determines a firm’s resource base, competencies and scale advantages. Due to internal economies of scale that reduce the per-unit costs over the number of units produced, efficiency advantages emerge from larger firm sizes, while small firms have to overcome these disadvantages (Jovanovic 1982; Raspe and Van Oort; 2011). In addition to size, a number of studies bring to the fore that learning process and firm experience (Majumdar 1997; Raspe and Van Oort 2011) and spin-offs (John and Ofek 1995; Berger and Ofek 1999; Chemmanur et al. 2014) are important for firm-level productivity.

In addition to internal factors and firms’ resources, the external factors are important for firm performance. Still remaining at micro-level, the concept of knowledge spillovers and firm productivity has received increasing interest over the past decades (Henderson et al. 1995; Audretsch and Feldman 1996; Ellison and Glaeser 1997; Rice et al. 2006). Recently theoretical developments have attempted to open the ‘black box’ of knowledge spillovers and to explain how these spillovers work at the micro-level (Duranton and Puga 2004; Henderson 2007). In other words, they seek to understand how local interactions, peer effects, spatial relationships, labor mobility, and social networks lead to better firm performance, such as productivity levels (Ciccone and Hall 1996; Cingano and Schivardi 2004; Stoyanov and Zubanov 2012) and innovation (Smit et al. 2015).

Taking the analysis to the regional-level, productivity gaps and regional convergence are issues of intense theoretical and empirical research since the development of New Growth theory and New Economic Geography (Rice et al. 2006; Ke 2010; Bronzini and Piselli 2009). These studies suggest that regional gap in productivity can be attributed to regional differences in various factors such as education endowment, foreign direct investment (FDI), producer’s market accessibility, customer’s market accessibility, and agglomeration economies. These factors contribute to the agglomeration of firms in urban areas. Indeed, Krugman (1991) showed that decline in transport costs, increases of economies of scale, and mobility of the specialized labor reinforce agglomeration of firms and increase regional disparities. The World Bank Annual Report (2009): Reshaping Economic Geography states also that “Markets favor some places over others, some places-cities, coastal area, and connected countries are favored by producers”.

Tunisia’s economic growth also fits this pattern. Although the Tunisian economy has shown robust economic growth over the past decade (the aggregate growth was about 5 percent per year since the late 1990s), wide-spread inequalities between coastal and inner regions persist. Private sector activity is heavily concentrated along the coast. In particular, almost all industrial firms are located close to the three coastal agglomerations of greater Tunis, Sfax, and Sousse. More than 90% of total employment is still generated in the coastal part of the country. Similarly, unemployment rates show considerable disparities across regions and are especially high in the interior regions. The interior areas have the highest unemployment rate (18.5%) as opposed to 13.1% in the coastal area (Amara and Ayadi 2014). The unemployment rate is higher for women (19% in 2010) than for average (11%), and twice as high for graduate women (33%) as for graduate mean (16%). Moreover, the investment incentives code, by favoring export-oriented production, has heavily favored investment in coastal areas, and may, therefore, have played a role in deepening regional disparities.

In addition, the social situation in Tunisia has dramatically worsened in recent years due to the rise of the informal sector, the pandemic growth of corruption, and the failure or the inability of the formal sector to guarantee the desired level of employment.

3 Data sources and methodology

3.1 Data

Data used in this paper are drawn from the National Annual Survey Reports on Firms (NASRF) conducted by the National Institute of Statistics (INS).Footnote 2 The dataset refers to an unbalanced panel of about 2843 firms between 1998 and 2004 from the agro-food (IAA), the textiles, wearing, leather and footwear (ITHC), the construction materials, ceramic and glass (IMCCV), the mechanic, electric and electronic (IME), the chemical (ICH) and the other manufacturing industries (ID). The firm’s activity is described by a one-digit Tunisian nomenclature of economic activities. The dataset was cleaned from outlier observations. More specifically, we exclude firms with fewer than six employees as well as these with negative added value or zero investment. The dataset includes added value, investment, firm’s birth date, capital stock, foreign capital participation, expenditure in information and communication technology, expenditure in R&D, exporting rate and labor (number of employees). The number of employees involves the number of engineers and managers used to approximate human capital.

Table 1 presents the annual average distribution of firm size and employment. Over the period 1998–2004, firms with 20–49 employees account for approximately 25% of all firms, and 7% of employment. On average, in each year there are only 13 firms that employed at least a thousand workers which represents no more than 1% of all firms. However, these firms account for 13.72% of all employment and are also the oldest with an average age of 25 years.

Table 1 Firm size and employment distributions (annual averages 1998–2004)

Table 2 shows the annual average of the number of firms as well as their employment by sector for the full sample of data. The distribution shows a concentration of firms in ITHC (49%) and in IME sectors (17%). Moreover, Table 2 indicates that more than half (55%) of employment is generated by the ITHC sector and 18% by the IME sector. Table 3 presents the annual average distribution of firms and manufacturing jobs by region. Firms and employment are largely concentrated in a small number of cities. The three coastal regions (Greater Tunis, North-East and the Center-East) account for around 95% of total firms and 95% of manufacturing jobs, and Center-East alone for 49.56% of total manufacturing firms. While, on average, the total number of manufacturing firms in the Center-West region does not exceed 15 firms (1.14% of all firms).

Table 2 Firm and employment distributions by sector (annual averages 1998–2004)
Table 3 Firm and employment distributions by region (annual averages 1998–2004)

3.2 Variable definitions

3.2.1 Firm-level variables

In this paper, the dependent variable is defined as the firm’s TFP. As a good robustness check, we use the labor productivity (measured as value added per employee) as the second dependent variable.Footnote 3 We use the structural approach developed by Olley and Pakes (1996) in response to simultaneity bias due to the instantaneous correlation between unobservable productivity shocks and inputs. Over a panel data and proceeding by a logarithmic transformation of the Cobb Douglass production function, the estimating equation is giving by:

$$\begin{aligned} y_{it} & = a_{0} + a_{k} k_{it} + a_{l} l_{it} + u_{it} \\ u_{it} & = \omega_{it} + \eta_{it} \\ \end{aligned}$$
(1)

where \(y_{it}\) is the log of output (value added) from firm \(i\) at time \(t\), \(k_{it}\) the log of its capital and \(l_{it}\) the log of its labor input; the \(a_{k}\) and \(a_{l}\) coefficients are the to-be-estimated parameters (interpreted also as output elasticity relative, respectively, to capital and labor). The error term \(u_{it}\) consists of two components: the stochastic term \(\eta_{it}\) and the productivity \(\omega_{it}\). \(\eta_{it}\) is a zero expected mean that uncorrelated with the input choices and unknown to firm and researcher. \(\omega_{it}\) is known to the firm but unknown to the researcher and acts as a state variable to which a firm adjusts its input choices (capital and labor).

Several types of bias emerge when TFP is estimated using the Ordinary Least Squares (OLS) estimator. First, the optimal firm’s choice of input quantities will be determined by prior beliefs about its productivity level. Hence, the productivity level and input choices are likely to be correlated. The existence of such dependence reflects a potential correlation between error term \(u_{it}\) and inputs (\(k_{it}\) and \(l_{it}\)) which, therefore, are not exogenous. This problem, known as simultaneity bias, violates the orthogonality conditions that make OLS provides a non-consistent estimation of the production function parameters.Footnote 4 Second, if no allowance is made for firm entry and exit, selection bias will emerge. In this paper, we don’t deal with the selection bias by using firm-level data because we do not have accurate information on entry and exit decisions.Footnote 5 We only consider the simultaneity bias, and we chose to apply the structural approach proposed in Olley and Pakes (1996) to solve this problem.Footnote 6 In addition, we exploit the availability of aggregated data on entry-exit patterns at sectoral and regional levels to minimize the selection bias.

Olley and Pakes (1996) suppose that at each time period \(t\), the firm aims to maximize the expected value of its current and future profits and must decide its investment level to survive. If there is no exit (firm continuous in operation), investment is a function of current state variables.

$$i_{it} = f_{t} \left( {\omega_{it} ,k_{it} } \right)$$
(2)

Olley and Pakes (1996) show that investment (if it is nonzero) is strictly increasing in productivity giving \(k_{it}\), so we have:

$$\omega_{it} = f_{t}^{ - 1} \left( {i_{it} , k_{it} } \right) = h_{t} \left( {i_{it} , k_{it} } \right)$$
(3)

Equation (3) expresses productivity as a function of capital and investment which are both observables. This fact allows us to correct the simultaneity problem as follows:

$$\begin{aligned} y_{it} & = a_{0} + a_{k} k_{it} + a_{l} l_{it} + h_{t} \left( {i_{it} ,k_{it} } \right) + \eta_{it} \\ & = a_{l} l_{it} + \phi \left( {i_{it} ,k_{it} } \right) + \eta_{it} \\ \end{aligned}$$
(4)

where \(\phi \left( {i_{it} ,k_{it} } \right) = a_{0} + a_{k} k_{it} + h_{t} \left( {i_{it} ,k_{it} } \right)\) is approximated by a higher-order polynomial in \(i_{it}\) and \(k_{it}\). This step provides a consistent estimate of the labor elasticity.

To estimate the coefficient on the capital variable, it is necessary to exploit information on firm dynamics. To do this, Olley and Pakes (1996) assume that productivity follows a first-order Markov process as:

$$\begin{aligned} \omega_{it} & = E\left[ {\omega_{it} |\omega_{i,t - 1} } \right] + \xi_{it} \\ & = g\left( {\omega_{i,t - 1} } \right) + \xi_{it} \\ \end{aligned}$$
(5)

where \(\xi_{it}\) is an innovation with zero mean uncorrelated with \(k_{it}\) (\(E\left( {\xi_{it} |k_{it} } \right) = 0\)). The function \(g\left( . \right)\) is unknown and it is always possible to be approximated by a polynomial function. This second step consists firstly to eliminate the contribution of labor to output which was estimated in the first step to obtain the following model (see Petrin et al. 2004 for more details):

$$\begin{aligned} y_{it} - a_{l} l_{it} & = a_{0} + a_{k} k_{it} + \omega_{it} + \eta_{it} \\ & = a_{k} k_{it} + E\left[ {\omega_{it} |\omega_{i,t - 1} } \right] + \xi_{it} + \eta_{it} \\ & = a_{k} k_{it} + g\left( {\omega_{i,t - 1} } \right) + \xi_{it} + \eta_{it} \\ & = a_{k} k_{it} + g\left( {\widehat{\phi }_{i,t - 1} - a_{0} - a_{k} k_{i,t - 1} } \right) + \xi_{it} + \eta_{it} \\ \end{aligned}$$
(6)

Once the production function parameters have been estimated, one can infer the total factor productivity using the following formula:Footnote 7

$$tfp_{it} = \log \left( {{\text{TFP}}_{it} } \right) = \omega_{it} = h_{it} \left( . \right)$$
(7)

The choice of the independent variables at the firm-level is based on the empirical and theoretical studies presented in section II. Most specifically, we expect TFP to rise with R&D intensity (\(R\& D_{ij}\)) (Hall and Mairesse 1995). Human capital (\(H_{ij}\)), foreign direct investment (\({\text{FDI}}_{ij}\))Footnote 8 and capital intensity (\({\text{capital}}_{ij}\)) were also tested as determinants of TFP. Human capital (the number of engineers and managers divided by the total number of employees) is expected to correlate positively with firms’ performance (Black and Lynch 1996, Girma 2005). In addition, we believe that the size of the firm (\({\text{size}}_{ij}\)), the size square (\({\text{sizesq}}_{ij}\)) its age (\({\text{age}}_{ij}\)), the age square (\({\text{agesq}}_{ij}\)) and the type of economic activity (\({\text{sector}}_{ij}\)) have an impact on TFP (Raspe and van Oort 2011).

3.2.2 Regional variables

To evaluate the agglomeration economies, we include two measures: the specialization index (\({\text{SPEC}}_{j}\)) that captures the degree of industrial specialization (MAR externalities) and the inverse of Hirschman–Herfindahl index (\({\text{DIV}}_{j}\)), which is the most common measure to account for Jacobs externalities (Beaudry and Schiffauerova 2009, Combes 2000). We use the Krugman specialization index as a relative measure of regional specialization, where the formula is the following:

$${\text{SPEC}}_{j} = \mathop \sum \limits_{s} \left| {\frac{{{\text{emp}}_{js} }}{{{\text{emp}}_{j} }} - \frac{{{\text{emp}}_{j} }}{\text{emp}}} \right|$$
(8)

where \({\text{emp}}_{j}\) and \({\text{emp}}\) are the total employment and \({\text{emp}}_{js}\) and \({\text{emp}}_{s}\) are the sectoral employment in governorate \(j\) and Tunisia, respectively. The index takes values in the interval [0, 2], where 0 indicates governorates with completely identical structure and 2 indicates governorates with a completely different industrial structure between the regional and the reference economy.

For each governorate \(j\) the Hirschman-Herfindahl index sums over all industries the square of the share of governorate \(j\)’s employment relative to total (national) employment in industry \(s\):

$${\text{DIV}}_{j} = 1 / \left( {\mathop \sum \limits_{s} \left( {\frac{{{\text{emp}}_{js} }}{{{\text{emp}}_{j} }}} \right)^{2} } \right)$$
(9)

\({\text{DIV}}_{j}\) = 1 if economic activity in the governorate under consideration is fully concentrated in one industry and increases as activities in the city become more diverse.

Information on wage differences across areas is also fundamental to explain total factor productivity (Krugman 1991). The New Economic Geography (NEG) identifies wage differences as one of the major determinants of firms’ location decisions and the emergence of a core-periphery structure. Combes et al. (2008) proposed three broad arguments to explain the origin of spatial wage disparities. First, spatial differences in the skill composition of the workforce directly affect wage disparities. Second, wage differences across areas are caused by differences in local nonhuman endowments (geographical features, natural resources, or some other local endowments like public or private capital, local institutions, and technology). The third interpretation considers that some interactions between workers or firms lead to productivity gains. In our analysis, we use the Gini coefficient (\({\text{Gini}}\_{\text{wage}}_{j}\)) to measure the regional wage disparities across governorates.

We also suppose that the presence of foreign direct investment within a region is an important factor of firms’ TFP. It has been argued that foreign investment is likely to be associated with the transfer of knowledge and spillovers, such as management skills and quality systems (Javorcik 2004). We use the share of FDI firms in the governorate (\({\text{FDI}}_{j}\)) to measure the presence of the FDI.

There is some evidence that the turnover of firms is higher in some regions than others. A high rate of firm turnover can positively affect regional productivity growth if it reflects a transfer of resources from less efficient (exiting firms) to more efficient producers (survivors). In order to test this idea, we use the regional volatility rate (\({\text{volatility}}_{j}\)) defined as:Footnote 9

$${\text{volatility}}_{j} = \left( {{\text{entry }}\;{\text{rate}}_{j} + {\text{exit}}\;{\text{rate}}_{j} } \right) - \left| {{\text{entry }}\;{\text{rate}}_{j} - {\text{exit}}\;{\text{rate}}_{j} } \right|$$
(10)

It is the sum of the entry and exit rate minus the absolute value of the net entry rate at the governorate level.Footnote 10

In addition to regional volatility rate, we control for regional education level, measured as the number of highly educated (university) employees in the total regional employment and the market size (population density and the number of industrial employees per 1000 inhabitants). Finally, the unemployment rate by governorate is included.

3.2.3 Industrial structure or industry-level variables

As discussed earlier, industrial agglomeration is helpful in generating information spillovers within the region. Following most existing studies, we use the agglomeration index developed by Ellison and Glaeser (1997), also known as the EG index, to examine the degree of industrial agglomeration. Compared to other agglomeration indices, such as the Gini index and Hoover’s coefficient of localization, the EG index purges the own firm size from industrial concentration. The EG index, can, therefore, distinguish between concentration arising from the industrial structure from concentration arising from agglomeration externalities (Rosenthal and Strange 2001). Indeed, industrial concentration can be simply due to the existence of a small number of large plants and that there is no agglomeration force. To address this problem, Ellison and Glaeser propose the following agglomeration index to measure the degree of each industry’s agglomeration:

$$\gamma_{s} = {\text{EG}} - {\text{index}}_{s} = \frac{{G_{s} - \left( {1 - \mathop \sum \nolimits_{j} X_{j}^{2} } \right)H_{s} }}{{\left( {1 - \mathop \sum \nolimits_{j} X_{j}^{2} } \right)\left( {1 - H_{s} } \right)}}, \;G_{s} = \mathop \sum \limits_{j} \left( {S_{sj} - X_{j} } \right)^{2} ,\; H_{s} = \mathop \sum \limits_{i} z_{is}^{2}$$
(11)

where \(G_{s}\) represents the raw geographical concentration, \(S_{sj}\) denotes the employment share of industry \(s\) in governorate \(j\) and \(X_{j}\) is the share of aggregate manufacturing employment (all industries) in the governorate. \(H_{s}\) is the Herfindahl–Hirschman index measured as the sum of squares of firm \(i\)’s employment to industry share (\(z_{is}\)). In addition to the industrial agglomeration variable, we also test the impact of the intra-industry wage differentials and the industrial volatility on TFP.

$${\text{volatility}}_{s} = \left( {{\text{entry}}\;{\text{rate}}_{s} + {\text{exit }}\;{\text{rate}}_{s} } \right) - \left| {{\text{entry }}\;{\text{rate}}_{s} - {\text{exit }}\;{\text{rate}}_{s} } \right|.$$
(12)

Table 4 gives descriptive statistics for dependent and independent variables.

Table 4 Descriptive statistics of dependent and independent variables

3.3 Methodology

In this study, we have used multilevel modeling that exploits the hierarchical structure of the data in order to determine the direct effect of the individual (firm) and group (governorate or sector) explanatory variables, as well as the interactions between them (Snijders and Bosker 1999). Thus, we can assess the extent to which variance in firms’ TFP can be attributed to between-firm variance, between-governorate variance, or between-industry variance (Van Oort et al. 2012). Considering an empty model that decomposes the variance of firm’s productivity \(Y_{ij}\) (measured by the log of total factor productivity of firm \(i\) nested at governorate \(j\)) into two independent components: \(\sigma_{e}^{2}\), the variance of the lowest level (firm level) errors \(e_{ij}\), and \(\sigma_{{u_{0} }}^{2}\), the variance of the highest level (regional or industrial level) errors \(\mu_{0j}\). The empty model, named also as the random intercept-only model or null model, is modeled as:

$$Y_{ij} = \gamma_{00} + \mu_{0j} + e_{ij}$$
(13)

where \(\gamma_{00}\) is the overall mean across governorates or industries. The intra-class correlation coefficient (ICC) measures the correlation among the individual observations (firms) within clusters (governorates or industries). When the ICC equals 0, there is no difference between OLS regression estimates and those obtained with the multilevel modeling. Formally, the ICC is calculated by the ratio of the between cluster variance to the total variance:

$$\rho = \sigma_{{u_{0} }}^{2} /\left( {\sigma_{{u_{0} }}^{2} + \sigma_{e}^{2} } \right)$$
(14)

The model in (Eq. 13) can be extended to consider both individual and regional or industrial factors. A separate regression model is defined in each level:

$$Y_{ij} = \beta_{0j} + \beta_{1j} X_{ij} + e_{ij}$$
(15)

where \(X_{ij}\) is an exploratory variable at the lowest level (firm). The variation of the regression coefficients \(\beta_{j}\) is modeled by a group-level regression model (governorate or industry):

$$\beta_{0j} = \gamma_{00} + \gamma_{01} Z_{j} + \mu_{0j} \quad {\text{and}}\quad \beta_{1j} = \gamma_{10} + \gamma_{11} Z_{j} + \mu_{1j}$$
(16)

Thus, the combined model follows:

$$Y_{ij} = \gamma_{00} + \gamma_{10} X_{ij} + \gamma_{01} Z_{j} + \gamma_{11} X_{ij} Z_{j} + (\mu_{0j} + \mu_{1j} X_{ij} + e_{ij} )$$
(17)

The deterministic part on the model, \(\gamma_{00} + \gamma_{10} X_{ij} + \gamma_{01} Z_{j} + \gamma_{11} X_{ij} Z_{j}\), contains all the fixed coefficients, while the stochastic component is in brackets. The individual or firm level residuals \(e_{ij}\) are assumed to have a normal distribution with mean zero and variance \(\sigma_{e}^{2}\). The group-level (regional or industrial level) \(\mu_{0j} + \mu_{1j}\) are assumed to have a multivariate normal distribution with an expected value of zero, and they are assumed to be independent of the individual-level residuals \(e_{ij}\) (Van Oort et al. 2012). The variances of the residual errors \(\mu_{0j}\) and \(\mu_{1j}\) are specified as \(\sigma_{{u_{0} }}^{2}\) and \(\sigma_{{u_{1} }}^{2}\).

4 Results

4.1 Empty model results

We start our analysis by fitting a two-level empty model of firm nested within governorates or sectors. We test two different specifications, A and B, of Eq. 13. In A, we have governorate as level 2 and firm as level 1. In specification B, we take sector as level 2 and firm for the first level. The purpose of this step is to test for significant intercept variance, which is a test of the need for mixed modeling. If the intercept variance is not significant, it can be fixed for future steps. For both specifications, we use the maximum likelihood methods [the maximum likelihood (ML) and restricted or residual maximum likelihood estimation (REML)] to obtain estimates of the empty model (Eq. 13). It is well known in the multilevel modeling literature that variance components based on the ML estimation are negatively biased, when the REML estimation is not (Schabenberger and Pierce 2010).

The results of ML and REML are shown in Table 5, where we report the second level intercept, \(\gamma_{00}\), its variance, \(\sigma_{{u_{0} }}^{2}\), and the variance of the lowest level, \(\sigma_{e}^{2}\). We have also included the intra-class correlation (ICC) and the likelihood ratio test (LR) to compare the mixed model to the linear regression model.

Table 5 Empty model

Table 5 shows that the estimates obtained from MLE and REML are very similar and sometimes equivalent. The LR tests indicate that mixed multilevel model is more appropriate than simple linear model (the LR tests are significant at the 0.01 level), which allows us to justify the use of the multilevel modeling approach. The ICCs indicate that 4% and 6% of the variability of firm-level productivity are due, respectively, to regional and industrial variations. So, the second level (regional or industrial) has a significant role on the firm’s TFP, but it is minor compared to the firm’s characteristics.

4.2 Fixed effects results with both firm and contextual characteristics

The results regarding the impact of firm characteristics on TFP by using REML are shown in Table 6 (column 1). We also include year dummies and industry dummies to control for fixed effects introduced, respectively, by time and sector classification (column 2 in Table 6). The results show that almost all firm-level explanatory variables (the fixed effects) have significant coefficients. The results in columns 1 and 2 suggest that size in terms of employment and age have a negative impact on TFP. However, for our sample where data on exiting firms are not available, the size and age variables have to be interpreted with caution. Hence, young firms tend to be smaller and less efficient than older and larger ones. This is consistent with the finding that new firms generally enter with low productivity levels in comparison with the existing firms. When controlling for nonlinear effects of firm’s size and firm’s age by using its square, only the effect of the age square on TFP becomes positive and significant at 1%. This positive relationship between productivity and age square can be explained by the fact that new firms need time to accommodate to the situation within which they operate (learning effects) in order to increase their productivity. These results show that the oldest small firms are more productive than larger firms.

Table 6 Hierarchical multilevel regression models of firm TFP

The estimated elasticity of R&D expenditure variable is significant at 1%. An increase in R&D expenditure of 10% would increase the firm’s TFP by nearly 0.15%. The estimates show also that FDI has a positive impact on the firm’s performance. This finding is in line with those of recent studies (Raspe and van Oort 2011; Amara and El Lahga 2015). The estimated coefficient of the human capital is significant and has a correct sign as usually found in the literature. Like in the case of R&D, an increase in human capital affects the ability of firms to learn and absorb new information. In addition, we found that firms operating in ICT industries display better TFP. Firms in a high-technology environment are more likely to absorb new developments quickly and to boost productivity additionally.

Our results show also that exporters are more productive than non-exporters. Indeed, trade liberalization induces greater competitive pushing firms to improve their productivity to remain active in the export markets. There are two alternative hypotheses on why exporters can be expected to be more productive than non-exporting firms: self-selection and learning-by-exporting (Bernard and Jensen 1999; Bernard and Wagner 1997). The first hypothesis refers to self-selection on the more productive firms into export markets. The additional costs of selling goods in foreign markets, the existence of sunk costs associated with selling abroad and fiercer competition in international markets provide an entry barrier that solely the successful firms can overcome. The second hypothesis refers to the role of learning-by-exporting and states that exporting makes firms more productive. Indeed, trade promotes knowledge transfer and provide an incentive for innovation. The exporting firms could then benefit from technologies, superior management practices and the exploitation of economies of scale induced by multiple foreign markets, and therefore productivity will be increased.

In Table 6, we also combine the firm-level variables with the governorate-level variables in order to predict the firm performance (column 3 and column 4). Among the governorate-level variables, the

Log of industrial density (capturing agglomeration effects) has a statistically significant positive effect on TFP. Ciccone and Hall (1996) have tried to explain the positive relationship between firm performance and industrial density. They argued that industrial density promotes productivity through externalities associated with physical proximity. In addition, the production of all goods within a particular geographical area can reduce transportation cost and improve productivity. By using the multilevel analysis that seriously considers micro–macro-linkages of firms in their spatial and sectoral contexts, we can explicitly clarify the importance of agglomeration economies to the performance of firms. Indeed, many studies using aggregated regional-level data provide only limited insights and weak support for the effects of agglomeration economies on firm performance (Van Oort et al. 2012).

An increase in the regional level of FDI would decrease the productivity level of the firm through negative spillovers. Indeed, the governorate is composed by firms from different sectors that can show a strong gap in terms of skilled workers and technology. Hence, educational and technological gaps between firms in the same governorate may have a negative impact on the firm’s performance and reduce, consequently, the capability to absorb spillovers. Similar results for the FDI are also found by Baccouche et al. (2008), Grima (2005) and Amara and El Lahga (2015). Baccouche et al. (2008) show, for example, that FDI spillovers can only be beneficial for companies with high absorption capacity and that Tunisian manufacturing firms are considered to have high absorptive capacity if they are operating close to the industry frontier. Girma (2005) argues that FDI-related productivity gains initially increase at an increasing rate, but the rate diminishes as the absorptive capacity of domestic firms rises.

Regional wage disparities approximated by the Gini index have a positive impact on firm performance. Wage differences across areas can reflect differences in workers skills and technology.Footnote 11 Combes et al. (2008) show that up to half of the spatial wage disparities can be traced back to differences in the skill composition of the workforce. In addition, they show that location matters for urban workers wages and larger cities would improve efficiency. As a result, TFP can be higher in cities where firms benefit from agglomeration externalities that increase the labor efficiency of their workers. The coefficient of regional wage disparities becomes insignificant at 5% level when controlling for sector classification.

The last two columns of Table 6 present the results of the hierarchical multilevel regression model with the sector as the second level. We use the ‘coastal zone’ variable that takes 1 if the firm is located at the coastal area and 0 otherwise to control for the spatial location effect. As we can see, firms from the coastal zone are more productive than those from lagging areas.

4.3 Fixed effects results with both firm, contextual and cross-level interactions

Another way to consider the effects of the regional or industrial variables is to examine their cross-level interaction effects. We examine whether a regional or sector variable has an effect on the productivity slopes of the firm (indirect effects). Table 7 presents the results of the interaction between contextual level (regional or sector) and firm characteristics such as age, size, human capital, and R&D. No significant cross-level interaction between governorate and firm was found. However, we find that the interaction effects between firm characteristics (age, size, human capital, and R&D) and industrial variables (industrial agglomeration and intra-industry wage differentials) are all significant and positive at 5% level. The negative effects of age and size on firm’s TFP become positive when the industrial agglomeration increases. From Table 7 (columns 3 and 4), we can see that the direct effect of industrial agglomeration is negative and significant at the 1% level. This finding of a negative industrial agglomeration effect can be explained by the fact that Tunisian manufacturing firms are less competitive. The positive interaction effects between industrial agglomeration (level 2) and firm’s size and firm’s age (level 1) show that agglomeration externalities are beneficial spillover effects for larger and older firms. So industrial agglomeration matters, however, the impact here is not with regard to its direct effect on TFP (which is negative), but with respect to its interaction effect.

Table 7 Hierarchical multilevel regression models of firm TFP with interaction terms

The positive effect of human capital on TFP becomes stronger when the Gini coefficient measuring the intra-industry wage differentials increases (the coefficient of the interaction effects for intra-industry wage gap by human capital). This result would be in line with sorting theories, according to which the quality of the human capital has an impact on productivity. Having schooled workers makes everyone more productive, raising the firm’s wage level which explains the intra-industry wage differentials. In addition, we find that the positive effect of the R&D becomes stronger when the intra-industry wage gap increases.

4.4 Robustness analysis

In the core of the paper, we used TFP as a proxy of firm productivity level. However, it is difficult to decide if TFP is the most appropriate measure of firm’s performance and it would be a good robustness check to estimate a multilevel mixed model using the labor productivity (LP) as a second proxy of firm performance.Footnote 12 Table 10 reports the result of the empty model by using the LP as the dependent variable. Compared to preview results reported in Table 5, the ICCs have increased by more than twice. About 10% and 16% of the variability of the firm-level labor productivity are due, respectively, to regional and sectoral variations. This result confirms again the utility of the multilevel analysis.

Table 11 reports the fixed effects results with both firm and contextual characteristics, where Table 12 added the cross-level interaction effects. It is easily seen that the results are very close to those under TFP (Tables 6, 7), except for the capital intensity variable which becomes significantly positive. The opposite sign of this variable can be caused by the complementarity between labor and capital. Indeed, when we use a partial productivity or single-factor productivity (e.g. LP), qualitative and quantitative changes in one factor can heavily impact the partial productivity of the other factor. It is also possible, during an important investment in capital goods, that labor productivity increased but the total factor productivity remains unchanged or even decrease given the investment cost.

5 Conclusion and policy implications

While most of the existing studies on firm productivity in Tunisia focused on the micro-aspects, the present study is the first to demonstrate the joint contribution of contextual-level (regional and sectoral) and individual-level firm’s characteristics on firm’s performance. To do this, we combine a dataset of Tunisian manufacturing firm with regional and sectoral variables and apply a multilevel analysis in order to discern the contextual effect of firm’s on its productivity level, after taking into account its individual characteristics.

Three main results can be derived from our analysis. The first is that firm characteristics greatly impact on both total factor productivity and labor productivity. More specifically, we find that about 95% of firms’ TFP is explained by internal firm characteristics, while the macro-level (regional or sectoral) effects just explain 5% of firms’ productivity, and that sectors play a more prominent role that region. This result confirms that the main sources of firm performance are different at the individual level. We find that the oldest small firms are more productive than larger firms. Results indicate that firms with a higher level of human capital, R&D expenditure, and FDI perform better in terms of productivity. We also found that firms operating in ICT industries and exporting firms are more productive. In this context, special stress must be placed on human capital and R&D expenditure to improve firms’ performance.

Secondly, the positive and significant relationship between firms’ productivity and industrial density at the governorate level clearly shows the essential role of location and contextual effects in promoting firms’ performance. The results of the ICCs and LR tests confirm the existence of significant between-governorate variation in TFP and LP that was not explained by individual firm level factors. We found that the governorate contributes almost 4% to TFP at the firm level. This seems modest, but the governorate contributions represent 10% when using labor productivity as a dependent variable. The positive and significant relationship between inter-firm wage dispersion and firm productivity provides evidence of technological gap and human capital intensity between sectors in the same governorate.

Finally, our results show that when we consider sector as the second level, the direct effects of sectoral variables are not significant. However, we find positive and significant interaction effects (indirect effects) of those variables through firm characteristics (age, size, human capital, and R&D). We find that the coefficient of the interaction effects for intra-industry wage gap by human capital is positive and significant. In addition, we find that the positive effect of the R&D becomes stronger when the intra-industry wage gap increases. The negative effects of age and size on firm’s TFP become positive when the industrial agglomeration increases.

Our results have important policy implications as well. One, the result shows that exportation, human capital, ICT and R&D generally benefit TFP and LP. This means that government needs to implement measures that aim to consolidate the export volume, improve terms of trade to increase access to foreign capital, and advocate investment in human capital to enhance the absorptive capacity in order to facilitate technology transfer. Our results show also that sectors matter more than the region; so it is necessary for the government to implement industrial policies. The reconsideration of competitiveness poles, covering the key sectors, such as mechanical and electrical industries, textiles-leather and footwear, agro-food and ICT, is one of these policies. These poles generate a competitive atmosphere, support the culture of innovation, and stimulate the transfer of knowledge and technologies between firms, workers, and universities.