1 Introduction

The rapidly growing field of productivity analysis in health economics tends to neglect heterogeneity in hospital technology by focusing on mean tendencies or on most efficient hospitals (Hollingsworth 2008). Accordingly, evidence on input productivity, defined as marginal product of output with respect to a given input (Jensen and Morrisey 1986), is barely documented in the literature for hospitals with different output levels. However, variations in practices for managing human resources, for providing clinical procedures and for operating a hospital as an economic unit were recently quantified at large samples of hospitals in different countries (Bloom et al. 2014, 2015; Dorgan et al. 2010). Combined with the finding of a direct association between management and productivity (Bloom et al. 2007; Bloom 2019; Cleverley and Harvey 1992; Hames 1991; Lee et al. 2013; Otto 1996; Tersigni 1992), heterogeneity in managerial practices points to different values of input productivity at hospitals with different level of output.

The recent technique of quantile regression analysis may be viewed as a convenient tool for formal quantification of heterogeneity of technology in hospital production. The analysis is based on the premise that different types of technology (interpreted as productivity of hospital inputs and as partial effects of hospital characteristics) correspond to different output quantiles, so hospital technology has consequences for the ability to maximize output. Statistically different values of the coefficients for hospital inputs and hospital characteristics, obtained in regressions for low-output and high-output quantiles, would indicate the presence of different types of hospital technology (Koenker 2005). The quantile regression approach does not extrapolate the mean tendency to the tails of the distribution and thereby avoids bias, which is inherent to groupwise post-estimation under conventional mean regression (Hendricks and Koenker 1992; Koenker and Bassett 1978). Another merit of the approach is equivalence of the linear quantile regression to a monotonically increasing transformation, which is a useful feature for estimating log-linearized production functions.Footnote 1 Despite its advantages, the use of quantile regression for purposes of productivity analysis in health economics has to date been limited to studies of the highest output quantiles and has concentrated primarily on nursing homes, which are a very specific type of healthcare facility (Christensen 2004; Knox et al. 2007; Liu et al. 2008; Martin and Jérôme 2016). Another shortcoming of the existing health economics literature is the inability to deal with panel data quantile regression models without imposing any restrictions: either the data are only pooled (Knox et al. 2007) or hospital effects are introduced in the model as a function of a potentially misspecified list of covariates (Hsu et al. 2017).

The purpose of this paper is to measure heterogeneity in hospital productivity by estimating the longitudinal production function of hospitals and evaluating marginal products of labor, capital, and medicines across conditional quantiles of output. The novelty of the present paper is severalfold. To the best of our knowledge, the paper is the first application of a quantile regression approach for measuring a general form of longitudinal production function in the hospital industry. We use a panel data quantile regression model with fixed effects for estimating productivity of capital, medicines, physicians, nurses, technicians, administrators and other staff. Next, the paper assesses optimality of input mix and quantifies potential cost savings across conditional quantiles of hospital output. Finally, using a second-stage sensitivity analysis, the paper ties public regulation, demand patterns and heterogeneity to the production function. For this purpose, we examine the association between production efficiency and a range of regional and municipal variables.

Our sample consists of acute-care regional and municipal public hospitals in Japan in 1999–2018. It should be noted that many productivity studies estimate hospital cost functions, but their approach is based on the assumption of cost minimizing behavior which is not necessarily exhibited by public hospitals (Blank and van Hulst 2017). Therefore, our paper follows the strand in the health economics literature that estimates input productivity through the analysis of production function (Jensen and Morrisey 1986; Thurston and Libby 2002).

Japanese local public hospitals are often criticized for inefficient models of production, and in particular, for an excessive emphasis on capital and medicines. On average total factor productivity and labor productivity have changed negligibly or decreased at these hospitals in the past decade (Kaneko et al. 2018; Zhang et al. 2018) and there has been little if any improvement in the average measures of technical efficiency (Kawaguchi et al. 2014; Kawaguchi 2008). However, this analysis is vitiated by not focusing on productivity of capital and medicines and it lacks estimations that would quantitatively address suboptimality of the labor-capital or labor-medicines mix.

Our findings are novel in establishing technological heterogeneity of Japanese local public hospitals, reflected in different relationships between the quantile index of hospital output, on the one hand, and, on the other hand, hospital technology (proxied by productivity of labor specialties, physician capital and medicines), quality (the fact of hospital accreditation and the status of a designated hospital as a reflection of the high referral rate), and teaching activity (training of nurse students and internship of young doctors).

The results of the statistical tests show that there is a more efficient production path (high-output quantiles) and a less efficient production path (low-output quantiles), and that there is a statistical difference in the values of input elasticities, input productivities and the partial effects of hospital variables across high- and low-output hospitals. Our analysis demonstrates that technological heterogeneity may be linked to different values of potential cost savings in the changeover to optimal combination of hospital inputs.

A possible explanation for the technological heterogeneity is a higher degree of labor specialization and better assignment of labor tasks at high-output hospitals, which serve larger local markets. Technology differences are also associated with managerial opportunities for efficient production. Indeed, we find that the values and time profiles of many managerial performance indicators, which are monitored as part of the managerial reform of Japanese local public hospitals, differ statistically across high-output and low-output hospitals.

The findings may be employed for answering a range of questions on the role of physicians and nurses at high-output and low-output hospitals and on the effective use of the labor of medical specialties, physical capital and medicines. The results about excess (or lack) of certain hospital inputs and of potential cost savings at each quantile of hospital output in each year over the past two decades may provide recommendations for adjusting the policy reforms by diversifying the regulation for high-output and low-output hospitals.

The remainder of this paper is structured as follows. The section “Background on Japanese prefectural and municipal hospitals” reviews the institutional background for Japanese local public hospitals. The section “Methodology” outlines the quantile regression model with fixed effects for estimating the longitudinal hospital production function. The explanation of data and variables is provided in the section “Data”. The section “Results” presents the findings on the typology of hospitals technologies, which are discussed in the section “Discussion”.

2 Background on Japanese prefectural and municipal hospitals

Japanese local public hospitals are operated by local governments of prefecture, city, town, village or a union of towns and villages. Local public hospitals show the highest deficit of all industries run by local government in Japan. The poor financial performance of these healthcare institutions has two main causes. Firstly, in an attempt to compensate for lack of physicians, Japanese local public hospitals overinvest in capital by acquiring expensive equipment, which is often underutilized (Campbell and Ikegami 1998; Kawabuchi and Kajitani 2003; Yamada et al. 1997). Moreover, hospitals often spend funds for unnecessary improvements to facilities (Hisamichi 2010). Secondly, local public hospitals suffer from ineffective management (Ikegami and Campbell 1999; Iwane 1976), reflected in large potential to improve the efficiency of capital accounting (Hisamichi 2010), to review structure of hospital beds (Kanagawa 2008) and speed up healthcare provision (Higuchi 2010; Kumazawa 2010; Nabemi 2010).

In line with the hospital productivity literature, Japanese local public hospitals have attracted academic and policy attention in respect of their undesirable trends in productive efficiency (Besstremyannaya 2013; Kawaguchi et al. 2014; Kawaguchi 2008; Takatsuka and Nishimura 2008). However, estimates of input productivity are commonly based on mean tendencies (Kaneko et al. 2018; Morikawa 2010), which do not fully capture differences between high- and low-output hospitals.

In a regulatory attempt to improve managerial performance of Japanese local public hospitals round-table discussions were held by the Ministry of Internal Affairs and Communications in 2007 and led to the ratification of guidelines for a first wave of reform of these healthcare institutions, which was launched in 2008–2013 (Ministry of Internal Affairs and Communications 2007).

The reform aimed to raise productivity and weaken the soft budget constraints of local public hospitals. A list of managerial performance indicators was designed in order to monitor hospital improvement: (1) share of ordinary revenues in ordinary expenses; (2) share of medical revenues in medical expenses; (3) share of the cost of medicines and medical materials in medical revenues; (4) share of the labor cost in medical revenues; (5) bed occupancy. The share of ordinary revenues in ordinary expenses and the share of medical revenues in medical expenses, which varied at local public hospitals from 70 to 99%, were recommended to be raised to 100% (the level, achieved at private hospitals). Bed occupancy was to be increased. At the same time, hospitals were expected to cut the shares of cost of medicines/medical materials and labor costs in medical revenues.

The reform was only partially successful, so it was followed by a second wave in 2015–2020, and new guidelines emphasized the continuation of performance monitoring (Ministry of Internal Affairs and Communications 2015). Specifically, the share of the cost of capital depreciation in medical revenues and the share of consignment fee in medical revenues were added to the list of monitored indicators. The values of both indicators were to be lowered to the levels achieved at private hospitals.

It should be noted that both the first and the second reform waves put insufficient emphasis on heterogeneity in hospital production. Target values differed only based on hospital size (total number of beds) and acute-care status.

Each hospital was obliged to develop its own plan and set its own targets. A case-by-case evaluation of the early effect of reform in a survey by Ministry of Internal Affairs and Communications showed that most hospitals did not achieve their target values (Kawaguchi et al. 2014). The reform also had only a short-term effect on hospital solvency: the share of local public hospitals with deficits dropped from 71% in 2008 to 48% in 2011, but subsequent years saw a steady reverse trend, reaching 63% by 2016 (Mandai and Watanabe 2019).

3 Methodology

3.1 Panel data smoothed quantile regression with quantile-dependent fixed effects

In this paper we employ panel data quantile regression model with fixed effects. These fixed effects reflect the unobserved characteristics of Japanese local public hospitals, such as treatment styles or managerial practices (Ikegami and Campbell 1999; Ikegami et al. 2011; Kawabuchi and Kajitani 2003; Kodera and Yoneda 2019), which could not be fully captured by the covariates available in the data. The fixed effects may be expressed in different ways for more productive and less productive hospitals, so we adhere to a general model which allows for potentially different values of fixed effects across output quantiles (i.e., quantile-dependent fixed effects).

It should be noted that the asymptotic theory for the panel data quantile regression with quantile-dependent fixed effects requires long panels (Koenker 2004) and the ratio n/T of sample size to the length of panel must be small. In this case the bias of the estimator may be neglected in inference procedures. At the same time, relatively short panels are commonly available for analysis of hospital production, and as the number of hospitals in the industry tends to be large, the ratio n/T becomes large. So the bias of the estimator in quantile regression analysis in short panels would lead to incorrect inference. Accordingly, it becomes necessary to eliminate or reduce the bias. But the usual approach to modeling quantile-dependent fixed effects (e.g., Kato et al. 2012; Koenker 2004; Harding and Lamarche 2014) does not derive the rate of convergence of the bias to zero and hence does not provide the knowledge, which would enable reduction of the bias.

Therefore, most attempts to incorporate short panels in the quantile regression analysis with quantile-dependent fixed effects impose restrictions on the model. For instance, Li et al. (2003) require strong assumptions about the distribution of the dependent variable. Other approaches require that the expected value and standard deviation of the dependent variable were linear in covariates (Machado and Santos Silva 2019) or that the fixed effects were functions of covariates (Harding and Lamarche 2016; Hsu et al. 2017).

The methodology of Galvao and Kato (2016) offers a solution to the problem of estimating quantile-dependent fixed effects in case of short panels without imposing any restrictions on the model: the Koenker (2004) quantile regression objective function is modified through smoothing, and this method of estimation enables computation of the bias and provides an approach for eliminating bias. Specifically, the Dhaene and Jochmans (2015) jackknife split-panel correction is proposed for dealing with the problem of asymptotic bias, arising from the short length of panel.Footnote 2

In this paper we employ the Galvao and Kato (2016) approach which can be formulated as follows. Consider the model

$${y}_{it}={\alpha }_{i}({U}_{it})+{{{{\bf{x}}}}}_{it}^{{\prime} }{{{\boldsymbol{\beta }}}}({U}_{it}),\quad {U}_{it} \sim U[0,1],$$
(1)
$$\tau \mapsto {\alpha }_{i}(\tau )+{{{{\bf{x}}}}}_{it}^{{\prime} }{{{\boldsymbol{\beta }}}}(\tau )\quad {{{\rm{is}}}}\,{{{\rm{monotonically}}}}\,{{{\rm{increasing}}}},$$
(2)

where τ ∈ (0, 1), mapping (2) is the conditional quantile of the dependent variable yit, xit is a vector of covariates and αi(τ) are fixed effects, which vary across quantiles. The estimates of the fixed effects \({\hat{\alpha }}_{i}(\tau )\) and of the coefficients \(\hat{{{{\boldsymbol{\beta }}}}}(\tau )\) are found by solving the Galvao and Kato (2016) optimization problem:

$$\mathop{\min }\limits_{{\alpha }_{1},\ldots ,{\alpha }_{n},{{{\boldsymbol{\beta }}}}}\frac{1}{nT}\mathop{\sum }\limits_{i=1}^{n}\mathop{\sum }\limits_{t=1}^{T}({y}_{it}-{\alpha }_{i}-{{{{\bf{x}}}}}_{it}^{{\prime} }{{{\boldsymbol{\beta }}}})(\tau -G({y}_{it}-{\alpha }_{i}-{{{{\bf{x}}}}}_{it}^{{\prime} }{{{\boldsymbol{\beta }}}})/h),$$
(3)

where \(G(v)=\int\nolimits_{u}^{\infty }K(v)dv\) is a smoothed analog of the step function I(u ≤ 0), K(v) is a kernel function, and h is a suitable bandwidth.Footnote 3

In order to reduce the bias of the estimator above obtained in case of short panels Galvao and Kato (2016) suggest using the split-panel jackknife estimator from Dhaene and Jochmans (2015). In case of balanced panels the Dhaene and Jochmans (2015) procedure requires splitting the panel into two: i ∈ {1, … , n} in each panel, while the time index is t ∈ {1, … , T/2} in the first panel and t ∈ {T/2 + 1, … , T} in the second panel. The split-panel estimator is computed as

$${\hat{{{{\boldsymbol{\beta }}}}}}_{1/2}(\tau )=2\hat{{{{\boldsymbol{\beta }}}}}(\tau )-({\hat{{{{\boldsymbol{\beta }}}}}}_{1}(\tau )+{\hat{{{{\boldsymbol{\beta }}}}}}_{2}(\tau ))/2,$$

where \(\hat{{{{\boldsymbol{\beta }}}}}(\tau )\), \({\hat{{{{\boldsymbol{\beta }}}}}}_{1}(\tau )\), \({\hat{{{{\boldsymbol{\beta }}}}}}_{2}(\tau )\) are respectively, estimators for the full panel, the first part of the panel and the second part of the panel. The split-panel estimator under the Dhaene and Jochmans (2015) approach has the same asymptotic variance as the original \(\hat{{{{\boldsymbol{\beta }}}}}\) estimator and allows for reliable inference under short panels.

Dhaene and Jochmans (2015) generalize the approach for the unbalanced panels as follows. Suppose there are Ti longitudinal observations for each i. For each \(T\in \{\mathop{\min }\nolimits_{1\,\le \,i\,\le \,n}{T}_{i},\ldots ,\mathop{\max }\nolimits_{1\,\le \,i\,\le \,n}{T}_{i}\}\) a balanced panel PT = {(i, t): Ti = T} is chosen and an estimate \({\hat{{{{\boldsymbol{\beta }}}}}}_{1/2,T}(\tau )\) is computed for this panel. Note that each panel PT includes all individuals with observations over T periods. Weights, which are equal to the shares of observations of each panel PT in the total sample (i.e., NT/N), are applied to \({\hat{{{{\boldsymbol{\beta }}}}}}_{1/2,T}(\tau )\). The estimator becomes:

$${\hat{{{{\boldsymbol{\beta }}}}}}_{1/2}(\tau )=\frac{1}{N}\mathop{\sum}\limits_{T}{N}_{T}{\hat{{{{\boldsymbol{\beta }}}}}}_{1/2,T}(\tau ),$$

where NT is the number of observations in panel PT and N is the total number of observations. The asymptotic variance matrix of the estimator is computed as the sum of the covariance matrices for each estimate with weights equal to \({({N}_{T}/N)}^{2}\).

3.2 Production function, input elasticities, marginal products and input mix

It should be noted that public hospitals commonly operate under soft budget constraints. So models which focus on the hospital production function and do not impose solution of the cost optimization problem seem better suited for the analysis of these institutions (Biorn et al. 2003). This approach also permits us to test whether hospitals choose their inputs optimally under given input prices.

To estimate input elasticities in a multi-product hospital we use the Panzar and Willig (1977) production transformation function F(y, x) = 1, which specifies the production possibilities frontier. It can be treated as an output distance function under the Coelli and Perelman (1999) restriction of F(y, x) being homogeneous of degree 1 in y.Footnote 4 We employ the translogarithmic form of F(y, x), which is a widely used example of a flexible function for the analysis of hospital production (Vita 1990).

$$\begin{array}{ll}\ln F({{{\bf{y}}}},{{{\bf{x}}}})&=\mathop{\sum }\limits_{m=1}^{M}{\beta }_{m}\ln {y}_{m}+\mathop{\sum }\limits_{m=1}^{M}\mathop{\sum }\limits_{m{^\prime} =1}^{M}{\beta }_{mm{^\prime} }\ln {y}_{m}\ln {y}_{m{^\prime} }+\mathop{\sum }\limits_{k=1}^{K}{\gamma }_{k}\ln {x}_{k}\\ &+\mathop{\sum }\limits_{k=1}^{K}\mathop{\sum }\limits_{k{^\prime} =1}^{K}{\gamma }_{kk{^\prime} }\ln {x}_{k}\ln {x}_{k{^\prime} }+\mathop{\sum }\limits_{m=1}^{M}\mathop{\sum }\limits_{k=1}^{K}{\eta }_{mk}\ln {y}_{m}\ln {x}_{k}+\mathop{\sum }\limits_{j=1}^{J}{\delta }_{j}{h}_{j}+\alpha ,\end{array}$$
(4)
$$\begin{array}{l}{\beta }_{mm{^\prime} }={\beta }_{m{^\prime} m},\,\,{\gamma }_{kk{^\prime} }={\gamma }_{k{^\prime} k},\\ {\beta }_{1}+\cdots +{\beta }_{M}=1,\\ {\beta }_{1m}+\cdots +{\beta }_{Mm}=0,\,\,m=1,\ldots ,M,\\ {\eta }_{1k}+\cdots +{\eta }_{Mk}=0,\,\,k=1,\ldots ,K,\\ \end{array}$$
(5)

where m is the index for output, \(M=\dim ({{{\bf{y}}}})\), k indicates inputs, \(K=\dim ({{{\bf{x}}}})\), h are hospital control variables, and α is a constant. After rearranging terms in (4) under conditions (5) we obtain:Footnote 5

$$\begin{array}{ll}\ln {y}_{M}&=-\mathop{\sum }\limits_{m=1}^{M-1}{\beta }_{m}\ln \frac{{y}_{m}}{{y}_{M}}-\mathop{\sum }\limits_{m=1}^{M-1}\mathop{\sum }\limits_{m{^\prime} =1}^{M-1}{\beta }_{mm{^\prime} }\ln \frac{{y}_{m}}{{y}_{M}}\ln \frac{{y}_{m{^\prime} }}{{y}_{M}}-\mathop{\sum }\limits_{k=1}^{K}{\gamma }_{k}\ln {x}_{k}\\ &-\mathop{\sum }\limits_{k=1}^{K}\mathop{\sum }\limits_{k{^\prime} =1}^{K}{\gamma }_{kk{^\prime} }\ln {x}_{k}\ln {x}_{k{^\prime} }-\mathop{\sum }\limits_{m=1}^{M-1}\mathop{\sum }\limits_{k=1}^{K}{\eta }_{mk}\ln \frac{{y}_{m}}{{y}_{M}}\ln {x}_{k}-\mathop{\sum }\limits_{j=1}^{J}{\delta }_{j}{h}_{j}\\ &-\alpha +\ln F({{{\bf{y}}}},{{{\bf{x}}}}).\end{array}$$
(6)

Equation (6) can be used to calculate the elasticity of the production function with respect to a given input:

$${\epsilon }_{k}=\frac{\partial \ln {y}_{M}}{\partial \ln {x}_{k}}=-{\gamma }_{k}-2\mathop{\sum }\limits_{k{^\prime} =1}^{K}{\gamma }_{kk{ ^\prime} }\ln {x}_{k{^\prime} }-\mathop{\sum }\limits_{m=1}^{M-1}{\eta }_{mk}\ln \frac{{y}_{m}}{{y}_{M}}.$$
(7)

Marginal product of an input k, which is interpreted as productivity of input k, is derived as \({{{{\rm{MP}}}}}_{k}=-\frac{\partial {y}_{M}}{\partial {x}_{k}}=-\frac{\partial \ln {y}_{M}}{\partial \ln {x}_{k}}\cdot \frac{{y}_{M}}{{x}_{k}}={\epsilon }_{k}\cdot \frac{{y}_{M}}{{x}_{k}}\). The marginal rate of technical substitution between inputs \({x}_{{k}_{1}}\) and \({x}_{{k}_{2}}\), which is defined as \({{{{\rm{MRTS}}}}}_{{k}_{1},{k}_{2}}={{{{\rm{MP}}}}}_{{k}_{1}}/{{{{\rm{MP}}}}}_{{k}_{2}}\) (Mas-Colell et al. 1995, p. 130; Chiang 1984, p. 210), becomes

$${{{{\rm{MRTS}}}}}_{{k}_{1},{k}_{2}}=\frac{{{{{\rm{MP}}}}}_{{k}_{1}}}{{{{{\rm{MP}}}}}_{{k}_{2}}}=\frac{{\epsilon }_{{k}_{1}}}{{\epsilon }_{{k}_{2}}}\cdot \frac{{x}_{{k}_{2}}}{{x}_{{k}_{1}}}.$$
(8)

Under the assumption of cost minimization behavior by hospitals, the marginal rate of technical substitution equals the ratio of input prices (Mas-Colell et al. 1995, p. 137; Chiang 1984, p. 419). The inequality implies that Japanese hospitals do not minimize costs through the optimal choice of inputs. For instance, if \({{{{\rm{MRTS}}}}}_{{k}_{1},{k}_{2}} > \frac{{p}_{{k}_{1}}}{{p}_{{k}_{2}}}\), then the ratio \(\frac{{x}_{{k}_{1}}}{{x}_{{k}_{2}}}\) is less than optimal and input k2 is overutilized relative to input k1.

3.3 Empirical model

3.3.1 Specification for quantile regression analysis

For a given hospital i at time t the value Fit = F(yit, xit) ∈ (0, 1]. Treating \(\ln {F}_{it}\) as a random error term (Coelli and Perelman 2000), (6) can be transformed into the regression specification, where the higher the quantile index, the larger the hospital’s output:Footnote 6

$$\begin{array}{ll}{Q}_{\tau }(\ln {y}_{Mit}| {{{{\bf{x}}}}}_{it},{{{{\bf{h}}}}}_{it})&=\mathop{\sum }\limits_{m=1}^{M-1}{\beta }_{m}(\tau )\ln \frac{{y}_{mit}}{{y}_{Mit}}+\mathop{\sum }\limits_{m=1}^{M-1}\mathop{\sum }\limits_{m{\prime} =1}^{M-1}{\beta }_{mm{^\prime} }(\tau )\ln \frac{{y}_{mit}}{{y}_{Mit}}\ln \frac{{y}_{m{^\prime} it}}{{y}_{Mit}}+\mathop{\sum }\limits_{k=1}^{K}{\gamma }_{k}(\tau )\ln {x}_{kit}\\ &+\mathop{\sum }\limits_{k=1}^{K}\mathop{\sum }\limits_{k{\prime} =1}^{K}{\gamma }_{kk{^\prime} }(\tau )\ln {x}_{kit}\ln {x}_{k{^\prime} it}+\mathop{\sum }\limits_{m=1}^{M-1}\mathop{\sum }\limits_{k=1}^{K}{\eta }_{mk}(\tau )\ln \frac{{y}_{mit}}{{y}_{Mit}}\ln {x}_{kit}\\ &+\mathop{\sum }\limits_{j=1}^{J}{\delta }_{j}(\tau ){h}_{jit}+{\alpha }_{i}(\tau )+{d}_{t}(\tau ).\end{array}$$
(9)

Here \({Q}_{\tau }(\ln {y}_{M}| {{{\bf{x}}}},{{{\bf{h}}}})\) is the conditional τth quantile of \(\ln {y}_{M}\), αi(τ) denotes fixed effects and dt(τ) are annual dummies which capture time effects associated with technological progress in the hospital industry. Control variables hjit include the binary variable for the use of electronic data systems,Footnote 7. We also add the interaction terms for Tohoku region and 2010 year, as well as the interaction term for Tohoku region and 2011 year to capture potential impact on production of the Great East Japan Earthquake of March 2010.

3.3.2 Estimation and inference

Equation (9) is used to estimate the conditional quantile of the dependent variable for each value of τ.Footnote 8 This way we avoid a multiple testing issue inherent to multiple quantile models and the estimates at quantiles outside of the extreme of (0,1) are not influenced by potential failure of the standard asymptotic theory to provide an accurate representation of the finite sample distribution. We use 13 values of τ ∈ [0.2, 0.8], starting with τ = 0.2 at the 0.05–step for detailed analysis. The potential tendencies across quantiles are established by consideration of the relative values of coefficients at several quantile points, adjacent to the extremes of this interval. The size of our sample (5845 observations) together with the total count of parameters (382 comprising 32 coefficients for covariates in the translog specification, 330 fixed effects for each hospital and 20 time effects for each year) restrict the expansion of the interval’s bounds, so we are unable to consider τ < 0.2 or τ > 0.8.Footnote 9

The fit of each regression is assessed using an equivalent of the R2 statistic computed for pairs of quantile regressions: with a restricted set of covariates and with a full set of covariates (Koenker and Machado 1999). Owing to the panel data structure of our model, we compute two variants of this pseudo R2 statistic, which resemble the within R2 for the linear model for panel data: the first compares the full model with the model with only hospital fixed effects, and the second contrasts the full model to the model with hospital fixed effects and time effects.Footnote 10

The Galvao and Kato (2016) asymptotic variance matrix is used for making inference about the coefficients. Standard errors for elasticities and other indicators which are functions of the coefficients are computed using the delta method.

The Wald test is employed to assess hypotheses about the differences between coefficients estimated in regressions for different values of τ. Specifically, we use the Koenker (2005) approach for applying Wald tests in order to assess linear hypotheses in quantile regressions.Footnote 11

3.3.3 Second-stage analysis

The value of the log distance function \(\ln {F}_{it}\) can be interpreted as productive efficiency of hospital i at time t (Knox et al. 2007), and higher values represent higher efficiency. The approach provides more robust estimates than the classic methods of efficiency analysis, which model optimal technology with nonparametric or parametric frontier analysis (Bernini et al. 2004; Liu and Tone 2008).Footnote 12

We use the estimated distance function for high quantile values (0.75, 0.8) as an approximation of the production possibility frontier (Koenker 2005; Liu and Tone 2008), and calculate the residuals

$$\begin{array}{ll}{\hat{u}}_{it}(\tau )=\ln {y}_{Mit}&+\mathop{\sum }\limits_{m=1}^{M-1}{\hat{\beta }}_{m}(\tau )\ln \frac{{y}_{mit}}{{y}_{Mit}}+\mathop{\sum }\limits_{m=1}^{M-1}\mathop{\sum }\limits_{m{^\prime} =1}^{M-1}{\hat{\beta }}_{mm{^\prime} }(\tau )\ln \frac{{y}_{mit}}{{y}_{Mit}}\ln \frac{{y}_{m{^\prime} it}}{{y}_{Mit}}+\mathop{\sum }\limits_{k=1}^{K}{\hat{\gamma }}_{k}(\tau )\ln {x}_{kit}\\ &+\mathop{\sum }\limits_{k=1}^{K}\mathop{\sum }\limits_{k{^\prime} =1}^{K}{\hat{\gamma }}_{kk{^\prime} }(\tau )\ln {x}_{kit}\ln {x}_{k{^\prime} it}+\mathop{\sum }\limits_{m=1}^{M-1}\mathop{\sum }\limits_{k=1}^{K}{\hat{\eta }}_{mk}(\tau )\ln \frac{{y}_{mit}}{{y}_{Mit}}\ln {x}_{kit}+\mathop{\sum }\limits_{j=1}^{J}{\hat{\delta }}_{j}(\tau ){h}_{jit}.\end{array}$$

Note that the residuals correspond to the value of the log distance function together with the individual effects and annual effects, since these quantile-dependent effects cannot be identified separately in the analysis that has the general form of the fixed-effect panel data quantile regression model.

The second-stage analysis of hospital productive efficiency scores \({\hat{u}}_{it}\) employs covariates that are associated with competition, regulation and demand patterns in the hospital industry (Blank and Valdmanis 2010; Ferrier and Valdmanis 1996; Jacobs et al. 2006; Rosko 1999, 2004). The variables for regulation proxy the stringency of financial constraints of the hospital: specifically, we use the share of subsidies and transfers in hospital medical revenues (Yamada et al. 1997) (denoted in the equation below subsidiessh) and the binary variable dpc for involvement in the prospective payment system (Rosko 1999).Footnote 13 The available data do not allow construction of the Herfindahl–Hirschman index, but we proxy the index by the share of the hospital’s beds in the total number of beds in a municipality (bedshare) and by the share of the hospital’s doctors in the total number of doctors in the municipality (docshare). The demographic component of demand pattern is proxied by the share of the population over 65 (shareover65) in the total population of the municipality (Morikawa 2010) while the socio-economic and geographic components are proxied by the standard financial need, which is the product of unit cost of public services, demand for public services and an adjustment coefficient which accounts for socio-economic and geographic factors. (We use log of per capita standard financial need, which is denoted lpstfinneed.)

The estimated equation becomes:

$$\begin{array}{ll}{\hat{u}}_{it}(\tau )\,={\lambda }_{1}{subsidiessh}_{it}+{\lambda }_{2}{dpc}_{it}\\ \qquad\qquad+\,{\lambda }_{3}{docshare}_{it}+{\lambda }_{4}{bedshare}_{it}\\ \qquad\qquad +\, {\lambda }_{5}{shareover65}_{it} +{\lambda }_{6}{profitability}_{it}\\ \qquad\qquad +\,{\lambda }_{7}{lpstfinneed}_{it}+{\kappa }_{i}+{\mu }_{t}+{\varepsilon }_{it},\end{array}$$
(10)

where κi are hospital effects, μt are time effects and εit is a stochastic error term.

3.4 Counterfactual policy analysis: optimal values of inputs and potential cost savings

Suboptimal choice of inputs is associated with excessive costs of a hospital. So using the results of estimation of Eq. (9), we conduct a counterfactual policy analysis: an evaluation of potential cost savings in case of changeover from the values of inputs observed in the data to optimal values.

The optimal values are defined as follows: these values minimize total cost of inputs and provide for the production of a given amount of output under fixed elasticities of the production function and under fixed values of the hospital control variables.

Consider a quantile τ ∈ (0, 1). The following approach is used to find the optimal values of inputs.Footnote 14 Using Eq. (11), we compute the mean elasticities for each input in the regression for conditional τth quantile of hospital output.

Next we focus on a given hospital i and find its optimal amount of inputs through a two-step procedure. (For brevity we omit argument τ in all variables in the description.) First, we compute the residual \({\hat{A}}_{it}\) left after subtracting the linear combination of log inputs \(\ln {x}_{kit}\) with corresponding coefficients \({\hat{\epsilon }}_{k}\) from log output:Footnote 15

$${\hat{A}}_{it}=\ln {y}_{Mit}-\mathop{\sum }\limits_{k=1}^{K}{\hat{\epsilon }}_{k}\ln {x}_{kit}$$

(note that hospital controls are included in \(\hat{A}\)).

The mean elasticity \({\hat{\epsilon }}_{k}\) comes from the estimates of Eqs. (7) and (9) as follows.

$${\hat{\epsilon }}_{k}=-{\hat{\gamma }}_{k}-2\mathop{\sum }\limits_{k{^\prime} =1}^{K}{\hat{\gamma }}_{kk{^\prime} }\overline{\ln {x}_{k}}-\mathop{\sum }\limits_{m=1}^{M-1}\overline{\ln \frac{{y}_{m}}{{y}_{M}}},$$
(11)

Second, we solve the cost minimization problem:

$$\begin{array}{l}\mathop{\mathrm{min}}\limits_{{x}_{1},\, \ldots ,{x}_{K}}\quad \mathop{\sum }\limits_{k=1}^{K}{p}_{kit}{x}_{k}, \qquad \\ {{{\rm{s.t.}}}} \quad {\hat{A}}_{it}+\mathop{\sum }\limits_{k=1}^{K}{\hat{\epsilon }}_{k}\ln {x}_{k}\ge \ln {y}_{Mit},\end{array}$$
(12)

where p1it, … , pKit are the values of input prices at the hospital. Here the boundary condition ensures that the hospital produces the same amount of output as is found in the data.Footnote 16

The solution of the cost minimization problem constitutes the optimum tuple of inputs \({x}_{1it}^{* }\), … , \({x}_{Kit}^{* }\).

The optimum value of the hospital costs is calculated as:

$${C}_{it}^{* }=\mathop{\sum }\limits_{k=1}^{K}{p}_{kit}{x}_{kit}^{* }.$$

So the potential percentage reduction in hospital costs in case of changeover from the actually employed amounts of inputs to the optimal values becomes

$$\% {{\Delta }}{C}_{it}=\frac{{C}_{it}-{C}_{it}^{* }}{{C}_{it}},$$
(13)

where the cost of actually used inputs Cit is computed as

$${C}_{it}=\mathop{\sum }\limits_{k=1}^{K}{p}_{kit}{x}_{kit}.$$

Similarly to the input elasticities, we report the values of cost savings for 1999–2018 and for the whole period.

For each τ, the optimal costs are compared with the factual costs through the standard t-test. For this purpose, we find hospitals where production is close to production in the τth quantile. This is done by considering the group of hospitals in (τ − 0.1, τ + 0.1) according to residuals obtained in the regression for the conditional τth quantile of hospital output. For example, for τ = 0.2 we consider the interval of hospitals with residuals in (0.1, 0.3). The point of the intervals is to avoid unstable results due to small subsamples of hospitals. Similarly, the t-test is applied to hospitals in (τ − 0.1, τ + 0.1) to evaluate the hypothesis about the equality of %ΔC to zero.

4 Data

4.1 Sources

The analysis uses data for Japanese local public hospitals from several sources. The main data source is annual surveys of all local public hospitals in Japan, published in digital form by the Department of Local Finance of the Ministry of Internal Affairs and Communications (Chihou Kouei Kigyou Kessan), with available variables for fiscal years 1999–2018. Participation in the prospective payment system is taken from an administrative database by the Ministry of Health, Labor and Welfare (2020). We use the Japan Council for Quality Health Care (2020) data on hospital accreditation. The names of designated local hospitals (hospitals with high referral rate and an additional subsidy per inpatient)Footnote 17 and the dates when they obtained this status come from the Ministry of Health, Labor and Welfare (2012) and the annual Survey of Finances of Hospital Enterprises (Byouin Jigyou Kessan Joukyou) by the Ministry of Internal Affairs and Communications (2020). We use hospital participation on the market for medical school graduates, taking data from the web portal of the Japan Residency Matching Program (2020). The binary variable for the use of electronic data systems—electronic medical records or electronic ordering system—is constructed using the date of the introduction of these systems at hospitals, published on the web portal of Software Service, Inc. (2021).

The data for Japanese prefectures and municipalities (population, number of acute-care beds and number of physicians at prefectural and municipal healthcare facilities) come from the Statistics Bureau of Japan, the Portal Site of Official Statistics of Japan (2020). The standard financial need of the prefecture or municipality (Kijun Zaisei Juuyougaku)Footnote 18 is taken from the annual Surveys of Prefectural and Municipal Finances (Toudoufuken Kessan Joukyouchou. Shichouson-betsu) carried out by the Ministry of Internal Affairs and Communications (2020). Data on the budgets of prefectures and municipalities from the annual Surveys of Local Public Finance (Chihou Zaisei Joukyouchousa), reported by the Statistics Bureau of Japan, are used to construct the prefectural/municipal profitability variable.

Hospital addresses are retrieved from the Tokyo’s Institute for Health Economics and Policy database on healthcare providers in Japan (Zenkoku Hokensya Iryoukikan. Byouin, Shinryoujo), available since 2015. Websites of hospitals which has ceased to exist by 2015 were used to restore the historic data on hospital location.

4.2 Sample

The non-anonymous character of the databases allows us to merge them by year, hospital name and the name of the prefecture or municipality.

The universe of Japanese local public hospitals includes 884–1007 institutions in 1999–2018.Footnote 19 To guarantee a certain homogeneity of hospital production in the absence of any variable directly related to the prevalence of patients with different diagnoses (case mix) we follow the common strategy of taking only hospitals with acute-care beds (Fujii 2001; Kawaguchi et al. 2014; Yamada et al. 1997). The group of hospitals with only acute-care beds constitutes 615–636 healthcare institutions in 1999–2003, but only 310–426 institutions in subsequent years. This fall in numbers since 2003 reflects the introduction of a long-term care bed category, which was formally embedded into the acute-care beds (Table 1).

Table 1 Universe and samples of Japanese local public hospitals

We discard hospitals which fail to report the numbers of outpatients or hospital discharges. Next, we exclude data with missing numbers of doctors, nurses and other hospital staff (missing values for these hospital inputs account for 5–6% of observations), and where the values depreciable fixed capital or of on medicines are not reported (5–9% of observations in various years).

Furthermore, we omit data with average length of stay below 6 daysFootnote 20 or over 90 daysFootnote 21 (1% of the sample), and data with bed occupancy falling below 0.1.Footnote 22 Finally, we impose a technical condition by which a hospital must be present in the database of at least 11 years out of the analyzed 20. The condition enables identification of hospital fixed effects. Note that imposing this condition automatically adjusts for the change in the definition of acute-care beds in 2003: hospitals that where classified as having only acute-care beds in 1999–2002 but lost this classification in 2003 (after a part of these beds became long-term care) would not have T ≥ 11 and would not be included in our sample. Our final sample consists of 246–322 hospitals in various years (5845 longitudinal observations with 330 hospital clusters).

4.3 Variables

Since administrative databases commonly suffer from lack of hospital-level variables on the actual outputs of hospital activity (such as improvement of patient’s health after completion of medical treatment), we follow a common approach in the hospital productivity literature and use proxies for hospital outputs. A list of such proxies often includes outpatient visits, inpatient admissions and discharges (Jacobs et al. 2006; Rosko and Mutter 2008; Worthington 2004). The Japanese local public hospital database reports the daily number of outpatients which we use as an output in the absence of other, more detailed data. Following the approach by Takatsuka and Nishimura (2008) we compute another output as the mean of admissions and discharges for acute-care hospitals with acute-care beds.Footnote 23 It should be noted that hospitals may pursue other activities as well as treating patients, notably scientific research, and we follow the Jacobs et al. (2006) approach, which captures it by adding the share of research expenditure in hospital revenues.

Labor inputs are doctors, nurses and other hospital staff.Footnote 24 The number of nurses is the sum of licensed practical nurses (the Japanese term is “assistant nurses”) and registered nurses.Footnote 25 Other hospital staffFootnote 26 includes technicians (59%), administrative personnel (29%) and all other staff (12%).

Capital input is depreciable fixed capital (buildings and equipment). Owing to the emphasis of Japanese healthcare on pharmaceuticals, medicines (medical materials and medicines per se, proxied by their costs) are treated as an additional input (Motohashi 2009). This approach follows studies using data from the US and UK, where medicines are viewed as an input in the production function (Pauly 1980; Feldstein et al. 1974).

Regarding input prices, the proxy for labor price is total labor cost of a given labor input divided by the amount of this input. Capital price is proxied by the share of depreciation and real interest expenditure in the book value of depreciable fixed capital (Fujii 2001). The data allow us to construct the proxy for the price of a component of medical materials (medicines per se). The proxy is the retail cost divided by revenue – the inverse of the so-called drug margin rate (Yamada et al. 1997). We implicitly assume that managerial attitude towards medicines and medical materials is similar, so this proxy price for medicines is expanded in our analysis to relate to all medical materials.Footnote 27

As regards hospital characteristics, the quality of care can be proxied by the referral rate. The values of the variable are unavailable, but we use data for the binary status of designated local hospitals (such hospitals must have a referral rate above 60–80%). Other dichotomous hospital characteristics are binary variables for having student nurses and for attracting residents through participating in the Japan Residency Matching Program.

We use bed occupancy to proxy a hospital’s uncertainty about output and openness for emergency care. Bed occupancy may be viewed as “a measure of demand for hospital services” (Ferrier and Valdmanis 1996) and lower bed occupancy indicates willingness to accept more emergency patients (Doi et al. 2005; Yamada et al. 1997).

As regards patient characteristics, we follow Yamada et al. (1997) and use the mean number of examinations per patient as the only available proxy for the severity of diagnosis. The list of variable names and the descriptive statistics are given in Table 2.

Table 2 Descriptive statistics for Japanese acute-care local public hospitals in 1999–2018

We add binary variables for each year to account for annual effects. Annual changes in aggregate medical prices are negligible within the analyzed period, so explicit introduction of a medical consumer price index along with annual dummies would not affect our results.

5 Results

5.1 Typology of technologies

The results of estimating Eqs. (9), (11), (8), (12), (13) for different quantiles τ are given in Table A.1 (coefficients for inputs and hospital characteristics), Table A.2 (input elasticities), Table A.3 (marginal products), Table B.3 (marginal rate of technical substitution), Supplementary Tables S46S47, and Figs. 4 and 5 (optimal values of inputs and potential cost savings).

To establish technological heterogeneity we test for the statistical difference of the values of coefficients, obtained in quantile regression for pairs of low- and high-output quantiles (0.2 and 0.8, 0.25 and 0.75). We also test whether the coefficients in each of these low- and high-output quantiles differ from the estimates in the median regression. Similar comparison is conducted for input elasticities, input productivity (evaluated at sample means) and potential cost savings.

The results of the tests, which are given in Supplementary Tables S5S24 and summarized in Table 3, show technological distinctions across high-output and low-output hospitals, reflected in the effect of the hospital control variables (teaching activity, accreditation, the status of designated hospital, the use of electronic data), in elasticity and productivity of labor of technicians, administrators and other staff, capital and medicines, and in potential cost savings.

Table 3 Difference across hospitals in low-, high- and median-output quantiles

5.2 Input productivity

5.2.1 Labor specialties

Physicians are most productive among labor specialties at Japanese local public hospitals, which differs from findings in US and European data, where the highest productivity is observed among nurses (Blank and van Hulst 2017; Jensen and Morrisey 1986). Production at the highest quantiles of hospital output is associated with lower elasticity of physician labor but higher elasticity of technicians, administrators and other staff. However, the labor of technicians and other staff is insignificant in explaining hospital output in low-output quantiles.

5.2.2 Capital and medicines

There is an inverse relationship between productivity of capital and hospital output: the higher the output, the lower the returns to capital. An inverse relationship is observed between productivity of medicines and hospital output.

Arguably, high-output hospitals do not overinvest in capital and medicines for treating patients. Instead, they demonstrate higher labor productivity, which is largely attributed to the highest productivity of technicians, administrators and other staff.

5.2.3 Time profiles

Note that input elasticity as well as marginal product of each input are functions of the estimated coefficients in Eq. (11) and the time-varying variables (outputs, inputs, input prices, hospital characteristics). Evaluation of time-varying variables at annual values of the sample means enables the computation of input elasticities (Supplementary Tables S7, S10, S13, S19 and S22), input productivities (Fig. 1 and Supplementary Tables S26, S29, S32, S38 and S41), marginal rate of technical substitution (Table B.3), optimal inputs (Fig. 4 and Supplementary Table S47) and potential cost savings in 1999–2018 (Fig. 5 and Supplementary Table S46).

Fig. 1
figure 1

Input productivity by output quantile and year

As regards time profiles of productivity of physicians and nurses, the patterns below are generally observed at all quantiles of hospital output. The marginal product of physician labor goes up in 1999–2004.Footnote 28 The rise in productivity may be linked to the biennial revisions of the unified fee schedule in 2002 and 2004, which were the first historic instances of reduction of the price for healthcare services. However, the productivity of physicians has seen a steady fall since 2004.

Productivity of nurses declines over the analyzed period. Falling productivity of doctors and nurses at Japanese local public hospitals can be largely explained by shortage of specialized personnel and resulting excessive workloads. Number of nurses in Japanese local public hospitals in the 1900s and 2000s were insufficient by international standards and there was a particular shortage of highly qualified and experienced nurses with lengthy tenure (Ikegami and Buchan 2014). Shortages of doctors at Japanese local public hospitals has been long observed in the academic and policy literature (Ikegami 2014).

Hospitals vary as to annual trends in the productivity of technicians, administrators and other staff, as well as productivity of capital and medicines. The productivity of technicians, administrators and other staff at high- and median-output hospitals increases in 1999–2011 and then falls in subsequent years. However, the productivity of this input at high-output hospitals declines throughout the period 1999–2018. Nonetheless, the value at high-output hospitals is several times larger than at low-output hospitals.

Productivity of capital decreases in 1999–2002 but then remains flat in low-output quantiles. High-output quantiles show a rise in the productivity of this input in 1999–2004, followed by steady values in subsequent years (Supplementary Tables S38, S39, S41 and S42).

Productivity of medicines goes up in 1999–2002, but then falls in 2003 and remains steady at median- and low-output hospitals. At high-output hospitals, the fall in productivity of medicines continues from 2003 until 2018.

The reform of local public hospitals launched in 2008 must have had a temporary positive effect by increasing the elasticity of physicians, technicians, administrators and other staff in 2008–2013. But the elasticity either remained flat or fell in subsequent years. As regards input productivity, the reform only succeeded in boosting productivity of technicians, administrators and other staff at median- and low-output hospitals in 2008–2011. The temporary effect of the reform on productivity is similar to temporary improvement of solvency of local public hospitals in 2008–2013 (Mandai and Watanabe 2019).

Significant changes in technology over time are observed with respect to the productivity of physicians and aggregate labor at both high- and low-output quantiles: there is statistical difference in the values of productivity of these inputs in case of each group of hospitals.

The finding contrasts with absence of differences in labor productivity at US hospitals from the mid-1960s to the late 1980s (Thurston and Libby 2002).

In addition, statistically significant annual changes in productivity of nurses and capital are observed at low-output quantiles, while high-output quantiles show annual changes in productivity of technicians, administrators and other staff.Footnote 29

5.3 Effect of hospital characteristics

Estimation of the Eq. (9) shows that each of the teaching variables—the fact of having student nurses and participation in the Japan Residency Matching Program—has a negative effect on production in the first-stage Eq. (9). The result is in line with the findings for US hospitals and may be explained by additional workload on physicians and nurses, who have to act as supervisors (Jensen and Morrisey 1986). However, the negative effect is less pronounced in high-output hospitals than in low-output hospitals.

The use of electronic data systems has a positive and significant effect on production at low-output quantiles. But the effect of the variable is insignificant at high-output hospitals and at some hospitals with medium values of the quantile index. Arguably, these hospitals have already employed efficient patterns of medical treatment, so the introduction of electronic data systems did not have an additional impact on productivity. Our result that finds heterogeneity of the effect of the variable is in line with the findings about differential impact of the use of electronic medical records on labor productivity at groups of Japanese local public hospitals and absence of the positive mean effect on multi-factor productivity (Kaneko et al. 2018).

Research activity reduces hospital output in low- and median-output quantiles, but does not affect production at high-output quantiles. Similarly, binary variables for healthcare quality—hospital accreditation and designation—have negative estimated coefficients. Arguably, higher quality care requires additional workload and intensive diagnostics, so ceteris paribus the number of treated inpatients and outpatients may decrease.

The results of the second-stage regression (10) reveal that productive efficiency is negatively affected by the share of subsidies in hospital revenue and by profitability of local government (Table B.2). Concerning the demographic component of the demand for healthcare at local public hospitals, the share of people below 65 in the total population of a municipality is negatively significant in explaining productive efficiency of a hospital. This result may be linked to relatively higher severity of medical cases among the elderly population, leading to loss of productive efficiency. Per capita standard financial need, embracing the demand for public services, adjusted for socio-economic and geographic factors, has a negative effect on efficiency. So hospitals may demonstrate lower efficiency in conditions of higher demand for their services.

As regards the proxy variable for the Herfindahl–Hirschman index, there is a positive relationship between the share of the hospital’s beds in the total number of beds of a municipality and the hospital’s productive efficiency. The findings is in line with the results of studies using the US data (Blank and Valdmanis 2008; Rosko 1999) and may be explained by the fact that healthcare markets are seldom competitive (Hollinsworth and Peacock 2008).

5.4 Optimal amounts of inputs and potential cost savings

The difference between marginal rate of technical substitution and price ratios is not statistically significant for pairs of labor inputs (doctors, nurses, technicians, administrators and other staff), so the mix of labor specialties does not statistically differ from optimal in all quantiles. However, the input mix between physical capital and total labor staff, between physical capital and medicines, and between medicines and total labor staff is suboptimal (Table B.3).Footnote 30

Accordingly, there are substantial opportunities for cost savings through changeover to an optimal combination of inputs. Decrease of the amount of capital is required for this purpose at all quantiles (Supplementary Table S47). High-output hospitals need to increase the number of technicians, administrators and other staff, but cut the amount of medicines and slightly reduce the number of doctors. Low- and median-output hospitals needed to increase the number of doctors, reduce the number of technicians, administrators and other staff, reduce the amount of medicines in 1999–2014 but increase the amount of medicines in 2015–2018.

For each output quantile the number of nurses required in the optimal combination of inputs is smaller than the actual number, so the number of nurses has to be reduced. This apparently counterintuitive finding might be explained if, due to the shortage of physicians at Japanese local public hospitals,Footnote 31 nurses have to take on the role of physicians, at least in part. This could explain the shortage of nurses in the current situation. But in the optimum situation the number of doctors is increased, so the number of nurses may need to be reduced.

Potential reduction of hospital costs thanks to changeover to optimal inputs varies from 10 to 25%, and the largest cost reductions in percentage terms may be expected at the lowest output quantile. However, high-output quantiles also have vast opportunities for cost reduction. In some years the potential percentage reduction of their costs exceeds that of low- and median-output hospitals.

5.5 Performance indicators in the reform of local public hospitals

To examine the impact of technological heterogeneity in hospital response to the reform, we estimate average values of the performance indicators at high-, low- and median-output quantiles.

With the exception of the share of depreciation in medical revenues, differences across high- and low-output quantiles are observed for all managerial indicators (Figs. 2 and 3 and Table 4). The differences are most salient for the share of medical revenues in medical expenses and the share of labor costs in medical revenues. For both indicators, the higher the output quantile, the closer the value of the indicator to the recommended target.

Fig. 2
figure 2

Performance indicators targeted since the 2008–2013 reform wave, by output quantile and year. Target values for bed occupancy are only available for all hospitals and not separately for hospitals with acute-care beds

Fig. 3
figure 3

Performance indicators added in the 2015–2020 reform wave, by output quantile and year

Table 4 Difference in managerial performance indicators across hospitals in low-, high- and median-output quantiles

As regards time profiles of the indicators, the reform of local public hospitals can be regarded as successful with respect to the share of cost of medicines in medical revenues. The indicator, which was considered to be excessively high, is decreasing and falls below the recommended target in the groups of high- and median-output hospitals in 2009 and in the group of low-output hospitals in 2016.

Desirable trends were observed for a time in the share of ordinary revenues in ordinary expenses, which was below 100% and needed to increase. It rose in all quantile groups in 2006–2011. However, the values remained below the recommended targets and started falling slightly in later years. As stated by the reform guidelines, the share of labor costs in medical revenues decreased in 2008–2012. But the trend was then reversed: the value of the indicator began to increase from 2011 to 2015 in different output quantiles.

5.6 Analysis for hospitals of different size

Since the fixed-effect model does not enable us to explicitly include time-invariant covariates, such as groups of hospitals by size, we classified hospitals according to the number of their beds for purposes of the post-estimation analysis: small (20–99), medium-sized (from 100 to 299) and large (300 and above).Footnote 32 We then estimated average values of input elasticities, input productivities and other parameters for high-, low- and median-output quantile in groups of small, medium-sized and large hospitals.

The results show that difference in the productivity of inputs at large hospitals exists across high- and low-output quantiles for the labor of technicians, administrators and other staff, and across high- and median-output quantiles for capital. Medium-sized hospitals show differences in the values of productivity of both capital and the labor of technicians, administrators and other staff inputs across high- and low-output hospitals as well as across high- and median-output hospitals (Supplementary Tables S59S64 and Supplementary Figs. S1S3). The values of potential cost savings after changeover to an optimal amount of inputs differ between high-output and low-output hospitals within each category of small, medium-sized and large hospitals (Supplementary Tables S77, S79 and S81).

Statistically significant differences in the values and time profiles of five managerial indicators, monitored since the first wave of the reform of local public hospitals, are observed at high- and low-output hospitals in each category of hospitals, classified by their size. These indicators are: the share of ordinary revenues in ordinary expenses, the share of medical revenues in medical expenses, the share of labor costs in medical revenues, the share of costs of medicines in medical revenues, and bed occupancy. The finding supports the cause of further policy diversification, since currently the reform guidelines set different targets only for groups of hospitals of different size (Supplementary Tables S83S109 and Supplementary Figs. S10S15).

5.7 Limitations

We note several limitations of the analysis. Firstly, we follow the common approach in health productivity analysis to use a multi-output production function (Hollinsworth and Peacock 2008; Jacobs et al. 2006), which imposes assumptions about additivity across inputs. However, the alternatives of combining all outputs (e.g., adding inpatients and outpatients) or concentrating on one output only may not fully capture the multi-product character of hospital operations. Secondly, inability to directly observe the quality of labor may result in underestimation of labor returns in high-output hospitals, where human capital is substantial. Thirdly, there is no variable directly related to the case mix of Japanese hospitals, except for those which are involved in the prospective payment system. To proxy homogeneity of patient cases, we follow the general trend in the Japanese hospital literature and limit our sample to hospitals with acute-care beds.

6 Discussion

The results of the estimations point to technology distinctions across high- and low-output acute-care local public hospitals in Japan. Specifically, heterogeneity in hospital technology is reflected in statistically different values of productivity of physicians, technicians, administrators and other staff, physical capital and medicines; it is also apparent in partial effects on production of teaching variables, the status of accreditation and designation; and also in cost savings after changeover to optimal input mix.

The discrete typology of technologies associated with production at Japanese local public hospitals goes in line with findings about the discrete typology of hospital managerial practices (Bloom et al. 2015). Indeed, input productivity can be regarded as a function of managerial efforts (Bloom et al. 2017), and poor management and lack of information can be viewed as major reasons for the existence of inferior technologies (Tsionas 2002).

High-output hospitals are characterized by higher productivity of the labor of technicians, administrators and other staff, probably explained by managerial efforts such as organizational support for team work of physicians and technicians on data management and diagnosis coding (Saito 2007; Shima et al. 2006), revision of occupancy of operating theatres and of the workload at inpatient/outpatient divisions (Doi et al. 2005; Tomioka et al. 2008), and faster reporting of the results of medical tests (Suwabe 2004).

Lower productivity of physicians’ labor at high-output hospitals may be related to a higher degree of labor specialization, possibly owing to larger size of the local market. This supposition can be linked to the theory of local demand shifters of labor specialization in service industries, developed in Baumgardner (1988a) as an extension of the Stigler (1951) model of the impact of market size, originally stemming from the seminal theorem of Adam Smith. The empirical proof for the hospital industry can be found in Baumgardner (1988b).Footnote 33 Similarly, Acemoglu and Autor (2011) allow for an assignment of labor tasks, which may be linked to labor specialization.

High-output hospitals show better values of many managerial performance indicators that were monitored during the reform of local public hospitals in Japan. The values of these indicators are statistically different between groups of high- and low-output hospitals. The finding supports the argument about interrelation between management and production.

Low-output hospitals show higher productivity of capital and medicines, while the productivity of technicians, administrators and other staff is most often insignificant.

7 Conclusion

Innovations in hospital technology are often targeted at a particular input, so it is important to examine the productivity of labor specialties, capital and medicine at hospitals (Blank and van Hulst 2017). Moreover, the technology distinction caused by the productivity of labor specialties, capital and medicines is crucial for disentangling best and worst practices and for making judgments about the differential impact of policy reforms (Acemoglu and Finkelstein 2008; Weisbrod 1992). However, empirical evidence on the issue is insufficient and there is a gap in the literature as regards quantification of differences in input productivity across groups of hospitals.

Our paper addresses technology differences at hospitals using a conditional quantile regression approach applied to analysis of the production function. The novel finding of the paper, which employs longitudinal data for Japanese local public hospitals in 1999–2018, is technological heterogeneity expressed as different levels of productivity of labor, capital and medicines at high- and low-output hospitals. Secondly, changes in the values of productivity of labor and capital are discovered over the analyzed period. Finally, differences in the values of input productivity, potential cost savings in case of changeover to optimal amount of inputs, and managerial performance indicators across high- and low-output hospitals persist even after controlling for hospital size.

The establishment of technological heterogeneity underlines the importance of input productivity analysis in the hospital sector. We believe that the quantification of technological heterogeneity of Japanese local public hospitals offers helpful guidance for regulatory changes. In particular, different values of the productivity of labor, capital and medicines at high-output and low-output hospitals may imply different values for the marginal rate of technical substitution and hence different conclusions about the optimality of input mix and potential cost savings.