1 Introduction

Bank inefficiency is considered as one of the main reasons behind the financial crisis that has shaken the U.S. banking system during 2007–2009 (Assaf et al. 2019). Since the Bank Holding Companies Act of 1956, bank holding company (BHC hereafter) structures have become dominant in the US banking industry (Koutsomanoli-Filippaki et al. 2012). They controlled over 15 trillion USD in total assets, accounting for more than 95% of total US banking assets in 2015. Kane (1996) emphasizes that the holding company framework can re-engineer the organisations to squeeze a large product line and geographically dispersed network. The rapid increase in the number of BHC subsidiaries has raised concerns on whether BHC structures enhance the efficiency of banks at the subsidiary level (Kashian et al. 2019). On the one hand, Assaf et al. (2019) find that efficient banks are more resilient to financial crises than their inefficient peers. Specifically, banks with lower cost and revenue efficiency could suffer higher bank risk (Fiordelisi et al. 2011). On the other hand, Luo et al. (2011) and Barth et al. (2008) state that financial crises have harmed banking activities from the funding side and the lending side, significantly reducing bank efficiency. In fact, banks reduced their lending activities (De Haas and Van Horen 2013), while at the same time suffering from pressure due to the risk of withdrawal of deposits from customers (Martinez Peria and Schmukler 2001). While the difference in bank efficiency between diversified and focused bank structures is questionableFootnote 1, it is essential for the regulators to increase bank efficiency at different bank holding company structures, leading to more stable U.S. banking industry.

This paper applies Data envelopment analysis (DEA hereafter) to assess bank efficiency since this technique has been widely used as a tool to measure performance in the banking industry (Wang et al. 2014; Barth et al. 2013). The DEA approach is a commonly applied non-parametric method to compute efficiency scores. It is combined here with a second stage regression analysis to determine factors explaining the level of bank efficiency (Casu et al. 2011; Curi et al. 2013). More specifically, we use a fuzzy multi-objective two-stage DEA approach to measure bank efficiency. This technique has been used by Wang et al. (2014) for bank holding company efficiency. Different from Wang et al. (2014), we focus on the subsidiary level of BHCs and compare the efficiency between banks belonging to a single-BHC and those that belong to a multi-BHC.

This study adopts the fuzzy approach developed by Zimmermann (1978) that transforms a multi-objective programming problem into a single-objective programming problem. We choose to apply fuzzy multi-objective DEA because this technique has several advantages compared to conventional DEA. First, a conventional DEA methodology considers the production procedure as a ‘black box’ with insufficient details to identify sources of inefficiency. Indeed, the bank production function is complex with the interaction of different activities and divisions (Zimmermann 1978). Second, conventional DEA gives a relatively large number of efficient DMUs, implying weak discriminating power. Fuzzy multi-objective DEA combines all the efficiency functions of each DMU into one function, providing more accurate result (Wang et al. 2014).

By using commercial bank data from 1994 to 2018, this paper examines the difference in bank efficiency between a multi-BHC and a single-BHC at the subsidiary level.Footnote 2 On the one hand, multi-BHCs can strengthen their finance by diversifying their funding externally and creating internal capital funding (San-Jose et al. 2018). Multi-BHCs can also lessen financial difficulties and avoid bankruptcy for their affiliates by transmitting their source of finance to them. Moreover, one subsidiary within a multi-BHC can share resources with other subsidiaries. On the other hand, multi-bank holding’s affiliates have lower efficiency due to their structure (Makinen and Jones 2015). A multi-BHC has more than two subsidiaries; making it, therefore, relatively difficult for multi-BHCs to distribute their financial resources equally between their affiliates. In addition, there is a competition at the affiliation level that may lead to higher cost of raising capital and reduce bank efficiency. Using several estimation techniques, i.e., fixed effect, truncated regression, difference-in-difference regression based on propensity score matching and dynamic treatment, we show that multi-BHC affiliates exhibit higher efficiency scores than their single counterparts. This result is consistent with the internal capital market theory, suggesting that multi-BHC affiliates can receive resources both from their parents and other banks that belong to the same multi-BHC.

There is also a possibility that affiliates of a multi-BHC suffer from high risk despite their benefit from the internal capital market. Hughes et al. (1996) state that diversification causes an increase in the proportion of loan to assets, which leads to higher credit risk. Berrospide et al. (2016) explain the risk transmission channel between BHC affiliates. More specifically, BHC affiliates could endure negative spillover effect through internal capital market when their peers suffer from local economic or credit shocks.

There could also be a link between efficiency and bank concentration, bank size and bank structure (Demirgüç-Kunt and Levine 2000). For instance, in dynamic and expanding markets, banks may benefit from growing demand, increased activity in branch offices, and expanded networking that could improve efficiency and vice versa. However, dealing with more customers could generate inefficiencies because of the need to meet all of their diverse requirements. Gonzalez (2009) suggest that ignoring endogeneity leads to biased estimation given the endogeneity nature of bank structures. To solve a possible endogeneity issue, we apply a difference-in-difference method based on propensity score matching. We first match banks switching their status from single-BHC affiliates to multi-BHC affiliates with banks that hold the same status based on bank characteristics such as bank size and bank capital. We then estimate a difference-in-differences regression to consider whether and how banks affect their efficiency when they switch their status from single-BHC affiliates to multi-BHC affiliates. We find that single-BHC affiliates that switch to multi-BHC affiliates gain higher efficiency.

We also test whether corporate governance, especially CEO power could influence the effect of bank structure on bank efficiency. Prior literature does not show clear-cut findings regarding the relationship between CEO power and bank efficiency, especially within different banking structures. Stulz (1988) and Bhagat and Jefferis (2002) suggest that bank governance plays an important role in explaining bank efficiency. On the one hand, powerful CEOs are likely to dominate boards, affect their decisions, and encourage the adoption of risky activities, leading to lower efficiency. On the other hand, powerful CEOs are more inclined to reduce conflicts between board members, hence increasing bank efficiency. We find that multi-BHC with powerful CEOs are less efficient. This result is consistent with prior research (Bitar et al. 2018; Haque and Brown 2017) and provides practical implications for bank regulators, especially for bank activities within different BHC structures.

Our paper contributes to several strands of literature. First, it applies a fuzzy multi-objective two-stage DEA technique that considers the structure of US banks. This technique is an advanced performance measurement tool that combines efficiency functions of all DMUs into one function and increases discriminating power (Wang et al. 2014). Second, it relies on internal capital market theory to expand our knowledge regarding the efficiency of multi-BHC subsidiaries. Third, our paper contributes to the recently growing literature on the role of CEO power in the banking industry. Fourth, it expands the literature comparing the efficiency between multi- and single-BHC subsidiaries. Our evidence shows an inverted U-shape relationship between multi-BHC network and bank efficiency, suggesting the existence of an optimal number of subsidiaries in BHC structures to gain the highest efficiency levels.

The rest of the paper is organised as follows. Section 2 lays out the theoretical framework and develops the hypotheses. Section 3 presents the multi-objective two-stage DEA. Section 4 describes the data and regression models. Section 5 presents the analyses and explains the empirical results. Section 6 reports robustness checks. The last section concludes.

2 Theoretical framework and hypotheses development

2.1 BHC structures and bank efficiency

A multi-BHC structure is organised as a hierarchy structure in which a holding company’s parent is located at the top. The parent company controls a lead bank while other bank subsidiaries can work as full-service branches. Contrariwise, Watkins and West (1982) define a single-BHC as a structure that includes a single bank and a number of nonbank subsidiaries.

Diversification at the parent level may increase the parent’s capacity to create an internal capital market and acquire better financing deals. The internal capital market theory suggests that the creation of an internal capital market, where the parents allocate their resources across different projects, could reduce the need and the cost of external financing. This theory explains many benefits for subsidiaries. For instance, Houston and James (1998) find that affiliated banks have lower cash flow sensitivity of loan growth than stand-alone banks, implying that banks belonging to a banking group are more likely to reduce the cost of raising funds externally. Cremers et al. (2010) and Kashian et al. (2019) state that headquarters of banking groups can provide their affiliations with intertemporal insurance when experiencing shortfalls in funding. In addition, multi-BHC affiliates can access to internal secondary loan market of their parents, hence, the subsidiaries holding less capital can originate loans and sell them to better-capitalized affiliates. Therefore, the subsidiaries can mitigate any capital constraint on their loan production.

Internal capital markets are regarded as a “source-of-strength” (Mirzaei and Moore 2019; Chronopoulos et al. 2013). For instance, headquarters can divert resources from other affiliates to rescue troubled subsidiaries. The “too-big-to-fail” resolution demonstrates that counterparties of troubled corporations need to be protected to decrease the collateral damage that was caused by the bankruptcy of that firm (Evanoff and Ors 2008; Kaufman 2014). In addition, headquarters can reallocate resources or reduce earnings volatility, that lead to lower risk-taking at the affiliate level of more diversified bank groups (Ly et al. 2018). Overall, a multi-BHC has more subsidiaries than single-BHC and allows them to have more internal resources. From the above arguments, we draw our first hypothesis.

Hypothesis 1

Multi-BHC affiliates exhibit higher efficiency than single-BHC affiliates.

2.2 CEO power and bank efficiency

CEOs are more powerful and play a more important role in the decision-making process of their banks when they also chair the board of directors and have longer tenure at their position. In general, researchers find that CEO power has a detrimental effect on bank performance and efficiency. For instance, De Haan and Vlahu (2016) find that CEO power has a negative impact on bank performance as it leads to CEO entrenchment, hence, preventing other board members accessing information flows, influencing board decisions and undermining monitoring function of independent directors (Mollah and Zaman 2015). Lewellyn and Muller-Kahle (2012) shows that CEO power measured by CEO duality reduces banks’ efficiency partly due to excessive concentration of power in one person’s hands.

More specifically, CEO duality is the situation in which CEO is also the chair of the board. CEO duality may have a detrimental effect on bank performance, board monitoring and its influence on board decisions (Lasfer 2006). For example, a powerful CEO has the ability to influence the selection of board members with the appointment of non-executive directors who are unlikely to influence their decisions (Adams and Mehran 2012; Chen et al. 2018; De Jonghe et al. 2012). Nevertheless, a powerful CEO can have a positive impact on bank performance and bank efficiency. In particular, a combined role of CEO and chairman may prevent the agency problem within banks by reducing the likelihood of conflict between CEO and board members, thereby, improving banks’ performance and banks’ efficiency (Stoeberl and Sherony 1985). Moreover, CEO duality enhances banks’ leadership, hence, directing banks’ objective in a clear manner and enhancing bank stability (Anderson and Anthony 1986).

Several studies on banks have already addressed the issues regarding the impact of CEO power on bank performance and bank efficiency. Pi and Timme (1993) show that banks with non-duality CEO are more cost efficient than those with CEO duality. Grove et al. (2011) show that CEO tenure is negatively associated with bank performance and loan quality. However, Simpson and Gleason (1999) argue that US banks experience lower probability of financial distress when CEO is also a chair of board of directors because of better strategic vision and leadership. Therefore, it can be argued that

Hypothesis 2

CEO power affects bank efficiency at BHC subsidiary level.

3 Fuzzy multi-objective two-stage DEA for BHC affiliations

3.1 Fuzzy multi-objective two-stage DEA

The DEA method is a widely used technique in measuring efficiency in the banking industry. The conventional DEA ignores the production process and considers it a “black box” as it totally ignores what happens inside. This paper adopts a relational two-stage DEA model along with a fuzzy multiple objective programming design to analyse the organizational structure and production process of commercial banks. We focus on technical efficiency measured by dividing the weighted sum of outputs by the weighted sum of inputs. We consider input oriented instead of output oriented since bank managers have greater influence over bank inputs rather than bank outputs (Fethi and Pasiouras 2010).

We follow Wang et al. (2014) to construct the fuzzy multi-objective two-stage DEA. Model (1) evaluates the relative efficiency of \( {\mathrm{n}}DMU_{j} ({\mathrm{j}} = 1,2, \ldots {\mathrm{n)}} \), each with m inputs \( x_{ij} ({\mathrm{i}} = 1,2, \ldots ,{\mathrm{m)}} \), q intermediate product \( k_{pj} \left( {p = 1,2, \ldots ,q} \right) \) and s output \( y_{rj} \left( {r = 1,2, \ldots ,s} \right) \). If we consider the efficiency ratio of all DMUs, the multiple objectives program can be shown as

$$ \begin{aligned} & z_{1} = max\frac{{\mathop \sum \nolimits_{r = 1}^{s} u_{r} y_{r1} }}{{\mathop \sum \nolimits_{i = 1}^{m} v_{i} x_{i1} }} \\ & \cdots \\ & z_{n} = max\frac{{\mathop \sum \nolimits_{r = 1}^{s} u_{r} y_{rn} }}{{\mathop \sum \nolimits_{i = 1}^{m} v_{i} x_{in} }} \\ & {\rm s.t.} \\ & \frac{{\mathop \sum \nolimits_{{p} = 1}^{\it{q}} {\eta}_{\it{p}} \it{k}_{\it{pj}} }}{{\mathop \sum \nolimits_{{i} = 1}^{\it{m}} \it{v}_{\it{i}} \it{x}_{\it{ij}} }} \le 1,\;\; {j} = 1,2, \ldots ,\it{n} \\ & \frac{{\mathop \sum \nolimits_{r = 1}^{s} u_{r} y_{rj} }}{{\mathop \sum \nolimits_{p = 1}^{q} \eta_{p} k_{pj} }} \le 1,\;\; j = 1,2, \ldots ,n \\ & \frac{{\mathop \sum \nolimits_{r = 1}^{s} u_{r} y_{rj} }}{{\mathop \sum \nolimits_{i = 1}^{m} v_{i} x_{ij } }} \le 1, \;\;j = 1,2 \ldots ,n \\ & u_{r} ,\eta_{p} ,v_{i} \ge \varepsilon > 0\;\;r = 1, \ldots ,s\;\;i = 1, \ldots ,m\;\;p = 1, \ldots ,q \\ \end{aligned} $$
(1)

In model (1), \( z_{1} \) is the efficiency of \( DMU_{1} \) while \( u_{r} ,\eta_{p} {\mathrm{and}} v_{i} \) are the factor weights. However, for computational convenience, the model can be re-expressed as

$$ \begin{aligned} & z_{1} = max\frac{{\mathop \sum \nolimits_{r = 1}^{s} u_{r} y_{r1} }}{{\mathop \sum \nolimits_{i = 1}^{m} v_{i} x_{i1} }} \\ & \cdots \\ & z_{n} = max\frac{{\mathop \sum \nolimits_{r = 1}^{s} u_{r} y_{rn} }}{{\mathop \sum \nolimits_{i = 1}^{m} v_{i} x_{in} }} \\ & {\rm s.t.} \\ & \mathop \sum \limits_{{p} = 1}^{q} \eta_{p}{k}_{pj} - \mathop \sum \limits_{{i} = 1}^{m} {v}_{i} x_{ij} \le 0 , \;\;j = 1,2, \ldots ,{n} \\ & \mathop \sum \limits_{r = 1}^{s} u_{r} y_{rj} - \mathop \sum \limits_{p = 1}^{q} \eta_{p} k_{pj} \le 0 , \;\;j = 1,2,\ldots n \\ & \mathop \sum \limits_{r = 1}^{s} u_{r} y_{rj} - \mathop \sum \limits_{i = 1}^{m} v_{i} x_{ij} \le 0 , \;\;j = 1,2, \ldots ,n \\ & u_{r} ,\eta_{p} ,v_{i} \ge \varepsilon > 0 \;\;r = 1, \ldots ,s\;\;i = 1, \ldots ,m\;\;p = 1, \ldots ,q \\ \end{aligned} $$
(2)

To solve this model, Zimmermann (1978) fuzzy approach has been adopted, which transforms a multiple objective program into a single objective program. With regard to its objective function, each DMU illustrates its level of achievement through means of the membership function. The membership function is illustrated as follows

$$ h_{j} \left( {z_{j} } \right) = \left\{ {\begin{array}{*{20}l} 0 \hfill &\quad { if\;\; z_{j} \le z_{j}^{l} } \hfill \\ {\frac{{z_{j} - z_{j}^{l} }}{{z_{j}^{u} - z_{j}^{l} }}} \hfill & \quad{if\;\;z_{j}^{l} \le z_{j} \le z_{j}^{u} } \hfill \\ 1 \hfill &\quad { if\;\;z_{j} \ge z_{j}^{u} } \hfill \\ \end{array} } \right. $$
(3)

where \( z_{j} \) is the efficiency of \( DMU_{j} \), \( z_{j}^{l} \;\;{\mathrm{and}}\;\; z_{j}^{u} \) illustrate the lower bound and upper bound of the objective function, respectively. The membership function of \( z_{j} \) is denoted by \( h_{j} \left( {z_{j} } \right) \). Within this function, the highest value of \( h_{j} \left( {z_{j} } \right) \) equals 1 and the lowest value equals 0. We solve the model by maximizing the minimum of the membership function in Model (3), which can be written as follows

$$ \begin{aligned} & \mathop {\mathrm{max }}\limits_{u,v, \eta } \mathop {\mathop { {\mathrm{min}}}\limits^{{ {\mathrm{n}}}}\limits_{ j } h_{j} \left( {z_{j} } \right)} \\ & {\mathrm{s}} . {\mathrm{t}} .\\ & \mathop \sum \limits_{p = 1}^{q} \eta_{p} k_{pj} - \mathop \sum \limits_{i = 1}^{m} v_{i} x_{ij} \le 0,\;\;j = 1,2, \ldots ,n \\ & \mathop \sum \limits_{r = 1}^{s} u_{r} y_{rj} - \mathop \sum \limits_{p = 1}^{q} \eta_{p} k_{pj} \le 0,\;\;j = 1,2, \ldots n \\ & \mathop \sum \limits_{r = 1}^{s} u_{r} y_{rj} - \mathop \sum \limits_{i = 1}^{m} v_{i} x_{ij} \le 0,\;\;j = 1,2, \ldots ,n \\ & u_{r} ,\eta_{p} ,v_{i} \ge \varepsilon > 0\;\;r = 1, \ldots ,s\;\;i = 1, \ldots ,m \;\;p = 1, \ldots ,q \\ \end{aligned} $$
(4)

Since efficiency of \( DMU_{j} \) ranges from 0 to 1, the membership function \( h_{j} \left( {z_{j} } \right) \) can be simplified as \( z_{j} \). For computational convenience, the auxiliary variable \( \psi \) is introduced as follows

$$ \psi = \mathop {\mathop { {\mathrm{min}}}\limits^{\mathrm{n}} \limits_{j} z_{j} } $$
(5)

We can rewrite Eq. (5) as

$$ \frac{{\mathop \sum \nolimits_{r = 1}^{s} u_{r} y_{rj} }}{{\mathop \sum \nolimits_{i = 1}^{m} v_{i} x_{ij } }} \ge \psi , \;\;j = 1,2 \ldots ,n $$
(6)

Combining Eqs. (4) and (6), we can rewrite the following mathematical programming model

$$ \begin{aligned} & \mathop {\mathrm{max }}\limits_{u,v, \eta } \psi \\ & {\mathrm{s}} . {\mathrm{t}} .\\ & \mathop \sum \limits_{r = 1}^{s} u_{r} y_{rj} - \psi \mathop \sum \limits_{i = 1}^{m} v_{i} x_{ij} \ge 0 ,\;\; j = 1,2, \ldots ,n \\ & \mathop \sum \limits_{p = 1}^{q} \eta_{p} k_{pj} - \mathop \sum \limits_{i = 1}^{m} v_{i} x_{ij} \le 0,\;\; j = 1,2, \ldots ,n \\ & \mathop \sum \limits_{r = 1}^{s} u_{r} y_{rj} - \mathop \sum \limits_{p = 1}^{q} \eta_{p} k_{pj} \le 0,\;\; j = 1,2,\ldots n \\ & \mathop \sum \limits_{r = 1}^{s} u_{r} y_{rj} - \mathop \sum \limits_{i = 1}^{m} v_{i} x_{ij} \le 0,\;\; j = 1,2, \ldots ,n \\ & u_{r} ,\eta_{p} ,v_{i} \ge \varepsilon > 0 \;\; r = 1, \ldots ,s\;\;i = 1, \ldots ,m\;\;p = 1, \ldots ,q \\ \end{aligned} $$
(7)

Applying bisection method to solve this model, we get the value of factor weight \( u_{r}^{*} ,\eta_{p}^{*} ,v_{i}^{*} \) to measure efficiency as follows

$$ z_{j}^{h} = \frac{{\mathop \sum \nolimits_{r = 1}^{s} u_{r}^{*} y_{rj} }}{{\mathop \sum \nolimits_{i = 1}^{m} v_{i}^{*} x_{ij} }} = \frac{{\mathop \sum \nolimits_{p = 1}^{q} \eta_{p}^{*} k_{pj} }}{{\mathop \sum \nolimits_{i = 1}^{m} v_{i}^{*} x_{ij} }}*\frac{{\mathop \sum \nolimits_{r = 1}^{s} u_{r}^{*} y_{rj} }}{{\mathop \sum \nolimits_{p = 1}^{q} \eta_{p}^{*} k_{pj} }} = z_{j}^{h1} * z_{j}^{h2} $$
(8)

In which, \( z_{j}^{h1} , z_{j}^{h2} {\mathrm{and}} z_{j}^{h} \) are the efficiency of first stage, second stage and the whole process, respectively.

3.2 Production process

As banks play an intermediate role in transforming deposits into lending and investments we follow the intermediate approach to construct input, output and intermediate products (Favero and Papi 1995). The production process includes two stages. For the first stage, the inputs are interest expense and non-interest expense (Harris et al. 2013). Deposits play an intermediate role between the first and second stage. Holod and Lewis (2011) state that deposits should be considered as the intermediate product of the process as deposits play a dual role in bank production procedure. Outputs include loans, interest income and non-interest income (Harris et al. 2013). The outputs of the first stage are inputs for the second stage.

4 Data and regression model

4.1 Sample

This study uses panel data of BHC affiliates for the period 1994–2018. Each bank subsidiary that belongs to a BHC is treated as a DMU. We exclude stand-alone banks, banks with foreign ownership and keep only banks belonging to holding companies. We also discard banks with missing values for the inputs or outputs needed to run DEA and those with no data available for at least half of the study period. After cleaning the data, we use the “Jackstrap” methodology to obtain a homogenous dataset (Chortareas et al. 2013) by applying bootstrap and calculating each DMU efficiency score relative to all other DMUs when a DMU is removed from the dataset. By doing so, outlier banks with data errors can be detected and removed from the dataset. The final sample consists of 3853 banks from 1994 to 2018.

We primarily obtain the input and output data from Call report and CEO data from Boardex and SNL Financial that is part of S&P Global Market Intelligence. We match Boardex data with financial data from Call Report. CEO details are from Boardex (e.g., CEO tenure and CEO duality). When CEO information is not available in Boardex, we look at it in Bloomberg and bank annual reports.

4.2 Variables and regression model

To evaluate the effect of bank type on bank efficiency, we estimate the following model

$$ {Bank\;efficiency}_{it} = \beta_{0} + \beta_{1} *{\textit{MBHC\;affiliate}}_{it} + \mathop \sum \limits_{i = 1}^{n} \beta_{i} *Controls_{it} + \lambda_{t} + \varepsilon_{it} $$
(9)

where the dependent variable is Bank efficiency of BHC affiliates measured by fuzzy multi-objective two-stage DEA. MBHC affiliate is a dummy variable that takes the value of 1 when banks are multi-BHC affiliates; and 0 if they are single-BHC affiliates. Controls are control variables that are deemed to affect bank efficiency. \( \lambda_{t} \) is a year dummy variable that captures year-fixed effects.

We use different control variables that affect bank efficiency such as Bank size, Bank capital, Bank non-performing loan, and Bank profit. Ly et al. (2017) find that the likelihood of BHC affiliates being acquired targets in mergers and acquisitions changes across various asset size, therefore, Bank size is measured as the natural logarithm of total asset and is included to capture the size effects of BHC affiliates. Benston (1965) and Miller and Noulas (1996) show that large banks can take advantage of economies of scale and are more efficient than others. Bank capital is measured as total bank capital divided by total assets. There are opposite views on the effect of Bank capital on bank performance and efficiency. On the one hand, by a study of 72 countries during the period 1999–2007, Barth et al. (2013) find that well-capitalized banks exhibit higher efficiency. On the other hand, bank capital can negatively affect bank performance and efficiency, encouraging banks to take excessive risks (Altunbas et al. 2007). Bank non-performing loan is measured as total bank non-performing loan divided by total assets. The importance of non-performing loans has been discussed by Berger and DeYoung (1997), who consider that non-performing loans have a detrimental effect on banks’ efficiency and stability because of asset quality deterioration. Karadima and Louri (2020) also argue that financial and debt crises in the euro area highlight the serious problem of non-performing loans faced by majority of banks. Bank profit is measured by net income divided by total assets. We expect bank profit ratio to have a positive impact on bank efficiency as highly profitable banks are preferred by clients, attracting more deposit and better customers (Miller and Noulas 1996).

We apply ordinary least square, fixed effect and truncated regression approaches. The latter is common in estimating the factors affecting bank efficiency as efficiency lies between 0 and 1. We also apply parametric bootstrapping to enhance the reliability of the results.

5 Analysis and results

5.1 Descriptive statistics and correlation analysis

Table 1 summarizes descriptive statistics of variables used in the DEA model for single-BHC affiliates, multi-BHC affiliates and the whole sample. We winsorize all continuous variables at the 1st and 99th percentile to minimize the effect of outliers.

Table 1 Descriptive statistics

The descriptive statistics from Table 1 show that multi-BHC affiliates seem to operate more efficiently than single-BHC affiliates with an average efficiency score of 0.50 and 0.43, respectively. Multi-BHCs tend to be larger, have higher capital ratio, higher profit ratio but lower deposit ratio.Footnote 3

5.2 The impact of bank structure on bank efficiency

Table 2 reports the results of our main regression. We adjust standard errors for heteroscedasticity and cluster them at the bank level. Year dummies are included in all models to control for year-fixed effects.

Table 2 Efficiency comparisons between single-bank holding company’s affiliates and multi-bank holding company’s affiliates

The coefficient of our main variable, MBHC affiliate, is positively and statistically significant at the 1% levels, implying that multi-BHC affiliates have higher efficiency than single-BHC affiliates. The results are consistent for OLS, fixed effect and truncated regression. According to internal capital market theory, headquarters of multi-BHC can diversify and obtain better finance, creating internal capital for banks at both parents and affiliate level. Multi-BHCs have more subsidiaries than single-BHCs and, therefore, allow affiliates to access more internal resources than their single-BHC counterparts. The subsidiaries, therefore, can take advantage of better financing with lower cost and reduce the effect of undesirable output by sharing risks between subsidiaries (Lamont 1997; Stein 1997).

For control variables, we find that large banks perform better, which is consistent with Barth et al. (2013), suggesting that larger banks may get benefit from economies of scale. Demsetz and Strahan (1995) state that large banks can take advantage of diversification. Diversification effect mostly dominates the internalization effect in multi-BHC structure, therefore, the multi-BHC subsidiaries can gain more benefits from the diversified structure (Ly and Shimizu 2018). Bank capital has a positive impact on bank efficiency, which is consistent with Barth et al. (2013). Bank non-performing loan has a negative effect on bank efficiency, implying that non-performing loans have a detrimental effect on banks’ efficiency and stability because of asset quality deterioration (Berger and DeYoung 1997). Bank profit exhibits a positive relationship with bank efficiency, indicating that profitable banks have higher efficiency. High profitable banks are preferred by clients, attracting more deposit and best potential borrowers (Miller and Noulas 1996).

5.3 Difference-in-difference based on propensity score matching analysis

From the previous analysis, multi-BHC affiliates are found to be more efficient than their single-BHC counterparts. It can be argued that the difference in bank efficiency may not be caused by the difference in bank types, i.e. either single-BHC or multi-BHC structure, however, due to the endogenous decision made by CEO to become such bank type or due to omitted bank characteristics.

To control for endogeneity, our test focuses on banks switching their parents from single-BHC affiliates to multi-BHC affiliates. We make an assumption that banks that change their status may not change their characteristics during such a short period, but their efficiency changes after switching their status. Difference-in-differences approach eliminates the unobserved heterogeneity and increases the evaluation quality (Blundell and Costa Dias 2000). This section, thus, tests whether banks switching from single-BHC affiliates to multi-BHC affiliates exhibit higher efficiency levels by applying a difference-in-difference regression based on propensity score matching method.

We conduct the test in two steps. First, we divide our sample into two sub-samples, namely, banks that change their status from single-BHC affiliates to multi-BHC affiliates (treated group) and those that do not change their status (control group). We use propensity score matching with nearest neighbour matching to match banks that do not change their status with those that change their status based on bank-specific characteristics such as bank size, capital ratio, non-performing loan ratio and profit ratio. We match the groups in the same year to rule out the difference between macroeconomics conditions across different years.

We then estimate differences in bank efficiency between treated banks and non-treated banks by the running a following difference-in-difference regression.

$$\begin{aligned} {Bank\;efficiency}_{it} & = \beta_{0} + \beta_{1} Treat_{i} + \beta_{2} Post_{i} + \beta_{3} Treat_{i} *Post_{i} \\ & \quad + \mathop \sum \limits_{i = 1}^{n} \beta_{i} *Controls_{it} + \lambda_{t} + \varepsilon_{it}\end{aligned} $$
(10)

\( Treat_{i} \) is equals to 1 for banks that change the status; and 0 for banks that do not change the status. This variable is time-invariant. \( Post_{i} \) is a dummy variable equal to 1 for the time after banks change their status and 0 for other periods. The most important variable is the interaction between \( Treat_{i} \) and \( Post_{i} \), indicating changes in difference between treated and non-treated bank before and after status changes. We include year fixed effect in the equation.

Table 3 reports the difference-in-difference regression results for two windows: (i) [− 1, + 1] that includes 1 year before and 1 year after the banks change their status; and (ii) [− 2, +2] that captures the effects of 2 year before and 2 year after the banks change their status. In all regressions shown in Table 3, the interaction term between \( Treat_{i} \) and \( Post_{i} \) is positively and statistically significant at the 1% threshold level, implying that single-BHC affiliates tend to increase their efficiency after switching to multi-BHC affiliates compared to single-BHC affiliates that keep the same status.Footnote 4

Table 3 Difference in difference based on propensity score matching

We follow Leung et al. (2019) to apply a dynamic treatment method with this sample to see how the difference between treated banks and non-treated banks changes over different periods based on the following equation:

$$\begin{aligned} Bank\;efficiency_{it} & = \lambda_{0} + \lambda_{1} * Before_{it}^{ - 2 or - 1} + \lambda_{2} * Current_{it}^{0} \\ & \quad + \lambda_{3} * After_{it}^{ + 1} + \lambda_{4} * After_{it}^{ + 2} + \mathop \sum \limits_{i = 1}^{n} \lambda_{i} *Controls_{it} + \alpha_{t} + \varepsilon_{it}\end{aligned} $$
(11)

where \( Before_{it}^{ - 2 or - 1} \) is a dummy variable that is equal to 1 for each of the two years before banks switch from single-BHC affiliate to multi-BHC affiliate. \( Current_{it}^{0} \) is a dummy variable that is equal to 1 for the year when banks switch their status. \( After_{it}^{ + 1} \) (\( After_{it}^{ + 2} \)) is a dummy variable that is equal to one year (two years) after changing their status, respectively. We include year fixed effect in the equation.

Table 4 reports the results of the dynamic treatment method. The coefficient of \( Before^{ - 2 or - 1} \) is small and insignificant, suggesting no systematic differences in pre-trend between the treated and control banks and the parallel assumption is likely satisfied (Roberts and Whited 2013). Compared to the pre-treatment years, we observe an increase in bank efficiency when banks change their status from single-BHC affiliates to multi-BHC affiliates. The effect is especially stronger for 1 and 2 years after banks change their status. This quasi-natural experiment reaffirms our main finding that multi-BHC affiliates are more efficient than single-BHC affiliates.

Table 4 Dynamic treatment

5.4 Optimal structure of BHC and bank efficiency

The previous results suggest that multi-BHC affiliation has higher efficiency than single-BHC affiliation. This section tests whether and how the network size of multi-BHC can affect bank efficiency. In other words, is there an optimal network size that can maximize bank efficiency? More specifically, network size of BHC is measured by a total number of bank subsidiaries and non-bank subsidiaries. To assess the existence of this optimal point, we consider a quadratic model to test a potential U shaped form of bank network where \( MBHC\_network_{it} \) is measured as the natural logarithm of total number of BHC subsidiaries including bank and non-bank subsidiaries

$$\begin{aligned} Bank efficiency_{it} & = \beta_{0} + \beta_{1} *MBHC\_network_{it} + \beta_{2} \nonumber \\ & \quad *MBHC\_network_{it}^{2} + \mathop \sum \limits_{i = 1}^{n} \beta_{i} *Controls_{it} + \lambda_{t} + \varepsilon_{it}\end{aligned} $$
(12)

An optimal point is obtained by taking the derivative of the efficiency score with respect to MBHC_network and setting it to zero. The impact of MBHC_network on Bank efficiency has been depicted in the Table 5.

Table 5 The impact of bank holding company network on bank efficiency

The coefficient of MBHC_network is positive and statistically significant, while the coefficient of its square is negative and significant at the 1% level for fixed effect and truncated regressions. This finding confirms that there is an inverted U curve relationship and an optimal number of subsidiaries for multi-BHC to obtain the highest efficiency. As it can be seen from Table 5, the optimal number of subsidiaries for multi-BHC is 20.Footnote 5 This finding suggests that although multi-BHC affiliate is more efficient than single-BHC affiliates, there is an optimal number of subsidiaries that multi-BHC should consider to enhance their affiliates’ efficiency.

5.5 CEO power and bank efficiency

This section assesses the effect of CEO power on bank efficiency. It proxies for CEO power using CEO tenure and CEO duality. CEOs are expected to have more power to influence the decisions of the bank when they stay longer in the bank or chair the board (Pathan 2009). On the one hand, CEOs with more power better monitor the bank. On the other hand, according to Fama and Jensen (1983), the presence of powerful CEOs often signal the absence of internal control mechanisms that can adversely affect bank efficiency (Table 6).

Table 6 The impact of CEO power on bank efficiency

The results regarding the effect of CEO power on bank efficiency are portrayed in Table 6 and are consistent with our expectation. They show that CEO power (proxied by CEO duality and CEO tenure) has a negative impact on bank efficiency. The results are statistically significant at the 1% level and are consistent with Grove et al. (2011), De Jonghe et al. (2012), and De Haan and Vlahu (2016). For instance, De Haan and Vlahu (2016) suggest that banks that combine CEO and chairman positions underperform their peers in terms of performance and cost efficiency as powerful CEOs tend to take higher risks and reduce bank efficiency. The interaction between CEO tenure and MBHC affiliate is negative and statistically significant, implying that when CEOs stay longer at the helm of a multi-BHC affiliate, bank efficiency is significantly reduced. This is because CEOs in multi-BHC affiliates tend to take more risk in particular when they manage the bank for a long time. The same results are found with CEO duality as a proxy for CEO power.

Table 7 The effect of bank structure on bank efficiency measured by conventional DEA

This result has important implications for regulators, policymakers and bank managers since CEO power is detrimental to bank efficiency. Therefore, regulators and bank managers should consider the risk of giving CEOs more power.

6 Robustness test

6.1 Alternative measures of bank efficiency

This section applies a conventional DEA technique as an alternative measure for bank efficiency. Although conventional DEA ignores the bank operational procedure, this technique has been applied widely in the banking literature. Conventional DEA is easy to apply and especially effective to simultaneously combine inputs and outputs of different natures. We choose deposit ratio, interest expense ratio and non-interest expense ratio as inputs while loan ratio, interest income ratio and non-interest income ratio as outputs (Harris et al. 2013). We use input oriented rather than output oriented since it is difficult for banks to enhance their outputs given certain level of inputs.

Table 7 shows that our results are robust irrespective of the efficiency measure, which confirms that multi-BHC affiliates have higher efficiency than single-BHC affiliates. In addition, Bank size, Bank capital and Bank profit have a positive and significant impact on bank efficiency. Bank profit has a large economic impact on bank efficiency, implying that the bank that generates a high level of profit often operates more efficiently than others.

6.2 Impact of organizational complexity on bank efficiency

There is a possibility that organizational complexity might affect bank efficiency. According to Stein (2002), organizations with centralized structure are more complex and have tendency to rely on hard information, while organizations with decentralized structure are less complex and tend to rely more on soft information. In addition, there is an incentive for small organizations to produce soft information due to the centralization in decision making of the authority. Meanwhile, large organizations could acquire hard information due to broader scope for resource allocation. Berger et al. (2005) find that large banks mainly lend to larger firms with good account records while small banks tend to lend to more difficult credits. Therefore, complexity of bank structure would have an effect on bank efficiency.

We follow Assaf et al. (2019) to capture organizational complexity with ratio of total active subsidiaries over bank total asset times one thousand. Bank holding companies with more affiliations per value of assets could have more complex structures.

Table 8 shows that complexity of bank organization has a positive and significant impact on bank efficiency at a threshold level of 1%. Banks with more complex structure tend to achieve better efficiency due to internal capital market regarded as “source of strength”. For example, the affiliations of multi-bank holding companies could receive capital injection in case of funding shortfall. In addition, those banks can access federal funds and large CDs markets easier (Ashcraft 2006) and lend mainly to large firms with higher amount of loans (Berger et al. 2005). Therefore, bank efficiency could be enhanced due to lower cost of capital and higher amount of loans produced.

Table 8 The impact of organizational complexity on bank efficiency

6.3 Additional test

We next conduct a number of cross-sectional tests. These additional tests allow us to identify factors that strengthen or weaken the effect of BHC structure on bank efficiency. By adopting this approach we can assess the relevance of alternative explanations of the relationship between BHC affiliation and bank efficiency such as bank capital, bank soundness (bank profitability), asset quality and bank size. As it can be seen from Table 9, the results are robust regardless of bank size, asset quality, bank performance, and bank age, implying that multi-BHC affiliations have higher efficiency than SBHC affiliations irrespective of the bank characteristics. The results also do not depend on whether the analysis is conducted before or after the global financial crisis of 2007–2008.

Table 9 Additional test

7 Conclusion

This paper examines and compares bank efficiency between single-BHC affiliates and multi-BHC affiliates. It measures bank efficiency by applying a fuzzy multi-objective two-stage data envelopment analysis technique. Using a sample of US commercial banks data from 1994 to 2018, it shows that multi-BHC affiliates have higher efficiency than single-BHC affiliate. By applying difference-in-differences estimation technique based on propensity score matching approach, the empirical results suggest that banks enhance their efficiency when they change their status from single- to multi-BHC affiliate, which reinforces our conclusions. We use internal capital market theory to explain why multi-BHC affiliates have higher efficiency than single-BHC affiliates. When banks switch from single-BHC to multi-BHC, they can access to more funds in internal capital markets, expanding their operation, and attracting more deposits at a lower cost. As a result, multi-BHC affiliates exhibit higher efficiency levels than single-BHC affiliates.

One limitation of this study is the fact that the decision maker preferences over the potential adjustments of various inputs and outputs are not considered (Golany 1988). To the extent that the DMUs are efficient or inefficient, the assessment relies on the uncertainty over the choice of inputs and outputs (Stolp 1990). Therefore, Peykani et al. (2019) suggest customizing fuzzy DEA models according to properties of DMUs.