1 Introduction

DEA (Data Envelopment Analysis) is well known as the method to measure efficiency of DMUs (decision making units) with multiple inputs and outputs [1, 2]. This method calculates efficiency score of each DMU based on Pareto optimal line which is called efficiency frontier. Then DEA shows a plan for improvement to inefficient DMUs. The procedure of this method consists of following four steps: (1) data item selection which shows the character of objective DMU’s activity, (2) DMUs selection by which a relative comparison is carried out to measure the objective DMU’s performance, (3) DEA model selection and calculation, and (4) adjustment of DEA parameters if analyst does not obtain the desirable results. This study focuses on step (4).

2 Traditional Data Envelopment Analysis

2.1 Output-based CCR model

To incorporate the subjective information, assurance region (AR) method is developed by DEA researchers [3, 4].

However, as analyst’s experience or intuition is necessary in calculation and there are some infeasible cases, it is difficult to use. Therefore, this research expands CCR model which is the most basic DEA model [1] to a new DEA model which is embedded subjectivity information.

In explaining the CCR model, it is defined here as n DMUs (\({{\text{DMU}}_1},{{\text{DMU}}_2}, \ldots ,{{\text{DMU}}_k}, \ldots ,{{\text{DMU}}_n}\)), where each DMU is characterized by m inputs (\({x_{1k}},{x_{2k}}, \ldots ,{x_{ik}}, \ldots {x_{mk}}\)) and s outputs (\({y_{1k}},{y_{2k}}, \ldots ,{y_{rk}},{y_{sk}}\)). Output-based CCR model can be mathematically formulated by

$$\begin{gathered} \hbox{min} \;\sum\limits_{{i=1}}^{m} {v_{i}^{{(k)}}{x_{ik}}} =({\varphi _k}) \hfill \\ {\text{s.t.}}\; - \sum\limits_{{i=1}}^{m} {v_{i}^{{(k)}}{x_{ij}}} +\sum\limits_{{r=1}}^{s} {u_{r}^{{(k)}}{y_{rj}}} \le 0 \hfill \\ \quad \quad (j=1,2, \ldots ,n) \hfill \\ \quad \sum\limits_{{r=1}}^{s} {u_{r}^{{(k)}}{y_{rk}}} =1 \hfill \\ \quad \quad v_{i}^{{(k)}} \ge 0{\text{ }},u_{r}^{{(k)}} \ge 0. \hfill \\ \end{gathered}$$
(1)

Here \(u_{{{r^r}}}^{{(k)}}\) is multiplier weight given to the \(r{\text{th}}\) output, and \(v_{i}^{{(k)}}\) is multiplier weight given to the \(i{\text{th}}\) input. Then, first restriction condition (or constraint) represents that the productivity of all DMU becomes 100% or less. And the objective function represents the minimization of the virtual inputs of \({\text{DM}}{{\text{U}}_k}\), setting that the virtual outputs of \({\text{DM}}{{\text{U}}_k}\) is equal to 1 which is formulated in second restriction. Therefore, the optimal solution of \((v_{i}^{{(k)}},u_{r}^{{(k)}})\) represents the convenient weight for \({\text{DM}}{{\text{U}}_k}\). Especially, the optimal objective function value indicates the evaluation value for \({\text{DM}}{{\text{U}}_k}\). This evaluation value used the convenient weight called “efficiency score” in the manner that \({\varphi _k}=1\,(100\% )\) means the state of efficiency, while \({\varphi _k}<1\,(100\% )\) means the state of inefficiency.

The dual form of (1) becomes

$$\begin{gathered} \hbox{max} \;{\varphi _k} \hfill \\ s.t.\;{x_{ik}} \ge \sum\limits_{{j=1}}^{n} {{x_{ij}}\lambda _{j}^{{(k)}}} \quad (i=1,2, \ldots ,m) \hfill \\ \quad {\varphi _k}{y_{rk}} \le \sum\limits_{{j=1}}^{n} {{y_{rj}}\lambda _{j}^{{(k)}}} \quad (r=1,2, \ldots ,s) \hfill \\ \quad \quad {\varphi _k}:{\text{free,}}\;\lambda _{j}^{{(k)}} \ge 0. \hfill \\ \end{gathered}$$
(2)

Here, variable \(\lambda _{j}^{{(k)}}\) is considered to make a convex combination of the data. \({\varphi _k}\) is regarded as the ratio of maximized data and current data. Especially, \(\lambda _{j}^{{(k)}}\) denotes weights which indicate efficiency frontier and the position of kth DMU is calculated by combining the weight of \(\lambda _{j}^{{(k)}}\). A set of \(\lambda _{j}^{{(k)}}{\text{ }}(\lambda _{j}^{{(k)}}>0)\) are called “the reference set for kth DMU”.

The current position of a DMU is indicated by its own reference set. In general, any DMU has difference reference set from others. This difference causes the difficulty of ranking DMUs. This research illustrates this problem using simple situation. In Fig. 1, the efficiency scores of K, L, and M are assumed to be 0.6, 0.7, and 0.8, respectively. It does not always mean M is superior to others even if its efficiency score is highest. It is because the efficiency score of K is based on B and C, while that of L is based on A and B, and that of M is based on C and D. Then, the rank among three DMUs is not clear if a researcher uses only efficiency score based on original DEA.

Fig. 1
figure 1

Reference set

2.2 Assurance region (AR)

Although, there are some cases only specific input/output items such as extremely small input items or extremely large output items are assigned weights in (1) and weights of the rest input/output items are zero equally. As a result, it cannot be evaluated by (1) how much these items weighted as zero contribute to the efficiency. Although it is not a problem mathematically, it may be problems in practical application. To overcome these problems, assurance regions (AR) was developed as the method to adjust dataset [5, 6]. Reflecting subjective opinions of analysts to relations among input/output items, AR enables analysts to examine the influences of input/output items weighted as zero by DEA.

In particular, the following constraint was added to (1) [7].

$$\begin{gathered} \underline {{{\alpha _i}}} \le \frac{{v_{i}^{{(k)}}}}{{v_{1}^{{(k)}}}} \le \overline {{{\alpha _i}}} (i=2, \ldots ,m) \hfill \\ \underline {{{\beta _r}}} \le \frac{{u_{r}^{{(k)}}{y_{rk}}}}{{\sum\limits_{{r=1}}^{s} {u_{r}^{{(k)}}{y_{rk}}} }} \le \overline {{{\beta _r}}} (r=1, \ldots ,s), \hfill \\ \end{gathered}$$
(3)

where \({\alpha _i}\) and \({\beta _r}\) are parameters for reflecting subjective opinions of analysts. This approach is the most basic one and although there are other approaches, a lot of them are based on this approach. In AR, considering subjecting opinions by adding some importance of input/output items to their weights, it is attempted to resolve “zero weights”. Although, depending on the parameters \({\alpha _i}\) and \({\beta _r}\), the second constraint of (1) is not satisfied and this model is infeasible.

3 DEA cone ratio model based on a paired comparison

3.1 Overview

To support the framework of the re-calculation incorporating an analyst’s intention, this study proposed a new application of DEA to correct an efficient frontier, not operate a parameter directly like AR. The proposed method consists of the following steps: (1) measurement of the subjectivity value by a paired comparison, (2) correction of a specific efficient frontier based on the subjectivity value.

3.2 Measurement of the subjectivity value by a paired comparison

To take in an analyst’s subjective information to DEA framework, this study considers the weight presumption problem of each I/O item to a paired comparison procession such as AHP (analytic hierarchy process). Here, this study aims at the paired comparison procession about the reference set (RS) about each output item. The paired comparison procession A about s output items becomes

$$A=({a_{ij}})\left\{ { \in {R^{s \times s}}|{a_{ij}}>0,{a_{ji}}={1 \mathord{\left/ {\vphantom {1 {{a_{ij}}}}} \right. \kern-0pt} {{a_{ij}}}}} \right\}.$$
(4)

Then, this method presumes the weight using a following eigenvalue problem:

$$A{\mathbf{P}}={\lambda _{\hbox{max} }}{\mathbf{P}}.$$
(5)

Let the vector \(P={({p_1},{p_2}, \ldots ,{p_s})^{\text{T}}}\) be an importance vector of the analyst about an output item. Where, the vector \(P^{\prime}\) which took the reciprocal of the element of the vector P is set as follows:

Similarly, let the vector \(Q={({q_1},{q_2}, \ldots ,{q_n})^{\text{T}}}\) be an importance vector of analyst about n DMUs. Then, let us set the following vector:

$$Q^{\prime}={(1/{q_1},1/{q_2}, \ldots ,1/{q_n})^{\text{T}}}.$$
(6)

3.3 Correction of a specific efficient frontier

Setting an output data procession to \(Y=({y_{rj}}) \in {R^{s \times n}}\), this study considers the following corrected output data procession using the parameters P and Q.

$$Y^{P} = \left( {\begin{array}{*{20}l} {1/P_{1} } \hfill & 0 \hfill & \cdot \hfill & \cdot \hfill & \cdot \hfill & 0 \hfill \\ 0 \hfill & {1/P_{2} } \hfill & 0 \hfill & \cdot \hfill & \cdot \hfill & 0 \hfill \\ \cdot \hfill & 0 \hfill & \cdot \hfill & {} \hfill & {} \hfill & \cdot \hfill \\ \cdot \hfill & \cdot \hfill & {} \hfill & \cdot \hfill & {} \hfill & \cdot \hfill \\ \cdot \hfill & \cdot \hfill & {} \hfill & {} \hfill & \cdot \hfill & \cdot \hfill \\ 0 \hfill & 0 \hfill & \cdot \hfill & \cdot \hfill & \cdot \hfill & {1/P_{s} } \hfill \\ \end{array} } \right)Y ,$$
(7)
$$Y^{Q} = Y\left( {\begin{array}{*{20}l} {1/Q_{1} } \hfill & 0 \hfill & \cdot \hfill & \cdot \hfill & \cdot \hfill & 0 \hfill \\ 0 \hfill & {1/Q_{2} } \hfill & 0 \hfill & \cdot \hfill & \cdot \hfill & 0 \hfill \\ \cdot \hfill & 0 \hfill & \cdot \hfill & {} \hfill & {} \hfill & \cdot \hfill \\ \cdot \hfill & \cdot \hfill & {} \hfill & \cdot \hfill & {} \hfill & \cdot \hfill \\ \cdot \hfill & \cdot \hfill & {} \hfill & {} \hfill & \cdot \hfill & \cdot \hfill \\ 0 \hfill & 0 \hfill & \cdot \hfill & \cdot \hfill & \cdot \hfill & {1/Q_{n} } \hfill \\ \end{array} } \right).$$
(8)

Setting the element of the vector \({Y^P}\) to \(y_{{rj}}^{p} \in {R^{s \times n}}\), let us to correct second restriction of formula (2).

$${\varphi _k}{y_{rk}} \le \sum\limits_{{j=1}}^{n} {y_{{rj}}^{P}\lambda _{j}^{{(k)}}} \quad (r=1,2, \ldots ,s).$$
(9)

Formula (9) has incorporated the data corrected to the coefficient of \(\lambda _{j}^{{(k)}}\) which is a variable for forming the efficient frontier about an output. Formula (9) can be replaced by following formula (10) and (11) mathematically:

$$\begin{gathered} {\varphi _k}{y_{rk}} \le \sum\limits_{{j=1}}^{n} {y_{{rj}}^{P}\lambda _{j}^{{(k)}}} =\sum\limits_{{j=1}}^{n} {(1/{p_r}){y_{rj}}\lambda _{j}^{{(k)}}} , \hfill \\ \quad (r=1,2, \ldots ,s) \hfill \\ \end{gathered}$$
(10)
$$\begin{gathered} ({p_r}{\varphi _k}){y_{rk}} \le \sum\limits_{{j=1}}^{n} {{y_{rj}}\lambda _{j}^{{(k)}}} . \hfill \\ \quad (r=1,2, \ldots ,s) \hfill \\ \end{gathered}$$
(11)

Formula (11) corrects the magnifying power of each output. Using Fig. 2, this study describes about a graphical interpretation. Figure 2 assumes that there is one inefficient DMU (A) with two output element and an efficiency frontier which represents the best practice frontier. An evaluation value of DMU (A) is measured by an efficient frontier.

Fig. 2
figure 2

Search direction

Because traditional DEA is treating the output element as the same rank at this time, ideal activity (A′) which touches an efficiency frontier with search direction enlarged by the square is chosen.

On the other hand, because proposed method is used, corrected data by the importance vector (“y1 is more important than y2”) about each output, ideal activity (A″) is chosen. It has the search direction which inclines to the output item y1 to think as important. Thus, the proposed method does not add restrictions to a variable directly, and since it is formulized in the form where it corrects to the search direction, “No solution” does not come out of it.

3.4 Correction for importance vector about the DMUs

Setting the element of the vector \({Y^Q}\) to \(y_{{rj}}^{Q} \in {R^{s \times n}}\) lets us to correct the first restriction of formula (1).

$$\begin{gathered} - \sum\limits_{{i=1}}^{m} {v_{i}^{{(k)}}{x_{ij}}} +\sum\limits_{{r=1}}^{s} {w_{r}^{{(k)}}y_{{rj}}^{Q}} <0. \hfill \\ \quad (j=1,2, \ldots ,n) \hfill \\ \end{gathered}$$
(12)

Formula (12) can be replaced by following formulas (13), (14), and (15) mathematically:

$$\begin{gathered} \frac{{\sum\limits_{{r=1}}^{s} {w_{r}^{{(k)}}y_{{rj}}^{Q}} }}{{\sum\limits_{{i=1}}^{m} {v_{i}^{{(k)}}{x_{ij}}} }}<1, \hfill \\ (j=1,2, \ldots ,n) \hfill \\ \end{gathered}$$
(13)
$$\begin{gathered} \frac{{\sum\limits_{{r=1}}^{s} {w_{r}^{{(k)}}y_{{rj}}^{Q}} }}{{\sum\limits_{{i=1}}^{m} {v_{i}^{{(k)}}{x_{ij}}} }}=\frac{{\sum\limits_{{r=1}}^{s} {w_{r}^{{(k)}}(1/{q_j}){y_{rj}}} }}{{\sum\limits_{{i=1}}^{m} {v_{i}^{{(k)}}{x_{ij}}} }}<1, \hfill \\ \quad (j=1,2, \ldots ,n) \hfill \\ \end{gathered}$$
(14)
$$\begin{gathered} \frac{{\sum\limits_{{r=1}}^{s} {w_{r}^{{(k)}}{y_{rj}}} }}{{\sum\limits_{{i=1}}^{m} {v_{i}^{{(k)}}{x_{ij}}} }}<{q_j}. \hfill \\ (j=1,2, \ldots ,n) \hfill \\ \end{gathered}$$
(15)

Formulas (13) and (14) represent the condition that productivity (virtual-corrected output/virtual input) of all DMU is made 100% or less.

On the other hand, formula (15) represents the condition that restricts the upper limit of the productivity in each DMU by the important vector about the DMUs. For example, if the analyst assumes that DMU “o” is more important than DMU “p” (\({q_{\text{o}}}>{q_{\text{p}}}\)), then following formulation can be obtained:

$$\frac{{\sum\limits_{{r=1}}^{s} {w_{r}^{{(k)}}{y_{{\text{ro}}}}} }}{{\sum\limits_{{i=1}}^{m} {v_{i}^{{(k)}}{x_{i{\text{o}}}}} }}>\frac{{\sum\limits_{{r=1}}^{s} {w_{r}^{{(k)}}{y_{r{\text{p}}}}} }}{{\sum\limits_{{i=1}}^{m} {v_{i}^{{(k)}}{x_{i{\text{p}}}}} }}.$$
(16)

Formula (16) represents that analyst’s assumption is expressed correctly. Thus, proposed method based on the formula (8) can be interpreted with the model which adds restrictions to the order relation of each DMU.

4 Experimental result

4.1 Data set and DEA result

To confirm the effectiveness of the proposed method, experiments are conducted using the sample data in Table 1.

Table 1 Data set

Each DMU has two inputs and three outputs.

Table 2 is the calculation result applied to Eq. (1).

Table 2 Experimental result

For example, the evaluation value of DMU 1 is obtained as shown in formula (17).

$$\theta =\frac{{0 \times 10.47+0 \times 97.37+0.001 \times 799.7}}{{0.037 \times 27.37+0 \times 28.20}}=0.832,$$
(17)

v2 and u3 are almost 0. An extreme value appears, causing problems that v2 and u3 are not reflected. It is impossible to find characteristic elements by not being able to analyze some items. To solve this problem, we will conduct experiments using AR of the conventional method and the proposed method.

4.2 Experimental result of AR

Table 3 shows the results obtained based on the constraint equation of AR. The constraint equation is shown in formula (18).

Table 3 Experimental result of AR
$$\begin{gathered} v2>0.001 \hfill \\ u3>0.001 \hfill \\ \end{gathered}$$
(18)

With AR of the conventional method, there are some no solution cases. Therefore, the items that the analysts want to emphasize are not reflected. Also, the subjectivity of the analyst is necessary for setting.

4.3 Proposed method

Table 4 shows the results of fitted formula (19). v2 and u3 are multiplied by magnification, respectively. The efficiency value θ may exceed 1 in some cases.

Table 4 Correction of a specific efficient frontier

As a result, calculation becomes impossible, value has come out.

$$P={(1/{p_1},2/{p_2},1/{p_3},1/{p_4},0.5/{p_5})^{\text{T}}}.$$
(19)

5 Conclusion

We could expand the DEA model by extracting subjective information based on pair comparison. The proposed DEA model, developed by creating frameworks and modifying specific efficient frontiers, reflected analysts’ intentions without taking out unfeasible solutions.

Moreover, it was able to confirm the effectiveness of the proposed method by numerical experiment.