1 Introduction

Providing better hospital service quality is one of the major concerns of healthcare industry over the world (Chang 2014; Akdag et al. 2014; Taşkin et al. 2015). Delivery of healthcare services in Turkey is performed in a very competitive environment. For that reason, to make a better choice, the services provided by the public and private hospitals should be monitored and evaluated according to the viewpoint of stakeholders such as patients, medical staff, and hospital executives in terms of more satisfaction. In recent years, great effort has been spent to improve the health system in Turkey. Key elements of the reforms under the health transformation program implemented include the development of private healthcare institutions and the improvement in the quality of services provided by both public and private hospitals (Demirer and Bülbül 2014). The increase in the number of private healthcare facilities in the health sector, despite the huge role of public health sector, is one of the indicators of the efforts to improve the healthcare system. In 2015, the number of hospitals in Turkey reached 1533. In 2014, the number of hospitals in the Ministry of Health was 866, while in 2015 it reached to 865. The number of university hospitals, which was 69 in 2014, reached 70 in 2015. Also, the number of private hospitals, which was 562 in 2014, increased to 556 in 2015 (Ministry of Health 2016). In 2015, number of medical staff per 100.000 employees increased compared to the previous year. In 2014, the number of specialist physicians per 100,000 people was 97, but in 2015 it reached 99. The number of nurses and the number of midwives per 100,000 persons increased from 251 in 2014 to 261 in 2015, which is an increase compared to 2014 in all titles. The number of nurses and midwives per 100,000 persons (251 in 2014 to 261 in 2015) increased in all the titles compared to 2014 (Ministry of Health 2016). Regarding the improvement in public health sector, public and university hospital admissions have increased since 2014. When the admission rates are examined, admissions to public and university hospitals have increased in 2016 compared to 2014 (while public hospital admissions are changed from approximately 292 million to 340 million, university hospital admissions are 32 million to 36 million). Contrary to this figure, regarding positive service quality provided by public hospitals in recent periods, in 2014 admissions to private healthcare institutions decreased with a small change (72 to 71 million) (Ministry of Health 2017; Ministry of Health 2015). Despite the continuous increase in the number of patients admitted and the number of hospitals in Turkey, there is lack of enough attempts about service quality evaluation in terms of hospital types. On the other hand, millions of people continue to receive health services from public hospitals, although private hospitals are increasingly located in the Turkish healthcare system. For this reason, a reliable and accurate approach is needed in order to measure and improve both public and private hospital service quality.

Therefore, in this study, a novel approach is proposed based on the concept of Pythagorean fuzzy sets, AHP, and TOPSIS method to provide an accurate decision-making process for evaluating the hospital service quality. Multicriteria decision-making (MCDM) is an important field of management science, which clearly reflects multiple criteria in decision-making environments. It includes several kinds of methodologies for decision makers and practitioners. MCDM-based methods heavily include human participation and judgments (Kubler et al. 2016). It deals with evaluating, prioritizing, or selecting alternatives under conflicting criteria with respect to decision maker(s) preferences (Gul et al. 2016). The main mechanisms of a MCDM method are itemized as alternatives, criteria against evaluated alternatives, scores of alternatives on the criteria, and criteria weights reflecting relative importance of each criterion as compared with others (Gul et al. 2016). One of the most important MCDM methods widely used is AHP. In the healthcare service quality evaluation literature, researchers apply AHP mostly to determine scores/weights of service quality criteria in a hierarchical manner (Min et al. 1997; Wu et al. 2008; Afkham et al. 2012; Akdag et al. 2014; Aktas et al. 2015; Lupo 2016). To this end, in the literature, the fuzzy set theory is used to take vagueness and uncertainty of subjective evaluations into consider. AHP proposed by Thomas L. Saaty is based on the hierarchical MCDM problem consisting of a goal, criteria, and alternatives. In each hierarchical level, pairwise comparisons are made with judgments using numerical values taken from the Saaty’s scale of 1–9 (Saaty 1990). The process of AHP provides to weight criteria and defines a ranking of the alternatives. The decision-making process begins with a comparison of the alternatives with respect to the criteria. The evaluation continues up to the criteria of the first level and then these are compared to the goal. AHP has the advantages of hierarchical structure definition, demonstration of the problem in a structural manner and integration of all the judgments with structured links. After the hierarchy is structured, linguistic terms are employed by experts to make pairwise comparisons. These linguistic terms are converted to numerical values by using fuzzy sets which are able to handle uncertainty and vagueness of evaluation processes. Therefore, it is possible to say that AHP is quite useful for modeling problems in the absence of certain measures.

In this study, Pythagorean fuzzy AHP is used to make the developed hospital service quality evaluation model more effectively. Pythagorean fuzzy sets are an extension of intuitionistic fuzzy sets. They provide more autonomy to decision makers in articulating their ideas about the vagueness and uncertainty of the considered MCDM problem. Pythagorean fuzzy sets achieve this purpose because decision makers do not have to assign membership and non-membership degrees whose sum is at most 1 (Ilbahar et al. 2018). However, the sum of squares of these degrees must be at most 1. The weights of 32 hospital service quality evaluation criteria are obtained through Pythagorean fuzzy analytic hierarchy process (PFAHP). In the health service quality evaluation literature, PFTOPSIS is used by researchers in order to determine priorities of the healthcare institutions like hospitals (Akdag et al. 2014). The obtained values for above-mentioned 32 criteria will be used as inputs for PFTOPSIS, and it will provide a ranking order among hospital types as an output.

The rest of this paper is structured as follows: A literature review on the general view of healthcare service quality and MCDM-based hospital service quality evaluation is given in Sect. 2. Preliminaries on Pythagorean fuzzy sets, linguistic variables parameterized by Pythagorean fuzzy numbers, steps of PFAHP, and PFTOPSIS are examined in Sect. 3. Application of the PFAHP and PFTOPSIS-integrated model proposal and conclusions are presented in Sects. 4 and 5, respectively.

2 Review of literature

In the context of this study, a broad literature review has been done to show current attempts about healthcare service quality evaluation. The related literature is divided into two parts as follows: (1) studies applied a SERVQUAL-based healthcare service quality evaluation model, and (2) studies applied a single or hybrid MCDM-based healthcare service quality evaluation model. SERVQUAL is a common used method to measure service quality in the literature (Parasuraman et al. 1988). This method is a survey analysis based on evaluation of perceived and expected service quality by customers. With the results of the evaluation, the gap between service expectations (e.g., required quality as important–unimportant) and the performance level of the service (e.g., good–poor perceived quality) determines the quality of the service. In other words, evaluation of service quality in SERVQUAL analysis is based on calculating the difference between the scores which the customers respond to the pairs of expressions “expectationperception” (Demir et al. 2018). SERVQUAL is applied to many service industries such as healthcare (Altuntas et al. 2012), transportation (Kayapinar and Erginel 2017), education (Lupo 2013), banking (Ali and Raza 2017), and safety (Demir et al. 2018). Regarding healthcare service quality, SERVQUAL is mentioned by many researchers. Altuntas et al. (2012) measured hospital service quality by using analytic hierarchy process (AHP) and analytic network process (ANP) to acquire the relationship and the level of the importance among SERVQUAL dimension. They evaluated the perceived service quality with respect to different hospital classes in Turkey and compared AHP- and ANP-based weighted SERVQUAL scales with the unweighted SERVQUAL scale for public hospital service quality in terms of different hospital classes (A–B and C). Afkham et al. (2012) proposed a fuzzy-based method. SERVQUAL model was used to evaluate the respondents’ judgments of service quality. Fuzzy AHP was applied to obtain weights of SERVQUAL dimensions and TOPSIS for ranking the hospitals. Chakravarty (2011) evaluated service quality of hospital outpatient departments using SERVQUAL. Service quality gaps were identified across all the five dimensions of the survey instrument. Significant gaps across the dimensions of “tangibles” and “responsiveness” were found. The quality gaps were further validated by a total unweighted SERVQUAL score of (–) 1.63. That study is the first attempt for service quality evaluation of hospital outpatient departments. Similarly, Teng et al. (2007) focused on surgical hospitalization using SERVQUAL. Lee et al. (2000) proposed three different methods for measuring healthcare service quality. These methods were not addressed in the previous studies. The performance of the constant-sum rating method, the single-item global rating method, and multi-item rating method in measuring the healthcare service quality was compared. That study also included two additional dimensions to SERVQUAL as core medical service and professionalism/skill assessed by physicians. Demirer and Bülbül (2014) comparatively explored the suitability of SERVQUAL and the relationship between perceived service quality, patient satisfaction, and patient preference for the public and private hospitals in Turkey. Lupo (2016) analyzed public healthcare sector of the Sicily region (Italy) via a SERVQUAL-based framework. Also, FAHP was employed to obtain reliable estimations of service quality expectations. Handayani et al. (2015) used a different dimension to classical SERVQUAL called professionalism. Using entropy method, strategic hospital service quality criteria were weighted. Shieh et al. (2010) combined SERVQUAL and decision-making trial and evaluation laboratory (DEMATEL) methods in identifying key success factors of hospital service quality. Firstly, a survey based on SERVQUAL model was conducted to identify seven major criteria from patients’ or their families’ viewpoints at a hospital in Taiwan. Then, a second survey was developed for applying DEMATEL method to evaluate the importance of criteria and construct the causal relations among the criteria.

Unlike SERVQUAL-based studied, limited number of studies stand in the literature in conjunction with MCDM, fuzzy sets and healthcare service quality evaluation. Aktas et al. (2015) developed a service quality index in order to present a basis for classification of hospitals by using MCDM tools with respect to the criteria in the literature. In the scope of this study, service quality criteria were determined and weights of them were obtained by AHP from the viewpoint of service providers and patients. Then, a formulation was developed to obtain service quality index (SQI) for hospitals. Lin et al. (2013) integrated hierarchical balanced scorecard with fuzzy linguistic for evaluating operating room performance in hospitals. Initially, a model was developed for measuring the acceptable performance of operating room based on the interaction financial, customers, internal business process, and learning and growth perspective. Then, balanced scorecard integrated with fuzzy linguistic was proposed for measuring the service. Taşkin et al. (2015) and Chang (2014) benefited from VIKOR method in hospital service quality evaluation. Both studies applied the fuzzy MCDM approach to determine the importance weights of evaluation criteria and the VIKOR method to prioritize feasible alternatives. For each of them, an empirical case involving 33 evaluation criteria and two public and three private medical centers in Turkey and Taiwan assessed by 18 evaluators from various fields of medical industry was presented. Akdag et al. (2014) applied TOPSIS method separated from Taşkin et al. (2015) and Chang (2014) to evaluate the service quality of some Turkish hospitals. In this study, importance weights of criteria were found with AHP. Then, the TOPSIS and Yager’s min–max approach were applied to rank hospitals. As a follow-up study, an aggregation of performance criteria with ordered weighted averaging (OWA) and compensatory AND operators are looked at instead of the TOPSIS method and min–max approach, and the obtained results were compared.

After examining the relevant literature, it is believed and concluded that the current study contributes to the knowledge of hospital service quality evaluation by some innovations: (1) First time in hospital environments, a PFAHP-based hybrid MCDM approach was used to obtain importance weights to the service quality criteria by using interval-valued Pythagorean fuzzy linguistic scale and pairwise comparison manner. (2) In the current literature, there is still no attempt to evaluate service quality performance of public and private hospitals from the viewpoint of various expert groups. Therefore, this paper aims to fill the gap in this area using the proposed approach (PFAHP–PFTOPSIS). This approach offers the opportunity to make assessments considering experts’ subjective judgments that are closer to the human mind than other options in the hospital performance evaluation. (3) On conclusion of this study, it is expected to represent a basis for decisions and policies that must be taken by any hospital authorities as part of their health delivery processes.

3 Proposed methodology

3.1 Pythagorean fuzzy sets

Intuitionistic fuzzy sets first proposed by Atanassov (1986) and have been used by many researchers in different fields to address uncertainty. These sets can be expressed in terms of membership functions, non-membership function and hesitancy degree. However, in some cases, it fails to fulfill the condition when there are times the degree of membership and non-membership is bigger than 1. Obviously, intuitionistic fuzzy sets are unable to capture the situation. As a result, Yager (2014) developed Pythagorean fuzzy sets. These sets are the generalization to the intuitionistic fuzzy sets in some condition intuitionistic fuzzy sets cannot address the uncertainty. This achievement makes Pythagorean fuzzy sets more powerful and flexible to solve problems involving uncertainty (Mohd and Abdullah 2017; Ilbahar et al. 2018; Gul 2018; Gul and Ak 2018; Oz et al. 2018; Mete 2018; Gul et al. 2018).

In Pythagorean fuzzy sets, unlike the intuitionistic fuzzy sets, the sum of membership and non-membership degrees can exceed 1 but the sum of squares cannot (Ilbahar et al. 2018; Zeng et al. 2016; Zhang and Xu 2014). This situation is shown below in Definition (1).

Definition 1

Let a set X be a universe of discourse. A Pythagorean fuzzy set P is an object having the form (Zhang and Xu 2014):

$$ P = \{ < x, \, P(\mu_{P} (x),v_{P} (x)) > \left| {x \in X} \right.\} $$
(1)

where \( \mu_{P} (x):X \mapsto [0,1] \) defines the degree of membership and \( v_{P} (x):X \mapsto [0,1] \) defines the degree of non-membership of the element \( x \in X \) to P, respectively, and, for every \( x \in X \), it holds:

$$ 0 \le \mu_{P} (x)^{2} + v_{P} (x)^{2} \le 1 $$
(2)

For any PFS P and \( x \in X \), \( \pi_{P} (x) = \sqrt {1 - \mu^{2}_{P} (x) - v^{2}_{P} (x)} \) is called the degree of indeterminacy of x to P.

Definition 2

Let \( \beta_{1} = P(\mu_{{\beta_{1} }} ,v_{{\beta_{1} }} ) \) and \( \beta_{2} = P(\mu_{{\beta_{2} }} ,v_{{\beta_{2} }} ) \) be two Pythagorean fuzzy numbers, and λ > 0, then the operations on these two Pythagorean fuzzy numbers are defined as follows (Zeng et al. 2016; Zhang and Xu 2014):

$$ \beta_{1} \oplus \beta_{2} = P\left( {\sqrt {\mu_{{\beta_{1} }}^{2} + \mu_{{\beta_{2} }}^{2} - \mu_{{\beta_{1} }}^{2} \mu_{{\beta_{2} }}^{2} } , \, v_{{\beta_{1} }} v_{{\beta_{2} }} } \right) $$
(3)
$$ \beta_{1} \otimes \beta_{2} = P\left( {\mu_{{\beta_{1} }} \mu_{{\beta_{2} }} , \, \sqrt {v_{{\beta_{1} }}^{2} + v_{{\beta_{2} }}^{2} - v_{{\beta_{1} }}^{2} v_{{\beta_{2} }}^{2} } } \right) $$
(4)
$$ \lambda \beta_{1} = P\left( {\sqrt {1 - \left( {1 - \mu_{{\beta_{1} }}^{2} } \right)^{\lambda } } ,(v_{{\beta_{1} }} )^{\lambda } } \right),\lambda > 0 $$
(5)
$$ \beta_{1}^{\lambda } = P\left( {(\mu_{{\beta_{1} }} )^{\lambda } ,\sqrt {1 - \left( {1 - v_{{\beta_{1} }}^{2} } \right)^{\lambda } } } \right),\lambda > 0 $$
(6)

Definition 3

Let \( \beta_{1} = P\left( {\mu_{{\beta_{1} }} ,v_{{\beta_{1} }} } \right) \) and \( \beta_{2} = P\left( {\mu_{{\beta_{2} }} ,v_{{\beta_{2} }} } \right) \) be two Pythagorean fuzzy numbers, a nature quasi-ordering on the Pythagorean fuzzy numbers is defined as follows (Zhang and Xu 2014):

$$ \beta_{1} \ge \beta_{2} \;{\text{if}}\;{\text{and}}\;{\text{only}}\;{\text{if}}\; \, \mu_{{\beta_{1} }} \ge \mu_{{\beta_{2} }} \;{\text{and}}\;v_{{\beta_{1} }} \le v_{{\beta_{2} }} $$

A score function is proposed to compare two Pythagorean fuzzy numbers by (Zhang and Xu 2014) as follows:

$$ s(\beta_{1} ) = (\mu_{{\beta_{1} }} )^{2} - (v_{{\beta_{1} }} )^{2} $$
(7)

Definition 4

Based on the score functions proposed above, the following laws are defined to compare two Pythagorean fuzzy numbers (Zhang and Xu 2014):

$$ \begin{aligned} ({\text{i}})\;{\text{If}}\;s(\beta_{1} ) < s(\beta_{2} ),\;{\text{then}}\;\beta_{1} \prec \beta_{2} \hfill \\ ({\text{ii}})\;{\text{If}}\;s(\beta_{1} ) > s(\beta_{2} ),\;{\text{then}}\;\beta_{1} \succ \beta_{2} \hfill \\ ({\text{iii}})\;{\text{If}}\;s(\beta_{1} ) = s(\beta_{2} ),\;{\text{then}}\;\beta_{1} \sim \beta_{2} \hfill \\ \end{aligned} $$

3.2 PFAHP and related linguistic terms

In this sub-section, we will give the steps of PFAHP method.

  • Step 1 The compromised pairwise comparison matrix \( A = (a_{ik} )_{m \times m} \) is structured based on linguistic evaluation of experts using the scale proposed by (Ilbahar et al. 2018) in Table 1.

    Table 1 Weighting scale for PAHP (Ilbahar et al. 2018)
  • Step 2 The difference matrices \( D = (d_{ik} )_{m \times m} \) between lower and upper values of the membership and non-membership functions are calculated using Eqs. (8) and (9):

    $$ d_{{ik_{L} }} = \mu_{{ik_{L} }}^{2} - v_{{ik_{U} }}^{2} $$
    (8)
    $$ d_{{ik_{U} }} = \mu_{{ik_{U} }}^{2} - v_{{ik_{L} }}^{2} $$
    (9)
  • Step 3 Interval multiplicative matrix \( S = (s_{ik} )_{m \times m} \) is computed using Eqs. (10) and (11):

    $$ s_{{ik_{L} }} = \sqrt {1000^{{d_{L} }} } $$
    (10)
    $$ s_{{ik_{U} }} = \sqrt {1000^{{d_{U} }} } $$
    (11)
  • Step 4 The determinacy value \( \tau = (\tau_{ik} )_{m \times m} \) is calculated using Eq. (12):

    $$ \tau_{ik} = 1 - \left( {\mu_{{ik_{U} }}^{2} - \mu_{{ik_{L} }}^{2} } \right) - \left( {v_{{ik_{U} }}^{2} - v_{{ik_{L} }}^{2} } \right) $$
    (12)
  • Step 5 The determinacy degrees are multiplied with \( S = (s_{ik} )_{m \times m} \) matrix for obtaining the matrix of weights, \( T = (t_{ik} )_{m \times m} \) before normalization using Eq. (13).

    $$ t_{ik} = \left( {\frac{{s_{{ik_{L} }} + s_{{ik_{U} }} }}{2}} \right)\tau_{ik} $$
    (13)
  • Step 6 The normalized priority weights \( w_{i} \) is computed by using Eq. (14).

    $$ w_{i} = \frac{{\sum\nolimits_{k = 1}^{m} {t_{ik} } }}{{\sum\nolimits_{i = 1}^{m} {\sum\nolimits_{k = 1}^{m} {t_{ik} } } }} $$
    (14)

4 PFTOPSIS

The TOPSIS was developed by Hwang and Yoon (1981) to find out the best alternative based on the compromise solution concept. The compromise solution concept can be regarded as selecting the solution with the shortest distance from the ideal solution and the farthest distance from the negative ideal solution. Since the ratings while evaluating alternatives against criteria usually refer to the subjective uncertainty, TOPSIS is extended to consider the situation of fuzzy numbers (Chen 2000; Tzeng and Huang 2011; Celik et al. 2012). The steps of FTOPSIS can be reached in (Tzeng and Huang 2011; Kutlu and Ekmekçioğlu 2012; Gul and Guneri 2018; Carpitella et al. 2018).

Pythagorean fuzzy sets are an extension of usual and intuitionistic fuzzy sets. It provides more freedom to experts in expressing their opinions about the vagueness and uncertainty of the considered problem. In Pythagorean fuzzy sets experts assign membership and non-membership degrees. On the other hand, TOPSIS method is applied to a variety of problems in the literature. It has many advantages as follows: It allows the experts to assign judgments by means of linguistic terms, which are better interpreted by humans, fuzzy in nature, and then transferred into Pythagorean fuzzy numbers. It has more capability in handling uncertainties, simultaneous consideration of the positive and the negative ideal points, simple computation, and logical concept.

Based on definitions given above, in the following, PFTOPSIS algorithm is presented with its routine steps (Oz et al. 2018).

  • Step 1 Initially, decision matrix under Pythagorean fuzzy sets \( R = (C_{j} (x_{i} ))_{m \times n} \) is constructed. Here, \( C_{j} (j = 1,2, \ldots ,n) \) and \( x_{i} (i = 1,2, \ldots ,m) \) refer to values of criteria and alternatives. The matrix form is as follows:

    $$ R = (C_{j} (x_{i} ))_{mxn} = \left( {\begin{array}{*{20}l} {P(u_{11} ,v_{11} )} \hfill & {P(u_{12} ,v_{12} )} \hfill & \ldots \hfill & {P(u_{1n} ,v_{1n} )} \hfill \\ {P(u_{21} ,v_{21} )} \hfill & {P(u_{22} ,v_{22} )} \hfill & \ldots \hfill & {P(u_{2n} ,v_{2n} )} \hfill \\ \vdots \hfill & \vdots \hfill & \vdots \hfill & \vdots \hfill \\ {P(u_{m1} ,v_{m1} )} \hfill & {P(u_{m2} ,v_{m2} )} \hfill & \ldots \hfill & {P(u_{mn} ,v_{mn} )} \hfill \\ \end{array} } \right) $$
  • Step 2 Secondly, Pythagorean fuzzy positive ideal solution (PIS) and negative ideal solutions (NIS) are determined using Eqs. (15, 16) as follows:

    $$ x^{ + } = \left\{ {C_{j} ,\mathop {\hbox{max} }\limits_{i} \left\langle {s(C_{j} (x_{i} ))} \right\rangle \left| {j = 1,2, \ldots ,n} \right.} \right\} = \left\{ {\left\langle {C_{1} ,P(u_{1}^{ + } ,v_{1}^{ + } )} \right\rangle ,\left\langle {C_{2} ,P(u_{2}^{ + } ,v_{2}^{ + } )} \right\rangle , \ldots ,\left\langle {C_{n} ,P(u_{n}^{ + } ,v_{n}^{ + } )} \right\rangle } \right\} $$
    (15)
    $$ x^{ - } = \left\{ {C_{j} ,\mathop {\hbox{min} }\limits_{i} \left\langle {s(C_{j} (x_{i} ))} \right\rangle \left| {j = 1,2, \ldots ,n} \right.} \right\} = \left\{ {\left\langle {C_{1} ,P(u_{1}^{ - } ,v_{1}^{ - } )} \right\rangle ,\left\langle {C_{2} ,P(u_{2}^{ - } ,v_{2}^{ - } )} \right\rangle , \ldots ,\left\langle {C_{n} ,P(u_{n}^{ - } ,v_{n}^{ - } )} \right\rangle } \right\} $$
    (16)
  • Step 3 Thirdly, distances from Pythagorean fuzzy PIS and NIS are determined using Eqs. (17, 18) as follows:

    $$ D\left( {x_{i} ,x^{ + } } \right) = \sum\limits_{j = 1}^{n} {w_{j} d} (C_{j} (x_{i} ),C_{j} (x^{ + } )) = \frac{1}{2}\sum\limits_{j = 1}^{n} {w_{j} \left( {\left| {(\mu_{ij} )^{2} - (\mu_{j}^{ + } )^{2} } \right| + \left| {(v_{ij} )^{2} - (v_{j}^{ + } )^{2} } \right| + \left| {(\pi_{ij} )^{2} - (\pi_{j}^{ + } )^{2} } \right|} \right)} $$
    (17)
    $$ D\left( {x_{i} ,x^{ - } } \right) = \sum\limits_{j = 1}^{n} {w_{j} d} (C_{j} (x_{i} ),C_{j} (x^{ - } )) = \frac{1}{2}\sum\limits_{j = 1}^{n} {w_{j} \left( {\left| {(\mu_{ij} )^{2} - (\mu_{j}^{ - } )^{2} } \right| + \left| {(v_{ij} )^{2} - (v_{j}^{ - } )^{2} } \right| + \left| {(\pi_{ij} )^{2} - (\pi_{j}^{ - } )^{2} } \right|} \right)} $$
    (18)

    for Eqs. (17, 18) i = 1,2,…,n. In general, the smaller \( D(x_{i} ,x^{ + } ) \) the better the alternative \( x_{i} \) and the bigger \( D(x_{i} ,x^{ - } ) \) the better the alternative \( x_{i} \) and let \( D_{\hbox{min} } (x_{i} ,x^{ + } ) = \mathop {\hbox{min} }\limits_{1 \le i \le m} D(x_{i} ,x^{ + } ) \) and \( D_{\hbox{max} } (x_{i} ,x^{ - } ) = \mathop {\hbox{max} }\limits_{1 \le i \le m} D(x_{i} ,x^{ - } ) \).

  • Step 4 Fourthly, the revised closeness \( \xi (x_{i} ) \, \) of the alternative \( x_{i} \) is computed using Eq. (19) as follows:

    $$ \xi (x_{i} ) = \frac{{D(x_{i} ,x^{ - } )}}{{D_{\hbox{max} } (x_{i} ,x^{ - } )}} - \frac{{D(x_{i} ,x^{ + } )}}{{D_{\hbox{min} } (x_{i} ,x^{ + } )}} $$
    (19)
  • Step 5 Finally, the best ranking order of the alternatives is determined. The alternative with the highest revised coefficient value is the best alternative.

4.1 PFAHP–PFTOPSIS-integrated model proposal for evaluating hospital service quality

The PFAHP–PFTOPSIS-integrated model proposal consists of several steps as shown in Fig. 1. First, the hospital service quality criteria and their sub-criteria are structured hierarchically according to the literature and hospital environments. After constructing the evaluation criteria hierarchy, the criteria weights are computed by applying interval-valued PFAHP method. Then, PFTOPSIS method is conducted to achieve the final ranking results. Finally, a broad discussion is presented in terms of showing how each hospital perform in terms of the handled criteria both independently of each other and all together using circumference of centroids method and area of expertise-based evaluation procedure. The detailed descriptions of each step are explained in each of the following sub-section.

Fig. 1
figure 1

Framework of the PFAHP–PFTOPSIS-integrated model proposal

5 Case study: healthcare service quality evaluation in public and private hospitals

5.1 Hierarchical structure of hospital service quality evaluation criteria

The hierarchical structure of the hospital service quality evaluation criteria is described based on a detailed overview of the relevant literature (Chang 2014; Akdag et al. 2014; Taşkin et al. 2015). In addition to the related literature, the main and sub-criteria of hospital service quality is finalized by the aid of consultation involving healthcare managers, experts and academicians. In the lights of all these, the structure is constructed under six main criteria and 32 sub-criteria as shown in Fig. 2.

Fig. 2
figure 2

Hierarchical structure of hospital service quality evaluation criteria

5.2 Linguistic scales and their corresponding fuzzy numbers

In this study, we benefit two linguistic scales and their corresponding fuzzy numbers. First, in evaluating service quality main and sub-criteria using pairwise comparison of PFAHP, we use the scale proposed by Ilbahar et al. (2018). That scale is based on interval-valued Pythagorean fuzzy numbers (Table 1). Second, in evaluating hospitals with respect to service quality criteria using PFTOPSIS, we apply the scale of Pérez-Domínguez et al. (2018). In Pérez-Domínguez et al. (2018), a nine-point Pythagorean fuzzy linguistic scale is defined as a set of linguistic variables which can be represented as Pythagorean fuzzy numbers. The prioritization of hospitals is performed by using five members’ Pythagorean fuzzy linguistic scale (Table 2).

Table 2 Pythagorean fuzzy linguistic scale used in PFTOPSIS

5.3 Data collection process

This study is carried out in three hospitals located in Black Sea Region of Turkey, in 2018. While two of them are public hospitals that serve as education and research hospital in Turkey, one is a private hospital. The chosen alternatives are evaluated with respect to 32 evaluation criteria. These two public hospitals and one private hospital are selected due to their great reputation in the studied region. Two questionnaires have been circulated among the experts in order to (1) determine importance levels of hospital service quality criteria by using pairwise comparison of PFAHP method and (2) prioritize hospitals with respect to these criteria by using PFTOPSIS (Appendix). The first questionnaire has contained a total number of 6 main and 32 sub-criteria. Within the data collection process from the expert group, we contact a total of 38 evaluators (36 for the first questionnaire and 2 for the second questionnaire). From the conducted questionnaires, 32 of them (30 for the first questionnaire and 2 for the second questionnaire) are found valid and suitable for use in the study. The evaluation process takes about 2 weeks (from the end of January and the beginning of February, 2018). The expert group includes experts from medical, administrative and auxiliary services such as governmental medical department, hospital administration, quality deployment department, medical staff from emergency department, intensive care unit, audiometry, and other inpatient units (doctor, nurse, technician, etc.) and ward service provider. Also, a number of patients are included to the first questionnaire as they are main stakeholders of hospitals. This additional questionnaire execution process takes about 10 days (last 10 days of October, 2018). These 32 evaluators are employers who are well experienced in the healthcare delivery and workflow in the hospitals. The information about the expert team and corresponding working experience is set out in Table 3. Due to anonymity reasons, experts’ identity has not been revealed herein, and therefore, they have been demonstrated as E1, E2, … and E32.

Table 3 Expert team information

5.4 Weighting calculation of main evaluation criteria and sub-criteria using PFAHP

Six main evaluation criteria including 32 sub-criteria are considered in this study to evaluate hospital service quality. Weighs for these 32 sub-criteria are obtained via PFAHP computations of 30 evaluators. The procedure explained in Sect. 3.2 shows the computational processes to derive the importance weights of evaluation criteria by using the proposed approach.

Thirty evaluators are asked to express their pairwise comparisons about the importance weight of each evaluation criterion by using the linguistic variables defined in Table 1. In this stage, the linguistic variables are transferred into corresponding interval-valued Pythagorean fuzzy numbers. Since the ratings of these evaluators are different, it is required to aggregate their subjective judgments toward a compromised pairwise comparison matrix A as indicated in Step 1 of Sect. 3.2. The aggregated compromised pairwise comparison matrix for the main criteria is given in Table 4. The difference matrix D and Interval multiplicative matrix S are also given in Tables 5 and 6, respectively.

Table 4 Aggregated compromised pairwise comparison evaluation of experts in matrix form
Table 5 The difference matrix
Table 6 The interval multiplicative matrix

The determinacy value matrix as stated in Eq. (11) and matrix of weights before normalization as in Eq. (12) are given in Tables 7 and 8, respectively.

Table 7 Determinacy value matrix (T)
Table 8 Matrix of weights before normalization (t)

Finally, the normalized priority weights of main criteria are computed using Eq. (13) as shown in Fig. 3. Due to space limitations, all of the computations related to the sub-criteria weighting using PFAHP are not provided inside the paper.

Fig. 3
figure 3

Priority weights of main criteria

Similarly, the local and global importance weights of sub evaluation criteria are listed in Table 9 following the PFAHP procedure as in weighting six main criteria.

Table 9 Importance weights of hospital service quality evaluation criteria

Table 9 provides the importance weight and rank of each evaluation criterion assessed by 30 experts. The results show that the five most important criteria for evaluating hospital service quality are: medical staff with professional abilities (C41), medical equipment level of the hospital (C11), trusted medical staff with professional competence of healthcare (C42), service personnel with immediate problem-solving abilities (C43), and detailed description of the patient’s illness by the doctor (C35). On the contrary, the five least important criteria are as follows: patient meals services (C53), being user friendly of hospital Web sites (C56), cordial attitude of service staff (C24), patient admission procedures (C55), and concern shown by medical staffs toward illness (C27).

5.5 Prioritization of hospitals using PFTOPSIS

Two evaluators who are experienced in healthcare management and know well of each of three hospitals use the linguistic variables defined in Table 2 to assess the service quality performance of the hospitals with respect to evaluation criteria. Since their evaluation ratings are different, their opinions are aggregated as Pythagorean fuzzy performance ratings of these five hospitals. The results are given in Table 10.

Table 10 Aggregated Pythagorean fuzzy performance ratings of three hospitals

The linguistic variables are transformed into Pythagorean fuzzy numbers. This is the first stage of the FTOPSIS analysis (determination of Pythagorean fuzzy decision matrix). The weights of criteria which are computed in PFAHP stage are then added into the calculation in PFTOPSIS analysis. Using the formulas in steps 2–4 of PFTOPSIS algorithm, Pythagorean fuzzy PIS & NIS, distances from Pythagorean fuzzy PIS & NIS, and revised closeness values are calculated. The resulting closeness coefficients values of hospitals are reported in Table 11. According to the PFTOPSIS method, the hospital closest to 1 is the hospital which is the closest to the positive ideal solution, and the least close to the negative ideal solution. Thus, the hospital which has the biggest ξ(Xi) value has performed best in terms of service quality. According to Table 11, Hospital 1 has shown the best performance.

Table 11 PFTOPSIS ξ(Xi) values

5.6 Discussion

In this section, a discussion on evaluation of hospital alternatives with respect to 32 service quality criteria is presented. To make a critique for individual performance of 3 hospitals, nonfuzzy performance (BNP) service quality performance rating values of three hospitals are provided in Table 12 using Circumference of Centroids method. In transforming linguistic terms to trapezoidal fuzzy numbers scale for assessing hospitals with respect to service quality criteria, the five-point scale provided by Samantra et al. (2017) is used.

Table 12 BNP service quality performance rating values of three hospitals

A generalized trapezoidal fuzzy number is defined as \( \tilde{A} = \left( {a,b,c,d;w} \right) \). The Circumcenter \( S_{{\tilde{A}}} (\bar{x}_{0} ,\bar{y}_{0} ) \) of the generalized trapezoidal fuzzy number is computed as in Eq. (20):

$$ S_{{\tilde{A}}} (\bar{x}_{0} ,\bar{y}_{0} ) = \left( {\frac{a + 2b + 2c + d}{6},\frac{{(2a + b - 3c)(2d + c - 3b) + 5w^{2} }}{12w}} \right) $$
(20)

The ranking function of the trapezoidal fuzzy number \( \tilde{A} = \left( {a,b,c,d;w} \right) \) which maps the set of all fuzzy numbers to a set of real numbers is defined as: \( R(\tilde{A}) = \sqrt {\bar{x}_{0}^{2} + \bar{y}_{0}^{2} } \) where \( R(\tilde{A}) \) is the Euclidean distance from the circumcenter of the centroids and the original point.

From Table 12, it is stated that Hospital 1 has better service quality performances in: cordial attitude of service staff, consideration of individual needs of patients by medical staff, and taking advice of medical staff to patients staying at home. On the other hand, the sufficiency of outpatient treatment areas and concern taken by medical staff in terms of illness are the two worst service quality performance aspects.

Four best service aspects in Hospital 2 are: tidy appearance of service staff, the attitude of nurses and medical staff to patients, being user friendly of hospital Web sites, and number & quality of available bathrooms in the hospital. However, it is not well in medical personnel with immediate problem-solving abilities, medical equipment level of the hospital, confidence to provided medical services, being user friendly of hospital appointment system, and patient flow procedures of emergency department.

Hospital 3 has several service quality aspects that have same rating values such as availability of marked signs in the hospital, service staff with good communication skills, cordial attitude of service staff, consideration of individual needs of patients by medical staff, taking advice of medical staffs to patients staying at home, trusted medical staff with professional competence of healthcare, lighting condition of the ward, and number & quality of available bathrooms in the hospital. This hospital, however, still has two worst aspects: the quality and cleanliness of the bed linen and ventilation and sanitation of the ward.

When evaluated physical capacities of the hospitals, it has clearly stated that H1 has more space and medical resources than others. H1 Hospital is a public hospital and has 500 beds and 200 doctors. The second public hospital evaluated in this study is Hospital 3, and it serves with 300 beds and 83 doctors. The only private hospital “Hospital 2” serves with 105 beds and 42 doctors. As an overall evaluation, H1 hospital is a better position in providing service quality form the viewpoints of the health experts. The experts who evaluated these institutions with respect to the criteria are more experienced staff in their hospital.

This study also proved that service quality of public hospitals is better than that of private hospitals. Although private hospitals are rarely subsidized from governmental departments, and they must provide better services to retain patients’ loyalty, innovations, and developments that the government has made in health in recent years have strengthened the serviceability of public hospitals. Unlike private hospitals, the operations budgets of public hospitals are mostly guaranteed by the governmental subsidization and taxpayers’ tax. They have the financial and managerial power to provide health care at a good level.

As a creative contribution to this application case study, the weights of main and sub-criteria and ranking orders for different evaluation groups are analyzed. In this case study, evaluators have four main area of expertise as follows: (1) medical, (2) administrative, (3) auxiliary services, and (4) patient. For this aim, weights of main and sub-criteria are determined according to these four evaluator groups. Importance weights of hospital service quality criteria across four evaluation groups are given in Table 13.

Table 13 Weights of hospital service quality criteria across four evaluation groups

On conclusion of analyzing the weight values and ranking orders of evaluation criteria for three evaluation groups (see Table 13), the medical staff group regards medical equipment level of the hospital (C11), medical staff with professional abilities (C41), and trusted medical staff with professional competence of healthcare (C42) as being more important than another evaluation criterion. Administrative staff group is more concerned about trusted medical staff with professional competence of healthcare (C42), medical staff with professional abilities (C41), and service personnel with immediate problem-solving abilities (C43). The auxiliary services staff group concludes three most important criteria which are: medical equipment level of the hospital (C11), service personnel with immediate problem-solving abilities (C43), and trusted medical staff with professional competence of healthcare (C42). Since the patients are among the stakeholders of hospitals, their viewpoint on importance weights of service quality criteria is very crucial. This group mostly concerns with medical equipment level of the hospital (C11), medical staff with professional abilities (C41), and detailed description of the patient’s illness by the doctor (C35).

When the ranking orders of hospitals in terms of service quality performance are considered, it is concluded that Hospital 1 with the maximum of ξ(Xi) has the best hospital service quality among these three hospitals from each of four evaluation group viewpoints. According to the viewpoints of medical, administrative staff and patients, while Hospital 2 stands in second place, and Hospital 3 is in third. Overall ranking orders of hospitals as shown in Table 11 give the same results with ranking orders from the viewpoint of three evaluator groups except auxiliary services staff group as shown in Table 14.

Table 14 PFTOPSIS scores and ranking orders in terms of four evaluation groups

5.7 Comparative analysis

The integrated model is compared with the results of PFAHP–FTOPSIS (Gul and Ak 2018), PFAHP–FVIKOR (Gul 2018), PFAHP–PFVIKOR, and PFAHP–PFMOORA (Mete 2018) as given in Table 15.

Table 15 Ranking results of the compared approaches

The first comparison analysis is conducted between the integrated model and PFAHP–FTOPSIS model (Gul and Ak 2018). In evaluation of hospitals with respect to the criteria, usual trapezoidal fuzzy numbers-based TOPSIS is used unlike the current study. The comparison shows that, the ranking orders of hospitals are partially different from the integrated model. The ranking orders of Hospital 2 and Hospital 3 are different. It has the same result with PFAHP–PFTOPSIS model under auxiliary services staff’s point of view.

The second comparison is performed between the current model and PFAHP–FVIKOR (Gul 2018). It has resulted in the same output (ranking orders of hospitals) with PFAHP–FTOPSIS model (Gul and Ak 2018). The third and fourth comparisons are regarding Pythagorean fuzzy sets-based approaches like our current approach. Two approaches yield same ranking orders in hospital service quality evaluation with our current approach. In addition, a Spearman correlation analysis is applied to measure the ratio correlation between the ranking orders of compared approaches. The outputs of correlation analysis are demonstrated in Table 16. According to this correlation analysis, there is a significant and positive correlation between the current model and others (50%, 50%, 100%, and 100%, respectively). Moreover, we have applied a Pearson correlation analysis between approaches using final scores of each approach. Results of this second correlation analysis are provided in Table 17.

Table 16 Spearman correlation coefficient results of the compared approaches
Table 17 Pearson correlation coefficient results of the compared approaches

According to results in Table 17, the relationships between ranking results are very strong. In VIKOR-based approaches, a higher index value shows a lower ranking order. Hence, the correlation coefficient between PFAHP–FVIKOR and PFAHP–PFVIKOR approaches and the remaining approaches is a negative, high value as shown in Table 17. The correlation coefficient between the proposed approach and PFAHP–PFMOORA approach is positive and the highest of all approaches (0.988). The lowest correlation coefficient values are obtained from PFAHP–FVIKOR with PFAHP–FTOPSIS (− 0.766 and 0.900). In sum, the proposed approach can produce reasonable results for evaluating hospital service quality using the advantage of Pythagorean fuzzy sets that reflects uncertainty in more powerful level than usual fuzzy sets. Considering the ranking results of the two compared Pythagorean fuzzy set-based approaches, it is proved that the proposed approach can provide suitable information to assist healthcare issues in the hospital services.

6 Conclusion

Measuring hospital service quality is one of the major concerns of healthcare industry over world. Since healthcare delivery in Turkey are provided in a very competitive environment, for making a better choice, the services delivered by the public and private hospitals should be evaluated according to the viewpoint of stakeholders such as medical staff, hospital executives, and auxiliary services personal in terms of satisfaction. In this study, a successful application of a hospital service quality evaluation approach including PFAHP and PFTOPSIS is presented. The case study is performed under fuzzy environment to reduce uncertainty and vagueness, and linguistic variables parameterized by Pythagorean fuzzy numbers are used. The proposed PFAHP–PFTOPSIS approach is separated from others with the integration of Pythagorean fuzzy sets, AHP and TOPSIS methods, in a way providing a systematic decision-making process. Through the case study, six main hospital service quality evaluation criteria and 32 sub-criteria are used to assess two public and one private hospitals in Turkey by two questionnaires filled by four expert groups called (1) medical staff, (2) administrative staff, (3) auxiliary services staff, and (4) patients. Results show that the five most important criteria for evaluating hospital service quality are: medical staff with professional abilities, medical equipment level of the hospital, trusted medical staff with professional competence of healthcare, service personnel with immediate problem-solving abilities, and cleanliness of the toilets. It is also proved that service quality of public hospitals discussed is better than that of private hospitals. An additional analysis is made in determination of the weights of main and sub-criteria and ranking orders for different evaluation groups. According to the each of four evaluation groups, medical equipment level of the hospital, medical staff with professional abilities, and trusted medical staff with professional competence of healthcare are regarded as being the most important three evaluation criteria among others. Overall ranking orders of hospitals are found similar.

The contributions and strengths of the paper are threefold: (1) It presents a new integrated model proposal based on PFAHP and PFTOPSIS; (2) it offers an additional analysis in determination of the weights of main and sub-criteria and ranking orders across four evaluation groups; (3) by the aid of additional analysis results of the integrated model proposal, it can help any hospital apart from the observed hospitals in this case study to identify its own strengths and weaknesses in their services, and provide better information to allow executives to improve their own areas of weakness to further improve service quality. Four approaches are benchmarked with the proposed approach in terms of correlation in both ranking orders and final scores.

As a limitation of the study, relevant to the second stage of the model proposal, it is important to keep in mind that the other preference-based MCDM methods (ELECTRE, PROMETHEE, TODIM, etc.) and/or their extension through the usage of other fuzzy set types such as hesitant fuzzy sets and neutrosophic sets can be alternatively used for the future extensions of this study.