1 Introduction

Capability indices are widely used in practice to evaluate the performance of a process and the information about the process is used to improve the capability [20,21,22]. As capability indices can only be estimated in most of the cases, confidence intervals can be used to predict a range in which the capability index lies with high probability. Therefore, industry and science benefit from trustable confidence intervals for the capability indices. In recent years, process capability indices (PCIs) have drawn much attention in industries. Process capability (PC) analysis is a method of combining the statistical tools to find out how well a given process meets a set of specification limits [25]. The purpose of the process capability analysis is to find the effect of time on both the average and the spread of the process. Before evaluating the process capability, it has to be shown that the process is under the statistical process control [33]. Although there are several process capability indices such as, Cp, Cpk, Cpm and Cpmk, the most commonly applied process capability index is Cp [21, 36]. In this paper, we focus only on the widely used process capability index, Cp, defined by Juran [20] and Kane [21], as follows:

$${\text{C}}_{\text{p}} = \frac{{{\text{USL}} - {\text{LSL}}}}{6\sigma }$$
(1)

where LSL and USL are the lower and upper specification limits, respectively and σ is the process standard deviation. The numerator gives the size of the range over which the process measurements is allowed to vary. The denominator gives the size of the range over which the process actually varies [23]. It follows that evaluation of the process capability can be based on the following three criteria: (i) variability in process; (ii) degree of departure of the process mean from the target value, and (iii) location of the process mean in the interval (LSL, USL). The process capability index, Cp, takes into account criterion (i) only, as it depends on spread for given specification limits. The larger a capability index value for a process is, the more capable the process. Most experts recommended different values for existing and new processes. The quality conditions and the corresponding Cp values are given by Chao and Lin [11] and are reported in Table 1.

Table 1 Quality conditions and Cp values for centered process

The process capability index Cp greatly depends on the assumption that the underlying quality characteristic measurements are independent and normally distributed. However, the assumption of normal distribution is not always valid and thus the non-normal distribution process is also being practiced in an industrial environment. Therefore, the classic PCI may not always be available [24, 27, 30]. When the population standard deviation (σ) in Eq. (1) is unknown, it should be estimated from the sample values. Thus, the point estimate of Cp is given as follows:

$${\hat{\text{C}}}_{\text{p}} = \frac{{{\text{USL}} - {\text{LSL}}}}{{6{\text{S}}}}$$
(2)

where S is the sample standard deviation. When normal distribution assumption is not assured or when there are extreme values in the data, robust estimators may be used [22]. An estimator, is said to be robust if it is fully efficient or nearly so for an assumed distribution but maintains high efficiency for plausible alternatives, for example normal distribution [3, 32]. Most known for robust measures are the pseudo-standard deviation (Sps), the average absolute deviation from the sample median (AADM), the median absolute deviation from the sample median (MAD), the Gini’s mean difference (GMD) and the Rousseeuw and Croux [29] estimators, Sn and Qn. There are studies on the estimation of σ that show better performance of robust estimators in the literature [1, 3, 4, 26]. For estimating the population mean with a confidence interval, the coverage probability (CP) is closer to the nominal confidence level when the data are normally distributed but far from nominal level when data are from the skewed distribution [5,6,7, 37].

Since the normality assumption about the data is not guaranteed or may not be feasible with some real life data, in fact, it would be interesting to construct some confidence intervals (CIs) for Cp based on various robust scale estimators.

The rest of this paper is organized as follows: Sect. 2 present the classical and proposed modified CIs for the classical PCI. A comprehensive Monte-Carlo simulation study has been conducted to compare the performance of confidence intervals in Sect. 3. In Sect. 4, two real-life data sets from the industry sector are presented and analyzed to illustrate the performance and the implementation of the considered CIs for Cp. Finally, some concluding remarks are given in Sect. 5.

2 The confidence interval (CI) and modified confidence intervals for Cp

Suppose that X1, X2,…,Xn are from a normal distribution with mean μ and standard deviation σ, then a \((1 - \alpha )\,100\,\%\) confidence interval for the classical process capability index, Cp, is constructed by using a pivotal quantity: \(Q = {{(n - 1)\,S^{2} } \mathord{\left/ {\vphantom {{(n - 1)\,S^{2} } {\sigma^{2} }}} \right. \kern-0pt} {\sigma^{2} }}\) and is given as follows:

$$\begin{aligned} &{\text{LCL }} = \frac{{{\text{USL}} - {\text{LSL}}}}{{6{\text{S}}}}\sqrt {\frac{{\chi_{{\left( {\frac{\alpha }{2}\,,\,{\text{n}} - 1} \right)}}^{2} }}{{({\text{n}} - 1)}}} \,{\text{and }}\,\\&{\text{UCL }} = \frac{{{\text{USL}} - {\text{LSL}}}}{{6{\text{S}}}}\sqrt {\frac{{\chi_{{\left( {1 - \frac{\alpha }{2}\,,\,{\text{n}} - 1} \right)}}^{2} }}{{({\text{n}} - 1)}}} \end{aligned}$$

where LCL = lower confidence limit and UCL = upper confidence limit, \(\chi_{{\left( {\frac{\alpha }{2}\,,\,n - 1} \right)}}^{2}\) and \(\chi_{{\left( {1 - \frac{\alpha }{2}\,,\,n - 1} \right)}}^{2}\) are the “α/2 and 1 − α/2” quantiles of the central Chi squared distribution with \(n - 1\) degrees of freedom respectively. The confidence interval for Cp shown above is to be used for normal distribution data. The underlying process distributions are non-normal in many industrial processes (e.g., Chen and Pearn [13], Bittanti and Moiraghi [9], Wu and Messimer [35], Chang et al. [10], Ding [15]). For a given nominal level of the confidence interval, a high coverage probability is desirable [7]. However, for a non-normal distribution, the coverage probability of the confidence interval is quite low (Balamurali and Kalyanasundaram [8]).

The main goal of this paper is to obtain some modified confidence intervals for Cp by comparing the performance of confidence intervals based on the robust scale estimators as an alternative to the sample standard deviation (S) instead of bootstrap method. In this study, we proposed six modified confidence intervals for estimating the classical process capability index, Cp, for non-normal distributions based on the robust methods. Since the process capability index, Cp greatly depends on the population standard deviation (σ), we want to estimate it by robust method so that the proposed confidence intervals can be used when data are not from normal distribution. Following Wooluru et al. [34], we will propose some confidence intervals for the process capability index, Cp, in this section follow.

2.1 CI based on Sps

The Sps is defined as \({\text{S}}_{\text{ps}} = \frac{\text{IQR}}{1.349}\), where IQR stands for inter-quartile range. Thus, the (1 − α) 100% confidence interval for the classical process capability index, Cp, based on Sps is given as follows:

$$\begin{aligned}& {\text{LCL}} = \frac{{{\text{USL}} - {\text{LSL}}}}{{6 {\text{S}}_{\text{ps}} }} \sqrt {\frac{{\chi_{{\left( {\frac{\alpha }{2}, n - 1} \right)}}^{2} }}{{\left( {{\text{n}} - 1} \right)}}} \,{\text{and}}\,\\&{\text{UCL}} = \frac{{{\text{USL}} - {\text{LSL}}}}{{6 {\text{S}}_{\text{ps}} }} \sqrt {\frac{{\chi_{{\left( {1 - \frac{\alpha }{2}, n - 1} \right)}}^{2} }}{{\left( {{\text{n}} - 1} \right)}}} \end{aligned}$$

where \(\chi_{{\left( {\frac{\alpha }{2}\,,\,n - 1} \right)}}^{2}\) and \(\chi_{{\left( {1 - \frac{\alpha }{2}\,,\,n - 1} \right)}}^{2}\) are α/2th and 1 − α/2th quintiles of the Chi squared distribution with \(n - 1\) degrees of freedom.

2.2 CI based on AADM

The average absolute deviation from the sample median (AADM) is a robust scale estimator that measures the deviation of the data from the sample median, MD, which is less influenced by outliers. It is defined as \(AADM\, = \,\frac{{\sqrt {{\pi \mathord{\left/ {\vphantom {\pi 2}} \right. \kern-0pt} 2}} }}{n}\sum\nolimits_{i = 1}^{n} {\left| {X_{i} - MD} \right|}\). The median is best known for being insensitive to outliers and has a maximal 50% breakdown point [29]. For main properties of MD, see for example Abu-Shawiesh and Kibria [2]. As stated in Gastwirth [17], AADM is a consistent estimate of σ and is asymptotically normally distributed. The (1 − α) 100% confidence interval for the classical process capability index, Cp, based on AADM is given as follows:

$$\begin{aligned} &{\text{LCL}} = \frac{{{\text{USL}} - {\text{LSL}}}}{{6 {\text{AADM}}}} \sqrt {\frac{{\chi_{{\left( {\frac{\alpha }{2}, n - 1} \right)}}^{2} }}{{\left( {{\text{n}} - 1} \right)}}} \quad {\text{and}}\quad\\& {\text{UCL}} = \frac{{{\text{USL}} - {\text{LSL}}}}{{6 {\text{AADM}}}} \sqrt {\frac{{\chi_{{\left( {1 - \frac{\alpha }{2}, n - 1} \right)}}^{2} }}{{\left( {{\text{n}} - 1} \right)}}} \end{aligned}$$

where \(\chi_{{\left( {\frac{\alpha }{2}\,,\,n - 1} \right)}}^{2}\) and \(\chi_{{\left( {1 - \frac{\alpha }{2}\,,\,n - 1} \right)}}^{2}\) are α/2th and 1 − α/2th quintiles of the Chi squared distribution with \(n - 1\) degrees of freedom.

2.3 CI based on MAD

The mean absolute deviation from the sample median (MAD) was first introduced by Hampel [19] and is widely used in various applications as an alternative to S. The MAD is defined as\(MAD = 1.4826MD\,\left\{ {\,\left| {X_{i} - MD} \right|} \right\}\,,\,\,i = 1,2,3, \ldots ,n\). The 1.4826 factor given in MAD adjusts the scale for maximum efficiency when the data comes from a normal distribution. The \((1 - \alpha )\,100\,\%\) confidence interval for the classical process capability index, Cp, based on MAD is given as follows:

$$\begin{aligned}& {\text{LCL }} = \frac{{{\text{USL}} - {\text{LSL}}}}{{6{\text{MAD}}}}\sqrt {\frac{{\chi_{{\left( {\frac{\alpha }{2}\,,\,{\text{n}} - 1} \right)}}^{2} }}{{({\text{n}} - 1)}}} \quad {\text{and }}\quad \\&{\text{UCL}} = \frac{{{\text{USL}} - {\text{LSL}}}}{{6{\text{MAD}}}}\sqrt {\frac{{\chi_{{\left( {1 - \frac{\alpha }{2}\,,\,{\text{n}} - 1} \right)}}^{2} }}{{({\text{n}} - 1)}}} \end{aligned}$$

where \(\chi_{{\left( {\frac{\alpha }{2}\,,\,n - 1} \right)}}^{2}\) and \(\chi_{{\left( {1 - \frac{\alpha }{2}\,,\,n - 1} \right)}}^{2}\) are α/2th and 1 − α/2th quintiles of the Chi squared distribution with \(n - 1\) degrees of freedom.

2.4 CI based on GMD

The Gini’s mean difference (GMD) was developed by Gini [16] and defined as follows:

$$GMD = \frac{{\begin{array}{*{20}c} {\sum\nolimits_{i\, = \,1}^{n} {\sum\nolimits_{j\, = \,1}^{n} {\left| {X_{i} - X_{j} } \right|} } } \\ {i < j} \\ \end{array} }}{{\left( {\begin{array}{*{20}c} n \\ 2 \\ \end{array} } \right)}}\, = \frac{2}{n\,(n - 1)}\left[ {\begin{array}{*{20}c} {\sum\limits_{i\, = \,1}^{n} {\sum\limits_{j\, = \,i + 1}^{n} {\left| {X_{i} - X_{j} } \right|} } } \\ {} \\ \end{array} } \right]$$

Gini’s mean difference may be more appropriate in case of a small departure from normality as it has an asymptotic relative efficiency 98% at the normal distribution [31]. It is more efficient than S if the normal distribution is contaminated by a small fraction [14, 18]. The \((1 - \alpha )\,100\,\%\) confidence interval for the classical process capability index, Cp, based on GMD is given as follows:

$$\begin{aligned}& {\text{LCL }} = \frac{{{\text{USL}} - {\text{LSL}}}}{{6{\text{GMD}}}}\sqrt {\frac{{\chi_{{\left( {\frac{\alpha }{2}\,,\,{\text{n}} - 1} \right)}}^{2} }}{{({\text{n}} - 1)}}} \;{\text{and}}\;\\&{\text{ UCL}} = \frac{{{\text{USL}} - {\text{LSL}}}}{{6{\text{GMD}}}}\sqrt {\frac{{\chi_{{\left( {1 - \frac{\alpha }{2}\,,\,{\text{n}} - 1} \right)}}^{2} }}{{({\text{n}} - 1)}}} \end{aligned}$$

where \(\chi_{{\left( {\frac{\alpha }{2}\,,\,n - 1} \right)}}^{2}\) and \(\chi_{{\left( {1 - \frac{\alpha }{2}\,,\,n - 1} \right)}}^{2}\) are α/2th and 1 − α/2th quintiles of the Chi squared distribution with \(n - 1\) degrees of freedom.

2.5 CI based on Sn

The Sn estimator was proposed by Rousseeuw and Croux [29] and is defined as the median of the n medians of the absolute differences between values. The Sn estimator can be defined as follows:

$$\begin{aligned} &{\text{S}}_{\text{n}} = 1.1926\,\,\,{\text{MD}}_{\text{i}} \left\{ {{\text{MD}}_{\text{j}} \left| {{\text{X}}_{\text{i}} - {\text{X}}_{\text{j}} } \right|} \right\},\,\,\\&{\text{i}} = 1,2,3, \ldots ,{\text{n}},\,\,{\text{j}} = 1,2,3, \ldots ,{\text{n}} \end{aligned}$$

A \((1 - \alpha )\,100\,\%\) confidence interval for the classical process capability index, Cp, based on Sn is given as follows:

$$\begin{aligned}& {\text{LCL}} = \frac{{{\text{USL}} - {\text{LSL}}}}{{6 {\text{S}}_{\text{n}} }} \sqrt {\frac{{\chi_{{\left( {\frac{\alpha }{2}, n - 1} \right)}}^{2} }}{{\left( {{\text{n}} - 1} \right)}}} \,{\text{and}}\,\\&{\text{UCL}} = \frac{{{\text{USL}} - {\text{LSL}}}}{{6 {\text{S}}_{\text{n}} }} \sqrt {\frac{{\chi_{{\left( {1 - \frac{\alpha }{2}, n - 1} \right)}}^{2} }}{{\left( {{\text{n}} - 1} \right)}}} \end{aligned}$$

where \(\chi_{{\left( {\frac{\alpha }{2}\,,\,n - 1} \right)}}^{2}\) and \(\chi_{{\left( {1 - \frac{\alpha }{2}\,,\,n - 1} \right)}}^{2}\) are α/2th and 1 − α/2th quintiles of the Chi squared distribution with \(n - 1\) degrees of freedom.

2.6 CI based on Qn

The Qn estimator was proposed by Rousseeuw and Croux [29], which is also a powerful alternative to the MAD. It is defined as follows:

$$\begin{aligned}& Q_{n} = 2.2219\,\,\left\{ {\,\left| {X_{i} - X_{j} } \right|;\,\,\text{i}\, < \,\text{j}} \right\}_{(g)} ,\,\,\,\\&\text{i} = 1,\,2,\,3, \ldots \,,\,\text{n};\,\,\,\text{j} = 1,\,2,\,3,\, \ldots \,,\,\text{n}\,\,\,\,\, \end{aligned}$$

where \(g = \left( {\begin{array}{*{20}c} h \\ 2 \\ \end{array} } \right) = \frac{h\,(h - 1)}{2}\), \(h = \left[ {\frac{n}{2}} \right] + 1\) and \(\left[ {\frac{n}{2}} \right]\) is the integer part of the fraction \(\frac{n}{2}\). Here the symbol (.) represents the combination. The Qn estimator is 2.2219 times the g-th order statistic of the \(\left( {\begin{array}{*{20}c} n \\ 2 \\ \end{array} } \right)\) distances between data points. A \((1 - \alpha )\,100\,\%\) confidence interval for the classical process capability index, Cp, based on Qn is given as follows:

$$\begin{aligned}& {\text{LCL}} = \frac{{{\text{USL}} - {\text{LSL}}}}{{6 {\text{Q}}_{\text{n}} }} \sqrt {\frac{{\chi_{{\left( {\frac{\alpha }{2}, n - 1} \right)}}^{2} }}{{\left( {{\text{n}} - 1} \right)}}} \,{\text{and}}\,\\&{\text{UCL}} = \frac{{{\text{USL}} - {\text{LSL}}}}{{6 {\text{Q}}_{\text{n}} }} \sqrt {\frac{{\chi_{{\left( {1 - \frac{\alpha }{2}, n - 1} \right)}}^{2} }}{{\left( {{\text{n}} - 1} \right)}}} \end{aligned}$$

where \(\chi_{{\left( {\frac{\alpha }{2}\,,\,n - 1} \right)}}^{2}\) and \(\chi_{{\left( {1 - \frac{\alpha }{2}\,,\,n - 1} \right)}}^{2}\) are α/2th and 1 − α/2th quintiles of a Chi squared distribution with \(n - 1\) degrees of freedom.

3 Simulation study

In this section, a Monte-Carlo simulation study is conducted using the statistical software MATLAB to compare performances of the classical and proposed confidence intervals given in this paper. Since, our obejctive is to find some good intervals for non-normal cases, we consider normal, t, Chi square, Exponnetial, Gamma, Lognorma and Beta distribitions, which cover a wide range of non-normal distributions. The simulation study was designed as follows:

  1. (i)

    Normal distribution, N(50, 1)

  2. (ii)

    Student-t distribution, t(5).

  3. (iii)

    Chi Square distribution, \(\chi_{{^{(1)} }}^{2}\).

  4. (iv)

    Exponential distribution, Exp (2).

  5. (v)

    Gamma distribution, G(1, 6).

  6. (vi)

    Lognormal distribution, LN(0, 1).

  7. (vii)

    Beta distributions, Beta (3, 3) and Beta (1, 10).

The number of simulation replications was M = 50,000 for each case. Random samples were generated from each of the above mentioned distributions with Cp = 1.0 and samples sizes n = 10, 25, 50 and 100. Coverage probability (CP) and average width (AW) of the selected confidence intervals were measured for each case. The most common 95% confidence interval (α = 0.05) is used for measuring coverage probability and average width of the confidence intervals. When (α = 0.05), an interval has perfect performance in terms of coverage probability that will capture the true process capability index, Cp, between the lower and upper limits 95% of the times. The estimated coverage probability and estimated average width for this simulation study are respectively given as:

$$\widehat{CP} = \frac{{\# \left( {L \le C_{p} \le U} \right)}}{M}\,\,{\text{and}}\,\widehat{AW} = \frac{{\mathop \sum \nolimits_{i = 1}^{M} \left( {U_{i} - L_{i} } \right)}}{M}$$

where \(\# \left( {{\text{L}} \le {\text{C}}_{\text{p}} \le {\text{U}}} \right)\) denotes the number of simulation runs for which Cp lies within confidence interval. The simulated coverage probabilities and average widths for each of the distributions described above are presented in Tables 2 and  3, respectively. For clear understanding, coverage probabilities and average widths for considered sample sizes are presented in Figs. 1a, 2b, respectively. In the Table 2, we have reported coverage probabilities for selected confidence intervals of Cp for the various distributions and also for all considered values of n. For clear understanding, in Fig. 1a, b, we have presented coverage probabilities and average widths values for sample size n = 10 to observe effects on all considered distributions. Our observations from Fig. 1a are follows:

Table 2 Estimated coverage probabilites of 95% CI for Cp = 1.00
Table 3 Estimated average widths of 95% CI for Cp = 1.00
Fig. 1
figure 1

a CP for n = 10; b AW for n = 10

Fig. 2
figure 2

a CP for n = 25; b AW for n = 25

  1. (i)

    Coverage probability for the classical interval, the AADM interval, the Sn interval and the Qn interval are close to the nominal level 0.95, while others are not when data are generated from the distributions for skewness 0.

  2. (ii)

    Coverage probability for the classical interval, the AADM interval and the MAD interval are close to the nominal level 0.95 compared to other considered intervals when data are generated from the Chi square distribution with df 1 and skewness 2.8284.

  3. (iii)

    Coverage probability only for the GMD interval is close to the nominal level 0.95 compared to other considered intervals when data are generated from exponential distribution for skewness 2.

  4. (iv)

    Coverage probability only for the GMD interval is very close to the nominal level 0.95 compared to other considered intervals when data are generated from the gamma distribution for skewness 0.8165.

  5. (v)

    Coverage probability for the classical interval, the AADM interval and the GMD interval is close to the nominal level 0.95 compared to other considered intervals when data are generated from the lognormal distribution for skewness 8.1074. It is very interesting to note that the GMD confidence interval performed best compared to other intervals.

  6. (vi)

    Coverage probability for the classical interval, the AADM interval, the GMD interval, the Sn interval and the Qn interval are very close to the nominal level 0.95 compared to other considered intervals when data are generated from the Beta (3,3) distribution for skewness 0. When data are generated from the Beta (1, 10) distribution for skewness 1.5170, it is noted that only the GMD interval has coverage probability very close to the nominal value 0.95 compared to other intervals.

Figure 1b shows that AWs for various confidence intervals and distributions for small sample size n = 10. From Fig. 1b, we noted the following performances:

  1. (i)

    The Qn interval has the lowest average width compared to other intervals, followed by the Sn interval, the classical interval and so on when random samples were generated from the N(50, 1) distribution.

  2. (ii)

    The Sn interval has the lowest average width compared to other intervals, followed by the Qn interval, the classical interval and so on when random samples were generated from the t(5) distribution.

  3. (iii)

    The GMD interval has the lowest average width compared to other intervals, followed by the Qn interval, the classical interval and so on when random samples were generated from the \(\chi_{{^{(1)} }}^{2}\) distribution.

  4. (iv)

    The GMD interval has the lowest average width compared to other intervals, followed by the classical interval, the AADM interval and so on when random samples were generated from the exponential distribution. It is very interesting to note that compared to N(50, 1), t(5) and \(\chi_{{^{(1)} }}^{2}\) distributions, for all intervals have small AWs when random samples were taken from this non-normal distribution.

  5. (v)

    The GMD interval has the lowest average width compared to other intervals, followed by the Qn interval, the classical interval and so on when random samples were generated from the gamma distribution.

  6. (vi)

    The GMD interval has the lowest average width compared to other intervals, followed by the classical interval, the AADM interval and so on when random samples were generated from the lognormal distribution.

  7. (viii)

    The GMD interval has the lowest average width compared to other intervals, followed by the Sn interval, the classical interval and so on when random samples were generated from the Beta (3, 3) distribution. For Beta (1, 10) distribution, the GMD interval has the lowest average width compared to other intervals, followed by the Qn interval, the classical interval and so on.

From above discussions, we may conclude that the classical interval, the AADM interval, the GMD interval, the Sn interval and the Qn interval have performed better with respect to coverage probability for all considered distributions. In terms of measure AW, the above-mentioned confidence intervals observed smaller values compared to other considered intervals.

Figure 2a, b present CP and AW values for sample size n = 25 to see influences on all selected distributions. We observed the following:

  1. (i)

    Coverage probability of all intervals are closer to the nominal level. Among them again the classical interval, the AADM interval, the Sn interval and the Qn interval have the nominal level close to 0.95 compared to other considered intervals when data are generated from N(50, 1).

  2. (ii)

    Coverage probability of classical interval, the AADM interval and the MAD interval are closer to the nominal level compared to other considered intervals when data are generated from the Chi square distribution.

  3. (iii)

    Coverage probability only for the GMD interval is closer to the nominal level 0.95 compared to other considered intervals when data are generated from the exponential distribution.

  4. (iv)

    We observed that with increasing size of the random samples there is a tendency of increasing coverage probability for all selected confidence intervals. Among them, only for the GMD interval is very close to the nominal level when data are generated from the gamma distribution.

  5. (v)

    Coverage probability for the classical interval, the AADM interval and the GMD interval are more close to the nominal level when data are generated from the lognormal distribution for skewness 8.1074. It is very interesting to note that again the GMD interval performed best compared to other intervals.

  6. (vi)

    Coverage probability for the classical interval, the AADM interval, the GMD interval, the Sn interval and the Qn interval are more close to the nominal level compared to sample size n = 10, when data are generated from the Beta (3, 3) distribution and also the Beta (1, 10) distribution.

Figure 2b shows that average widths for various intervals and for various considered distributions for small sample size n = 25. We noted that with increasing size of the samples, AWs decrease and observed almost same performances, discussed as follows:

  1. (i)

    The Qn interval has the lowest average width compared to other intervals, followed by the Sn interval, the classical interval and so on when random samples were generated from the N(50, 1) distribution.

  2. (ii)

    The Sn interval has the lowest average width compared to other intervals, followed by the Qn interval, the classical interval and so on when random samples were generated from the t(5) distribution.

  3. (iii)

    The GMD interval has the lowest average width compared to other intervals, followed by the Qn interval, the classical interval and so on when random samples were generated from the \(\chi_{{^{(1)} }}^{2}\) distribution.

  4. (iv)

    The GMD interval has the lowest average width compared to other intervals, followed by the classical interval, the AADM interval and so on when random samples were generated from the exponential distribution. It is very interesting to note that compared to N(50, 1), t(5) and \(\chi_{{^{(1)} }}^{2}\) distributions, for all intervals have small AWs when random samples were taken from this non-normal distribution.

  5. (v)

    The GMD interval has the lowest average width compared to other intervals, followed by the Qn interval, the classical interval and so on when random samples were generated from the gamma distribution. It is also very interesting to note that compared to all considered distributions, for all intervals observed smallest AWs when random samples were taken from this non-normal distribution.

  6. (vi)

    The GMD interval has the lowest average width compared to other intervals, followed by the classical interval, the AADM interval and so on when random samples were generated from the lognormal distribution.

  7. (vii)

    The GMD interval has the lowest average width compared to other intervals, followed by the Sn interval, the classical interval and so on when random samples were generated from the Beta (3, 3) distribution. For Beta (1, 10) distribution, the GMD interval has the lowest AW compared to other intervals, followed by the Qn interval, the classical interval and so on.

From the above discussions, again we may conclude that the classical interval, the AADM interval, the GMD interval, the Sn interval and the Qn interval have performed better with respect to coverage probability for all considered distributions. In terms of measure AW, we noted again the above-mentioned confidence intervals observed smaller values compared to n = 10 and other confidence intervals.

In Figs. 3a, 4b, we presented coverage probabilities and AWs for n = 50 and n = 100 under our simulation flowchart. We observed that with increasing samples sizes, coverage probabilities are closer to the nominal level compared to small sample sizes. The average widths observed smaller for large sample sizes compared to small sample sizes for all selected confidence intervals and also under all considered distributions.

Fig. 3
figure 3

a CP for n = 50; b AW for n = 50

Fig. 4
figure 4

a CP for n = 100; b AW for n = 100

4 Applications

In this section, two real-life data examples from the industry sector are presented to illustrate the implementation and performance of the classical and modified confidence intervals for the classical process capability index, Cp.

4.1 Example I

The first data set was obtained from Rezaie et al. [28]. Their case study involved a manufacturer and supplier of audio-speaker components. The data represents the weight (in grams) of the rubber edge, which is one of the key components that reflect the sound quality of drive unit, has been studied (Table 4).

Table 4 The weight (in gram) of the rubber edge data

The company decided that the process has upper and lower specifications at USL = 8.94 g and LSL = 8.46 g. If the weight of the rubber edge falls outside the specification limits, it is unacceptable. A summary of the location and scale statistics values are calculated and given in Table 5. The normality has been examined by using the Kolmogorov–Smirnov (K–S) goodness-of-fit test. The histogram, density plot, and normal probability plot from the data was obtained using Minitab® Release 14 (Minitab Inc., 2012) and shown below in Fig. 5.

Table 5 Summary statistics for the weight of the rubber edge data
Fig. 5
figure 5

Plots for the weight of the rubber edge data

As it can be observed, the Kolmogorov–Smirnov (K–S) goodness-of-fit test for normality have a p value greater than α = 0.05. In addition, the histogram and the normal probability plot show a normal distribution. Thus, it may be concluded that the sample data can be regarded as taken from a normal process. The estimated capability indices, the resulting 95% confidence interval and the corresponding confidence interval width for all confidence intervals of Cp are reported in Table 6.

Table 6 The 95% CIs for Cp for the weight rubber edge data

From Table 6, we observe that the CIGMD interval has the smallest width followed by the classical confidence interval, CISps and CIAAMD. The CIQn interval has the highest width. Both CIMAD and CISn have the shorter widths compared to the corresponding interval CIQn. Thus the CIGMD interval performs the best in the sense of having smaller width than the classical confidence interval and the other modified confidence intervals. In addition, according to the quality conditions for Cp value given in Table 1, we observe from Table 6 that when S, Sps, AAMD and GMD are used to estimate σ, then the process is satisfactory capable of meeting the given specifications, but when MAD, Sn and Qn are used to estimate σ, then the process is excellent capable of meeting the given specifications. The Qn gives the best value followed by MAD and Sn. The results of this example supported the simulation study results to some extent. The results showed that the process is being capable and the normal distribution is adequate for modeling this data.

4.2 Example II

The second data set was obtained from Chen and Ding [12], which represents the inner diameter (in mm) for roller bearing and presented in Table 7.

Table 7 The inner diameter (in mm) for roller bearing data

According to the results, the upper specification limit is 60.004 mm and lower specification limit is 59.981 mm. A summary of location and scale statistics values are given in Table 8.

Table 8 Summary statistics for inner diameter of roller bearing data

The normality has been examined by using the Kolmogorov–Smirnov (K–S) goodness-of-fit test. The histogram, density plot, and normal probability plot are shown in Fig. 6. The Kolmogorov–Smirnov (K–S) goodness-of-fit test has a p value less than α = 0.01, which indicates that the data do not follow normality assumption. The histogram and the normal probability plot show a non-normal distribution. Thus, it may be concluded that the inner diameter of roller bearing data can be regarded as taken from a non-normal process.

Fig. 6
figure 6

Plots for the inner diameter of roller bearing data

The estimated capability indices, the resulting 95% confidence interval and the corresponding confidence interval width for all confidence intervals of Cp are presented in Table 9. From Table 9, we observe that the CISps has the smallest width followed by the CIQn, CIMAD, CISn, CIAAMD, CIGMD and classical confidence interval. The classical confidence interval has the highest width. Also, widths for all modified CIs are very close to each other while the width for the classical confidence interval is very far from other widths and has a large value. Thus all the modified confidence intervals perform better than the classical confidence interval in the sense of having smaller width. Thus, according to the quality conditions for the Cp value given in Table 1, we observe from Table 9 that all values based on S and the other robust estimators are less than 0.67. However, the Cp value based on S is the largest one of them, then the process is poor capable of meeting the given specifications. The results are not surprising because the data is skewed. The results of this example supported the simulation study results to some extent. The results of data set under non-normality showed that the process is not capable and the normal distribution is not adequate for modeling this data.

Table 9 95% CIS for Cp for the inner diameter of roller bearing data

5 Conclusions

Most of the standing confidence intervals for Cp (process capability index) are based on the confidence interval for the standard deviation, which is based on a normal distribution. However, the underlying distribution of the observed data may or may not be normally distributed. In the presented study, we proposed some interval estimators for the classical process capability index, Cp, based on some robust estimators for scale parameter. A Monte-Carlo simulation study has been conducted to compare the performance of our proposed confidence intervals with the classical existing confidence interval based on coverage probability and average width. Random samples were generated from a variety of symmetric and non-symmetric distributions. Two real-life data sets have been analyzed which supported the findings of our simulation study to some extent. From our study, it may be concluded that the classical confidence interval, the AADM confidence interval, the GMD confidence interval, the Sn confidence interval and the Qn confidence interval have performed very well with respect to high coverage probability for all considered distributions. In terms of measure of average width, the above mentioned confidence intervals also performed very well compared to other confidence intervals. The findings of this paper are consistent with the results of Rousseeuw and Croux [29], Abu-Shawiesh and Kibria [2] and Piña-Monarrez et al. [27] among others.