L-moments-based uncertainty quantification for scarce samples including extremes

Jayaraman, Deepan; Ramu, Palaniappan

doi:10.1007/s00158-021-02930-2

L-moments-based uncertainty quantification for scarce samples including extremes

Research Paper
Published: 30 June 2021

Volume 64, pages 505–539, (2021)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Structural and Multidisciplinary Optimization Aims and scope Submit manuscript

L-moments-based uncertainty quantification for scarce samples including extremes

Download PDF

507 Accesses
5 Citations
Explore all metrics

Abstract

Sampling-based uncertainty quantification demands large data. Hence, when the available sample is scarce, it is customary to assume a distribution and estimate its moments from scarce data, to characterize the uncertainties. Nonetheless, inaccurate assumption about the distribution leads to flawed decisions. In addition, extremes, if present in the scarce data, are prone to be classified as outliers and neglected which leads to wrong estimation of the moments. Therefore, it is desirable to develop a method that is (i) distribution independent or allows distribution identification with scarce samples and (ii) accounts for the extremes in data and yet be insensitive or less sensitive to moments estimation. We propose using L-moments to develop a distribution-independent, robust moment estimation approach to characterize the uncertainty and propagate it through the system model. L-moment ratio diagram that uses higher order L-moments is adopted to choose the appropriate distribution, for uncertainty quantification. This allows for better characterization of the output distribution and the probabilistic estimates obtained using L-moments are found to be less sensitive to the extremes in the data, compared to the results obtained from the conventional moments approach. The efficacy of the proposed approach is demonstrated on conventional distributions covering all types of tails and several engineering examples. Engineering examples include a sheet metal manufacturing process, 7 variable speed reducer, and probabilistic fatigue life estimation.

A new class of quantile functions useful in reliability analysis

Article 01 September 2018

On Methods of Estimation for the Type II Discrete Weibull Distribution

Article 04 July 2018

The Inverse Xgamma Distribution: Statistical Properties and Different Methods of Estimation

Article 16 April 2019

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

In structural design, uncertainties in the system inputs are considered the root cause of poor product performance and this leads to variation in the system output responses. These uncertainties typically occur in material properties, manufacturing parameters, externally applied forces, etc. (Jin et al. 2003; Lee et al. 2009; Ramu and Arul 2016; Voinov et al. 2016; Lee et al. 2019). Irrespective of the variability in the input, a designer wishes to achieve a system response that satisfies the design objectives, preferably with less variability (Ramu et al. 2017). It is a common statistical practice to quantify uncertainties by conventional moments (C-moments), mean (μ), and variance (σ²) (Hosking 1990). Higher order C-moments such as skewness (γ₁) and kurtosis (α₄) are also used to describe the system outputs more precisely (Mekid and Vaja 2008; Anderson and Mattson 2012). In order to design under uncertainty, it is important to quantify uncertainties of system output which is the result of uncertain inputs propagated through the system. This is referred to as uncertainty propagation (Mekid and Vaja 2008; Lee and Chen 2009; Anderson and Mattson 2012; Jayaraman et al. 2018; Liu et al. 2019). Sometimes, input distributions are known and moments are computed to be propagated through the system model to obtain statistics of output response. When moments of distributions are used for propagation, it is imperative to compute higher order moments accurately (Lee and Chen 2009).

Most engineering applications have scarce data and description of the uncertainties associated with the input variables is usually not available readily (Ramu and Arul 2016). One way of quantifying this uncertainty is to use nonparametric methods such as histogram, kernel density estimation (KDE) techniques, and interval analysis (Lee et al. 2019). Though histogram is a nonparametric density estimation approach, it is less robust and the challenge lies in choosing the bin size. Changing the bin size leads to different inference from the data (Rudemo 1982; Silverman 1986; Lee et al. 2019). In KDE, the smoothing of the density curve is controlled by bandwidth and selecting the right bandwidth is a major challenge (Hall et al. 1991; Kang et al. 2018). Often, higher bandwidth leads to over-smoothing and lower bandwidth leads to under-smoothing of the density curve (Lee et al. 2019). Bandwidth selection for heavy tailed distributions is relatively difficult (Buch-Larsen et al. 2005). Integrated squared error, mean integrated squared error, and least squares cross validation error are used to choose the optimal bandwidth (Silverman 1986; Hall et al. 1991; Shirahata and Is 1992; Turlach 1993). These error metrics are more sensitive to the extremes present in the small data and suffer from sampling variability (Silverman 1986; Park and Marron 1990; Hall et al. 1991). Sometimes KDE can represent a very irregular shape of distribution for small data (Kang et al. 2018). Interval analysis is also used to represent the uncertainty as an interval or variable that has lower and upper bound. Fuzzy set theory and evidence theory are generally used to estimate the interval. Major drawback of this approach is that it requires interval arithmetic for statistical model comparison and validation (Rokne 2001; Gao et al. 2010; Lee et al. 2019). Though nonparametric methods are robust, they require sufficient data for accurate modelling (Lee et al. 2019). When the interest is in extreme probabilities, tail modeling techniques are also used to quantify the uncertainties. These techniques approximate the tail portion of the cumulative density function (CDF) using limited samples. However, the tail itself is highly volatile and very sensitive to number of data points (Ramu P 2013; Acar and Ramu 2014). These techniques are usually suitable for low probabilities and not to model the entire probability space, which is the general interest in uncertainty quantification.

A widely used parametric approach is the Pearson system which is based on the hypothesis that higher order moments provide a good representation of PDF. However, researchers (Hosking 1990) observe that there is lack of clarity on what information does the higher order moments impart, on the shape of the distribution. Also, moments computed from scarce data can be markedly different from those of the probability distribution from which the sample was drawn. The choice of underlying probabilistic distribution plays a very crucial role. When the choice is influenced by scarce samples, it is likely to have errors, which when propagated through the model result in amplified errors in the system output. In addition, the scarce samples might also include extremes, which will only worsen the errors. In scarce data, extremes have large influence on the moments estimation (Moon et al. 2020). C-moments are sensitive to extremes present in the data leading to large variation in the computed moments. Extremes can sometimes be classified as outliers and excluded from the statistical study (Jayaraman et al. 2018; Jayaraman and Ramu 2019). This again will lead to large error between computed moments and actual moments. Since the available data itself is scarce, excluding one or a few data points leads to significant information loss. Therefore, it is desirable to develop an approach that can:

(i) Identify appropriate probabilistic distribution with scarce data.

(ii) Estimate moments accurately from scarce data with possible extremes. This translates to the estimation technique being less or insensitive to the extremes.

To overcome the hurdle of computing robust moments while accounting for extreme realizations, researchers recommend using linear moments (L-moments) which uses linear combinations of order statistics. L-moments are known to be robust and less sensitive to extremes or outliers in the scarce samples. In addition, L-moments are less subject to bias in estimation and their approximations on the asymptotic normal distribution are better, when the samples are finite (Sillitto1969; David 1981; Hosking 1989, 1990, 1992; Gubareva and Gartsman 2010). L-moments are widely used in various applications in hydrology, water resource applications, and regional frequency analysis (Hosking and Wallis 1997; Sankarasubramanian and Srinivasan 1999; Adamowski 2000; Kumar and Chatterjee 2005; Atiem and Harmancioǧlu 2006). L-moment approach is preferable for insufficient or lesser data in flood or rainfall analysis (Smithers and Schulze 2001; Haddad et al. 2011). L-moments are used to identify the probability distribution of the censored data in the fields of environmental quality and quantity monitoring (Zafirakou-Koulouris et al. 1998; Sankarasubramanian and Srinivasan 1999; Elamir and Seheult 2004). In reliability analysis, L-moments are used to analyze and characterize the life time data (Nair and Vineshkumar 2010). Based on the discussions presented above, the use of L-moments in the context of design with scarce samples that might include extremes is not available. Hence, the authors propose a framework using L-moments to quantify and propagate uncertainties in an optimization framework, when the available data is scarce and could include potential extremes.

To this end, the purpose of this study is to identify the PDF of response, with scarce samples and in the presence of extremes. We propose using L-moment ratio to identify the PDF. The identified PDF is compared to the exact PDF using the Jensen-Shannon divergence. This comparison is only to validate the proposed approach, because in reality, one will not have the information about the exact PDF. Higher order L-moments are used to compute the L-moment ratios such as L-skewness (τ₃) and L-kurtosis (τ₄). We compare L-moment estimates against C-moment estimates and their effects on identifying the underlying distribution using L-moment ratio and the Pearson system, respectively. The novelty lies in aspects such as the following: (i) comprehensive understanding of performance of L-moments on different types of tails when the data is scarce and with potential extremes; (ii) combining scarce data, extremes, and L-moments in a design context; (iii) using L-moments to quantify the uncertainty and propagate it in a design framework, while implementing on real-life engineering examples. The proposed approach is demonstrated on a suite of distributions that cover all type of tails and several engineering examples.

The rest of the paper is organized as follows: In Section 2, it is explained how the PDFs are identified using the Pearson system and L-moment ratio diagram. Investigation strategy of the proposed approach and the comparison of obtained PDFs with the exact PDF using the Jensen-Shannon divergence and how extremes are generated and incorporated in the data is discussed in Section 3. Demonstration of the proposed approach on a suite of statistical distributions and engineering examples is carried out in Section 4 followed by conclusions in Section 5.

2 C-moments and L-moments

In this study, we focus on scarce data that might include extremes. The goal is to identify the underlying PDF and use its moments for characterising uncertainties. The two approaches that we compare here (C- and L- moment approaches) compute the moments and identify the respective PDFs using the Pearson system and L-moment ratio diagram, respectively. The specific objective is to compare the performance of the approaches in the presence of extremes. The process of investigation is presented in Fig. 1 and the elements of Fig. 1 are discussed below.

2.1 Identification of distribution using C-moments and Pearson system

The Pearson system (Pearson 1916) is a method of choosing appropriate distribution from a set of distributions called the Pearson distribution, based on first four C-moments of the system response. The Pearson distribution contains several distributions such as beta, normal, and gamma. Each PDF in the Pearson distribution satisfies the generalized differential equation, in (1) (Kenney and Keeping 1947; Craig 1991; Weisstein et al. 2004).

$$ \frac{dy}{dx}=\frac{y(m-x)}{a_{0}+a_{1}x+a_{2}x^{2}} $$

(1)

where y is the PDF, a₀,a₁,a₂, and m are the parameters expressed in terms of C-moments of the system, in (2).

$$ \begin{array}{ll} a_{0} &= \frac{2+\delta}{2(1+2\delta)}\\ a_{1} &= -m = \frac{\gamma_{1}}{2(1+2\delta)}\\ a_{2} &= \frac{\delta}{2(1+2\delta)} \end{array} $$

(2)

where $\delta =\frac {2\alpha _{4}-3{\gamma _{1}^{2}}-6}{\gamma _{1}+3}$, γ₁ is the skewness, and α₄ is the kurtosis of the data.

The roots c₁, c₂ and the coefficients of a₀ + a₁x + a₂x² provide information that can be used to classify the distributions. Few possible types of distribution are (i) a₁ = a₂ = 0,a₀ > 0: normal distribution, (ii) ${a_{1}}^{2}/4{a_{0}}{a_{2}}<0,c_{1} \leq {x} \leq c_{2}$: beta distribution (iii) ${a_{1}}^{2}/4{a_{0}}{a_{2}}=0,{a_{2}}>0,-\infty <x< \infty $: Student’s t-distribution, etc. The solution of (1) needs to satisfy the following conditions:

i. the distribution curve should vanish at the ends of the range, i.e., as $y\rightarrow 0, dy/dx \rightarrow 0$

ii. when $x=m , dy/dx \rightarrow 0$

In this study, the Pearson system is adopted to identify the distribution using C-moments computed from the scarce samples.

2.2 L-moments

L-moments (Hosking 1989; 1990; 1992; Hosking and Wallis 1997; Hosking 2006) are expectations of certain linear combinations of order statistics. These combinations provide information about location, scale, and shape of the distribution from which the samples are drawn. Since L-moments are linear functions of the data, they suffer less from the effects of sampling variability and are reported to be robust in the presence of extremes. L-moments which are modifications of probability weighted moments (PWM) can be used to define the shape of the probability distribution (Greenwood et al. 1979). PWM (β_r) (Hosking and Wallis 1997) for a probability distribution with cumulative density function, F and quantile function, x(F) is given by $\beta _{r}={{\int \limits }_{0}^{1}}x(F)F^{r}dF$. The population L-moment is

$$ \begin{array}{ll} \lambda_{r+1}={\sum}_{k=0}^{r}p_{r,k}^{*}\beta_{k},r=0,1,\dots,n-1; \end{array} $$

(3)

where λ is the population L-moment, $p^{*}_{r,k}$ is

$(-1)^{r-k}\binom {r}{k}\binom {r+k}{k}$, n is the sample size.

Equation (3) can be rewritten in terms of the expectations of order statistics of a random variable, X, as

$$ \begin{aligned} \lambda_{r}&=\frac{1}{r}\sum\limits_{k=0}^{r-1}(-1)^{k}\binom{r-1}{k}E(X_{r-k:r}),\\r&=1,2,\dots,n; \end{aligned} $$

(4)

where E(.) is the expectation value. The first four L-moments are given in terms of PWMs derived from (3), and presented in (5)

$$ \begin{array}{ll} \lambda_{1_{r=0}}&=\beta_{0},\\ \lambda_{2_{r=1}}&=2\beta_{1}-\beta_{0},\\ \lambda_{3_{r=2}}&=6\beta_{2}-6\beta_{1}+\beta_{0},\\ \lambda_{4_{r=3}}&=20\beta_{3}-30\beta_{2}+12\beta_{1}-\beta_{0} \end{array} $$

(5)

In practice, L-moments are estimated from the ordered sample, $x_{1:n}\leq x_{2:n}\leq \dots \leq x_{n:n}$. The sample PWM, b_r is presented in (6).

$$ \begin{aligned} b_{r} &= \frac{1}{n} \sum\limits_{j=r+1}^{n}\frac{(j-1)(j-2)\dots(j-r)}{(n-1)(n-2){\dots} (n-r)} x_{j:n},\\r&=0,1,2,\dots,n-1; \end{aligned} $$

(6)

The general form of sample L-moment (l) in terms of sample PWM is

$$ l_{r+1}=\sum\limits_{k=0}^{r}p_{r,k}^{*}b_{k},r=0,1,2,\dots,n-1; $$

(7)

The first sample L-moment (l₁) is the sample mean, a measure of location. The dispersion measure of the sample is provided by the second L-moment (l₂), a scalar multiple of Gini’s mean difference statistic (Hosking 1990; Ceriani and Verme 2012). Sample L-moment ratios (t_r) are obtained by dividing the higher order sample L-moments by the dispersion measure (l₂), t_r = l_r/l₂. These are dimensionless quantities, independent of the units of measurement data. The L-moment analogue of the coefficient of variation (σ/μ), is L-coefficient of variance, $\tau _{_{L-CV}} =\lambda _{2}/\lambda _{1}$ and it varies as $0\leq \tau _{_{L-CV}} <1$ (Hosking 1990; Hosking and Wallis 1997).

2.3 Identification of distribution using L–moment ratio

Similar to conventional moments, L-moments are used to summarize the characteristics of the sample data. Many researchers (Hosking 1990; Hosking and Wallis 1997; Zafirakou-Koulouris et al. 1998; Adamowski 2000; Kumar and Chatterjee 2005) have used L-moment ratio diagram to describe the PDF. L-moment ratio diagram shown in Fig. 2, is a plot of L-skewness vs L-kurtosis of standard distributions such as uniform, normal, exponential, log-normal, logistic, gumbel, generalized extreme value (GEV), generalized pareto (GP) and gamma. It is useful to provide simple explicit expressions for τ₄ (L-kurtosis) in terms of τ₃ (L-skewness) for some widely used three-parameter distributions of probability such as lognormal, gamma, GEV and GP (Hosking and Wallis 1997). The L-moment ratio of two-parameter distributions is shown in Fig. 2 as single dots and mentioned in Table 1. Polynomial approximations of the form $\tau _{4} = A_{0}+A_{1}\tau _{3}+A_{2}{\tau _{3}^{2}}+\dots +A_{8}{\tau _{3}^{8}}$, have also been obtained and the coefficients are provided in Hosking and Wallis (1997). Overall bound is the lower bound on L-moment ratios of the all distributions obtained while satisfying the constraints, λ₂ ≥ 0, − 1 < τ₃ < 1 and $\frac {1}{4}(5{\tau _{3}^{2}}-1)\leq \tau _{4} < 1 $.

Table 1 L-moment ratio of distributions

L-moments-based uncertainty quantification for scarce samples including extremes

Abstract

Similar content being viewed by others

A new class of quantile functions useful in reliability analysis

On Methods of Estimation for the Type II Discrete Weibull Distribution

The Inverse Xgamma Distribution: Statistical Properties and Different Methods of Estimation

1 Introduction

2 C-moments and L-moments

2.1 Identification of distribution using C-moments and Pearson system

2.2 L-moments

2.3 Identification of distribution using L–moment ratio

3 Investigation strategy

3.1 Jensen-Shannon divergence

3.2 Incorporation of extremes

4 Results and discussion

4.1 Statistical distributions

4.2 Engineering examples

4.2.1 A flat rolling process: sheet metal manufacturing

4.2.2 Design of speed reducer with 7 variables

4.2.3 Probabilistic fatigue life assessment

5 Conclusion

References

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interests

Additional information

Replication of results

Publisher’s note

Appendices

Appendix 1: Distribution selection based on L-moment

Appendix 2: Statistical distributions

1.1 2.1 Shape parameter and tail heaviness

1.2 2.2 Results—One extreme

1.3 2.3 Results—Two extremes

Appendix 3: Flat rolling metal sheet manufacturing process

1.1 3.1 Distribution selection process based on L-moment

1.2 3.2 Results—Two extremes

Appendix 4: Design of speed reducer

1.1 4.1 Results—One extreme

1.2 4.2 Results—Two extremes

Appendix 5: Probabilistic fatigue life assessment

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation