Statistical Analysis of Various Optimal Latin Hypercube Designs

Jalal Uddin Jamali, A. R. M.; Asadul Alam, Md.; Aziz, Abdul

doi:10.1007/978-981-16-1919-9_13

A. R. M. Jalal Uddin Jamali³,
Md. Asadul Alam⁴ &
Abdul Aziz⁵

541 Accesses
1 Citations

Abstract

Among several Design of Experiments (DoEs), Latin Hypercube Design (LHD) is one of the most frequently used methods in the field of physical experiments and in the field of computer simulations to find out the behavior of response surface of the surrogate model with respect to design points. A good experimental design should have three important characteristics, namely (i) non-collapsing, (ii) space-filling and (iii) orthogonal properties. Though inherently LHD preserves non-collapsing property, but randomly generated LHDs have poor space-filling in terms of minimum pair-wise distance. In order to ensure the last two properties in LHD, researchers are frequently looking for finding optimal LHD in the sense of space-filling and orthogonal properties. Moreover, researchers are frequently encountered the question, which distance measure is the best in the case of optimal designs? In the literature, several types of optimal LHDs are available such as Maximin LHD, Orthogonal LHD, Uniform LHD, etc. On the other hand, two distance measures namely Euclidean and Manhattan distance measures are used frequently to find optimal DoEs. But which one of the two distance measures is better, is still unknown. In this article, intensive statistical analysis has been carried out on numerical instances to explore the deep scenario of each optimal LHD. The main goal of this research is to find out a scenario of the well-known optimal designs from statistical point of view. From this elementary experimental study, it seems to us that in the sense of space-filling, Euclidean distance measure-based Maximin LHD is the best. But if one needs space-filling along with better orthogonal property, then multi-objective (Maximin with approximate orthogonal)-based optimal LHD is relatively better than Maximin LHD.

Access provided by Autonomous University of Puebla. Download chapter PDF

Heuristic Search Algorithms for Constructing Optimal Latin Hypercube Designs

A Novel Flexible Experiment Design Method

Isovolumetric adaptations to space-filling design of experiments

Article Open access 14 June 2022

Keywords

1 Introduction

Computer simulation-based DoEs are used in a wide range of applications to learn about the effect of input variables x on a response of interest y. Fang et al. (2006) said, as simulation programs are usually deterministic so the output of a computer experiment is not subject to random variations, which makes the design of computer experiments different from that of physical experiments. Many simulation models involve several hundred factors or even more. It is desirable to avoid replicates when projecting the design onto a subset of factors. This is because a few out of the numerous factors in the system usually dominate the performance of the product. Thus, a good model can be fitted using only these few important factors. Therefore, when we project the design on to these factors, replication is not required. As it is recognized by several authors, the choice of the design points for computer experiments should at least fulfill two requirements namely non-collapsing and space-filling (Johnson et al. 1990; Morris and Mitchell 1995). In addition, in order to find out the individual effect of factors on response surface, researchers hunt DoEs with orthogonal property among the factors. LHD inherently preserves non-collapsing property, i.e. factors are evenly spread whenever projected on any coordinate. McKay et al. (1979) first introduced LHD. Later, authors (Morris and Mitchel 1995; Grosso et al. 2009) defined LHD which are a bit different. In a LHD, there are N points (design points) and each point has k distinct coordinates (parameters/factors). The points are placed in such a way that they are uniformly distributed when projected on every single coordinate (factor) axis. In a LHD, the range of each parameter/factor is normalized to the interval [1, N] (actually authors Grosso et al. 2009) considered the interval [0, N-1] but for the symmetry of other designs considered, here we have considered the range [1, N]). Mathematically, let us consider a set of N points in a uniform k-dimensional grid {1, 2, …, N}^k:

$$X = \left( \begin{gathered} \user2{X}_{1} \hfill \\ \vdots \hfill \\ \user2{X}_{\user2{N}} \hfill \\ \end{gathered} \right) = \left( {\begin{array}{*{20}c} {x_{{11}} } & \ldots & {x_{{1k}} } \\ \vdots & \ldots & \vdots \\ {x_{{N1}} } & \ldots & {x_{{Nk}} } \\ \end{array} } \right)\forall x_{{ij}} = \left\{ {1,2, \ldots ,N} \right\}.$$

Then X is a LHD iff $x_{{pj}} \, \ne \,x_{{qj}} \,\forall \,j,p,q\, \in \,\left\{ {1,2, \ldots ,N} \right\}\,\exists \,p \ne q$, i.e. each column has no duplicate entries. Note that the number of possible LHDs is huge: there are (N!)^k possible LHDs.

It is known that randomly generated LHD has poor space-filling property. Moreover, it bears poor orthogonal property. Therefore, researchers search for optimal LHDs in the sense of space-filling, orthogonal, uniform etc., Jin et al. (2005), and Johnson et al. (1990) proposed Maximin (maximize the minimum inter-site pair-wise distance) experimental design but it was not LHD. They measured the space-filling on the basis of minimum inter-site pair-wise distance “D₁” among all pair-wise distances of design points. That is if “D₁” value of any LHD is relatively smaller then we say that the LHD is not good enough in the sense of space-filling. Morris and Mitchell (1995) proposed ${\mathrm{\Phi }}_{p}$ optimal criterion and later Grosso et al. (2009) modified it for easy computation, which is given below:

$$\Phi _{p} \left( X \right) = \sum\limits_{{i = 1}}^{N} {\sum\limits_{{j = i + 1}}^{N} {\left[ {\frac{1}{{d_{{ij}}^{p} }}} \right]} } ^{{\frac{1}{p}}}$$

(1)

where d_ij = d(x_i, x_j) be the distance between points x_i and x_j and p is a positive integer.

Audze and Eglais (1977) proposed a new optimal criterion that is similar to the potential energy function (Potential (U)) and which is defined as below:

$$Min\,\,U = \min \sum\limits_{{i = 1}}^{N} {\sum\limits_{{j = i + 1}}^{N} {\frac{1}{{d_{{ij}}^{2} }}} } .$$

(2)

It is known that multicollinearity (orthogonal property) measures the linear dependency among the factors of the design points. Multicollinearity may be measured by the partial pair-wise correlations among the factors. Several ways are available in the literature to measure the pair-wise correlations. Here, we consider the measure of average pair-wise correlations and maximum correlation, respectively, as below:

$${\rho }^{2}=\frac{\sum _{i=2}^{k}\sum _{j=1}^{i-1}{\rho }_{ij}^{2}}{k(k-1)/2}$$

(3)

and

$$\rho _{{\max }} = \mathop {\max }\limits_{{1 \le i,j \le k}} \rho _{{ij}}$$

(4)

Here ρ_ij denotes the simple product–moment correlation between the factors i and j; k denotes the number of factors in the design considered. Butler (2001) found optimal LHD on the basis of this criterion. On the other hand, Joseph and Hung (2008) proposed the combination of the above two criteria. Moreover, Fang et al. (2000) defined a uniform design that allocates experimental points uniformly spread over the domain. It is noted that uniform design does not require orthogonal (multicollinearity) property. It considers projective uniformity over all sub-dimensions. But, in Fang et al. (2000), authors classified uniform design as space-filling design. A good literature review is available in Viana et al. (2014). It is worthwhile to mention here that Jamali et al. (2019) recently established some relations and bonds between Euclidian and Manhattan distance measures regarding LHD.

2 Experimental Result and Discussion

For the experimental study, we have considered a typical LHD of four factors (k = 4) with nine (N = 9) design points, which is optimized using several optimal criteria. The optimized LHDs are shown in Table 1. In Table 1, MLH-SA denotes Maximin LHD regarding Manhattan distance measure (L¹) obtained by Simulated Annealing (SA) approach (Husslage et al. 2011). OMLH-MSA denotes Orthogonal Maximin LHD regarding L¹ distance measure obtained by Modified Simulated Annealing (MSA) (Morris and Mitchell 1995), OLH-Y denotes Orthogonal LHD regarding L¹ distance measure obtained by Ye (1998), ULH-F denotes uniform LHD regarding L¹ distance measure obtained by Fang et al. (2000) and MLH-ILS indicates the Maximin LHD regarding Euclidean distance measure (L²) obtained by Iterated Local Search (ILS) approach (Grosso et al. 2009).

Table 1 Some optimal LHDs for (k,N) = (4,9)

Full size table

At first, we have extracted all important statistical characteristics of each optimal design given in Table 1. Then all the findings are shown in Table 2. The symbols SD, CoV, β₁, β₂, Min Distance, Max Distance denote Standard Deviation, Coefficient of Variation (%), Moment Coefficient of Skewness, Moment Coefficient of Kurtosis (${\alpha }_{1}+3$), minimum pair-wise inter-site distance among all design points of the LHD and maximum pair-wise inter-site distance among all design points of the LHD respectively. Besides, D₁^L1 and D₁^L2 denote the minimum pair-wise inter-site distance of design points measured in L¹ (Manhattan) distance measure and in L² (Euclidean) distance measure respectively. It is mentioned earlier that space-filling is defined on only D₁ value or $\mathrm{\Phi }$ value. Similarly, orthogonal LHD is defined by multi-collinearity and so on. But the insight views of several optimized LHDs are still unknown. Here, we have analyzed the optimal LHDs statistically and on distance measures. In addition, we have compared optimized LHDs in those issues.

Table 2 Statistical analysis among several optimal LHDs for (k, N) = (4, 9)

Full size table

We have also detected all pair-wise inter-site Euclidean distances (actually square of Euclidean distances) “D_i,j²” values for all pair of design points for each optimal LHDs. Now for each optimal LHD, the “D_i,j²” are plotted corresponding to each pair of design points (I = {1,2,..., 36}), which are shown in Fig. 1. It is noted that since we have taken 9 design points, we have obtained [(N (N – 1)/2] = 36 number of pair-wise distances. It is worthwhile to mention here that the sum of square of all pair-wise (Euclidian) distances,${\left(\mathrm{d}\left({\mathrm{x}}_{\mathrm{i}},{\mathrm{x}}_{\mathrm{j}}\right)\right)}^{2}$, of a fixed LHD is identical (=2160) and so average of the square distances of all LHDs (for k = 4, N = 9) are also identical. It is noted that there are (9!)⁴ such LHDs and for each such LHD the value of $\sum {\left(\mathrm{d}\left({\mathrm{x}}_{\mathrm{i}},{\mathrm{x}}_{\mathrm{j}}\right)\right)}^{2}$ is equal to 2160.

Now, we will discuss the comparison between two Maximin LHDs namely MLH-SA and MLH-ILS in which both are optimized by using same single optimal criterion called Maximin optimal criterion. But distance measures namely L¹ and L² are used in MLH-SA and MLH-ILS, respectively, which are different. In addition, different optimal heuristic algorithms namely SA and ILS approaches are applied to optimize the MLS-SA and MLH-ILS designs, respectively. It is observed in both MLH-SA and MLH-ILS that the values of SD, CoV, ${\beta }_{1,}$ ${\beta }_{2},$ $\rho$, ${\rho }_{Max}$, ${\mathrm{\Phi }}_{p}^{L2}$ and Potential (U) are not significantly different. It is also observed that the difference of D₁^L1 of the two DoEs is not significant, though MLH-SA searched optimal D₁^L1 value whereas MLH-ILS explored for D₁^L2 value rather than D₁^L1.

Now we will talk about the comparison between Maximin LHD and Uniform LHD mainly MLH-ILS and ULH-F. It is shown in Table 2 that uniform optimality criterion-based LHD is flatter and un-skewed compared to all other LHDs considered here. Moreover, the coefficient of correlation of ULH-F is relatively smaller than that of MLH-ILS. It is also observed that SD, CoV, Max Distance and Range values are not significantly different from the two optimal LHDs namely MLH-ILS and ULH-F. It is also clear from Table 2 that the D₁^L1 values of both DoEs are identical. On the other hand, for ${\mathrm{\Phi }}_{p}^{L2}$, Potential (U) and D₁^L2 values, MLH-ILS design is significantly better than those of ULH-F design. It is worthwhile to mention here that the D₁^L2 value of ULH-F is worst compared to other optimal designs considered here.

We will also discuss orthogonal-based DoEs mainly of OLH- Y. In addition, comparison will be drawn with Maximin-based LHD mainly MLH-ILS. Since the initial criterion of OLH-Y is $\rho =0$, so obviously the factors of the DoE are all uncorrelated. It is observed in Table 2 that the SD, CoV, ${\beta }_{1}$ and D₁^L1 values of OLH-Y are not significantly different with that of MLH-ILS. But it is noticed that for ${\beta }_{2}$, Max Distance, Range, ${\mathrm{\Phi }}_{p}^{L2}$, Potential (U) and D₁^L2 values, OLH-Y is significantly worse compared with the values of MLH-ILS. Moreover, though in OLH-Y, L¹ distance measure is used whereas, in MLH-ILS, L² distance measure is used but both D₁^L1 values are identical. On the other hand, D₁^L2 value of MLH-ILS is significantly better compared with not only OLH-Y design but also all other designs considered here.

Now we will discuss the properties of OMLH-MSA in which multi-objective function namely Maximin and multi-collinearity was considered. It is observed that the SD, the CoV, the ${\beta }_{1}$ and Max Distance values of OMLH-MSA are the best compared with all other optimal designs. But these values are not significantly different from those values of MLH-ILS.

It is observed in Table 2 that the average correlation as well as maximum correlation of OMLH-MSA is negligible and smaller than those values of MLH-ILS. But it is also noticed in Table 2 that the average correlation as well as maximum correlation of MLH-ILS is also not significantly larger. In addition, it is also observed in Table 2 that D₁^L1 value of OMLH-MSA is not significantly smaller than that of other optimal LHDs. On the other hand, D₁^L2, ${\mathrm{\Phi }}_{p}^{L2}$ and potential (U) values of MLH-ILS are the best compared with all other optimal LHDs. Moreover, these values are significantly better compared with OMLH-MSA. It is also observed in Fig. 1 that according to space-filling criterion MLH-ILS is relatively looking more homogeneous over the design space compared with other designs.

Now we will discuss the two distance measures considered here namely Euclidean and Manhattan. It is noticed in Table 2 that except MLH-ILS all other designs are optimized by using Manhattan distance measure whereas MLH-ILS is optimized by considering Euclidean distance measure. It is observed in Table 2 that D₁^L1 value of anyone LHD compared with the remaining other LHDs is not significantly better. On the other hand, the D₁^L2 value of MLH-ILS is the best and significantly larger than any other one.

3 Conclusion

When LHD is considered as an experimental design, the design inherently preserves good non-collapsing property. On the other hand, when Maximin optimal criterion is considered, then the DoE should have good space-filling property. Again, when we use multicollinearity as an optimal criterion, then the factors of the DoE are uncorrelated or approximately uncorrelated. Similarly, whenever the uniform optimal design is considered then design becomes flatter. Besides optimal criteria, another question is frequently raised; which distance measure is relatively better in case of an optimal design? In this elementary study, the main objective is to find out the schematic view of several well-known optimal designs from the statistical point of view. A rigorous statistical analysis has been carried out over some well-known optimal LHDs, which are optimized with different optimal criteria. From this elementary analysis, it seems to us that, if we prefer a DoE containing all three properties (good) then multi-objective-based LHD namely OMLH-MSA is relatively better. But if one considers only space-filling LHD, then MLH-ILS is the best in which Euclidean distance measure is considered. Similarly, if one needs the design with uncorrelated factors along with non-collapsing property then it is obvious that OLH-Y is the best. In addition, it seems to us that L² is relatively better to find out Maximin LHD. Though multi-objective-based optimal LHD is relatively better, it is obviously time-consuming. Moreover, in that optimal DoE, none of the individual optimal criterion (of multi-objective function) is found good enough. It is worthwhile to mention here that ILS is preferable to find Maximin LHD with cheaper computational time. Therefore, according to us, one may first find out an optimal LHD by considering Maximin optimal criterion with ILS approach to detect the important factors on response surface by using L² distance measure. Then orthogonal optimal criteria may be applied on the reduced DoE (eliminated unimportant factors) to get an uncorrelated-based DoE for further experiments. But it is worthwhile to mention here that, in this experimental research, only some optimal LHDs of (k, N) = (4, 9) are considered. At the same time, we would like to note here that in the existing literature only four optimal LHDs of (k, N) = (4, 9) viz MLH-SA, OMLH-MSA, OLH-Y, ULH-F, MLH-ILS are available, which means, some other instances that are optimized with all these five approaches are not yet available in the literature. So, further experiments (with different instances) should be carried out to draw a more concrete conclusion.

References

Audze, P., & Eglais, V. (1977). New approach for planning out of experiments. Problems of Dynamics and Strength, 35, 104–107.
Google Scholar
Butler, N. A. (2001). Optimal and orthogonal Latin hypercube designs for computer experiments. Biometrika, 88(3), 847–857.
Article MathSciNet Google Scholar
Fang, K. T., Lin, D. K. J., Winker, P., & Zhang, Y. (2000). Uniform design: Theory and application. Technimetrics, 42(3), 237–248.
Article MathSciNet Google Scholar
Fang, K. T., Li, R., & Sudjianto, A. (2006). Design and modeling for computer experiments. CRC Press.
MATH Google Scholar
Felipe, A. C. V., Simpson, T. W., Balabanov, V., & Toropov, V. (2014). Metamodeling in multidisciplinary design optimization: How far have we really come? AIAA (American Institute of Aeronautics and Astronautics) Journal, 52(4), 670–690.
Google Scholar
Grosso, A., Jamali, A. R. J. U., & Locatelli, M. (2009). Finding maximin latin hypercube designs by I. L. S. Heuristics . European Journal of Operation Research, 197, 541–547.
Article Google Scholar
Husslage, B. G. M., Rennen, G., van Dam, E. R., & den Hertog, D. (2011). Space-filling Latin hypercube designs for computer experiments. Journal of Optimization and Engineering, 12, 611–630. https://doi.org/10.1007/s11081-010-9129-8
Jamali, A. R. M. J. U., Alam, Md., & A. . (2019). Approximate relations between Manhattan and Euclidean distance regarding Latin hypercube experimental design . Journal of Physics: Conference Series IOP, 1366, 012–030. https://doi.org/10.1088/1742-6596/1366/1/012030
Article Google Scholar
Jin, R., Chen, W., & Sudjianto, A. (2005). An efficient algorithm for constructing optimal design of computer experiments. Journal of Statistical Planning and Inference, 134(1), 268–287.
Article MathSciNet Google Scholar
Johnson, M. E., Moore, L. M., & Ylvisaker, D. (1990). Minimax and maximin distance designs. Journal of Statistical Planning and Inference, 26, 131–148.
Article MathSciNet Google Scholar
Joseph, V. R., & Hung, Y. (2008). Orthogonal-maximin latin hypercube designs. Statistica Sinica, 18, 171–186.
MathSciNet MATH Google Scholar
McKay, M. D., Beckman, R. J., & Conover, W. J. (1979). A comparison of three methods for selecting values of input variables in the analysis of output from a computer code. Techno Metrics, 21, 239–245.
MathSciNet MATH Google Scholar
Morris, M. D., & Mitchell, T. J. (1995). Exploratory designs for computer experiments. Journal of Statistical Planning and Inference, 43, 381–402.
Article Google Scholar
Ye, K. Q. (1998). Orthogonal column Latin hypercube and their application in computer experiments. Journal of the American Statistical Association, 3, 1430–1439.
Article MathSciNet Google Scholar

Download references

Acknowledgements

This research work is supported by the Special Grant of Ministry of Science and Technology and Khulna University of Engineering and Technology, Khulna, Bangladesh.

Author information

Authors and Affiliations

Department of Mathematics, Khulna University of Engineering & Technology, Khulna, Bangladesh
A. R. M. Jalal Uddin Jamali
Department of Mathematics, Daud Public College, Jessore, Bangladesh
Md. Asadul Alam
Department of CSE, Khulna University of Engineering & Technology, Khulna, Bangladesh
Abdul Aziz

Authors

A. R. M. Jalal Uddin Jamali
View author publications
You can also search for this author in PubMed Google Scholar
Md. Asadul Alam
View author publications
You can also search for this author in PubMed Google Scholar
Abdul Aziz
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to A. R. M. Jalal Uddin Jamali .

Editor information

Editors and Affiliations

Indian Statistical Institute, Kolkata, West Bengal, India
Bikas Kumar Sinha
Department of Statistics, University of Rajshahi, Rajshahi, Bangladesh
Md. Nurul Haque Mollah

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Jalal Uddin Jamali, A.R.M., Asadul Alam, M., Aziz, A. (2021). Statistical Analysis of Various Optimal Latin Hypercube Designs. In: Sinha, B.K., Mollah, M.N.H. (eds) Data Science and SDGs. Springer, Singapore. https://doi.org/10.1007/978-981-16-1919-9_13

Download citation

DOI: https://doi.org/10.1007/978-981-16-1919-9_13
Published: 14 August 2021
Publisher Name: Springer, Singapore
Print ISBN: 978-981-16-1918-2
Online ISBN: 978-981-16-1919-9
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)

Publish with us

Policies and ethics

Statistical Analysis of Various Optimal Latin Hypercube Designs

Abstract

Similar content being viewed by others

Heuristic Search Algorithms for Constructing Optimal Latin Hypercube Designs

A Novel Flexible Experiment Design Method

Isovolumetric adaptations to space-filling design of experiments

Keywords

1 Introduction

2 Experimental Result and Discussion

3 Conclusion

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this chapter

Cite this chapter

Download citation

Publish with us

Navigation

Statistical Analysis of Various Optimal Latin Hypercube Designs

Abstract

Similar content being viewed by others

Heuristic Search Algorithms for Constructing Optimal Latin Hypercube Designs

A Novel Flexible Experiment Design Method

Isovolumetric adaptations to space-filling design of experiments

Keywords

1 Introduction

2 Experimental Result and Discussion

3 Conclusion

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this chapter

Cite this chapter

Download citation

Share this chapter

Publish with us

Search

Navigation