Abstract
Exploratory Projection Pursuit (EPP) methods have been developed thirty years ago in the context of exploratory analysis of large data sets. These methods consist in looking for low-dimensional projections that reveal some interesting structure existing in the data set but not visible in high dimension. Each projection is associated with a real valued index which optima correspond to valuable projections. Several EPP indices have been proposed in the statistics literature but the main problem lies in their optimization. In the present paper, we propose to apply Genetic Algorithms (GA) and recent Particle Swarm Optimization (PSO) algorithm to the optimization of several projection pursuit indices. We explain how the EPP methods can be implemented in order to become an efficient and powerful tool for the statistician. We illustrate our proposal on several simulated and real data sets.
Article PDF
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.Avoid common mistakes on your manuscript.
References
Achard, V., Landrevie, A., Fort, J.-C.: Anomalies detection in hyperspectral imagery using projection pursuit algorithm, image and signal processing for remote sensing X. In: Bruzzone, L. (ed.) Proceedings of the SPIE, vol. 5573, pp. 193–202 (2004)
Art, D., Gnanadesikan, R., Kettenring, J.R.: Data-based metrics for cluster analysis. Util. Math. 21A, 75–99 (1982)
Caussinus, H., Ruiz-Gazen, A.: Metrics for finding typical structures by means of principal component analysis. In: Escoufier, Y. et al. (eds.) Data Science and its Applications, pp. 177–192. Academic (1995)
Caussinus, H., Ruiz-Gazen, A.: Classification and generalized principal component analysis. In: Brito et al. (eds.) Selected Contributions in Data Analysis and Classification, pp. 539–548. Springer (2007)
Caussinus, H., Ruiz-Gazen, A.: Exploratory projection pursuit. In: Govaert, G. (ed.) Data Analysis (Digital Signal and Image Processing series). ISTE (2009)
Chiang, S., Chang, C.: Unsupervised target detection in hyperspectral images using projection pursuit. IEEE Trans. Geosci. Remote Sens. 39(7), 1380–1391 (2001)
Clerc, M.: Particle swarm optimization. International Scientific and Technical Encyclopaedia. Wiley, Hoboken (2006)
Cook, D., Buja, A., Cabrera, J.: Projection pursuit indices based on orthogonal function expansions. J. Comput. Graph. Stat. 2(3), 225–250 (1993)
Cook, D., Caragea, D., Honavar, H.: Visualization in classification problems. In: Antoch, J. (ed.) Proceeding in Computational Statistics (COMPSTAT 2004), pp. 799–806. Springer, Berlin (2004)
Cook, D., Swayne, D.F.: Interactive and Dynamic Graphics for Data Analysis. Springer, New York (2007)
Crawford, S.L.: Genetic optimization for exploratory projection pursuit. In: Computer Science and Statistics: Proc. 23rd Symp. Interface (1992)
Eiben, A.E., Smith, J.E.: Introduction to Evolutionary Computing. Springer (2003)
Fogel, L.J., Owens, A.J., Walsh, M.J.: Artificial Intelligence Through Simulated Evolution. Wiley, New York (1966)
Friedman, J.H.: Exploratory projection pursuit. J. Am. Stat. Assoc. 82(1), 249–266 (1987)
Friedman, J.H., Tukey, J.W.: A projection pursuit algorithm for exploratory data analysis. IEEE Trans. Comput. C-23, 881–889 (1974)
Goldberg, D.E.: Genetic Algorithms in Search, Optimization and Machine Learning, 412 p. Addison Wesley Longman (1989)
Glover, D.M., Hopke, P.K.: Exploration of multivariate chemical data by projection pursuit. Chemometr. Intell. Lab. Syst. 16, 45–59 (1992)
Guo, Q., Wu, W., Massart, D., Boucon, S., de Jong, S.: Sequential projection pursuit using genetic algorithms for data mining of analytical data. Anal. Chem. 72(13), 2846–2855 (2000)
Hansen, N.: The CMA Evolution Strategy. Website http://www.lri.fr/∼hansen/cmaesintro.html (1996)
Holland, J.H.: Adaptation in Natural and Artificial Systems. University of Michigan Press, Ann Harbor (1975)
Huber, P.J.: Projection pursuit. Ann. Stat. 13(2), 435–475 (1985)
Jones, M.C., Sibson, R.: What is projection pursuit? (with discussion). J. R. Stat. Soc. A 150, 1–36 (1987)
Kennedy, J., Eberhart, R.C.: Swarm Intelligence. Yuhui Shi (1995)
Klinke, S.: Data Structures in Computational Statistics. Physica-Verlag (1997)
Krink, T., Vesterstrom, J., Riget, J.: Particle swarm optimization with spatial particle extension. In: Proceedings of the Fourth Congress on Evolutionary Computation (CEC-2002), vol. 2, pp. 1474–1479. IEE, Piscataway (2002)
Kruskal, J.B.: Toward a practical method which helps uncover the structure of a set of multivariate observations by finding the linear transformation which optimizes a new index of condensation. Statistical Computing, pp. 427–440. Academic, New York (1969)
Lee, E., Klinke, S., Cook, D., Thomas, L.: Projection pursuit for exploratory supervised classification. J. Comput. Graph. Stat. 14(4), 831–846 (2005)
Li, X.: A multimodal particle swarm optimizer based on fitness euclidean-distance ratio. In: Thierens, D. (ed.) Proceeding of Genetic and Evolutionary Computation Conference 2007 (GECCO’07), pp. 78–85. ACM (2007)
Lovbjerg, M., Krink, T.: Extending particle swarms with self-organized criticality. In: Proceedings of the Fourth Congress on Evolutionary Computation (CEC-2002), vol. 2, pp. 1588–1593. IEE, Piscataway (2002)
Lubischew, A.A.: On the use of discriminant functions in Taxonomy. Biometrics 18, 455–477 (1962)
Malpica, J.A., Rejas, J.G., Alonso, M.C.: A projection pursuit algorithms for anomaly detection in hyperspectral imagery. Pattern Recogn. 41, 3313–3327 (2008)
Martinez, W., Martinez, A.: Computational Statistics Handbook with Matlab. CRC (Taylor and Francis Group) (2001)
Nason, G.P.: Design and choice of projection indices. Ph.D. dissertation, University of Bath (1992)
Nason, G.P.: Three-dimensional projection pursuit. J. R. Stat. Soc. C 44, 411–430 (1995)
Peña, D., Prieto, F.: Cluster identification using projections. J. Am. Stat. Assoc. 96(456), 1433–1445 (2001)
Posse, C.: Tools for two-dimensional exploratory projection pursuit. J. Comput. Graph. Stat. 4(2), 83–100 (1995)
Rechenberg, R.I.: Evolutionsstrategie: Optimierung Technischer Systeme Nach Prinzipien Der Biologischen Evolution. Frommann-Holzboog, Stuttgart (1973)
Riani, M., Atkinson, A.C., Cerioli, A.: Finding an unknown number of multivariate outliers. J. R. Stat. Soc. B 71(2), 447–466 (2009)
Schwefel, H.P.: Numerical Optimization of Computer Models. Wiley, New York (1981)
Silverman, B.W.: Density Estimation for Statistics and Data Analysis. Chapman and Hall, New York (1986)
Storn, R., Price, K.: Differential evolution a simple and efficient heuristic for global optimization over continuous space. J. Glob. Optim. 11(4), 341–359 (1997)
Sun, J.: Some practical aspects of exploratory projection pursuit. SIAM J. Sci. Comput. 14(1), 68–80 (1993)
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Berro, A., Larabi Marie-Sainte, S. & Ruiz-Gazen, A. Genetic algorithms and particle swarm optimization for exploratory projection pursuit. Ann Math Artif Intell 60, 153–178 (2010). https://doi.org/10.1007/s10472-010-9211-0
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10472-010-9211-0