Abstract
When dealing with classification and regression problems, there is a strong need for high-quality attributes. This is a capital issue not only in financial problems, but in many Data Mining domains. Constructive Induction methods help to overcome this problem by mapping the original representation into a new one, where prediction becomes easier. In this work we present GPPE: a GP-based method that projects data from an original data space into another one where data approaches linear behavior (linear separability or linear regression). Also, GPPE is able to reduce the dimensionality of the problem by recombining related attributes and discarding irrelevant ones. We have applied GPPE to two financial domains: Bankruptcy prediction and IPO Underpricing prediction. In both cases GPPE automatically generated a new data representation that obtained competitive prediction rates and drastically reduced the dimensionality of the problem.
Article PDF
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.Avoid common mistakes on your manuscript.
References
Shafti LS, Pérez E (2005) Constructive induction and genetic algorithms for learning concepts with complex interaction. In: Beyer H-G, O’Reilly U-M (eds) Genetic and evolutionary computation conference, GECCO 2005, proceedings, Washington, DC, USA, 25–29 June, 2005. ACM, New York, pp 1811–1818
Liu H, Motada H (eds) (1998) Feature extraction, construction and selection: a data mining perspective. Kluwer Academic, Norwell
Jolliffe IT (1986) Principal component analysis. Springer, Berlin
Altman EI (1984) Business failure prediction models: an international survey. J Bank Account Financ 8:171
Ritter JR, Welch I (2002) A review of ipo activity, pricing, and allocations. J Financ 57:1795–1828
Quintana D, Luque C, Isasi P (2005) Evolutionary rule-based system for ipo underpricing prediction. In: Proceedings of the genetic and evolutionary computation conference GECO 2005, vol 1
Koza JR (1992) Genetic programming: on the programming of computers by means of natural selection. MIT, Cambridge
Koza JR (1994) Genetic programming II: automatic discovery of reusable programs. MIT, Cambridge
Zongker D, Punch B (1998) lil-gp 1.1 www homepage. http://garage.cps.msu.edu/software/lil-gp/
Tam KY, Kiang MY (1992) Managerial applications of neural networks: the case bank failure predictions. Manag Sci 38:926–947
Han I, Jo H, Shin KS (1997) The hybrid systems for credit rating. J Korean Oper Res Manag Sci Soc 22(3):163–173
Fletcher D, Goss E (1993) Forecasting with neural networks: An application using bankruptcy data. Inf Man 24(3):159–167
Vieira A, Bas N (2003) A training algorithm for classification of high-dimensional data. Neurocomputing 50:461–472
Jain BA, Nag BN (1995) Artificial neural network models for pricing initial public offerings. Decis Sci 26:283–299
Vieira A, Ribeiro B, Neves JC (2005) A method to improve generalization of neural networks: application to the problem of bankruptcy prediction. In: Proceedings of 7th international conference on adaptive and natural, computing algorithms. ICANNGA 2005. Springer series on adaptive and natural computing algorithms, vol 1. Springer, Berlin, p 417
Quintana D, Isasi P (2005) Estructura de colocación y rendimiento inicial de salidas a bolsa: tecnológicas frente a no tecnológicas. Actual Contab Faces 11:80–86
Otero FEB, Silva MMS, Freitas AA, Nievola JC (2003) Genetic programming for attribute construction in data mining. In: Ryan C, Soule T, Keijzer M, Tsang E, Poli R, Costa E (eds) Genetic programming, proceedings of EuroGP’2003, Essex, 14–16 April 2003. Lecture notes in computer sciences, vol 2610. Springer, Berlin, pp 389–398
Krawiec K (2002) Genetic programming-based construction of features for machine learning and knowledge discovery tasks. Genet Program Evol Mach 3(4):329–343
Kuscu I (1999) A genetic constructive induction model. In: 1999 congress on evolutionary computation. IEEE Service Center, Piscataway, pp 212–217
Hu Y-J (1998) A genetic programming approach to constructive induction. In: Koza JR, Banzhaf W, Chellapilla K, Deb K, Dorigo M, Fogel DB, Garzon MH, Goldberg DE, Iba H, Riolo R (eds) Genetic programming 1998: proceedings of the third annual conference, University of Wisconsin, Madison, Wisconsin, USA, 22–25 July 1998. Kaufmann, Los Altos, pp 146–151
Howley T, Madden MG (2005) The genetic kernel support vector machine: description and evaluation. Artif Intel Rev 24(3–4):379–395
Eads D, Hill D, Davis S, Perkins S, Ma J, Porter R, Theiler J (2002) Genetic algorithms and support vector machines for time series classification. In: Proceedings SPIE 4787 conference on visualization and data analysis, pp 74–85
Szymanski JJ, Brumby SP, Pope P, Eads D, Esch-Mosher D, Galassi M, Harvey NR, McCulloch HDW, Perkins SJ, Porter R, Theiler J, Young AC, Bloch JJ, David N (2002) Feature extraction from multiple data sources using genetic programming. In: Shen SS, Lewis PE (eds) Algorithms and technologies for multispectral, hyperspectral, and ultraspectral imagery VIII. SPIE, vol 4725, pp 338–345
Harvey NR, Theiler J, Brumby SP, Perkins S, Szymanski JJ, Bloch JJ, Porter RB, Galassi M, Young AC (2002) Comparison of GENIE and conventional supervised classifiers for multispectral image feature extraction. IEEE Trans Geosci Remote Sens 40(2):393–404
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Estébanez, C., Valls, J.M. & Aler, R. GPPE: a method to generate ad-hoc feature extractors for prediction in financial domains. Appl Intell 29, 174–185 (2008). https://doi.org/10.1007/s10489-007-0048-0
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10489-007-0048-0