Abstract
Genetic programming represents a flexible and powerful evolutionary technique in machine learning. The use of genetic programming for rule induction has generated interesting results in classification problems. This paper proposes an evolutionary approach for logical rule induction, which is applied to clinical data. Since logical rules disclose knowledge from the analyzed data, we use such a knowledge to filter features from the target dataset. The results reached by the used dataset have been very promising when used in classification tasks and compared with other methods.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Bandyopadhyay, S., Pal, S.K.: Classification and Learning Using Genetic Algorithms: Applications in Bioinformatics and Web Intelligence. Natural Computing Series. Springer, Heidelberg (2007). https://doi.org/10.1007/3-540-49607-6
Bonelli, P., Parodi, A.: An efficient classifier system and its experimental comparison with two representative learning methods on three medical domains. In: Proceedings of the 4th International Conference on Genetic Algorithms (ICGA), pp. 288–295 (1991)
Hong, J.H., Cho, S.B.: The classification of cancer based on DNA microarray data that uses diverse ensemble genetic programming. Artif. Intell. Med. 36, 43–58 (2006)
Kumar, T.P., Iba, H.: Prediction of cancer class with majority voting genetic programming classifier using gene expression data. IEEE/ACM Trans. Comput. Biol. Bioinf. 6, 353–367 (2009)
Kumar, R., Verma, R.: Classification rule discovery for diabetes patients by using genetic programming. Int. J. Soft Comput. Eng. (IJSCE) 2, 183–185 (2012)
Larraaga, P., et al.: Machine learning in bioinformatics. Briefings Bioinf. 7, 86–112 (2006)
Liu, K.H., Xu, C.G.: A genetic programming-based approach to the classification of multiclass microarray datasets. Bioinformatics 25, 331–337 (2009)
Maulik, U., Bandyopadhyay, S., Mukhopadhyay, A.: Multiobjective Genetic Algorithms for Clustering: Applications in Data Mining and Bioinformatics. Springer, Heidelberg (2011). https://doi.org/10.1007/978-3-642-16615-0
Pea-Reyes, C.A., Sipper, M.: Evolutionary computation in medicine: an overview. Artif. Intell. Med. 19, 1–23 (2000)
Podgorelec, V., Kokol, P., Stiglic, M.M., Hericko, M., Rozrnan, I.: Knowledge discovery with classification rules in a cardiovascular dataset. Comput. Methods Program. Biomed. 1, 539–549 (2005)
Soni, J., Ansari, U., Sharma, D., Soni, S.: Intelligent and effective heart disease prediction system using weighted associative classifiers. Int. J. Comput. Sci. Eng. (IJCSE) 3, 2385–2392 (2011)
Tsakonas, A., Dounias, G., Jantzen, J., Axer, H., Bjerregaard, B., von Keyserlingk, D.G.: Evolving rule-based systems in two medical domains using genetic programming. Artif. Intell. Med. 32, 195–216 (2004)
Vargas, C.M.B., Chidambaram, C., Hembecker, F., Silvério, H.L.: A comparative study of machine learning and evolutionary computation approaches for protein secondary structure classification. In: Computational Biology and Applied Bioinformatics, pp. 239–258. InTech (2011)
Wolberg, W.H., Mangasarian, O.L.: Multisurface method of pattern separation for medical diagnosis applied to breast cytology. Proc. Natl. Acad. Sci. USA 87, 9193–9196 (1990)
Lucas, P.: Analysis of notions of diagnosis. Artif. Intell. 12(105), 295–343 (1998)
Lucas, P.: Prognostic methods in medicine. Artif. Intell. 15, 105–119 (1999)
Ramos, J., Castellanos-Garzón, J.A., González-Briones, A., de Paz, J.F., Corchado, J.M.: An agent-based clustering approach for gene selection in gene expression microarray. Interdiscip. Sci. Comput. Life Sci. 9, 1–13 (2017)
Castellanos-Garzón, J.A., Ramos, J., González-Briones, A., de Paz, J.F.: A clustering-based method for gene selection to classify tissue samples in lung cancer. In: Saberi Mohamad, M., Rocha, M., Fdez-Riverola, F., Domínguez Mayo, F., De Paz, J. (eds.) PACBB 2016. AISC, vol. 477, pp. 99–107. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-40126-3_11
Castellanos-Garzón, J.A., Ramos, J.: A gene selection approach based on clustering for classification tasks in colon cancer. ADCAIJ Adv. Distrib. Comput. Artif. Intell. J. 4(3), 1–10 (2015)
González-Briones, A., Ramos, J., De Paz, J.F.: A drug identification system for intoxicated drivers based on a systematic review. ADCAIJ Adv. Distrib. Comput. Artif. Intell. J. 4(4), 83–101 (2015)
Espejo, P.G., Ventura, S., Herrera, F.: A survey on the application of genetic programming to classification. IEEE Trans. Syst. Man Cybern. Part C Appl. Rev. 40(2), 121–144 (2010)
Pappa, G.L., Freitas, A.A.: Evolving rule induction algorithms with multi-objective grammar-based genetic programming. Knowl. Inf. Syst. 19(3), 283–309 (2009)
Alcalá-Fdez, J., et al.: KEEL: a software tool to assess evolutionary algorithms for data mining problems. Soft. Comput. 13, 307–318 (2009)
Fernández, A., García, S., Luengo, J., Bernadó-Mansilla, E., Herrera, F.: Genetics-based machine learning for rule induction: state of the art, taxonomy, and comparative study. IEEE Trans. Evol. Comput. 14(6), 913–941 (2010)
Oyebode, O.K., Adeyemo, J.A.: Genetic programming: principles, applications and opportunities for hydrological modelling. World Acad. Sci. Eng. Technol. Int. J. Environ. Ecol. Geol. Min. Eng. 8, 310–316 (2014)
Freitas, A.A.: A survey of evolutionary algorithms for data mining and knowledge discovery. In: Ghosh, A., Tsutsui, S. (eds.) Advances in Evolutionary Computation, pp. 819–845. Springer, Heidelberg (2002)
Freitas, A.A.: A review of evolutionary algorithms for data mining. In: Maimon, O., Rokach, L. (eds.) Soft Computing for Knowledge Discovery and Data Mining, Part II, pp. 79–111. Springer, Boston (2008). https://doi.org/10.1007/978-0-387-69935-6_4
Witten, I.H., Frank, E.: Data Mining: Practical Machine Learning Tools and Techniques, 2nd edn. Morgan Kaufmann, San Francisco (2005)
Flach, P.: Machine Learning: The Art and Science of Algorithms that Make Sense of Data. Cambridge University Press, Cambridge (2012)
Pappa, G.L., Freitas, A.A.: Automating the Design of Data Mining Algorithms: An Evolutionary Computation Approach. Springer, Heidelberg (2010). https://doi.org/10.1007/978-3-642-02541-9
Witten, I.H., Frank, E., Hall, M.A.: Data Mining: Practical Machine Learning, Tools and Techniques, 3rd edn. Elsevier Inc., Waltham (2011)
Bacardit, J., Goldberg, D.E., Butz, M.V.: Improving the performance of a pittsburgh learning classifier system using a default rule. In: Kovacs, T., Llorà, X., Takadama, K., Lanzi, P.L., Stolzmann, W., Wilson, S.W. (eds.) IWLCS 2003-2005. LNCS (LNAI), vol. 4399, pp. 291–307. Springer, Heidelberg (2007). https://doi.org/10.1007/978-3-540-71231-2_20
Goldberg, D.E.: Genetic Algorithms in Search, Optimization and Machine Learning. Addison-Wesley, Reading (1989)
Blake, C., Merz, C.: Repository of machine learning databases (UCI). Center for Machine Learning and Intelligent Systems (1998)
Kononenko, I., Simec, E., Robnik-Sikonja, M.: Overcoming the myopia of inductive learning algorithms with RELIEFF. Appl. Intell. 7(1), 39–55 (1997)
Li, J., Wong, L.: Using rules to analyse bio-medical data: a comparison between C4.5 and PCL. In: Dong, G., Tang, C., Wang, W. (eds.) WAIM 2003. LNCS, vol. 2762, pp. 254–265. Springer, Heidelberg (2003). https://doi.org/10.1007/978-3-540-45160-0_25
Zhou, Z.H., Jiang, Y.: NeC4.5: neural ensemble based C4.5. IEEE Trans. Knowl. Data Eng. 16(6), 770–773 (2004)
Smirnov, E., Sprinkhuizen-Kuyper, I.G., Nalbantis, I.: Unanimous voting using support vector machines. Technical report, ERIM and Universiteit Rotterdam, IKAT, Universiteit Maastricht (2004)
Acknowledgments
This work has been carried out under the iCIS project (CENTRO-07-ST24-FEDER-002003), which has been co-financed by QREN, in the scope of the Mais Centro Program and European Union’s FEDER.
This work has also been partially supported by the Interreg V-A Spain-Portugal Program (PocTep) and the European Regional Development Fund (ERDF) under the IOTEC project (grant 0123_IOTEC_3_E).
The research of Juan Ramos González has been co-financed by the European Social Fund and Junta de Castilla y Len (Operational Programme 2014–2020 for Castilla y Len, BOCYL EDU/602/2016).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Switzerland AG
About this paper
Cite this paper
Castellanos-Garzón, J.A., Ramos, J., Martín, Y.M., de Paz, J.F., Costa, E. (2019). A Genetic Programming Approach Applied to Feature Selection from Medical Data. In: Fdez-Riverola, F., Mohamad, M., Rocha, M., De Paz, J., González, P. (eds) Practical Applications of Computational Biology and Bioinformatics, 12th International Conference. PACBB2018 2018. Advances in Intelligent Systems and Computing, vol 803. Springer, Cham. https://doi.org/10.1007/978-3-319-98702-6_24
Download citation
DOI: https://doi.org/10.1007/978-3-319-98702-6_24
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-98701-9
Online ISBN: 978-3-319-98702-6
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)