Abstract
In this paper, a novel feature selection method called kernel F-score is applied for Breast cancer diagnosis. In this method, feature selection for removing the irrelevant/redundant features is achieved in high dimensional spaces than the original spaces. Basically, the datasets in the input space are moved to high dimensional kernel spaces for clear separation of nonlinearity through kernel functions. Then the F-score values for all the features in the kernel space are computed and mean kernel F-score value is set as the threshold for selection or rejection of features. The features lesser than the threshold are removed from feature space. The features above and equal to the threshold are selected for classification and used in the classification of benign and malignant cases using Support Vector Machines (SVM). The results obtained from Wisconsin Breast Cancer Dataset (WBCD) have been satisfied as it produced efficient results than F-score. So, we conclude kernel F-score with SVM for WBCD is promising than F-score with SVM.
Access provided by Autonomous University of Puebla. Download to read the full chapter text
Chapter PDF
Similar content being viewed by others
References
Blake, C.L., Merz, C.J.: UCI repository of machine learning database. University of California, Irvine (1998), http://www.ics.uci.edu/mlearn/MLRepository.html
Cao, B., Shen, D., Sun, J.-T., Yang, Q., Chen, Z.: Feature selection in a kernel space. In: International Conference on Machine Learning (ICML), Oregon, USA, June 20-24, pp. 121–128 (2007)
Chen, Y.-W., Lin, C.-J.: Combining SVMs with various feature selection strategies, NIPS 2003 feature selection challenge, 1–10 (2003)
Goodman, D.E., Boggess, L., Watkins, A.: Artificial immune system classification of multiple-class problems. In: Proceedings of the Artificial Neural Networks in Engineering, pp. 179–183
Hamilton, H.J., Shan, N., Cercone, N.: RIAC: A rule induction algorithm based on approximate classification. Technical Report CS 96-06, University of Regina (1996)
Polat, K., Güneş, S.: A new feature selection method on classification of medical datasets: Kernel F-score feature selection. Expert Systems with Applications 36(7), 10367–10373 (2009)
Akay, M.F.: Support vector machines combined with feature selection for breast cancer diagnosis. Expert Systems with Applications 36(2), Part 2, 3240–3247 (2009)
Karabatak, M., Cevdet Ince, M.: Expert Systems with Applications 36(2), Part 2, 3465–3469 (2009)
Scholkopf, B., Smola, A.J.: Learning with Kernels. The MIT Press, Cambridge (2002)
Subashini, T.S., Ramalingam, V., Palanivel, S.: Expert Systems with Applications 36(3), Part 1, 5284–5290 (2009)
Vapnik, V.: The Nature of Statistical Learning Theory. Springer, New York (1995), http://www.emeraldinsight.com
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Jaganathan, P., Rajkumar, N., Nagalakshmi, R. (2011). A Kernel Based Feature Selection Method Used in the Diagnosis of Wisconsin Breast Cancer Dataset. In: Abraham, A., Lloret Mauri, J., Buford, J.F., Suzuki, J., Thampi, S.M. (eds) Advances in Computing and Communications. ACC 2011. Communications in Computer and Information Science, vol 190. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-22709-7_66
Download citation
DOI: https://doi.org/10.1007/978-3-642-22709-7_66
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-22708-0
Online ISBN: 978-3-642-22709-7
eBook Packages: Computer ScienceComputer Science (R0)