The Feature Importance Ranking Measure

Zien, Alexander; Krämer, Nicole; Sonnenburg, Sören; Rätsch, Gunnar

doi:10.1007/978-3-642-04174-7_45

Alexander Zien^22,23,
Nicole Krämer²⁴,
Sören Sonnenburg²³ &
…
Gunnar Rätsch²³

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 5782))

Included in the following conference series:

Joint European Conference on Machine Learning and Knowledge Discovery in Databases

6694 Accesses
36 Citations

Abstract

Most accurate predictions are typically obtained by learning machines with complex feature spaces (as e.g. induced by kernels). Unfortunately, such decision rules are hardly accessible to humans and cannot easily be used to gain insights about the application domain. Therefore, one often resorts to linear models in combination with variable selection, thereby sacrificing some predictive power for presumptive interpretability. Here, we introduce the Feature Importance Ranking Measure (FIRM), which by retrospective analysis of arbitrary learning machines allows to achieve both excellent predictive performance and superior interpretation. In contrast to standard raw feature weighting, FIRM takes the underlying correlation structure of the features into account. Thereby, it is able to discover the most relevant features, even if their appearance in the training data is entirely prevented by noise. The desirable properties of FIRM are investigated analytically and illustrated in simulations.

Download to read the full chapter text

Chapter PDF

Selecting Relevant Features for Classifier Optimization

Classifier-dependent feature selection via greedy methods

Article Open access 06 July 2024

Alternative feature selection with user control

Article Open access 26 March 2024

Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

References

Bennett, K., Mangasarian, O.: Robust linear programming discrimination of two linearly inseparable sets. Optimization Methods and Software 1, 23–34 (1992)
Article Google Scholar
Cortes, C., Gretton, A., Lanckriet, G., Mohri, M., Rostamizedeh, A.: Outcome of the NIPS 2008 workshop on kernel learning: Automatic selection of optimal kernels (2008)
Google Scholar
Friedman, J.: Greedy function approximation: a gradient boosting machine. Annals of Statistics 29, 1189–1232 (2001)
Article MathSciNet MATH Google Scholar
Graf, A., Wichmann, F., Bülthoff, H.H., Schölkopf, B.: Classification of faces in man and machine. Neural Computation 18, 143–165 (2006)
Article Google Scholar
Lanckriet, G.R.G., Cristianini, N., Ghaoui, L.E., Bartlett, P., Jordan, M.I.: Learning the kernel matrix with semidefinite programming. Journal of Machine Learning Research 5, 27–72 (2004)
MathSciNet MATH Google Scholar
Rätsch, G., Sonnenburg, S., Schölkopf, B.: RASE: Recognition of alternatively spliced exons in C. elegans. Bioinformatics 21(suppl. 1), i369–i377 (2005)
Article Google Scholar
Schäfer, J., Strimmer, K.: A Shrinkage Approach to Large-Scale Covariance Matrix Estimation and Implications for Functional Genomics. Statistical Applications in Genetics and Molecular Biology 4(1), 32 (2005)
Article MathSciNet Google Scholar
Schölkopf, B., Smola, A.J.: Learning with Kernels. MIT Press, Cambridge (2002)
MATH Google Scholar
Sonnenburg, S., Rätsch, G., Schäfer, C.: Learning interpretable SVMs for biological sequence classification. In: Miyano, S., Mesirov, J., Kasif, S., Istrail, S., Pevzner, P.A., Waterman, M. (eds.) RECOMB 2005. LNCS (LNBI), vol. 3500, pp. 389–407. Springer, Heidelberg (2005)
Chapter Google Scholar
Sonnenburg, S., Rätsch, G., Schäfer, C., Schölkopf, B.: Large Scale Multiple Kernel Learning. Journal of Machine Learning Research 7, 1531–1565 (2006)
MathSciNet MATH Google Scholar
Strobl, C., Boulesteix, A., Kneib, T., Augustin, T., Zeileis, A.: Conditional variable importance for random forests. BMC Bioinformatics 9(1), 307 (2008)
Article Google Scholar
Strobl, C., Boulesteix, A., Zeileis, A., Hothorn, T.: Bias in random forest variable importance measures: Illustrations, sources and a solution. BMC bioinformatics 8(1), 25 (2007)
Article Google Scholar
Tibshirani, R.: Regression Shrinkage and Selection via the Lasso. Journal of the Royal Statistical Society, Series B 58(1), 267–288 (1996)
MathSciNet MATH Google Scholar
Üstün, B., Melssen, W.J., Buydens, L.M.: Visualisation and interpretation of support vector regression models. Analytica Chimica Acta 595(1-2), 299–309 (2007)
Article Google Scholar
van der Laan, M.: Statistical inference for variable importance. The International Journal of Biostatistics 2(1), 1008 (2006)
MathSciNet Google Scholar
Zien, A., Philips, P., Sonnenburg, S.: Computing Positional Oligomer Importance Matrices (POIMs). Res. Report; Electronic Publ. 2, Fraunhofer FIRST (December 2007)
Google Scholar
Zien, A., Sonnenburg, S., Philips, P., Rätsch, G.: POIMs: Positional Oligomer Importance Matrices – Understanding Support Vector Machine Based Signal Detectors. In: Proceedings of the 16th International Conference on Intelligent Systems for Molecular Biology (2008)
Google Scholar

Download references

Author information

Authors and Affiliations

Fraunhofer FIRST.IDA, Kekuléstr. 7, 12489, Berlin, Germany
Alexander Zien
Friedrich Miescher Laboratory, Max Planck Society, Spemannstr. 39, 72076, Tübingen, Germany
Alexander Zien, Sören Sonnenburg & Gunnar Rätsch
Machine Learning Group, Berlin Institute of Technology, Franklinstr. 28/29, 10587, Berlin, Germany
Nicole Krämer

Authors

Alexander Zien
View author publications
You can also search for this author in PubMed Google Scholar
Nicole Krämer
View author publications
You can also search for this author in PubMed Google Scholar
Sören Sonnenburg
View author publications
You can also search for this author in PubMed Google Scholar
Gunnar Rätsch
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

NICTA, Locked Bag 8001, Canberra, 2601, Australia and Helsinki Institute of IT, Finland
Wray Buntine
Dept. of Knowledge Technologies, Jožef Stefan Institute, Jamova 39, 1000, Ljubljana, Slovenia
Marko Grobelnik & Dunja Mladenić &
The Centre for Computational Statistics and Machine Learning Department of Computer Science, University College London, Gower St.,, WC1E 6BT, London, UK
John Shawe-Taylor

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Zien, A., Krämer, N., Sonnenburg, S., Rätsch, G. (2009). The Feature Importance Ranking Measure. In: Buntine, W., Grobelnik, M., Mladenić, D., Shawe-Taylor, J. (eds) Machine Learning and Knowledge Discovery in Databases. ECML PKDD 2009. Lecture Notes in Computer Science(), vol 5782. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-04174-7_45

Download citation

DOI: https://doi.org/10.1007/978-3-642-04174-7_45
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-04173-0
Online ISBN: 978-3-642-04174-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

The Feature Importance Ranking Measure

Abstract

Chapter PDF

Similar content being viewed by others

Selecting Relevant Features for Classifier Optimization

Classifier-dependent feature selection via greedy methods

Alternative feature selection with user control

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

The Feature Importance Ranking Measure

Abstract

Chapter PDF

Similar content being viewed by others

Selecting Relevant Features for Classifier Optimization

Classifier-dependent feature selection via greedy methods

Alternative feature selection with user control

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation