A Multiple Expert Approach to the Class Imbalance Problem Using Inverse Random under Sampling

Tahir, Muhammad Atif; Kittler, Josef; Mikolajczyk, Krystian; Yan, Fei

doi:10.1007/978-3-642-02326-2_9

Muhammad Atif Tahir¹⁹,
Josef Kittler¹⁹,
Krystian Mikolajczyk¹⁹ &
…
Fei Yan¹⁹

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 5519))

Included in the following conference series:

International Workshop on Multiple Classifier Systems

2748 Accesses
49 Citations

Abstract

In this paper, a novel inverse random under sampling (IRUS) method is proposed for class imbalance problem. The main idea is to severely under sample the negative class (majority class), thus creating a large number of distinct negative training sets. For each training set we then find a linear discriminant which separates the positive class from the negative class. By combining the multiple designs through voting, we construct a composite between the positive class and the negative class. The proposed methodology is applied on 11 UCI data sets and experimental results indicate a significant increase in Area Under Curve (AUC) when compared with many existing class-imbalance learning methods.

Access provided by Autonomous University of Puebla. Download to read the full chapter text

Chapter PDF

A Membership Probability–Based Undersampling Algorithm for Imbalanced Data

Article 14 January 2020

A Review of the Oversampling Techniques in Class Imbalance Problem

Experimental Analysis of Oversampling Techniques in Class Imbalance Problem

Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

References

Batista, G., Prati, R.C., Monard, M.C.: A study of the bahavior of several methods for balancing machine learning training data. SIGKDD Explorations 6(20-29) (2004)
Google Scholar
Blake, C., Keogh, E., Merz, C.J.: UCI repository of machine learning databases
Google Scholar
Chan, P.K., Stolfo, S.J.: Toward scalable learning with non-uniform class and cost distributions: A case study in credit card fraud detection. In: Proceedings of the 4th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, New York, pp. 164–168 (1998)
Google Scholar
Chawla, N.V.: C4.5 and imbalanced data sets: Investigating the effect of sampling method, probabilistic estimate, and decision tree structure. In: Proceedings of the International Conference on Machine Learning (ICML 2003) Workshop on Learning from Imbalanced Data Sets II (2003)
Google Scholar
Chawla, N.V., Bowyer, K.W., Hall, L.O., Kegelmeyer, W.P.: Synthetic minority over-sampling technique. Journal of Artificial Inetelligence Review (16), 321–357 (2002)
Google Scholar
de Souto Marcilio, C.P., Bittencourt, V.G., Jose, A.F.C.: An empirical analysis of under-sampling techniques to balance a protein structural class dataset. In: King, I., Wang, J., Chan, L.-W., Wang, D. (eds.) ICONIP 2006. LNCS, vol. 4234, pp. 21–29. Springer, Heidelberg (2006)
Chapter Google Scholar
Hart, P.E.: Condensed nearest neighbor rule. IEEE Transactions on Information Theory 14, 515–516 (1968)
Article Google Scholar
Japkowicz, M., Stephen, S.: The class imbalance problem: A systematic study. Intelligent data analysis (6), 429–449 (2002)
Google Scholar
Kotsiantis, S.B., Pintelas, P.E.: Mixture of expert agents for handling imbalanced data sets. Annals of Mathematics, Computing and Teleinformatics 1(1), 46–55 (2003)
Google Scholar
Kubat, M., Matwin, S.: Addressing the course of imbalanced training sets: One-sided selection. In: Proceedings of International Conference of Machine Learning, pp. 179–186 (1997)
Google Scholar
Laurikkala, J.: Artificial Intelligence in Medicine. In: Improving Identification of Difficult Small Classes by Balancing Class Distribution. LNCS. Springer, Heidelberg (2001)
Chapter Google Scholar
Liu, X.Y., Wu, J., Zhou, Z.H.: Exploratory under-sampling for class-imbalance learning. IEEE Transactions on Systems, Man, and Cybernetics - Part B: Cybernetics (2009)
Google Scholar
Tomek, I.: Two modifications of CNN. IEEE Transactions on Systems, Man and Cybernatics (6), 769–772 (1976)
Google Scholar
Witten, I.H., Frank, E.: Data Mining: Practical machine learning tools and techniques. Morgan Kaufmann, San Francisco (2005)
MATH Google Scholar

Download references

Author information

Authors and Affiliations

Centre for Vision, Speech and Signal Processing, University of Surrey, Guildford, GU2 7XH, UK
Muhammad Atif Tahir, Josef Kittler, Krystian Mikolajczyk & Fei Yan

Authors

Muhammad Atif Tahir
View author publications
You can also search for this author in PubMed Google Scholar
Josef Kittler
View author publications
You can also search for this author in PubMed Google Scholar
Krystian Mikolajczyk
View author publications
You can also search for this author in PubMed Google Scholar
Fei Yan
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Faculty of Electrical and Computer Engineering, University of Iceland, Hjardarhagi 2-6, 107, Reykjavik, Iceland
Jón Atli Benediktsson
Speech and Signal Processing, Guildford, University of Surrey, Centre for Vision, GU2 7XH, Surrey, United Kingdom
Josef Kittler
Department of Electrical and Electronic Engineering, Piazza d’Armi, University of Cagliari, 09123, Cagliari, Italy
Fabio Roli

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Tahir, M.A., Kittler, J., Mikolajczyk, K., Yan, F. (2009). A Multiple Expert Approach to the Class Imbalance Problem Using Inverse Random under Sampling. In: Benediktsson, J.A., Kittler, J., Roli, F. (eds) Multiple Classifier Systems. MCS 2009. Lecture Notes in Computer Science, vol 5519. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-02326-2_9

Download citation

DOI: https://doi.org/10.1007/978-3-642-02326-2_9
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-02325-5
Online ISBN: 978-3-642-02326-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

A Multiple Expert Approach to the Class Imbalance Problem Using Inverse Random under Sampling

Abstract

Chapter PDF

Similar content being viewed by others

A Membership Probability–Based Undersampling Algorithm for Imbalanced Data

A Review of the Oversampling Techniques in Class Imbalance Problem

Experimental Analysis of Oversampling Techniques in Class Imbalance Problem

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

A Multiple Expert Approach to the Class Imbalance Problem Using Inverse Random under Sampling

Abstract

Chapter PDF

Similar content being viewed by others

A Membership Probability–Based Undersampling Algorithm for Imbalanced Data

A Review of the Oversampling Techniques in Class Imbalance Problem

Experimental Analysis of Oversampling Techniques in Class Imbalance Problem

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation