Tackling class imbalance in computer vision: a contemporary review

Saini, Manisha; Susan, Seba

doi:10.1007/s10462-023-10557-6

Tackling class imbalance in computer vision: a contemporary review

Published: 20 July 2023

Volume 56, pages 1279–1335, (2023)
Cite this article

Artificial Intelligence Review Aims and scope Submit manuscript

Manisha Saini¹ &
Seba Susan¹

1562 Accesses
9 Citations
Explore all metrics

Abstract

Class imbalance is a key issue affecting the performance of computer vision applications such as medical image analysis, objection detection and recognition, image segmentation, scene understanding, and many others. Class imbalance refers to the situation when the number of samples in the majority classes outnumber the minority class populations. The model might then get biased towards the majority classes while neglecting the minority classes, adversely affecting the classification performance. In this paper, an extensive literature survey has been conducted to discuss in depth about the class imbalance issues affecting various classification tasks in computer vision. The study analyzes the performance of several contemporary machine learning algorithms such as chi-square support vector machine and gradient boosted decision trees, and deep learning models such as deep pre-trained convolutional networks, generative adversarial networks and vision transformers, for effective learning from imbalanced computer vision datasets. Most of these models either perform data-level manipulation (data augmentation) or cost-sensitive learning (loss functions) or a combination of the two. This survey also includes a summary of novel deep learning frameworks customized to mitigate the effect of class imbalance. It has included recent advancement and new developments in this field such as Explainable AI. The scrutiny of various popular and benchmark imbalanced datasets in computer vision and performance evaluation metrics are also included as a part of this study. Along with that it has emphasized on the research gaps in contemporary literature which would contribute towards future artificial vision models that can learn effectively from imbalanced datasets.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Understanding imbalanced data: XAI & interpretable ML framework

Article Open access 16 January 2024

The class imbalance problem in deep learning

Article 28 December 2022

Output Layer Multiplication for Class Imbalance Problem in Convolutional Neural Networks

Article 19 October 2020

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

References

Adadi A, Berrada M (2020) Explainable AI for healthcare: from black box to interpretable models. Embedded systems and artificial intelligence: proceedings of ESAI 2019, Fez, Morocco. Springer Singapore, Singapore, pp 327–337
Google Scholar
Afzal S, Maqsood M, Nazir F, Khan U, Aadil F, Awan KM, Mehmood I, Song O-Y (2019) A data augmentation-based framework to handle class imbalance problem for Alzheimer’s stage detection. IEEE Access 7:115528–115539
Google Scholar
Akhtar Z, Micheloni C, Foresti GL (2015) Biometric liveness detection: challenges and research opportunities. IEEE Secur Privacy 13(5):63–72
Google Scholar
Alaba SY, Nabi MM, Shah C, Prior J, Campbell MD, Wallace F, Ball JE, Moorhead R (2022) Class-aware fish species recognition using deep learning for an imbalanced dataset. Sensors 22(21):8268
Google Scholar
Alam TM, Shaukat K, Hameed IA, Luo S, Sarwar MU, Shabbir S, Li J, Khushi M (2020) An investigation of credit card default prediction in the imbalanced datasets. IEEE Access 8:201173–201198
Google Scholar
Alam TM, Shaukat K, Khan WA, Hameed IA, Almuqren LA, Raza MA, Aslam M, Luo S (2022) An efficient deep learning-based skin cancer classifier for an imbalanced dataset. Diagnostics 12(9):2115
Google Scholar
Alghamdi HS, Amoudi G, Elhag S, Saeedi K, Nasser J (2021) Deep learning approaches for detecting COVID-19 from chest X-ray images: a survey. Ieee Access 9:20235–20254
Google Scholar
Alia A, Maree M, Chraibi M (2022) A hybrid deep learning and visualization framework for pushing behavior detection in pedestrian dynamics. Sensors 22(11):4040
Google Scholar
Ali-Gombe A, Elyan E (2019) MFC-GAN: Class-imbalanced dataset classification using multiple fake class generative adversarial network. Neurocomputing 361:212–221
Google Scholar
Anand A, Kadian T, Shetty MK, Gupta A (2022) Explainable AI decision model for ECG data of cardiac disorders. Biomed Signal Process Control 75:103584
Google Scholar
Bai L, Wang L, Chen T, Zhao Y, Ren H (2022) Transformer-based disease identification for small-scale imbalanced capsule endoscopy dataset. Electronics 11(17):2747
Google Scholar
Bailey KO, Okolica JS, Peterson GL (2014) User identification and authentication using multi-modal behavioral biometrics. Comput Secur 43:77–89
Google Scholar
Barandela R, Rosa MV, Salvador Sánchez J, Francesc JF. (2004) The imbalanced training sample problem: under or over sampling? In structural, syntactic, and statistical pattern recognition: joint IAPR international workshops, SSPR 2004 and SPR 2004, Lisbon, Portugal. Proceedings, Springer Berlin Heidelberg, pp. 806–814
Bhowal P, Sen S, Velasquez JD, Sarkar R (2022) Fuzzy ensemble of deep learning models using choquet fuzzy integral, coalition game and information theory for breast cancer histology classification. Expert Syst Appl 190:116167
Google Scholar
Buda M, Maki A, Mazurowski MA (2018) A systematic study of the class imbalance problem in convolutional neural networks. Neural Netw 106:249–259
Google Scholar
Carbonell JG, Michalski RS, Mitchell TM (1983) An overview of machine learning. Mach Learn 1:3–23
Google Scholar
Carranza-García M, Lara-Benítez P, García-Gutiérrez J, Riquelme JC (2021) Enhancing object detection for autonomous driving by optimizing anchor generation and addressing class imbalance. Neurocomputing 449:229–244
Google Scholar
Castiglioni I, Rundo L, Codari M, Di Leo G, Salvatore C, Interlenghi M, Gallivanone F, Cozzi A, Claudia N (2021) AI applications to medical images: from machine learning to deep learning. Phys Med 83:9–24
Google Scholar
Cetinic E, Lipic T, Grgic S (2018) Fine-tuning convolutional neural networks for fine art classification. Expert Syst Appl 114:107–118
Google Scholar
Chamseddine E, Mansouri N, Soui M, Abed M (2022) Handling class imbalance in COVID-19 chest X-ray images classification: using SMOTE and weighted loss. Appl Soft Comput 129:109588
Google Scholar
Chawla NV, Bowyer KW, Hall LO, Philip Kegelmeyer W (2002) SMOTE: synthetic minority over-sampling technique. J Artif Intell Res 16:321–357
MATH Google Scholar
Chen Z, Duan J, Kang Li, Qiu G (2021) Class-imbalanced deep learning via a class-balanced ensemble. IEEE Trans Neural Networks Learn Syst 33(10):5626–5640
Google Scholar
Chen H, Li C, Yang W, Liu J, An X, Zhao Y (2022) Deep balanced cascade forest: an novel fault diagnosis method for data imbalance. ISA Trans 126:428–439
Google Scholar
Chen L-C, George P, Florian S, Hartwig A (2017) Rethinking atrous convolution for semantic image segmentation. arXiv preprint arXiv:1706.05587
Ciccarelli M, Corradini F, Germani M, Menchi G, Mostarda L, Papetti A, Piangerelli M (2022) SPECTRE: a deep learning network for posture recognition in manufacturing. J Intell Manuf. https://doi.org/10.1007/s10845-022-02014-y
Article Google Scholar
Codella N, Veronica R, Philipp T, Emre Celebi M, Stephen D, David G, Brian H et al. (2019) Skin lesion analysis toward melanoma detection 2018: a challenge hosted by the international skin imaging collaboration (isic). arXiv preprint arXiv:1902.03368
Costa D, Arthur Z, Figueroa HEH, Fracarolli JA (2020) Computer vision based detection of external defects on tomatoes using deep learning. Biosyst Eng 190:131–144
Google Scholar
Creswell A, White T, Dumoulin V, Arulkumaran K, Sengupta B, Bharath AA (2018) Generative adversarial networks: an overview. IEEE Signal Process Mag 35(1):53–65
Google Scholar
Dai W, Li D, Tang D, Wang H, Peng Y (2022) Deep learning approach for defective spot welds classification using small and class-imbalanced datasets. Neurocomputing 477:46–60
Google Scholar
Databases–Laboratório Visão Robótica e Imagem, 2019, https://web.inf.ufpr.br/vri/databases/. (Accessed 28 Nov 2019).
Doersch C (2016) Tutorial on variational autoencoders. arXiv preprint arXiv:1606.05908
Dong Q, Shaogang G, Xiatian Z (2017) Class rectification hard mining for imbalanced deep learning. In Proceedings of the IEEE international conference on computer vision, pp. 1851–1860
Dumagpi JK, Jeong Y-J (2020) Evaluating gan-based image augmentation for threat detection in large-scale x-ray security images. Appl Sci 11(1):36
Google Scholar
Esgario JGM, Krohling RA, Ventura JA (2020) Deep learning for classification and severity estimation of coffee leaf biotic stress. Comput Electron Agric 169:105162
Google Scholar
Eyepacs and Kaggle. Diabetic retinopathy detection. 2015. url: https://www.kaggle.com/c/diabetic-retinopathy-detection/data
Feng W, Huang W, Bao W (2019) Imbalanced hyperspectral image classification with an adaptive ensemble method based on SMOTE and rotation forest with differentiated sampling rates. IEEE Geosci Remote Sens Lett 16(12):1879–1883
Google Scholar
Fernández A, García S, Galar M, Prati RC, Krawczyk B, Herrera F (2018) Learning from imbalanced data sets. Springer, Cham
Google Scholar
Fuqua D, Razzaghi T (2020) A cost-sensitive convolution neural network learning for control chart pattern recognition. Expert Syst Appl 150:113275
Google Scholar
Gandhi, Shreyansh, Samrat Kokkula, Abon Chaudhuri, Alessandro Magnani, Theban Stanley, Behzad Ahmadi, Venkatesh Kandaswamy, Omer Ovenc, and Shie Mannor. "Scalable detection of offensive and non-compliant content/logo in product images." In Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, pp. 2247–2256. 2020.
Ghosh K, Bellinger C, Corizzo R, Branco P, Krawczyk B, Japkowicz N (2022) The class imbalance problem in deep learning. Mach Learn. https://doi.org/10.1007/s10994-022-06268-8
Article Google Scholar
Goodfellow I, Pouget-Abadie J, Mirza M, Bing Xu, Warde-Farley D, Ozair S, Courville A, Bengio Y (2020) Generative adversarial networks. Commun ACM 63(11):139–144
MathSciNet Google Scholar
Guo X, Yilong Y, Cailing D, Gongping Y, Guangtong Z (2008) On the class imbalance problem. In 2008 Fourth international conference on natural computation, IEEE, vol. 4, pp. 192–201
Gupta H, Verma OP (2022) Monitoring and surveillance of urban road traffic using low altitude drone images: a deep learning approach. Multimed Tools Appl 81:1–21
Google Scholar
Gupta A, Anpalagan A, Guan L, Khwaja AS (2021) Deep learning for object detection and scene perception in self-driving cars: survey, challenges, and open issues. Array 10:100057
Google Scholar
Haixiang G, Yijing Li, Jennifer Shang Gu, Mingyun HY, Bing G (2017) Learning from class-imbalanced data: review of methods and applications. Expert Syst Appl 73:220–239
Google Scholar
Hamad RA, Yang L, Woo WL, Wei B (2020) Joint learning of temporal models to handle imbalanced data for human activity recognition. Appl Sci 10(15):5293
Google Scholar
Han K, Wang Y, Chen H, Chen X, Guo J, Liu Z, Tang Y et al (2022) A survey on vision transformer. IEEE Trans Pattern Anal Mach Intell 45(1):87–110
Google Scholar
Han S, Chan L, Bonggeon C, Jongwuk L (2021) An empirical study for class imbalance in extreme multi-label text classification. In 2021 IEEE international conference on big data and smart computing (BigComp), IEEE, pp. 338–341
He H, Yang B, Edwardo AG, Shutao L (2008) ADASYN: Adaptive synthetic sampling approach for imbalanced learning. In 2008 IEEE international joint conference on neural networks (IEEE world congress on computational intelligence), IEEE, pp. 1322–1328
He K, Xiangyu Z, Shaoqing R, Jian S (2016) Deep residual learning for image recognition. In Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 770–778
Heath M, Bowyer K, Kopans D, Kegelmeyer P, Moore R, Chang K, Munishkumaran S (1998) Current status of the digital database for screening mammography. Digit Mammogr Nijmegen 1998:457–460
Google Scholar
Hidayat AA, Purwandari K, Cenggoro TW, Pardamean B (2021) A convolutional neural network-based ancient sundanese character classifier with data augmentation. Procedia Comput Sci 179:195–201
Google Scholar
Hou R, Chen J, Feng Y, Liu S, He S, Zhou Z (2022) Contrastive-weighted self-supervised model for long-tailed data classification with vision transformer augmented. Mech Syst Signal Process 177:109174
Google Scholar
Huang Z, Pan Z, Lei B (2017b) Transfer learning with deep convolutional neural network for SAR target classification with limited labeled data. Remote Sensing 9(9):907
Google Scholar
Huang Y, Jin Yi, Li Y, Lin Z (2020) Towards imbalanced image classification: a generative adversarial network ensemble learning method. IEEE Access 8:88399–88409
Google Scholar
Huang G, Zhuang L, Laurens Van Der M, Kilian QW (2017a) Densely connected convolutional networks. In Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 4700–4708
Hung J, Anne C (2017) Applying faster R-CNN for object detection on malaria images. In Proceedings of the IEEE conference on computer vision and pattern recognition workshops, pp. 56–61
Hussain E, Mahmudul H, Syed ZH, Tanzina HA, Md Anisur R, Mohammad ZP (2020) Deep learning based binary classification for alzheimer’s disease detection using brain MRI images. In 2020 15th IEEE conference on industrial electronics and applications (ICIEA), IEEE, pp. 1115–1120
Iglesias G, Talavera E, Díaz-Álvarez A (2023) A survey on GANs for computer vision: recent research, analysis and taxonomy. Comput Sci Rev 48:100553
MathSciNet MATH Google Scholar
Japkowicz N (2013) Assessment metrics for imbalanced learning. Imbalanced learning: Foundations, algorithms, and applications. John Wiley & Sons, Hoboken, pp 187–206
Google Scholar
Jing J, Wang Z, Rätsch M, Zhang H (2022) Mobile-Unet: an efficient convolutional neural network for fabric defect detection. Text Res J 92(1–2):30–42
Google Scholar
Johnson JM, Khoshgoftaar TM (2019) Survey on deep learning with class imbalance. J Big Data 6(1):1–54
Google Scholar
Kaselimi M, Voulodimos A, Daskalopoulos I, Doulamis N, Doulamis A (2022) A vision transformer model for convolution-free multilabel classification of satellite imagery in deforestation monitoring. IEEE Trans Neural Networks Learn Syst 34:3299
Google Scholar
Kaur H, Pannu HS, Malhi AK (2019) A systematic review on imbalanced data challenges in machine learning: applications and solutions. ACM Comput Surv (CSUR) 52(4):1–36
Google Scholar
Khan S, Naseer M, Hayat M, Zamir SW, Khan FS, Shah M (2022) Transformers in vision: a survey. ACM Comput Surv (CSUR) 54(10):1–41
Google Scholar
Kim Y, Lee Y, Jeon M (2021) Imbalanced image classification with complement cross entropy. Pattern Recogn Lett 151:33–40
Google Scholar
Kim P-K, Kil-Taek L. (2017) Vehicle type classification using bagging and convolutional neural network on multi view surveillance image. In Proceedings of the IEEE conference on computer vision and pattern recognition workshops, pp. 41–46
Kotsiantis S, Kanellopoulos D, Pintelas P (2006) Handling imbalanced datasets: a review. GESTS Int Trans Comput Sci Eng 30(1):25–36
Google Scholar
Krawczyk B (2016) Learning from imbalanced data: open challenges and future directions. Progress Artif Intell 5(4):221–232
Google Scholar
Kulatilleke, Gayan K (2022) Challenges and complexities in machine learning based credit card fraud detection. arXiv preprint arXiv:2208.10943
Kuo W, Häne C, Yuh E, Mukherjee P, Malik J (2018) Cost-sensitive active learning for intracranial hemorrhage detection. Medical image computing and computer assisted intervention–MICCAI 2018. Springer International Publishing, Cham, pp 715–723
Google Scholar
LeCun Y, Bengio Y, Hinton G (2015) Deep learning. Nature 521(7553):436–444
Google Scholar
Lemaître G, Nogueira F, Aridas CK (2017) Imbalanced-learn: a python toolbox to tackle the curse of imbalanced datasets in machine learning. J Mach Learn Res 18(1):559–563
Google Scholar
Li T, Gao Y, Wang K, Guo S, Liu H, Kang H (2019) Diagnostic assessment of deep learning algorithms for diabetic retinopathy screening. Inf Sci 501:511–522
Google Scholar
Li L, Zhang S, Wang B (2022) Apple leaf disease identification with a small and imbalanced dataset based on lightweight convolutional networks. Sensors 22(1):173
Google Scholar
Li D, Zhang Z, Xiaotang C, Haibin L, Kaiqi H (2016) A richly annotated dataset for pedestrian attribute recognition. arXiv preprint arXiv:1603.07054
Li Y, Tao W, Bingyi K, Sheng T, Chunfeng W, Jintao L, Jiashi F (2020) Overcoming classifier imbalance for long-tail object detection with balanced group softmax. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp. 10991–11000
Lin T-Y, Priya G, Ross G, Kaiming H, Piotr D (2017) Focal loss for dense object detection. In Proceedings of the IEEE international conference on computer vision, pp. 2980–2988
Liu W, Zhang M, Luo Z, Cai Y (2017) An ensemble deep learning method for vehicle type classification on visual traffic surveillance sensors. IEEE Access 5:24417–24425
Google Scholar
Liu Y, Gao H, Guo L, Qin A, Cai C, You Z (2019) A data-flow oriented deep ensemble learning method for real-time surface defect inspection. IEEE Trans Instrum Meas 69(7):4681–4691
Google Scholar
Liu Z, Ping L, Xiaogang W, Xiaoou T (2015) Deep learning face attributes in the wild. In Proceedings of the IEEE international conference on computer vision, pp. 3730–3738
Liu Z, Yutong L, Yue C, Han H, Yixuan W, Zheng Z, Stephen L, Baining G (2021) Swin transformer: hierarchical vision transformer using shifted windows. In Proceedings of the IEEE/CVF international conference on computer vision, pp. 10012–10022
Lu X, Chao M, Bingbing N, Xiaokang Y, Ian R, Ming-Hsuan Y (2018) Deep regression tracking with shrinkage loss. In Proceedings of the European conference on computer vision (ECCV), pp. 353–369
Mahesh B (2020) Machine learning algorithms-a review. Int J Sci Res (IJSR) 9:381–386
Google Scholar
Mikołajczyk A, Michał G (2018) Data augmentation for improving deep learning in image classification problem. In 2018 international interdisciplinary PhD workshop (IIPhDW), IEEE, pp. 117–122
Milletari F, Nassir N, Seyed-Ahmad A (2016) V-net: Fully convolutional neural networks for volumetric medical image segmentation. In 2016 fourth international conference on 3D vision (3DV), IEEE, pp. 565–571
Minaee S, Boykov Y, Porikli F, Plaza A, Kehtarnavaz N, Terzopoulos D (2021) Image segmentation using deep learning: a survey. IEEE Trans Pattern Anal Mach Intell 44(7):3523–3542
Google Scholar
MobileODT, “Intel and mobileodt cervical can cer screening.” [Online]. Available: https://www.kaggle.com/c/intel-mobileodt-cervical-cancer-screening/data
Mooney P (2020) Breast histopathology images. Kaggle. Accessed Oct 09, 2020. https://www.kaggle.com/datasets/paultimothymooney/breast-histopathology-images
Mushtaq Z, Shun-Feng Su, Tran Q-V (2021) Spectral images based environmental sound classification using CNN with meaningful data augmentation. Appl Acoust 172:107581
Google Scholar
Nazki H, Lee J, Yoon S, Park DS (2019) Image-to-image translation with GAN for synthetic data augmentation in plant disease datasets. Smart Med J 8(2):46–57
Google Scholar
Nesteruk S, Dmitrii S, Mariia P (2021) Image augmentation for multitask few-shot learning: Agricultural domain use-case. arXiv preprint arXiv:2102.12295
Ngo QT, Yoon S (2020) Facial expression recognition based on weighted-cluster loss and deep transfer learning using a highly imbalanced dataset. Sensors 20(9):2639
Google Scholar
Ortiz JAA (2021) COVID-19 X-ray Image classification: a transfer learning approach. University of California, Los Angeles
Google Scholar
Papathanasiou ND, Spyridonidis T, Apostolopoulos DJ (2020) Automatic characterization of myocardial perfusion imaging polar maps employing deep learning and data augmentation. Hell J Nucl Med 23:125–132
Google Scholar
Park S, Jongin L, Younghan J, Jin YC (2021) Influence-balanced loss for imbalanced visual classification. In Proceedings of the IEEE/CVF international conference on computer vision, pp. 735–744
Pérez-Hernández F, Tabik S, Lamas A, Olmos R, Fujita H, Herrera F (2020) Object detection binary classifiers methodology based on deep learning to identify small objects handled similarly: application in video surveillance. Knowl-Based Syst 194:105590
Google Scholar
Phan TH, Kazuma Y (2020) Resolving class imbalance in object detection with weighted cross entropy losses. arXiv preprint arXiv:2006.01413
Pouyanfar S, Chen S-C (2017) Automatic video event detection for imbalance data using enhanced ensemble deep learning. Int J Semant Comput 11(01):85–109
Google Scholar
Rácz A, Bajusz D, Héberger K (2021) Effect of dataset size and train/test split ratios in QSAR/QSPR multiclass classification. Molecules 26(4):1111
Google Scholar
Rahman MM, Davis DN (2013) Addressing the class imbalance problem in medical datasets. Int J Mach Learn Comput 3(2):224
Google Scholar
Rath A, Mishra D, Panda G, Satapathy SC (2021) Heart disease detection using deep learning methods from imbalanced ECG samples. Biomed Signal Process Control 68:102820
Google Scholar
Reddy C, Deepak S, Soroush M, Adriana R-S, Samira S, Sina H (2021) Benchmarking bias mitigation algorithms in representation learning through fairness metrics. In thirty-fifth conference on neural information processing systems datasets and benchmarks track (Round 1). url: https://paperswithcode.com/dataset/ci-mnist
Redmon J, Santosh D, Ross G, Ali F (2016) You only look once: unified, real-time object detection. In Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 779–788
Ren S, He K, Girshick R, Sun J (2015) Faster r-cnn: towards real-time object detection with region proposal networks. Adv Neural Inform Process Syst 28:1
Google Scholar
Ren J, Mingyuan Z, Cunjun Y, Ziwei L (2022) Balanced mse for imbalanced visual regression. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 7926–7935
Reza MS, Ma J (2018) Imbalanced histopathological breast cancer image classification with convolutional neural network. In 2018 14th IEEE international conference on signal processing (ICSP), IEEE, pp. 619–624
Rezaei M, Haojin Y, Christoph M (2018) Conditional generative refinement adversarial networks for unbalanced medical image semantic segmentation. arXiv preprint arXiv:1810.03871
Ronneberger O, Philipp F, Thomas B (2015) U-net: Convolutional networks for biomedical image segmentation. In Medical image computing and computer-assisted intervention–MICCAI 2015: 18th international conference, Munich, Germany, October 5–9, 2015, Proceedings, Part III 18. Springer International Publishing, pp. 234–241
Saini M, Seba S (2022c) Cervical cancer screening on multi-class imbalanced cervigram dataset using transfer learning. In 2022c 15th International congress on image and signal processing, biomedical engineering and informatics (CISP-BMEI), IEEE, pp. 1–6
Saini M, Susan S (2019) Data augmentation of minority class with transfer learning for classification of imbalanced breast cancer dataset using inception-V3. Pattern recognition and image analysis. Springer International Publishing, Cham, pp 409–420
Google Scholar
Saini M, Susan S (2020) Deep transfer with minority data augmentation for imbalanced breast cancer dataset. Appl Soft Comput 97:106759
Google Scholar
Saini M, Susan S (2021) Bag-of-visual-words codebook generation using deep features for effective classification of imbalanced multi-class image datasets. Multimed Tools Appl 80(14):20821–20847
Google Scholar
Saini M, Susan S (2022a) Diabetic retinopathy screening using deep learning for multi-class imbalanced datasets. Comput Biol Med 149:105989
Google Scholar
Saini M, Susan S (2022b) Vggin-net: deep transfer network for imbalanced breast cancer dataset. IEEE/ACM Trans Comput Biol Bioinform 20:752
Google Scholar
Sambasivam GAOGD, Opiyo GD (2021) A predictive machine learning application in agriculture: Cassava disease detection and classification with imbalanced dataset using convolutional neural networks. Egypt Inform J 22(1):27–34
Google Scholar
Sampath V, Maurtua I, Martin JJA, Gutierrez A (2021) A survey on generative adversarial networks for imbalance problems in computer vision tasks. J Big Data 8:1–59
Google Scholar
Sandler M, Andrew H, Menglong Z, Andrey Z, Liang-Chieh C (2018) Mobilenetv2: inverted residuals and linear bottlenecks. In Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 4510–4520
Saood A, Hatem I (2021) COVID-19 lung CT image segmentation using deep learning methods: U-Net versus SegNet. BMC Med Imaging 21(1):1–10
Google Scholar
Sarafianos N, Xiang X, Ioannis AK (2018) Deep imbalanced attribute classification using visual attention aggregation. In Proceedings of the European conference on computer vision (ECCV), pp. 680–697
Seiffert C, Khoshgoftaar TM, Van Hulse J, Napolitano A (2009) RUSBoost: a hybrid approach to alleviating class imbalance. IEEE Trans Syst Man Cybern Part a: Syst Humans 40(1):185–197
Google Scholar
Shamshad F, Khan S, Zamir SW, Khan MH, Hayat M, Khan FS, Huazhu F (2023) Transformers in medical imaging: a survey. Med Image Anal 88:102802
Google Scholar
Shao S, Zeming L, Tianyuan Z, Chao P, Gang Y, Xiangyu Z, Jing L, Jian S (2019) Objects365: a large-scale, high-quality dataset for object detection. In Proceedings of the IEEE/CVF international conference on computer vision, pp. 8430–8439. url: https://github.com/nkicsl/DDR-dataset
Shaukat K, Luo S, Varadharajan V, Hameed IA, Chen S, Liu D, Li J (2020a) Performance comparison and current challenges of using machine learning techniques in cybersecurity. Energies 13(10):2509
Google Scholar
Shaukat K, Luo S, Varadharajan V, Hameed IA, Min Xu (2020b) A survey on machine learning techniques for cyber security in the last decade. IEEE Access 8:222310–222354
Google Scholar
Shaukat K, Luo S, Varadharajan V (2023) A novel deep learning-based approach for malware detection. Eng Appl Artif Intell 122:106030
Google Scholar
Shauka K, Luo S, Chen S, Liu D, (2020c) Cyber threat detection using machine learning techniques: a performance evaluation perspective. In 2020c international conference on cyber warfare and security (ICCWS), IEEE, pp. 1–6
Shaukat K, Suhuai L, Nadir A, Talha MA, Muhammad ET, Ibrahim AH (2021) An analysis of blessed Friday sale at a retail store using classification models. In 2021 The 4th international conference on software engineering and information management, pp. 193–198
Simonyan K, Andrew Z (2014) Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556
Sivapuram AK, Ravi V, Senthil G, Gorthi RK (2023) Visal—a novel learning strategy to address class imbalance. Neural Netw 161:178
Google Scholar
Soleymani M, Mahdi B, Hadi M, Farnad N (2021) Construction material classification on imbalanced datasets using vision transformer (ViT) architecture. arXiv preprint arXiv:2108.09527
Sori WJ, Feng J, Godana AW, Liu S, Gelmecha DJ (2021) DFD-Net: lung cancer detection from denoised CT scan image using deep learning. Front Comp Sci 15:1–13
Google Scholar
Sozykin K, Stanislav P, Adil K, Rasheed H, Jooyoung L (2018) Multi-label class-imbalanced action recognition in hockey videos via 3D convolutional neural networks. In 2018 19th IEEE/ACIS international conference on software engineering, artificial intelligence, networking and parallel/distributed computing (SNPD), IEEE, pp. 146–151
Srinivas C, Nandini Prasad KS, Zakariah M, Alothaibi YA, Kamran Shaukat B (2022) Deep transfer learning approaches in performance analysis of brain tumor classification using MRI images. J Healthcare Eng 2022:1–17
Google Scholar
Sun Y, Kamel MS, Wong AKC, Wang Y (2007) Cost-sensitive boosting for classification of imbalanced data. Pattern Recognit 40(12):3358–3378
MATH Google Scholar
Sundararajan K, Woodard DL (2018) Deep learning for biometrics: a survey. ACM Comput Surv (CSUR) 51(3):1–34
Google Scholar
Susan S, Ashu K (2022) Localized metric learning for large multi-class extremely imbalanced face database. Database systems for advanced applications. Springer International Publishing, Cham, pp 64–78
Google Scholar
Susan S, Kumar A (2019) SSOMaj-SMOTE-SSOMin: three-step intelligent pruning of majority and minority samples for learning from imbalanced datasets. Appl Soft Comput 78:141–149
Google Scholar
Susan S, Kumar A (2021) The balancing trick: optimized sampling of imbalanced datasets—a brief survey of the recent state of the art. Eng Rep 3(4):e12298
Google Scholar
Susan S, Sethi D, Arora K (2021) CW-CAE: pulmonary nodule detection from imbalanced dataset using class-weighted convolutional autoencoder. International conference on innovative computing and communications. Springer Singapore, Singapore, pp 825–833
Google Scholar
Susan S, Ankit K (2016) Auto-segmentation using mean-shift and entropy analysis. In 2016 3rd international conference on computing for sustainable global development (INDIACom), IEEE, pp. 292–296
Susan S, Amitesh K (2020) Hybrid of intelligent minority oversampling and PSO-based intelligent majority undersampling for learning from imbalanced datasets. In Intelligent systems design and applications: 18th international conference on intelligent systems design and applications (ISDA 2018) held in Vellore, India, December 6–8, Springer International Publishing, Vol. 2, pp. 760–769
Szegedy C, Ioffe S, Vanhoucke V, Alemi A (2017) Inception-v4, inception-resnet and the impact of residual connections on learning. Proc AAAI Conf Artif Intell. https://doi.org/10.1609/aaai.v31i1.11231
Article Google Scholar
Szegedy C, Vincent V, Sergey I, Jon S, Zbigniew W (2016) Rethinking the inception architecture for computer vision. In Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 2818–2826
Tajbakhsh N, Shin JY, Gurudu SR, Todd Hurst R, Kendall CB, Gotway MB, Liang J (2016) Convolutional neural networks for medical image analysis: full training or fine tuning? IEEE Trans Med Imaging 35(5):1299–1312
Google Scholar
Tan M, Quoc L (2019) Efficientnet: rethinking model scaling for convolutional neural networks." In International conference on machine learning, PMLR, pp. 6105–6114
Tan M, Ruoming P, Quoc VL. (2020) Efficientdet: scalable and efficient object detection. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp. 10781–10790
Tanaka FHKS, Claus A (2019) Data augmentation using GANs. arXiv preprint arXiv:1904.09135
Tanha J, Abdi Y, Samadi N, Razzaghi N, Asadpour M (2020) Boosting methods for multi-class imbalanced data classification: an experimental review. J Big Data 7(1):1–47
Google Scholar
Tran GS, Nghiem TP, Nguyen VT, Luong CM, Burie J-C (2019) Improving accuracy of lung nodule classification using deep learning with focal loss. J Healthcare Eng 2019:1–9
Google Scholar
Trong VH, Gwang-hyun Y, Dang Thanh V, Jin-young K (2020) Late fusion of multimodal deep neural networks for weeds classification. Comput Electron Agric 175:105506
Google Scholar
Vicente S, Joao C, Lourdes A, Jorge B (2014) Reconstructing pascal voc. In Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 41–48
Wang Y, Weihao G, Jie Y, Wei W, Junjie Y (2019) Dynamic curriculum learning for imbalanced data classification. In Proceedings of the IEEE/CVF international conference on computer vision, pp. 5017–5026
Wang G, Ding H, Duan M, Yuanyuan Pu, Yang Z, Li H (2022) Fighting against terrorism: a real-time CCTV autonomous weapons detection based on improved YOLO v4. Digital Signal Process 132:103790
Google Scholar
Wardhani NWS, Masithoh YR, Atiek I, Agus DS, Prayudi L (2019) Cross-validation metrics for evaluating classification performance on imbalanced data. In 2019 International conference on computer, control, informatics and its applications (IC3INA), IEEE, pp. 14–18
Woźniak M, Wieczorek M, Siłka J (2023) BiLSTM deep neural network model for imbalanced medical data of IoT systems. Futur Gener Comput Syst 141:489–499
Google Scholar
Wu E, Kevin W, Cox D, Lotter W (2018) Conditional infilling GANs for data augmentation in mammogram classification. Image analysis for moving organ, breast, and thoracic images. Springer, Cham, pp 98–106
Google Scholar
Xu F, Uszkoreit H, Yangzhou D, Fan W, Zhao D, Zhu J (2019) Explainable AI: a brief survey on history, research areas, approaches and challenges. Natural language processing and chinese computing. Springer International Publishing, Dunhuang, pp 563–574
Google Scholar
Xu B, Zeng Z, Lian C, Ding Z (2021) Semi-supervised low-rank semantics grouping for zero-shot learning. IEEE Trans Image Process 30:2207–2219
Google Scholar
Xu M, Yoon S, Fuentes A, Park DS (2023) A comprehensive survey of image augmentation techniques for deep learning. Pattern Recogn 137:109347
Google Scholar
Yang L, Yuan G, Zhou H, Liu H, Chen J, Hao Wu (2022) RS-YOLOX: a high-precision detector for object detection in satellite remote sensing images. Appl Sci 12(17):8707
Google Scholar
Yang X, Matloob K, Kamran S (2020) Biomarker CA125 feature engineering and class imbalance learning improves ovarian cancer prediction. In 2020 IEEE Asia-Pacific conference on computer science and data engineering (CSDE), IEEE, pp. 1–6
Yeung M, Sala E, Schönlieb C-B, Rundo L (2022) Unified focal loss: generalising dice and cross entropy-based losses to handle class imbalanced medical image segmentation. Comput Med Imaging Graph 95:102026
Google Scholar
Yijing Li, Haixiang G, Xiao L, Yanan Li, Jinling Li (2016) Adapted ensemble classification algorithm based on multiple classifier system and feature selection for classifying multi-class imbalanced data. Knowl-Based Syst 94:88–104
Google Scholar
Zaidi SS, Abbas MS, Ansari AA, Kanwal N, Asghar M, Lee B (2022) A survey of modern deep learning based object detection models. Digit Signal Process 126:103514
Google Scholar
Zaitoun NM, Aqel MJ (2015) Survey on image segmentation techniques. Procedia Comput Sci 65:797–806
Google Scholar
Zhang Y, Siyuan Lu, Zhou X, Yang M, Lenan Wu, Liu B, Phillips P, Wang S (2016) Comparison of machine learning methods for stationary wavelet entropy-based multiple sclerosis detection: decision tree, k-nearest neighbors, and support vector machine. SIMULATION 92(9):861–871
Google Scholar
Zhang L, Zhang C, Quan S, Xiao H, Kuang G, Liu Li (2020) A class imbalance loss for imbalanced object recognition. IEEE J Select Top Appl Earth Observ Remote Sens 13:2778–2792
Google Scholar
Zhang Y, Bingyi K, Bryan H, Shuicheng Y, Jiashi F (2021) Deep long-tailed learning: a survey. arXiv preprint arXiv:2110.04596
Zhao C, Shuai R, Ma Li, Liu W, Menglin Wu (2022) Improving cervical cancer classification with imbalanced datasets combining taming transformers with T2T-ViT. Multimed Tools Appl 81(17):24265–24300
Google Scholar
Zhao H, Jianping S, Xiaojuan Q, Xiaogang W, Jiaya J (2017) Pyramid scene parsing network. In Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 2881–2890
Zhao R, Buyue Q, Xianli Z, Yang L, Rong W, Yang L, Yinggang P (2020) Rethinking dice loss for medical image segmentation. In 2020 IEEE international conference on data mining (ICDM), IEEE, pp. 851–860
Zhou Z-H, Liu X-Y (2010) On multi-class cost-sensitive learning. Comput Intell 26(3):232–257
MathSciNet Google Scholar
Zhou JT, Zhang L, Fang Z, Jiawei D, Peng X, Xiao Y (2019) Attention-driven loss for anomaly detection in video surveillance. IEEE Trans Circuits Syst Video Technol 30(12):4639–4647
Google Scholar
Zhu X, Liu Y, Li J, Wan T, Qin Z (2018) Emotion classification with data augmentation using generative adversarial networks. Advances in knowledge discovery and data mining. Springer International Publishing, Cham, pp 349–360
Google Scholar

Download references

Author information

Authors and Affiliations

Delhi Technological University, Delhi, India
Manisha Saini & Seba Susan

Authors

Manisha Saini
View author publications
You can also search for this author in PubMed Google Scholar
Seba Susan
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

M.S. wrote the main manuscript text and S.S. reviewed the manuscript.

Corresponding author

Correspondence to Manisha Saini.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Saini, M., Susan, S. Tackling class imbalance in computer vision: a contemporary review. Artif Intell Rev 56 (Suppl 1), 1279–1335 (2023). https://doi.org/10.1007/s10462-023-10557-6

Download citation

Published: 20 July 2023
Issue Date: October 2023
DOI: https://doi.org/10.1007/s10462-023-10557-6

Keywords

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Tackling class imbalance in computer vision: a contemporary review

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Understanding imbalanced data: XAI & interpretable ML framework

The class imbalance problem in deep learning

Output Layer Multiplication for Class Imbalance Problem in Convolutional Neural Networks

References

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Keywords

Subscribe and save

Buy Now

Navigation

Tackling class imbalance in computer vision: a contemporary review

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Understanding imbalanced data: XAI & interpretable ML framework

The class imbalance problem in deep learning

Output Layer Multiplication for Class Imbalance Problem in Convolutional Neural Networks

Explore related subjects

References

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Subscribe and save

Buy Now

Search

Navigation