Skip to main content

Advertisement

Log in

Tackling class imbalance in computer vision: a contemporary review

  • Published:
Artificial Intelligence Review Aims and scope Submit manuscript

Abstract

Class imbalance is a key issue affecting the performance of computer vision applications such as medical image analysis, objection detection and recognition, image segmentation, scene understanding, and many others. Class imbalance refers to the situation when the number of samples in the majority classes outnumber the minority class populations. The model might then get biased towards the majority classes while neglecting the minority classes, adversely affecting the classification performance. In this paper, an extensive literature survey has been conducted to discuss in depth about the class imbalance issues affecting various classification tasks in computer vision. The study analyzes the performance of several contemporary machine learning algorithms such as chi-square support vector machine and gradient boosted decision trees, and deep learning models such as deep pre-trained convolutional networks, generative adversarial networks and vision transformers, for effective learning from imbalanced computer vision datasets. Most of these models either perform data-level manipulation (data augmentation) or cost-sensitive learning (loss functions) or a combination of the two. This survey also includes a summary of novel deep learning frameworks customized to mitigate the effect of class imbalance. It has included recent advancement and new developments in this field such as Explainable AI. The scrutiny of various popular and benchmark imbalanced datasets in computer vision and performance evaluation metrics are also included as a part of this study. Along with that it has emphasized on the research gaps in contemporary literature which would contribute towards future artificial vision models that can learn effectively from imbalanced datasets.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12

Similar content being viewed by others

Explore related subjects

Discover the latest articles, news and stories from top researchers in related subjects.

References

  • Adadi A, Berrada M (2020) Explainable AI for healthcare: from black box to interpretable models. Embedded systems and artificial intelligence: proceedings of ESAI 2019, Fez, Morocco. Springer Singapore, Singapore, pp 327–337

    Google Scholar 

  • Afzal S, Maqsood M, Nazir F, Khan U, Aadil F, Awan KM, Mehmood I, Song O-Y (2019) A data augmentation-based framework to handle class imbalance problem for Alzheimer’s stage detection. IEEE Access 7:115528–115539

    Google Scholar 

  • Akhtar Z, Micheloni C, Foresti GL (2015) Biometric liveness detection: challenges and research opportunities. IEEE Secur Privacy 13(5):63–72

    Google Scholar 

  • Alaba SY, Nabi MM, Shah C, Prior J, Campbell MD, Wallace F, Ball JE, Moorhead R (2022) Class-aware fish species recognition using deep learning for an imbalanced dataset. Sensors 22(21):8268

    Google Scholar 

  • Alam TM, Shaukat K, Hameed IA, Luo S, Sarwar MU, Shabbir S, Li J, Khushi M (2020) An investigation of credit card default prediction in the imbalanced datasets. IEEE Access 8:201173–201198

    Google Scholar 

  • Alam TM, Shaukat K, Khan WA, Hameed IA, Almuqren LA, Raza MA, Aslam M, Luo S (2022) An efficient deep learning-based skin cancer classifier for an imbalanced dataset. Diagnostics 12(9):2115

    Google Scholar 

  • Alghamdi HS, Amoudi G, Elhag S, Saeedi K, Nasser J (2021) Deep learning approaches for detecting COVID-19 from chest X-ray images: a survey. Ieee Access 9:20235–20254

    Google Scholar 

  • Alia A, Maree M, Chraibi M (2022) A hybrid deep learning and visualization framework for pushing behavior detection in pedestrian dynamics. Sensors 22(11):4040

    Google Scholar 

  • Ali-Gombe A, Elyan E (2019) MFC-GAN: Class-imbalanced dataset classification using multiple fake class generative adversarial network. Neurocomputing 361:212–221

    Google Scholar 

  • Anand A, Kadian T, Shetty MK, Gupta A (2022) Explainable AI decision model for ECG data of cardiac disorders. Biomed Signal Process Control 75:103584

    Google Scholar 

  • Bai L, Wang L, Chen T, Zhao Y, Ren H (2022) Transformer-based disease identification for small-scale imbalanced capsule endoscopy dataset. Electronics 11(17):2747

    Google Scholar 

  • Bailey KO, Okolica JS, Peterson GL (2014) User identification and authentication using multi-modal behavioral biometrics. Comput Secur 43:77–89

    Google Scholar 

  • Barandela R, Rosa MV, Salvador Sánchez J, Francesc JF. (2004) The imbalanced training sample problem: under or over sampling? In structural, syntactic, and statistical pattern recognition: joint IAPR international workshops, SSPR 2004 and SPR 2004, Lisbon, Portugal. Proceedings, Springer Berlin Heidelberg, pp. 806–814

  • Bhowal P, Sen S, Velasquez JD, Sarkar R (2022) Fuzzy ensemble of deep learning models using choquet fuzzy integral, coalition game and information theory for breast cancer histology classification. Expert Syst Appl 190:116167

    Google Scholar 

  • Buda M, Maki A, Mazurowski MA (2018) A systematic study of the class imbalance problem in convolutional neural networks. Neural Netw 106:249–259

    Google Scholar 

  • Carbonell JG, Michalski RS, Mitchell TM (1983) An overview of machine learning. Mach Learn 1:3–23

    Google Scholar 

  • Carranza-García M, Lara-Benítez P, García-Gutiérrez J, Riquelme JC (2021) Enhancing object detection for autonomous driving by optimizing anchor generation and addressing class imbalance. Neurocomputing 449:229–244

    Google Scholar 

  • Castiglioni I, Rundo L, Codari M, Di Leo G, Salvatore C, Interlenghi M, Gallivanone F, Cozzi A, Claudia N (2021) AI applications to medical images: from machine learning to deep learning. Phys Med 83:9–24

    Google Scholar 

  • Cetinic E, Lipic T, Grgic S (2018) Fine-tuning convolutional neural networks for fine art classification. Expert Syst Appl 114:107–118

    Google Scholar 

  • Chamseddine E, Mansouri N, Soui M, Abed M (2022) Handling class imbalance in COVID-19 chest X-ray images classification: using SMOTE and weighted loss. Appl Soft Comput 129:109588

    Google Scholar 

  • Chawla NV, Bowyer KW, Hall LO, Philip Kegelmeyer W (2002) SMOTE: synthetic minority over-sampling technique. J Artif Intell Res 16:321–357

    MATH  Google Scholar 

  • Chen Z, Duan J, Kang Li, Qiu G (2021) Class-imbalanced deep learning via a class-balanced ensemble. IEEE Trans Neural Networks Learn Syst 33(10):5626–5640

    Google Scholar 

  • Chen H, Li C, Yang W, Liu J, An X, Zhao Y (2022) Deep balanced cascade forest: an novel fault diagnosis method for data imbalance. ISA Trans 126:428–439

    Google Scholar 

  • Chen L-C, George P, Florian S, Hartwig A (2017) Rethinking atrous convolution for semantic image segmentation. arXiv preprint arXiv:1706.05587

  • Ciccarelli M, Corradini F, Germani M, Menchi G, Mostarda L, Papetti A, Piangerelli M (2022) SPECTRE: a deep learning network for posture recognition in manufacturing. J Intell Manuf. https://doi.org/10.1007/s10845-022-02014-y

    Article  Google Scholar 

  • Codella N, Veronica R, Philipp T, Emre Celebi M, Stephen D, David G, Brian H et al. (2019) Skin lesion analysis toward melanoma detection 2018: a challenge hosted by the international skin imaging collaboration (isic). arXiv preprint arXiv:1902.03368

  • Costa D, Arthur Z, Figueroa HEH, Fracarolli JA (2020) Computer vision based detection of external defects on tomatoes using deep learning. Biosyst Eng 190:131–144

    Google Scholar 

  • Creswell A, White T, Dumoulin V, Arulkumaran K, Sengupta B, Bharath AA (2018) Generative adversarial networks: an overview. IEEE Signal Process Mag 35(1):53–65

    Google Scholar 

  • Dai W, Li D, Tang D, Wang H, Peng Y (2022) Deep learning approach for defective spot welds classification using small and class-imbalanced datasets. Neurocomputing 477:46–60

    Google Scholar 

  • Databases–Laboratório Visão Robótica e Imagem, 2019, https://web.inf.ufpr.br/vri/databases/. (Accessed 28 Nov 2019).

  • Doersch C (2016) Tutorial on variational autoencoders. arXiv preprint arXiv:1606.05908

  • Dong Q, Shaogang G, Xiatian Z (2017) Class rectification hard mining for imbalanced deep learning. In Proceedings of the IEEE international conference on computer vision, pp. 1851–1860

  • Dumagpi JK, Jeong Y-J (2020) Evaluating gan-based image augmentation for threat detection in large-scale x-ray security images. Appl Sci 11(1):36

    Google Scholar 

  • Esgario JGM, Krohling RA, Ventura JA (2020) Deep learning for classification and severity estimation of coffee leaf biotic stress. Comput Electron Agric 169:105162

    Google Scholar 

  • Eyepacs and Kaggle. Diabetic retinopathy detection. 2015. url: https://www.kaggle.com/c/diabetic-retinopathy-detection/data

  • Feng W, Huang W, Bao W (2019) Imbalanced hyperspectral image classification with an adaptive ensemble method based on SMOTE and rotation forest with differentiated sampling rates. IEEE Geosci Remote Sens Lett 16(12):1879–1883

    Google Scholar 

  • Fernández A, García S, Galar M, Prati RC, Krawczyk B, Herrera F (2018) Learning from imbalanced data sets. Springer, Cham

    Google Scholar 

  • Fuqua D, Razzaghi T (2020) A cost-sensitive convolution neural network learning for control chart pattern recognition. Expert Syst Appl 150:113275

    Google Scholar 

  • Gandhi, Shreyansh, Samrat Kokkula, Abon Chaudhuri, Alessandro Magnani, Theban Stanley, Behzad Ahmadi, Venkatesh Kandaswamy, Omer Ovenc, and Shie Mannor. "Scalable detection of offensive and non-compliant content/logo in product images." In Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, pp. 2247–2256. 2020.

  • Ghosh K, Bellinger C, Corizzo R, Branco P, Krawczyk B, Japkowicz N (2022) The class imbalance problem in deep learning. Mach Learn. https://doi.org/10.1007/s10994-022-06268-8

    Article  Google Scholar 

  • Goodfellow I, Pouget-Abadie J, Mirza M, Bing Xu, Warde-Farley D, Ozair S, Courville A, Bengio Y (2020) Generative adversarial networks. Commun ACM 63(11):139–144

    MathSciNet  Google Scholar 

  • Guo X, Yilong Y, Cailing D, Gongping Y, Guangtong Z (2008) On the class imbalance problem. In 2008 Fourth international conference on natural computation, IEEE, vol. 4, pp. 192–201

  • Gupta H, Verma OP (2022) Monitoring and surveillance of urban road traffic using low altitude drone images: a deep learning approach. Multimed Tools Appl 81:1–21

    Google Scholar 

  • Gupta A, Anpalagan A, Guan L, Khwaja AS (2021) Deep learning for object detection and scene perception in self-driving cars: survey, challenges, and open issues. Array 10:100057

    Google Scholar 

  • Haixiang G, Yijing Li, Jennifer Shang Gu, Mingyun HY, Bing G (2017) Learning from class-imbalanced data: review of methods and applications. Expert Syst Appl 73:220–239

    Google Scholar 

  • Hamad RA, Yang L, Woo WL, Wei B (2020) Joint learning of temporal models to handle imbalanced data for human activity recognition. Appl Sci 10(15):5293

    Google Scholar 

  • Han K, Wang Y, Chen H, Chen X, Guo J, Liu Z, Tang Y et al (2022) A survey on vision transformer. IEEE Trans Pattern Anal Mach Intell 45(1):87–110

    Google Scholar 

  • Han S, Chan L, Bonggeon C, Jongwuk L (2021) An empirical study for class imbalance in extreme multi-label text classification. In 2021 IEEE international conference on big data and smart computing (BigComp), IEEE, pp. 338–341

  • He H, Yang B, Edwardo AG, Shutao L (2008) ADASYN: Adaptive synthetic sampling approach for imbalanced learning. In 2008 IEEE international joint conference on neural networks (IEEE world congress on computational intelligence), IEEE, pp. 1322–1328

  • He K, Xiangyu Z, Shaoqing R, Jian S (2016) Deep residual learning for image recognition. In Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 770–778

  • Heath M, Bowyer K, Kopans D, Kegelmeyer P, Moore R, Chang K, Munishkumaran S (1998) Current status of the digital database for screening mammography. Digit Mammogr Nijmegen 1998:457–460

    Google Scholar 

  • Hidayat AA, Purwandari K, Cenggoro TW, Pardamean B (2021) A convolutional neural network-based ancient sundanese character classifier with data augmentation. Procedia Comput Sci 179:195–201

    Google Scholar 

  • Hou R, Chen J, Feng Y, Liu S, He S, Zhou Z (2022) Contrastive-weighted self-supervised model for long-tailed data classification with vision transformer augmented. Mech Syst Signal Process 177:109174

    Google Scholar 

  • Huang Z, Pan Z, Lei B (2017b) Transfer learning with deep convolutional neural network for SAR target classification with limited labeled data. Remote Sensing 9(9):907

    Google Scholar 

  • Huang Y, Jin Yi, Li Y, Lin Z (2020) Towards imbalanced image classification: a generative adversarial network ensemble learning method. IEEE Access 8:88399–88409

    Google Scholar 

  • Huang G, Zhuang L, Laurens Van Der M, Kilian QW (2017a) Densely connected convolutional networks. In Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 4700–4708

  • Hung J, Anne C (2017) Applying faster R-CNN for object detection on malaria images. In Proceedings of the IEEE conference on computer vision and pattern recognition workshops, pp. 56–61

  • Hussain E, Mahmudul H, Syed ZH, Tanzina HA, Md Anisur R, Mohammad ZP (2020) Deep learning based binary classification for alzheimer’s disease detection using brain MRI images. In 2020 15th IEEE conference on industrial electronics and applications (ICIEA), IEEE, pp. 1115–1120

  • Iglesias G, Talavera E, Díaz-Álvarez A (2023) A survey on GANs for computer vision: recent research, analysis and taxonomy. Comput Sci Rev 48:100553

    MathSciNet  MATH  Google Scholar 

  • Japkowicz N (2013) Assessment metrics for imbalanced learning. Imbalanced learning: Foundations, algorithms, and applications. John Wiley & Sons, Hoboken, pp 187–206

    Google Scholar 

  • Jing J, Wang Z, Rätsch M, Zhang H (2022) Mobile-Unet: an efficient convolutional neural network for fabric defect detection. Text Res J 92(1–2):30–42

    Google Scholar 

  • Johnson JM, Khoshgoftaar TM (2019) Survey on deep learning with class imbalance. J Big Data 6(1):1–54

    Google Scholar 

  • Kaselimi M, Voulodimos A, Daskalopoulos I, Doulamis N, Doulamis A (2022) A vision transformer model for convolution-free multilabel classification of satellite imagery in deforestation monitoring. IEEE Trans Neural Networks Learn Syst 34:3299

    Google Scholar 

  • Kaur H, Pannu HS, Malhi AK (2019) A systematic review on imbalanced data challenges in machine learning: applications and solutions. ACM Comput Surv (CSUR) 52(4):1–36

    Google Scholar 

  • Khan S, Naseer M, Hayat M, Zamir SW, Khan FS, Shah M (2022) Transformers in vision: a survey. ACM Comput Surv (CSUR) 54(10):1–41

    Google Scholar 

  • Kim Y, Lee Y, Jeon M (2021) Imbalanced image classification with complement cross entropy. Pattern Recogn Lett 151:33–40

    Google Scholar 

  • Kim P-K, Kil-Taek L. (2017) Vehicle type classification using bagging and convolutional neural network on multi view surveillance image. In Proceedings of the IEEE conference on computer vision and pattern recognition workshops, pp. 41–46

  • Kotsiantis S, Kanellopoulos D, Pintelas P (2006) Handling imbalanced datasets: a review. GESTS Int Trans Comput Sci Eng 30(1):25–36

    Google Scholar 

  • Krawczyk B (2016) Learning from imbalanced data: open challenges and future directions. Progress Artif Intell 5(4):221–232

    Google Scholar 

  • Kulatilleke, Gayan K (2022) Challenges and complexities in machine learning based credit card fraud detection. arXiv preprint arXiv:2208.10943

  • Kuo W, Häne C, Yuh E, Mukherjee P, Malik J (2018) Cost-sensitive active learning for intracranial hemorrhage detection. Medical image computing and computer assisted intervention–MICCAI 2018. Springer International Publishing, Cham, pp 715–723

    Google Scholar 

  • LeCun Y, Bengio Y, Hinton G (2015) Deep learning. Nature 521(7553):436–444

    Google Scholar 

  • Lemaître G, Nogueira F, Aridas CK (2017) Imbalanced-learn: a python toolbox to tackle the curse of imbalanced datasets in machine learning. J Mach Learn Res 18(1):559–563

    Google Scholar 

  • Li T, Gao Y, Wang K, Guo S, Liu H, Kang H (2019) Diagnostic assessment of deep learning algorithms for diabetic retinopathy screening. Inf Sci 501:511–522

    Google Scholar 

  • Li L, Zhang S, Wang B (2022) Apple leaf disease identification with a small and imbalanced dataset based on lightweight convolutional networks. Sensors 22(1):173

    Google Scholar 

  • Li D, Zhang Z, Xiaotang C, Haibin L, Kaiqi H (2016) A richly annotated dataset for pedestrian attribute recognition. arXiv preprint arXiv:1603.07054

  • Li Y, Tao W, Bingyi K, Sheng T, Chunfeng W, Jintao L, Jiashi F (2020) Overcoming classifier imbalance for long-tail object detection with balanced group softmax. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp. 10991–11000

  • Lin T-Y, Priya G, Ross G, Kaiming H, Piotr D (2017) Focal loss for dense object detection. In Proceedings of the IEEE international conference on computer vision, pp. 2980–2988

  • Liu W, Zhang M, Luo Z, Cai Y (2017) An ensemble deep learning method for vehicle type classification on visual traffic surveillance sensors. IEEE Access 5:24417–24425

    Google Scholar 

  • Liu Y, Gao H, Guo L, Qin A, Cai C, You Z (2019) A data-flow oriented deep ensemble learning method for real-time surface defect inspection. IEEE Trans Instrum Meas 69(7):4681–4691

    Google Scholar 

  • Liu Z, Ping L, Xiaogang W, Xiaoou T (2015) Deep learning face attributes in the wild. In Proceedings of the IEEE international conference on computer vision, pp. 3730–3738

  • Liu Z, Yutong L, Yue C, Han H, Yixuan W, Zheng Z, Stephen L, Baining G (2021) Swin transformer: hierarchical vision transformer using shifted windows. In Proceedings of the IEEE/CVF international conference on computer vision, pp. 10012–10022

  • Lu X, Chao M, Bingbing N, Xiaokang Y, Ian R, Ming-Hsuan Y (2018) Deep regression tracking with shrinkage loss. In Proceedings of the European conference on computer vision (ECCV), pp. 353–369

  • Mahesh B (2020) Machine learning algorithms-a review. Int J Sci Res (IJSR) 9:381–386

    Google Scholar 

  • Mikołajczyk A, Michał G (2018) Data augmentation for improving deep learning in image classification problem. In 2018 international interdisciplinary PhD workshop (IIPhDW), IEEE, pp. 117–122

  • Milletari F, Nassir N, Seyed-Ahmad A (2016) V-net: Fully convolutional neural networks for volumetric medical image segmentation. In 2016 fourth international conference on 3D vision (3DV), IEEE, pp. 565–571

  • Minaee S, Boykov Y, Porikli F, Plaza A, Kehtarnavaz N, Terzopoulos D (2021) Image segmentation using deep learning: a survey. IEEE Trans Pattern Anal Mach Intell 44(7):3523–3542

    Google Scholar 

  • MobileODT, “Intel and mobileodt cervical can cer screening.” [Online]. Available: https://www.kaggle.com/c/intel-mobileodt-cervical-cancer-screening/data

  • Mooney P (2020) Breast histopathology images. Kaggle. Accessed Oct 09, 2020. https://www.kaggle.com/datasets/paultimothymooney/breast-histopathology-images

  • Mushtaq Z, Shun-Feng Su, Tran Q-V (2021) Spectral images based environmental sound classification using CNN with meaningful data augmentation. Appl Acoust 172:107581

    Google Scholar 

  • Nazki H, Lee J, Yoon S, Park DS (2019) Image-to-image translation with GAN for synthetic data augmentation in plant disease datasets. Smart Med J 8(2):46–57

    Google Scholar 

  • Nesteruk S, Dmitrii S, Mariia P (2021) Image augmentation for multitask few-shot learning: Agricultural domain use-case. arXiv preprint arXiv:2102.12295

  • Ngo QT, Yoon S (2020) Facial expression recognition based on weighted-cluster loss and deep transfer learning using a highly imbalanced dataset. Sensors 20(9):2639

    Google Scholar 

  • Ortiz JAA (2021) COVID-19 X-ray Image classification: a transfer learning approach. University of California, Los Angeles

    Google Scholar 

  • Papathanasiou ND, Spyridonidis T, Apostolopoulos DJ (2020) Automatic characterization of myocardial perfusion imaging polar maps employing deep learning and data augmentation. Hell J Nucl Med 23:125–132

    Google Scholar 

  • Park S, Jongin L, Younghan J, Jin YC (2021) Influence-balanced loss for imbalanced visual classification. In Proceedings of the IEEE/CVF international conference on computer vision, pp. 735–744

  • Pérez-Hernández F, Tabik S, Lamas A, Olmos R, Fujita H, Herrera F (2020) Object detection binary classifiers methodology based on deep learning to identify small objects handled similarly: application in video surveillance. Knowl-Based Syst 194:105590

    Google Scholar 

  • Phan TH, Kazuma Y (2020) Resolving class imbalance in object detection with weighted cross entropy losses. arXiv preprint arXiv:2006.01413

  • Pouyanfar S, Chen S-C (2017) Automatic video event detection for imbalance data using enhanced ensemble deep learning. Int J Semant Comput 11(01):85–109

    Google Scholar 

  • Rácz A, Bajusz D, Héberger K (2021) Effect of dataset size and train/test split ratios in QSAR/QSPR multiclass classification. Molecules 26(4):1111

    Google Scholar 

  • Rahman MM, Davis DN (2013) Addressing the class imbalance problem in medical datasets. Int J Mach Learn Comput 3(2):224

    Google Scholar 

  • Rath A, Mishra D, Panda G, Satapathy SC (2021) Heart disease detection using deep learning methods from imbalanced ECG samples. Biomed Signal Process Control 68:102820

    Google Scholar 

  • Reddy C, Deepak S, Soroush M, Adriana R-S, Samira S, Sina H (2021) Benchmarking bias mitigation algorithms in representation learning through fairness metrics. In thirty-fifth conference on neural information processing systems datasets and benchmarks track (Round 1). url: https://paperswithcode.com/dataset/ci-mnist

  • Redmon J, Santosh D, Ross G, Ali F (2016) You only look once: unified, real-time object detection. In Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 779–788

  • Ren S, He K, Girshick R, Sun J (2015) Faster r-cnn: towards real-time object detection with region proposal networks. Adv Neural Inform Process Syst 28:1

    Google Scholar 

  • Ren J, Mingyuan Z, Cunjun Y, Ziwei L (2022) Balanced mse for imbalanced visual regression. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 7926–7935

  • Reza MS, Ma J (2018) Imbalanced histopathological breast cancer image classification with convolutional neural network. In 2018 14th IEEE international conference on signal processing (ICSP), IEEE, pp. 619–624

  • Rezaei M, Haojin Y, Christoph M (2018) Conditional generative refinement adversarial networks for unbalanced medical image semantic segmentation. arXiv preprint arXiv:1810.03871

  • Ronneberger O, Philipp F, Thomas B (2015) U-net: Convolutional networks for biomedical image segmentation. In Medical image computing and computer-assisted intervention–MICCAI 2015: 18th international conference, Munich, Germany, October 5–9, 2015, Proceedings, Part III 18. Springer International Publishing, pp. 234–241

  • Saini M, Seba S (2022c) Cervical cancer screening on multi-class imbalanced cervigram dataset using transfer learning. In 2022c 15th International congress on image and signal processing, biomedical engineering and informatics (CISP-BMEI), IEEE, pp. 1–6

  • Saini M, Susan S (2019) Data augmentation of minority class with transfer learning for classification of imbalanced breast cancer dataset using inception-V3. Pattern recognition and image analysis. Springer International Publishing, Cham, pp 409–420

    Google Scholar 

  • Saini M, Susan S (2020) Deep transfer with minority data augmentation for imbalanced breast cancer dataset. Appl Soft Comput 97:106759

    Google Scholar 

  • Saini M, Susan S (2021) Bag-of-visual-words codebook generation using deep features for effective classification of imbalanced multi-class image datasets. Multimed Tools Appl 80(14):20821–20847

    Google Scholar 

  • Saini M, Susan S (2022a) Diabetic retinopathy screening using deep learning for multi-class imbalanced datasets. Comput Biol Med 149:105989

    Google Scholar 

  • Saini M, Susan S (2022b) Vggin-net: deep transfer network for imbalanced breast cancer dataset. IEEE/ACM Trans Comput Biol Bioinform 20:752

    Google Scholar 

  • Sambasivam GAOGD, Opiyo GD (2021) A predictive machine learning application in agriculture: Cassava disease detection and classification with imbalanced dataset using convolutional neural networks. Egypt Inform J 22(1):27–34

    Google Scholar 

  • Sampath V, Maurtua I, Martin JJA, Gutierrez A (2021) A survey on generative adversarial networks for imbalance problems in computer vision tasks. J Big Data 8:1–59

    Google Scholar 

  • Sandler M, Andrew H, Menglong Z, Andrey Z, Liang-Chieh C (2018) Mobilenetv2: inverted residuals and linear bottlenecks. In Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 4510–4520

  • Saood A, Hatem I (2021) COVID-19 lung CT image segmentation using deep learning methods: U-Net versus SegNet. BMC Med Imaging 21(1):1–10

    Google Scholar 

  • Sarafianos N, Xiang X, Ioannis AK (2018) Deep imbalanced attribute classification using visual attention aggregation. In Proceedings of the European conference on computer vision (ECCV), pp. 680–697

  • Seiffert C, Khoshgoftaar TM, Van Hulse J, Napolitano A (2009) RUSBoost: a hybrid approach to alleviating class imbalance. IEEE Trans Syst Man Cybern Part a: Syst Humans 40(1):185–197

    Google Scholar 

  • Shamshad F, Khan S, Zamir SW, Khan MH, Hayat M, Khan FS, Huazhu F (2023) Transformers in medical imaging: a survey. Med Image Anal 88:102802

    Google Scholar 

  • Shao S, Zeming L, Tianyuan Z, Chao P, Gang Y, Xiangyu Z, Jing L, Jian S (2019) Objects365: a large-scale, high-quality dataset for object detection. In Proceedings of the IEEE/CVF international conference on computer vision, pp. 8430–8439. url: https://github.com/nkicsl/DDR-dataset

  • Shaukat K, Luo S, Varadharajan V, Hameed IA, Chen S, Liu D, Li J (2020a) Performance comparison and current challenges of using machine learning techniques in cybersecurity. Energies 13(10):2509

    Google Scholar 

  • Shaukat K, Luo S, Varadharajan V, Hameed IA, Min Xu (2020b) A survey on machine learning techniques for cyber security in the last decade. IEEE Access 8:222310–222354

    Google Scholar 

  • Shaukat K, Luo S, Varadharajan V (2023) A novel deep learning-based approach for malware detection. Eng Appl Artif Intell 122:106030

    Google Scholar 

  • Shauka K, Luo S, Chen S, Liu D, (2020c) Cyber threat detection using machine learning techniques: a performance evaluation perspective. In 2020c international conference on cyber warfare and security (ICCWS), IEEE, pp. 1–6

  • Shaukat K, Suhuai L, Nadir A, Talha MA, Muhammad ET, Ibrahim AH (2021) An analysis of blessed Friday sale at a retail store using classification models. In 2021 The 4th international conference on software engineering and information management, pp. 193–198

  • Simonyan K, Andrew Z (2014) Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556

  • Sivapuram AK, Ravi V, Senthil G, Gorthi RK (2023) Visal—a novel learning strategy to address class imbalance. Neural Netw 161:178

    Google Scholar 

  • Soleymani M, Mahdi B, Hadi M, Farnad N (2021) Construction material classification on imbalanced datasets using vision transformer (ViT) architecture. arXiv preprint arXiv:2108.09527

  • Sori WJ, Feng J, Godana AW, Liu S, Gelmecha DJ (2021) DFD-Net: lung cancer detection from denoised CT scan image using deep learning. Front Comp Sci 15:1–13

    Google Scholar 

  • Sozykin K, Stanislav P, Adil K, Rasheed H, Jooyoung L (2018) Multi-label class-imbalanced action recognition in hockey videos via 3D convolutional neural networks. In 2018 19th IEEE/ACIS international conference on software engineering, artificial intelligence, networking and parallel/distributed computing (SNPD), IEEE, pp. 146–151

  • Srinivas C, Nandini Prasad KS, Zakariah M, Alothaibi YA, Kamran Shaukat B (2022) Deep transfer learning approaches in performance analysis of brain tumor classification using MRI images. J Healthcare Eng 2022:1–17

    Google Scholar 

  • Sun Y, Kamel MS, Wong AKC, Wang Y (2007) Cost-sensitive boosting for classification of imbalanced data. Pattern Recognit 40(12):3358–3378

    MATH  Google Scholar 

  • Sundararajan K, Woodard DL (2018) Deep learning for biometrics: a survey. ACM Comput Surv (CSUR) 51(3):1–34

    Google Scholar 

  • Susan S, Ashu K (2022) Localized metric learning for large multi-class extremely imbalanced face database. Database systems for advanced applications. Springer International Publishing, Cham, pp 64–78

    Google Scholar 

  • Susan S, Kumar A (2019) SSOMaj-SMOTE-SSOMin: three-step intelligent pruning of majority and minority samples for learning from imbalanced datasets. Appl Soft Comput 78:141–149

    Google Scholar 

  • Susan S, Kumar A (2021) The balancing trick: optimized sampling of imbalanced datasets—a brief survey of the recent state of the art. Eng Rep 3(4):e12298

    Google Scholar 

  • Susan S, Sethi D, Arora K (2021) CW-CAE: pulmonary nodule detection from imbalanced dataset using class-weighted convolutional autoencoder. International conference on innovative computing and communications. Springer Singapore, Singapore, pp 825–833

    Google Scholar 

  • Susan S, Ankit K (2016) Auto-segmentation using mean-shift and entropy analysis. In 2016 3rd international conference on computing for sustainable global development (INDIACom), IEEE, pp. 292–296

  • Susan S, Amitesh K (2020) Hybrid of intelligent minority oversampling and PSO-based intelligent majority undersampling for learning from imbalanced datasets. In Intelligent systems design and applications: 18th international conference on intelligent systems design and applications (ISDA 2018) held in Vellore, India, December 6–8, Springer International Publishing, Vol. 2, pp. 760–769

  • Szegedy C, Ioffe S, Vanhoucke V, Alemi A (2017) Inception-v4, inception-resnet and the impact of residual connections on learning. Proc AAAI Conf Artif Intell. https://doi.org/10.1609/aaai.v31i1.11231

    Article  Google Scholar 

  • Szegedy C, Vincent V, Sergey I, Jon S, Zbigniew W (2016) Rethinking the inception architecture for computer vision. In Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 2818–2826

  • Tajbakhsh N, Shin JY, Gurudu SR, Todd Hurst R, Kendall CB, Gotway MB, Liang J (2016) Convolutional neural networks for medical image analysis: full training or fine tuning? IEEE Trans Med Imaging 35(5):1299–1312

    Google Scholar 

  • Tan M, Quoc L (2019) Efficientnet: rethinking model scaling for convolutional neural networks." In International conference on machine learning, PMLR, pp. 6105–6114

  • Tan M, Ruoming P, Quoc VL. (2020) Efficientdet: scalable and efficient object detection. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp. 10781–10790

  • Tanaka FHKS, Claus A (2019) Data augmentation using GANs. arXiv preprint arXiv:1904.09135

  • Tanha J, Abdi Y, Samadi N, Razzaghi N, Asadpour M (2020) Boosting methods for multi-class imbalanced data classification: an experimental review. J Big Data 7(1):1–47

    Google Scholar 

  • Tran GS, Nghiem TP, Nguyen VT, Luong CM, Burie J-C (2019) Improving accuracy of lung nodule classification using deep learning with focal loss. J Healthcare Eng 2019:1–9

    Google Scholar 

  • Trong VH, Gwang-hyun Y, Dang Thanh V, Jin-young K (2020) Late fusion of multimodal deep neural networks for weeds classification. Comput Electron Agric 175:105506

    Google Scholar 

  • Vicente S, Joao C, Lourdes A, Jorge B (2014) Reconstructing pascal voc. In Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 41–48

  • Wang Y, Weihao G, Jie Y, Wei W, Junjie Y (2019) Dynamic curriculum learning for imbalanced data classification. In Proceedings of the IEEE/CVF international conference on computer vision, pp. 5017–5026

  • Wang G, Ding H, Duan M, Yuanyuan Pu, Yang Z, Li H (2022) Fighting against terrorism: a real-time CCTV autonomous weapons detection based on improved YOLO v4. Digital Signal Process 132:103790

    Google Scholar 

  • Wardhani NWS, Masithoh YR, Atiek I, Agus DS, Prayudi L (2019) Cross-validation metrics for evaluating classification performance on imbalanced data. In 2019 International conference on computer, control, informatics and its applications (IC3INA), IEEE, pp. 14–18

  • Woźniak M, Wieczorek M, Siłka J (2023) BiLSTM deep neural network model for imbalanced medical data of IoT systems. Futur Gener Comput Syst 141:489–499

    Google Scholar 

  • Wu E, Kevin W, Cox D, Lotter W (2018) Conditional infilling GANs for data augmentation in mammogram classification. Image analysis for moving organ, breast, and thoracic images. Springer, Cham, pp 98–106

    Google Scholar 

  • Xu F, Uszkoreit H, Yangzhou D, Fan W, Zhao D, Zhu J (2019) Explainable AI: a brief survey on history, research areas, approaches and challenges. Natural language processing and chinese computing. Springer International Publishing, Dunhuang, pp 563–574

    Google Scholar 

  • Xu B, Zeng Z, Lian C, Ding Z (2021) Semi-supervised low-rank semantics grouping for zero-shot learning. IEEE Trans Image Process 30:2207–2219

    Google Scholar 

  • Xu M, Yoon S, Fuentes A, Park DS (2023) A comprehensive survey of image augmentation techniques for deep learning. Pattern Recogn 137:109347

    Google Scholar 

  • Yang L, Yuan G, Zhou H, Liu H, Chen J, Hao Wu (2022) RS-YOLOX: a high-precision detector for object detection in satellite remote sensing images. Appl Sci 12(17):8707

    Google Scholar 

  • Yang X, Matloob K, Kamran S (2020) Biomarker CA125 feature engineering and class imbalance learning improves ovarian cancer prediction. In 2020 IEEE Asia-Pacific conference on computer science and data engineering (CSDE), IEEE, pp. 1–6

  • Yeung M, Sala E, Schönlieb C-B, Rundo L (2022) Unified focal loss: generalising dice and cross entropy-based losses to handle class imbalanced medical image segmentation. Comput Med Imaging Graph 95:102026

    Google Scholar 

  • Yijing Li, Haixiang G, Xiao L, Yanan Li, Jinling Li (2016) Adapted ensemble classification algorithm based on multiple classifier system and feature selection for classifying multi-class imbalanced data. Knowl-Based Syst 94:88–104

    Google Scholar 

  • Zaidi SS, Abbas MS, Ansari AA, Kanwal N, Asghar M, Lee B (2022) A survey of modern deep learning based object detection models. Digit Signal Process 126:103514

    Google Scholar 

  • Zaitoun NM, Aqel MJ (2015) Survey on image segmentation techniques. Procedia Comput Sci 65:797–806

    Google Scholar 

  • Zhang Y, Siyuan Lu, Zhou X, Yang M, Lenan Wu, Liu B, Phillips P, Wang S (2016) Comparison of machine learning methods for stationary wavelet entropy-based multiple sclerosis detection: decision tree, k-nearest neighbors, and support vector machine. SIMULATION 92(9):861–871

    Google Scholar 

  • Zhang L, Zhang C, Quan S, Xiao H, Kuang G, Liu Li (2020) A class imbalance loss for imbalanced object recognition. IEEE J Select Top Appl Earth Observ Remote Sens 13:2778–2792

    Google Scholar 

  • Zhang Y, Bingyi K, Bryan H, Shuicheng Y, Jiashi F (2021) Deep long-tailed learning: a survey. arXiv preprint arXiv:2110.04596

  • Zhao C, Shuai R, Ma Li, Liu W, Menglin Wu (2022) Improving cervical cancer classification with imbalanced datasets combining taming transformers with T2T-ViT. Multimed Tools Appl 81(17):24265–24300

    Google Scholar 

  • Zhao H, Jianping S, Xiaojuan Q, Xiaogang W, Jiaya J (2017) Pyramid scene parsing network. In Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 2881–2890

  • Zhao R, Buyue Q, Xianli Z, Yang L, Rong W, Yang L, Yinggang P (2020) Rethinking dice loss for medical image segmentation. In 2020 IEEE international conference on data mining (ICDM), IEEE, pp. 851–860

  • Zhou Z-H, Liu X-Y (2010) On multi-class cost-sensitive learning. Comput Intell 26(3):232–257

    MathSciNet  Google Scholar 

  • Zhou JT, Zhang L, Fang Z, Jiawei D, Peng X, Xiao Y (2019) Attention-driven loss for anomaly detection in video surveillance. IEEE Trans Circuits Syst Video Technol 30(12):4639–4647

    Google Scholar 

  • Zhu X, Liu Y, Li J, Wan T, Qin Z (2018) Emotion classification with data augmentation using generative adversarial networks. Advances in knowledge discovery and data mining. Springer International Publishing, Cham, pp 349–360

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Contributions

M.S. wrote the main manuscript text and S.S. reviewed the manuscript.

Corresponding author

Correspondence to Manisha Saini.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Saini, M., Susan, S. Tackling class imbalance in computer vision: a contemporary review. Artif Intell Rev 56 (Suppl 1), 1279–1335 (2023). https://doi.org/10.1007/s10462-023-10557-6

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10462-023-10557-6

Keywords

Navigation