Abstract
Class imbalance is a key issue affecting the performance of computer vision applications such as medical image analysis, objection detection and recognition, image segmentation, scene understanding, and many others. Class imbalance refers to the situation when the number of samples in the majority classes outnumber the minority class populations. The model might then get biased towards the majority classes while neglecting the minority classes, adversely affecting the classification performance. In this paper, an extensive literature survey has been conducted to discuss in depth about the class imbalance issues affecting various classification tasks in computer vision. The study analyzes the performance of several contemporary machine learning algorithms such as chi-square support vector machine and gradient boosted decision trees, and deep learning models such as deep pre-trained convolutional networks, generative adversarial networks and vision transformers, for effective learning from imbalanced computer vision datasets. Most of these models either perform data-level manipulation (data augmentation) or cost-sensitive learning (loss functions) or a combination of the two. This survey also includes a summary of novel deep learning frameworks customized to mitigate the effect of class imbalance. It has included recent advancement and new developments in this field such as Explainable AI. The scrutiny of various popular and benchmark imbalanced datasets in computer vision and performance evaluation metrics are also included as a part of this study. Along with that it has emphasized on the research gaps in contemporary literature which would contribute towards future artificial vision models that can learn effectively from imbalanced datasets.
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.References
Adadi A, Berrada M (2020) Explainable AI for healthcare: from black box to interpretable models. Embedded systems and artificial intelligence: proceedings of ESAI 2019, Fez, Morocco. Springer Singapore, Singapore, pp 327–337
Afzal S, Maqsood M, Nazir F, Khan U, Aadil F, Awan KM, Mehmood I, Song O-Y (2019) A data augmentation-based framework to handle class imbalance problem for Alzheimer’s stage detection. IEEE Access 7:115528–115539
Akhtar Z, Micheloni C, Foresti GL (2015) Biometric liveness detection: challenges and research opportunities. IEEE Secur Privacy 13(5):63–72
Alaba SY, Nabi MM, Shah C, Prior J, Campbell MD, Wallace F, Ball JE, Moorhead R (2022) Class-aware fish species recognition using deep learning for an imbalanced dataset. Sensors 22(21):8268
Alam TM, Shaukat K, Hameed IA, Luo S, Sarwar MU, Shabbir S, Li J, Khushi M (2020) An investigation of credit card default prediction in the imbalanced datasets. IEEE Access 8:201173–201198
Alam TM, Shaukat K, Khan WA, Hameed IA, Almuqren LA, Raza MA, Aslam M, Luo S (2022) An efficient deep learning-based skin cancer classifier for an imbalanced dataset. Diagnostics 12(9):2115
Alghamdi HS, Amoudi G, Elhag S, Saeedi K, Nasser J (2021) Deep learning approaches for detecting COVID-19 from chest X-ray images: a survey. Ieee Access 9:20235–20254
Alia A, Maree M, Chraibi M (2022) A hybrid deep learning and visualization framework for pushing behavior detection in pedestrian dynamics. Sensors 22(11):4040
Ali-Gombe A, Elyan E (2019) MFC-GAN: Class-imbalanced dataset classification using multiple fake class generative adversarial network. Neurocomputing 361:212–221
Anand A, Kadian T, Shetty MK, Gupta A (2022) Explainable AI decision model for ECG data of cardiac disorders. Biomed Signal Process Control 75:103584
Bai L, Wang L, Chen T, Zhao Y, Ren H (2022) Transformer-based disease identification for small-scale imbalanced capsule endoscopy dataset. Electronics 11(17):2747
Bailey KO, Okolica JS, Peterson GL (2014) User identification and authentication using multi-modal behavioral biometrics. Comput Secur 43:77–89
Barandela R, Rosa MV, Salvador Sánchez J, Francesc JF. (2004) The imbalanced training sample problem: under or over sampling? In structural, syntactic, and statistical pattern recognition: joint IAPR international workshops, SSPR 2004 and SPR 2004, Lisbon, Portugal. Proceedings, Springer Berlin Heidelberg, pp. 806–814
Bhowal P, Sen S, Velasquez JD, Sarkar R (2022) Fuzzy ensemble of deep learning models using choquet fuzzy integral, coalition game and information theory for breast cancer histology classification. Expert Syst Appl 190:116167
Buda M, Maki A, Mazurowski MA (2018) A systematic study of the class imbalance problem in convolutional neural networks. Neural Netw 106:249–259
Carbonell JG, Michalski RS, Mitchell TM (1983) An overview of machine learning. Mach Learn 1:3–23
Carranza-García M, Lara-Benítez P, García-Gutiérrez J, Riquelme JC (2021) Enhancing object detection for autonomous driving by optimizing anchor generation and addressing class imbalance. Neurocomputing 449:229–244
Castiglioni I, Rundo L, Codari M, Di Leo G, Salvatore C, Interlenghi M, Gallivanone F, Cozzi A, Claudia N (2021) AI applications to medical images: from machine learning to deep learning. Phys Med 83:9–24
Cetinic E, Lipic T, Grgic S (2018) Fine-tuning convolutional neural networks for fine art classification. Expert Syst Appl 114:107–118
Chamseddine E, Mansouri N, Soui M, Abed M (2022) Handling class imbalance in COVID-19 chest X-ray images classification: using SMOTE and weighted loss. Appl Soft Comput 129:109588
Chawla NV, Bowyer KW, Hall LO, Philip Kegelmeyer W (2002) SMOTE: synthetic minority over-sampling technique. J Artif Intell Res 16:321–357
Chen Z, Duan J, Kang Li, Qiu G (2021) Class-imbalanced deep learning via a class-balanced ensemble. IEEE Trans Neural Networks Learn Syst 33(10):5626–5640
Chen H, Li C, Yang W, Liu J, An X, Zhao Y (2022) Deep balanced cascade forest: an novel fault diagnosis method for data imbalance. ISA Trans 126:428–439
Chen L-C, George P, Florian S, Hartwig A (2017) Rethinking atrous convolution for semantic image segmentation. arXiv preprint arXiv:1706.05587
Ciccarelli M, Corradini F, Germani M, Menchi G, Mostarda L, Papetti A, Piangerelli M (2022) SPECTRE: a deep learning network for posture recognition in manufacturing. J Intell Manuf. https://doi.org/10.1007/s10845-022-02014-y
Codella N, Veronica R, Philipp T, Emre Celebi M, Stephen D, David G, Brian H et al. (2019) Skin lesion analysis toward melanoma detection 2018: a challenge hosted by the international skin imaging collaboration (isic). arXiv preprint arXiv:1902.03368
Costa D, Arthur Z, Figueroa HEH, Fracarolli JA (2020) Computer vision based detection of external defects on tomatoes using deep learning. Biosyst Eng 190:131–144
Creswell A, White T, Dumoulin V, Arulkumaran K, Sengupta B, Bharath AA (2018) Generative adversarial networks: an overview. IEEE Signal Process Mag 35(1):53–65
Dai W, Li D, Tang D, Wang H, Peng Y (2022) Deep learning approach for defective spot welds classification using small and class-imbalanced datasets. Neurocomputing 477:46–60
Databases–Laboratório Visão Robótica e Imagem, 2019, https://web.inf.ufpr.br/vri/databases/. (Accessed 28 Nov 2019).
Doersch C (2016) Tutorial on variational autoencoders. arXiv preprint arXiv:1606.05908
Dong Q, Shaogang G, Xiatian Z (2017) Class rectification hard mining for imbalanced deep learning. In Proceedings of the IEEE international conference on computer vision, pp. 1851–1860
Dumagpi JK, Jeong Y-J (2020) Evaluating gan-based image augmentation for threat detection in large-scale x-ray security images. Appl Sci 11(1):36
Esgario JGM, Krohling RA, Ventura JA (2020) Deep learning for classification and severity estimation of coffee leaf biotic stress. Comput Electron Agric 169:105162
Eyepacs and Kaggle. Diabetic retinopathy detection. 2015. url: https://www.kaggle.com/c/diabetic-retinopathy-detection/data
Feng W, Huang W, Bao W (2019) Imbalanced hyperspectral image classification with an adaptive ensemble method based on SMOTE and rotation forest with differentiated sampling rates. IEEE Geosci Remote Sens Lett 16(12):1879–1883
Fernández A, García S, Galar M, Prati RC, Krawczyk B, Herrera F (2018) Learning from imbalanced data sets. Springer, Cham
Fuqua D, Razzaghi T (2020) A cost-sensitive convolution neural network learning for control chart pattern recognition. Expert Syst Appl 150:113275
Gandhi, Shreyansh, Samrat Kokkula, Abon Chaudhuri, Alessandro Magnani, Theban Stanley, Behzad Ahmadi, Venkatesh Kandaswamy, Omer Ovenc, and Shie Mannor. "Scalable detection of offensive and non-compliant content/logo in product images." In Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, pp. 2247–2256. 2020.
Ghosh K, Bellinger C, Corizzo R, Branco P, Krawczyk B, Japkowicz N (2022) The class imbalance problem in deep learning. Mach Learn. https://doi.org/10.1007/s10994-022-06268-8
Goodfellow I, Pouget-Abadie J, Mirza M, Bing Xu, Warde-Farley D, Ozair S, Courville A, Bengio Y (2020) Generative adversarial networks. Commun ACM 63(11):139–144
Guo X, Yilong Y, Cailing D, Gongping Y, Guangtong Z (2008) On the class imbalance problem. In 2008 Fourth international conference on natural computation, IEEE, vol. 4, pp. 192–201
Gupta H, Verma OP (2022) Monitoring and surveillance of urban road traffic using low altitude drone images: a deep learning approach. Multimed Tools Appl 81:1–21
Gupta A, Anpalagan A, Guan L, Khwaja AS (2021) Deep learning for object detection and scene perception in self-driving cars: survey, challenges, and open issues. Array 10:100057
Haixiang G, Yijing Li, Jennifer Shang Gu, Mingyun HY, Bing G (2017) Learning from class-imbalanced data: review of methods and applications. Expert Syst Appl 73:220–239
Hamad RA, Yang L, Woo WL, Wei B (2020) Joint learning of temporal models to handle imbalanced data for human activity recognition. Appl Sci 10(15):5293
Han K, Wang Y, Chen H, Chen X, Guo J, Liu Z, Tang Y et al (2022) A survey on vision transformer. IEEE Trans Pattern Anal Mach Intell 45(1):87–110
Han S, Chan L, Bonggeon C, Jongwuk L (2021) An empirical study for class imbalance in extreme multi-label text classification. In 2021 IEEE international conference on big data and smart computing (BigComp), IEEE, pp. 338–341
He H, Yang B, Edwardo AG, Shutao L (2008) ADASYN: Adaptive synthetic sampling approach for imbalanced learning. In 2008 IEEE international joint conference on neural networks (IEEE world congress on computational intelligence), IEEE, pp. 1322–1328
He K, Xiangyu Z, Shaoqing R, Jian S (2016) Deep residual learning for image recognition. In Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 770–778
Heath M, Bowyer K, Kopans D, Kegelmeyer P, Moore R, Chang K, Munishkumaran S (1998) Current status of the digital database for screening mammography. Digit Mammogr Nijmegen 1998:457–460
Hidayat AA, Purwandari K, Cenggoro TW, Pardamean B (2021) A convolutional neural network-based ancient sundanese character classifier with data augmentation. Procedia Comput Sci 179:195–201
Hou R, Chen J, Feng Y, Liu S, He S, Zhou Z (2022) Contrastive-weighted self-supervised model for long-tailed data classification with vision transformer augmented. Mech Syst Signal Process 177:109174
Huang Z, Pan Z, Lei B (2017b) Transfer learning with deep convolutional neural network for SAR target classification with limited labeled data. Remote Sensing 9(9):907
Huang Y, Jin Yi, Li Y, Lin Z (2020) Towards imbalanced image classification: a generative adversarial network ensemble learning method. IEEE Access 8:88399–88409
Huang G, Zhuang L, Laurens Van Der M, Kilian QW (2017a) Densely connected convolutional networks. In Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 4700–4708
Hung J, Anne C (2017) Applying faster R-CNN for object detection on malaria images. In Proceedings of the IEEE conference on computer vision and pattern recognition workshops, pp. 56–61
Hussain E, Mahmudul H, Syed ZH, Tanzina HA, Md Anisur R, Mohammad ZP (2020) Deep learning based binary classification for alzheimer’s disease detection using brain MRI images. In 2020 15th IEEE conference on industrial electronics and applications (ICIEA), IEEE, pp. 1115–1120
Iglesias G, Talavera E, Díaz-Álvarez A (2023) A survey on GANs for computer vision: recent research, analysis and taxonomy. Comput Sci Rev 48:100553
Japkowicz N (2013) Assessment metrics for imbalanced learning. Imbalanced learning: Foundations, algorithms, and applications. John Wiley & Sons, Hoboken, pp 187–206
Jing J, Wang Z, Rätsch M, Zhang H (2022) Mobile-Unet: an efficient convolutional neural network for fabric defect detection. Text Res J 92(1–2):30–42
Johnson JM, Khoshgoftaar TM (2019) Survey on deep learning with class imbalance. J Big Data 6(1):1–54
Kaselimi M, Voulodimos A, Daskalopoulos I, Doulamis N, Doulamis A (2022) A vision transformer model for convolution-free multilabel classification of satellite imagery in deforestation monitoring. IEEE Trans Neural Networks Learn Syst 34:3299
Kaur H, Pannu HS, Malhi AK (2019) A systematic review on imbalanced data challenges in machine learning: applications and solutions. ACM Comput Surv (CSUR) 52(4):1–36
Khan S, Naseer M, Hayat M, Zamir SW, Khan FS, Shah M (2022) Transformers in vision: a survey. ACM Comput Surv (CSUR) 54(10):1–41
Kim Y, Lee Y, Jeon M (2021) Imbalanced image classification with complement cross entropy. Pattern Recogn Lett 151:33–40
Kim P-K, Kil-Taek L. (2017) Vehicle type classification using bagging and convolutional neural network on multi view surveillance image. In Proceedings of the IEEE conference on computer vision and pattern recognition workshops, pp. 41–46
Kotsiantis S, Kanellopoulos D, Pintelas P (2006) Handling imbalanced datasets: a review. GESTS Int Trans Comput Sci Eng 30(1):25–36
Krawczyk B (2016) Learning from imbalanced data: open challenges and future directions. Progress Artif Intell 5(4):221–232
Kulatilleke, Gayan K (2022) Challenges and complexities in machine learning based credit card fraud detection. arXiv preprint arXiv:2208.10943
Kuo W, Häne C, Yuh E, Mukherjee P, Malik J (2018) Cost-sensitive active learning for intracranial hemorrhage detection. Medical image computing and computer assisted intervention–MICCAI 2018. Springer International Publishing, Cham, pp 715–723
LeCun Y, Bengio Y, Hinton G (2015) Deep learning. Nature 521(7553):436–444
Lemaître G, Nogueira F, Aridas CK (2017) Imbalanced-learn: a python toolbox to tackle the curse of imbalanced datasets in machine learning. J Mach Learn Res 18(1):559–563
Li T, Gao Y, Wang K, Guo S, Liu H, Kang H (2019) Diagnostic assessment of deep learning algorithms for diabetic retinopathy screening. Inf Sci 501:511–522
Li L, Zhang S, Wang B (2022) Apple leaf disease identification with a small and imbalanced dataset based on lightweight convolutional networks. Sensors 22(1):173
Li D, Zhang Z, Xiaotang C, Haibin L, Kaiqi H (2016) A richly annotated dataset for pedestrian attribute recognition. arXiv preprint arXiv:1603.07054
Li Y, Tao W, Bingyi K, Sheng T, Chunfeng W, Jintao L, Jiashi F (2020) Overcoming classifier imbalance for long-tail object detection with balanced group softmax. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp. 10991–11000
Lin T-Y, Priya G, Ross G, Kaiming H, Piotr D (2017) Focal loss for dense object detection. In Proceedings of the IEEE international conference on computer vision, pp. 2980–2988
Liu W, Zhang M, Luo Z, Cai Y (2017) An ensemble deep learning method for vehicle type classification on visual traffic surveillance sensors. IEEE Access 5:24417–24425
Liu Y, Gao H, Guo L, Qin A, Cai C, You Z (2019) A data-flow oriented deep ensemble learning method for real-time surface defect inspection. IEEE Trans Instrum Meas 69(7):4681–4691
Liu Z, Ping L, Xiaogang W, Xiaoou T (2015) Deep learning face attributes in the wild. In Proceedings of the IEEE international conference on computer vision, pp. 3730–3738
Liu Z, Yutong L, Yue C, Han H, Yixuan W, Zheng Z, Stephen L, Baining G (2021) Swin transformer: hierarchical vision transformer using shifted windows. In Proceedings of the IEEE/CVF international conference on computer vision, pp. 10012–10022
Lu X, Chao M, Bingbing N, Xiaokang Y, Ian R, Ming-Hsuan Y (2018) Deep regression tracking with shrinkage loss. In Proceedings of the European conference on computer vision (ECCV), pp. 353–369
Mahesh B (2020) Machine learning algorithms-a review. Int J Sci Res (IJSR) 9:381–386
Mikołajczyk A, Michał G (2018) Data augmentation for improving deep learning in image classification problem. In 2018 international interdisciplinary PhD workshop (IIPhDW), IEEE, pp. 117–122
Milletari F, Nassir N, Seyed-Ahmad A (2016) V-net: Fully convolutional neural networks for volumetric medical image segmentation. In 2016 fourth international conference on 3D vision (3DV), IEEE, pp. 565–571
Minaee S, Boykov Y, Porikli F, Plaza A, Kehtarnavaz N, Terzopoulos D (2021) Image segmentation using deep learning: a survey. IEEE Trans Pattern Anal Mach Intell 44(7):3523–3542
MobileODT, “Intel and mobileodt cervical can cer screening.” [Online]. Available: https://www.kaggle.com/c/intel-mobileodt-cervical-cancer-screening/data
Mooney P (2020) Breast histopathology images. Kaggle. Accessed Oct 09, 2020. https://www.kaggle.com/datasets/paultimothymooney/breast-histopathology-images
Mushtaq Z, Shun-Feng Su, Tran Q-V (2021) Spectral images based environmental sound classification using CNN with meaningful data augmentation. Appl Acoust 172:107581
Nazki H, Lee J, Yoon S, Park DS (2019) Image-to-image translation with GAN for synthetic data augmentation in plant disease datasets. Smart Med J 8(2):46–57
Nesteruk S, Dmitrii S, Mariia P (2021) Image augmentation for multitask few-shot learning: Agricultural domain use-case. arXiv preprint arXiv:2102.12295
Ngo QT, Yoon S (2020) Facial expression recognition based on weighted-cluster loss and deep transfer learning using a highly imbalanced dataset. Sensors 20(9):2639
Ortiz JAA (2021) COVID-19 X-ray Image classification: a transfer learning approach. University of California, Los Angeles
Papathanasiou ND, Spyridonidis T, Apostolopoulos DJ (2020) Automatic characterization of myocardial perfusion imaging polar maps employing deep learning and data augmentation. Hell J Nucl Med 23:125–132
Park S, Jongin L, Younghan J, Jin YC (2021) Influence-balanced loss for imbalanced visual classification. In Proceedings of the IEEE/CVF international conference on computer vision, pp. 735–744
Pérez-Hernández F, Tabik S, Lamas A, Olmos R, Fujita H, Herrera F (2020) Object detection binary classifiers methodology based on deep learning to identify small objects handled similarly: application in video surveillance. Knowl-Based Syst 194:105590
Phan TH, Kazuma Y (2020) Resolving class imbalance in object detection with weighted cross entropy losses. arXiv preprint arXiv:2006.01413
Pouyanfar S, Chen S-C (2017) Automatic video event detection for imbalance data using enhanced ensemble deep learning. Int J Semant Comput 11(01):85–109
Rácz A, Bajusz D, Héberger K (2021) Effect of dataset size and train/test split ratios in QSAR/QSPR multiclass classification. Molecules 26(4):1111
Rahman MM, Davis DN (2013) Addressing the class imbalance problem in medical datasets. Int J Mach Learn Comput 3(2):224
Rath A, Mishra D, Panda G, Satapathy SC (2021) Heart disease detection using deep learning methods from imbalanced ECG samples. Biomed Signal Process Control 68:102820
Reddy C, Deepak S, Soroush M, Adriana R-S, Samira S, Sina H (2021) Benchmarking bias mitigation algorithms in representation learning through fairness metrics. In thirty-fifth conference on neural information processing systems datasets and benchmarks track (Round 1). url: https://paperswithcode.com/dataset/ci-mnist
Redmon J, Santosh D, Ross G, Ali F (2016) You only look once: unified, real-time object detection. In Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 779–788
Ren S, He K, Girshick R, Sun J (2015) Faster r-cnn: towards real-time object detection with region proposal networks. Adv Neural Inform Process Syst 28:1
Ren J, Mingyuan Z, Cunjun Y, Ziwei L (2022) Balanced mse for imbalanced visual regression. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 7926–7935
Reza MS, Ma J (2018) Imbalanced histopathological breast cancer image classification with convolutional neural network. In 2018 14th IEEE international conference on signal processing (ICSP), IEEE, pp. 619–624
Rezaei M, Haojin Y, Christoph M (2018) Conditional generative refinement adversarial networks for unbalanced medical image semantic segmentation. arXiv preprint arXiv:1810.03871
Ronneberger O, Philipp F, Thomas B (2015) U-net: Convolutional networks for biomedical image segmentation. In Medical image computing and computer-assisted intervention–MICCAI 2015: 18th international conference, Munich, Germany, October 5–9, 2015, Proceedings, Part III 18. Springer International Publishing, pp. 234–241
Saini M, Seba S (2022c) Cervical cancer screening on multi-class imbalanced cervigram dataset using transfer learning. In 2022c 15th International congress on image and signal processing, biomedical engineering and informatics (CISP-BMEI), IEEE, pp. 1–6
Saini M, Susan S (2019) Data augmentation of minority class with transfer learning for classification of imbalanced breast cancer dataset using inception-V3. Pattern recognition and image analysis. Springer International Publishing, Cham, pp 409–420
Saini M, Susan S (2020) Deep transfer with minority data augmentation for imbalanced breast cancer dataset. Appl Soft Comput 97:106759
Saini M, Susan S (2021) Bag-of-visual-words codebook generation using deep features for effective classification of imbalanced multi-class image datasets. Multimed Tools Appl 80(14):20821–20847
Saini M, Susan S (2022a) Diabetic retinopathy screening using deep learning for multi-class imbalanced datasets. Comput Biol Med 149:105989
Saini M, Susan S (2022b) Vggin-net: deep transfer network for imbalanced breast cancer dataset. IEEE/ACM Trans Comput Biol Bioinform 20:752
Sambasivam GAOGD, Opiyo GD (2021) A predictive machine learning application in agriculture: Cassava disease detection and classification with imbalanced dataset using convolutional neural networks. Egypt Inform J 22(1):27–34
Sampath V, Maurtua I, Martin JJA, Gutierrez A (2021) A survey on generative adversarial networks for imbalance problems in computer vision tasks. J Big Data 8:1–59
Sandler M, Andrew H, Menglong Z, Andrey Z, Liang-Chieh C (2018) Mobilenetv2: inverted residuals and linear bottlenecks. In Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 4510–4520
Saood A, Hatem I (2021) COVID-19 lung CT image segmentation using deep learning methods: U-Net versus SegNet. BMC Med Imaging 21(1):1–10
Sarafianos N, Xiang X, Ioannis AK (2018) Deep imbalanced attribute classification using visual attention aggregation. In Proceedings of the European conference on computer vision (ECCV), pp. 680–697
Seiffert C, Khoshgoftaar TM, Van Hulse J, Napolitano A (2009) RUSBoost: a hybrid approach to alleviating class imbalance. IEEE Trans Syst Man Cybern Part a: Syst Humans 40(1):185–197
Shamshad F, Khan S, Zamir SW, Khan MH, Hayat M, Khan FS, Huazhu F (2023) Transformers in medical imaging: a survey. Med Image Anal 88:102802
Shao S, Zeming L, Tianyuan Z, Chao P, Gang Y, Xiangyu Z, Jing L, Jian S (2019) Objects365: a large-scale, high-quality dataset for object detection. In Proceedings of the IEEE/CVF international conference on computer vision, pp. 8430–8439. url: https://github.com/nkicsl/DDR-dataset
Shaukat K, Luo S, Varadharajan V, Hameed IA, Chen S, Liu D, Li J (2020a) Performance comparison and current challenges of using machine learning techniques in cybersecurity. Energies 13(10):2509
Shaukat K, Luo S, Varadharajan V, Hameed IA, Min Xu (2020b) A survey on machine learning techniques for cyber security in the last decade. IEEE Access 8:222310–222354
Shaukat K, Luo S, Varadharajan V (2023) A novel deep learning-based approach for malware detection. Eng Appl Artif Intell 122:106030
Shauka K, Luo S, Chen S, Liu D, (2020c) Cyber threat detection using machine learning techniques: a performance evaluation perspective. In 2020c international conference on cyber warfare and security (ICCWS), IEEE, pp. 1–6
Shaukat K, Suhuai L, Nadir A, Talha MA, Muhammad ET, Ibrahim AH (2021) An analysis of blessed Friday sale at a retail store using classification models. In 2021 The 4th international conference on software engineering and information management, pp. 193–198
Simonyan K, Andrew Z (2014) Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556
Sivapuram AK, Ravi V, Senthil G, Gorthi RK (2023) Visal—a novel learning strategy to address class imbalance. Neural Netw 161:178
Soleymani M, Mahdi B, Hadi M, Farnad N (2021) Construction material classification on imbalanced datasets using vision transformer (ViT) architecture. arXiv preprint arXiv:2108.09527
Sori WJ, Feng J, Godana AW, Liu S, Gelmecha DJ (2021) DFD-Net: lung cancer detection from denoised CT scan image using deep learning. Front Comp Sci 15:1–13
Sozykin K, Stanislav P, Adil K, Rasheed H, Jooyoung L (2018) Multi-label class-imbalanced action recognition in hockey videos via 3D convolutional neural networks. In 2018 19th IEEE/ACIS international conference on software engineering, artificial intelligence, networking and parallel/distributed computing (SNPD), IEEE, pp. 146–151
Srinivas C, Nandini Prasad KS, Zakariah M, Alothaibi YA, Kamran Shaukat B (2022) Deep transfer learning approaches in performance analysis of brain tumor classification using MRI images. J Healthcare Eng 2022:1–17
Sun Y, Kamel MS, Wong AKC, Wang Y (2007) Cost-sensitive boosting for classification of imbalanced data. Pattern Recognit 40(12):3358–3378
Sundararajan K, Woodard DL (2018) Deep learning for biometrics: a survey. ACM Comput Surv (CSUR) 51(3):1–34
Susan S, Ashu K (2022) Localized metric learning for large multi-class extremely imbalanced face database. Database systems for advanced applications. Springer International Publishing, Cham, pp 64–78
Susan S, Kumar A (2019) SSOMaj-SMOTE-SSOMin: three-step intelligent pruning of majority and minority samples for learning from imbalanced datasets. Appl Soft Comput 78:141–149
Susan S, Kumar A (2021) The balancing trick: optimized sampling of imbalanced datasets—a brief survey of the recent state of the art. Eng Rep 3(4):e12298
Susan S, Sethi D, Arora K (2021) CW-CAE: pulmonary nodule detection from imbalanced dataset using class-weighted convolutional autoencoder. International conference on innovative computing and communications. Springer Singapore, Singapore, pp 825–833
Susan S, Ankit K (2016) Auto-segmentation using mean-shift and entropy analysis. In 2016 3rd international conference on computing for sustainable global development (INDIACom), IEEE, pp. 292–296
Susan S, Amitesh K (2020) Hybrid of intelligent minority oversampling and PSO-based intelligent majority undersampling for learning from imbalanced datasets. In Intelligent systems design and applications: 18th international conference on intelligent systems design and applications (ISDA 2018) held in Vellore, India, December 6–8, Springer International Publishing, Vol. 2, pp. 760–769
Szegedy C, Ioffe S, Vanhoucke V, Alemi A (2017) Inception-v4, inception-resnet and the impact of residual connections on learning. Proc AAAI Conf Artif Intell. https://doi.org/10.1609/aaai.v31i1.11231
Szegedy C, Vincent V, Sergey I, Jon S, Zbigniew W (2016) Rethinking the inception architecture for computer vision. In Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 2818–2826
Tajbakhsh N, Shin JY, Gurudu SR, Todd Hurst R, Kendall CB, Gotway MB, Liang J (2016) Convolutional neural networks for medical image analysis: full training or fine tuning? IEEE Trans Med Imaging 35(5):1299–1312
Tan M, Quoc L (2019) Efficientnet: rethinking model scaling for convolutional neural networks." In International conference on machine learning, PMLR, pp. 6105–6114
Tan M, Ruoming P, Quoc VL. (2020) Efficientdet: scalable and efficient object detection. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp. 10781–10790
Tanaka FHKS, Claus A (2019) Data augmentation using GANs. arXiv preprint arXiv:1904.09135
Tanha J, Abdi Y, Samadi N, Razzaghi N, Asadpour M (2020) Boosting methods for multi-class imbalanced data classification: an experimental review. J Big Data 7(1):1–47
Tran GS, Nghiem TP, Nguyen VT, Luong CM, Burie J-C (2019) Improving accuracy of lung nodule classification using deep learning with focal loss. J Healthcare Eng 2019:1–9
Trong VH, Gwang-hyun Y, Dang Thanh V, Jin-young K (2020) Late fusion of multimodal deep neural networks for weeds classification. Comput Electron Agric 175:105506
Vicente S, Joao C, Lourdes A, Jorge B (2014) Reconstructing pascal voc. In Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 41–48
Wang Y, Weihao G, Jie Y, Wei W, Junjie Y (2019) Dynamic curriculum learning for imbalanced data classification. In Proceedings of the IEEE/CVF international conference on computer vision, pp. 5017–5026
Wang G, Ding H, Duan M, Yuanyuan Pu, Yang Z, Li H (2022) Fighting against terrorism: a real-time CCTV autonomous weapons detection based on improved YOLO v4. Digital Signal Process 132:103790
Wardhani NWS, Masithoh YR, Atiek I, Agus DS, Prayudi L (2019) Cross-validation metrics for evaluating classification performance on imbalanced data. In 2019 International conference on computer, control, informatics and its applications (IC3INA), IEEE, pp. 14–18
Woźniak M, Wieczorek M, Siłka J (2023) BiLSTM deep neural network model for imbalanced medical data of IoT systems. Futur Gener Comput Syst 141:489–499
Wu E, Kevin W, Cox D, Lotter W (2018) Conditional infilling GANs for data augmentation in mammogram classification. Image analysis for moving organ, breast, and thoracic images. Springer, Cham, pp 98–106
Xu F, Uszkoreit H, Yangzhou D, Fan W, Zhao D, Zhu J (2019) Explainable AI: a brief survey on history, research areas, approaches and challenges. Natural language processing and chinese computing. Springer International Publishing, Dunhuang, pp 563–574
Xu B, Zeng Z, Lian C, Ding Z (2021) Semi-supervised low-rank semantics grouping for zero-shot learning. IEEE Trans Image Process 30:2207–2219
Xu M, Yoon S, Fuentes A, Park DS (2023) A comprehensive survey of image augmentation techniques for deep learning. Pattern Recogn 137:109347
Yang L, Yuan G, Zhou H, Liu H, Chen J, Hao Wu (2022) RS-YOLOX: a high-precision detector for object detection in satellite remote sensing images. Appl Sci 12(17):8707
Yang X, Matloob K, Kamran S (2020) Biomarker CA125 feature engineering and class imbalance learning improves ovarian cancer prediction. In 2020 IEEE Asia-Pacific conference on computer science and data engineering (CSDE), IEEE, pp. 1–6
Yeung M, Sala E, Schönlieb C-B, Rundo L (2022) Unified focal loss: generalising dice and cross entropy-based losses to handle class imbalanced medical image segmentation. Comput Med Imaging Graph 95:102026
Yijing Li, Haixiang G, Xiao L, Yanan Li, Jinling Li (2016) Adapted ensemble classification algorithm based on multiple classifier system and feature selection for classifying multi-class imbalanced data. Knowl-Based Syst 94:88–104
Zaidi SS, Abbas MS, Ansari AA, Kanwal N, Asghar M, Lee B (2022) A survey of modern deep learning based object detection models. Digit Signal Process 126:103514
Zaitoun NM, Aqel MJ (2015) Survey on image segmentation techniques. Procedia Comput Sci 65:797–806
Zhang Y, Siyuan Lu, Zhou X, Yang M, Lenan Wu, Liu B, Phillips P, Wang S (2016) Comparison of machine learning methods for stationary wavelet entropy-based multiple sclerosis detection: decision tree, k-nearest neighbors, and support vector machine. SIMULATION 92(9):861–871
Zhang L, Zhang C, Quan S, Xiao H, Kuang G, Liu Li (2020) A class imbalance loss for imbalanced object recognition. IEEE J Select Top Appl Earth Observ Remote Sens 13:2778–2792
Zhang Y, Bingyi K, Bryan H, Shuicheng Y, Jiashi F (2021) Deep long-tailed learning: a survey. arXiv preprint arXiv:2110.04596
Zhao C, Shuai R, Ma Li, Liu W, Menglin Wu (2022) Improving cervical cancer classification with imbalanced datasets combining taming transformers with T2T-ViT. Multimed Tools Appl 81(17):24265–24300
Zhao H, Jianping S, Xiaojuan Q, Xiaogang W, Jiaya J (2017) Pyramid scene parsing network. In Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 2881–2890
Zhao R, Buyue Q, Xianli Z, Yang L, Rong W, Yang L, Yinggang P (2020) Rethinking dice loss for medical image segmentation. In 2020 IEEE international conference on data mining (ICDM), IEEE, pp. 851–860
Zhou Z-H, Liu X-Y (2010) On multi-class cost-sensitive learning. Comput Intell 26(3):232–257
Zhou JT, Zhang L, Fang Z, Jiawei D, Peng X, Xiao Y (2019) Attention-driven loss for anomaly detection in video surveillance. IEEE Trans Circuits Syst Video Technol 30(12):4639–4647
Zhu X, Liu Y, Li J, Wan T, Qin Z (2018) Emotion classification with data augmentation using generative adversarial networks. Advances in knowledge discovery and data mining. Springer International Publishing, Cham, pp 349–360
Author information
Authors and Affiliations
Contributions
M.S. wrote the main manuscript text and S.S. reviewed the manuscript.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Saini, M., Susan, S. Tackling class imbalance in computer vision: a contemporary review. Artif Intell Rev 56 (Suppl 1), 1279–1335 (2023). https://doi.org/10.1007/s10462-023-10557-6
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10462-023-10557-6