A computer-aided diagnostic system for mammograms based on YOLOv3

Zhao, Jianhui; Chen, Tianquan; Cai, Bo

doi:10.1007/s11042-021-10505-y

A computer-aided diagnostic system for mammograms based on YOLOv3

1182: Deep Processing of Multimedia Data
Published: 22 February 2021

Volume 81, pages 19257–19281, (2022)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Multimedia Tools and Applications Aims and scope Submit manuscript

A computer-aided diagnostic system for mammograms based on YOLOv3

Download PDF

Jianhui Zhao¹,
Tianquan Chen¹ &
Bo Cai²

632 Accesses
8 Citations
Explore all metrics

Abstract

Due to a large amount of noise in medical images, the task of detecting and classifying the lesions of mammograms remains a huge challenge. Based on the existing deep learning methods, focusing on the diversity of breast cancer lesion types, this paper proposes a computer-aided diagnosis system based on YOLOv3 (You Only Look Once version 3) convolutional neural network for mammograms. In this system, we integrate detection and multi-classification problems of breast lesions into a regression problem, thereby simultaneously accomplish the two tasks in one framework. The proposed computer-aided diagnosis system is mainly divided into three components: preprocessing part of the original mammograms, deep convolutional neural network based on YOLOv3, processing and evaluation of the network output. We use the dataset from CBIS-DDSM to train three models: general model, mass model and microcalcification model. These trained models can detect the position of the input mammograms in different situations, and then classify them into mass, microcalcification, benign, malignant, and other categories. After evaluating the performance by using test set images, the accuracy rates of the general model, mass model, and microcalcification model trained by our system reach 93.667 %, 97.767 %, 96.870 % in the detection task, and 93.927 %, 98.121 %, 97.045 % in the classification task. The computer-aided diagnosis system performs well in lesion detection and classification tasks with high-noise mammograms, reflecting well robustness.

Computer-aided diagnosis of breast cancer from mammogram images using deep learning algorithms

Article Open access 18 September 2024

Evaluation of Learning Approaches Based on Convolutional Neural Networks for Mammogram Classification

Automated early breast cancer detection and classification system

Article 07 April 2021

Discover the latest articles, news and stories from top researchers in related subjects.

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

The breast cancer is the most common cancer among women in many countries or regions [7]. Also, breast cancer is the leading cause of death among women in 103 countries. The number of new breast cancer patients in US women will account for 30 % of all women’s new malignant tumor patients in 2020 [29]. To diagnose breast cancer, many screening methods have been presented. Mammography, breast ultrasound, and breast magnetic resonance imaging (MRI) examinations are currently the main screening methods for breast cancer [17]. In particular, mammography can detect abnormal areas that are not clinically accessible in the early stage of breast cancer. Therefore, mammography plays an indispensable role in improving the diagnosis rate of breast cancer. Usually, the radiologists browse the mammograms in a subjective visual way to find out the position of the lesion and classify it. However, errors in human eye and other factors have caused misdiagnosis and missed diagnosis, which brings challenges to radiologists and pressures to patients [11]. Therefore, computer-aided diagnosis technology is indispensable, and it can give the second opinion to doctors for reference.

In recent years, with the breakthrough in computer hardware performance and the rapid development of deep learning algorithms, the application of deep learning for mammograms classifying has become more and more widely used. However, due to the complexity of human tissues and the insufficient availability of the information which is collected by medical imaging equipment, the breast tissue image information often contains a lot of noise, which results in a relatively low signal-to-noise ratio (SNR). Using noisy mammograms can cause wrong learning direction in deep learning models’ training stage and misjudgment in the testing stage. Therefore, reducing the effect of noise in mammograms will bring accurate learning direction and proper judgment, thereby improving the robustness of the model. Inspired by this insight, our research aims to further reduce the impact of mammography image noise, which can promote the performance of diagnostic task.

Based on state-of-the-art methods, this article focuses on the issues including multi-scale feature maps, lesion detection, and classification. We further propose a computer-aided diagnosis system for mammograms based on YOLOv3 [24]. Figure 1 shows the work flow chart of our computer-aided diagnosis system. It is mainly divided into three stages: preprocessing part, YOLOv3-based convolutional neural network part, and evaluation part.

The following narrative structure of this article is as follows: Section 2 mainly introduces the existing related research work; Section 3 will describe in detail each structure of the entire computer-aided diagnosis system, including dataset, image preprocessing method, neural networks’ architecture and processing method for network output; in Section 4, we will give our evaluation method, and quantitatively evaluate the performance of our system through various metrics and compare with the previous methods; finally, in Section 5, we will summarize this study and propose measures that can be further improved afterward.

2 Related work

With the achievement of convolutional neural network (CNN) technology, in the field of mammogram recognition, CNN-based deep learning models have attracted the attention of many researchers, and various efficient algorithms have been continuously proposed. For the classification of mass lesions, Arevalo et al. [4], Kooi et al. [16], Sun et al. [31], Sun et al. [30] and Suzuki et al. [32] used artificially labeled suspicious lesion areas and used CNNs with different structures for feature extraction and recognition. In particular, Suzuki et al. presented the deep convolutional neural network (DCNN) with the transfer learning strategy for mass detection in mammographic images, and achieved a recall rate of 89.90 % on the DDSM [12] dataset [32]. Differently, Arfan et al. adopted CNN to extract the features of the entire image, and then used support vector machine (SVM) for classification, achieving 93 % AUC on MIAS [12] and DDSM dataset [5]. Mordang et al. [20] and Bria et al. [8] both paid attention to the classification of microcalcification lesions. For that, they used different preprocessing methods and CNN structures leading to different results. Mordang et al. applied a hard negative mining strategy, which helps overcome the large class imbalance between pixels belonging to microcalcifications and other breast tissue [20]. Bria et al. proposed a preprocessing algorithm for defogging images, achieving a recall rate of 76.26 % [8]. There are also some researchers who ignored the type of lesion and focused on whether the lesion is benign or malignant. For example, Omonigho et al. [22] utilized a DCNN based on AlexNet [15] to extract and classify the mammograms of the MIAS dataset into two classes of benign (normal) and malignant (abnormal) tumors. With augmentation techniques for improving classification accuracy, the system finally obtained an accuracy rate of 95.70 %.

The above methods either directly input the preprocessed whole image, which will be doped with a lot of noise and affect the performance of the classifiers, or simply use the cropped lesion area, which is extremely dependent on manual annotation information. To reduce the effects of manual annotation information, Ben-ari et al. paid attention to the detection of the lesions. They provided a new R-CNN method by using a pretrained network on a candidate region guided by clinical observations, to detect and classify lesions in the DDSM dataset [6]. Based on the characteristics of mammograms with multiple views, Ma et al. used faster-RCNN [26] based method, termed Cross-View Relation Region-based Convolutional Neural Networks (CVR-RCNN), to detect and classify the lesions from two paired views, and achieved an F1 score of 73 % [19]. Sarath et al. proposed a two-stage Multi-Instance Learning (MIL) framework with the first stage for extracting local candidate patches in the mammograms and the second stage for classifying an image level benign vs. malignant mass, and achieved an accuracy of 76 % in the detection task and an AUC of 0.91 in the classification task on the INbreast dataset [27]. Jung et al. adopted the FaceBook AI team’s RetinaNet [18] as the deep learning network to train for mammogram lesions detection and classification tasks, and achieved comparable or better performance [13] on the INbreast dataset [21]. Al-Masni et al. [1] and Platania et al. [23] also paid attention to the detection of mammogram lesions. They used YOLOv1 [25] algorithm to achieve the two tasks of detecting and classifying breast mass lesions in the same framework. In particular, Al-Masni et al. achieved 96.33 % detection accuracy rate and 85.52 % classification accuracy rate on the subset of CBIS-DDSM [1]. Lately, they utilized data augmentation to further improve the detection accuracy rate of breast lesions to 99.7 % and the classification accuracy rate to 97 % [2]. The framework of Platania et al. achieved a detection accuracy of up to 90 % and a classification accuracy of 93.5 % (AUC of 92.315 %) [23]. However, the methods still have the following shortcomings. First, the recognition accuracy of potential small lesions is relatively low. Second, these methods only identify breast mass lesions, but there are other types of mammogram lesions such as microcalcification. These problems have thus determined the limitation of this kind of method.

In order to solve the limitations mentioned above, while paying attention to the detection and classification of mammograms, we further notice the problems of various types of lesions and small-sized lesions and propose a YOLOv3-based computer-aided diagnosis system for mammograms. Our detailed contributions are summarized as follows. According to the types of lesions in the mammograms (mass and microcalcification), we train three models using the mammograms from CBIS-DDSM dataset [28]: the general model trained using all images, the mass model trained by mass images only and the microcalcification model trained using microcalcification images only. The computer-aided diagnosis system we proposed can learn the entire image in one network architecture to achieve two tasks including detecting the positions of the lesions and classifying the lesions simultaneously. Compared with other state-of-the-art methods, we have enhanced the ability to detect small-sized lesions and paid attention to the diversity of lesion types, so that our system has better performance and can handle more tasks.

3 Computer-aided diagnosis system based on YOLOv3

3.1 Dataset

DDSM is a mammogram dataset maintained by the University of South Florida in 1997 [12]. It contains mammograms from 2620 patients. Each patient generally has four images from two views including the mediolateral oblique (MLO) view and the craniocaudal (CC) view of the left and right breasts. To use the DDSM dataset in a standardized manner, the TCIA website [9] collated and obtained the CBIS-DDSM dataset [28]. This is an updated standardized version of a subset of DDSM, including images of two lesion categories, mass, and microcalcification. Each mammogram is marked with a label (benign and malignant) and provides an accurate bounding box of the lesion area. Figure 2 shows the original mammograms and lesion outline from CBIS-DDSM. We can find the mass lesions are small and dense, while the microcalcification lesions are large and banded. These differentiated features bring great challenges to our computer-aided diagnosis system.

As shown in the Table 1, after removing the duplicate images, we use all the mass images and microcalcification images in the CBIS-DDSM dataset. As the deep learning algorithms often have better performance on large amounts of data, we use data augmentation to increase the number of images. Based on the original images, we rotate each image clockwise by 90^∘, 180^∘, and 270^∘ to expand the dataset by 4 times and randomly mix these images. As shown in Table 2, we use a total of 12,040 images to train and test our computer-aided diagnosis system. Since the lesion appears as a complex curve on the image, in order to conveniently express the position of the lesion, as shown in the Fig. 3, we use a rectangle with the center point coordinates and the length and width information to replace the complex curve.

Table 1 Original dataset after removing the duplicate images

A computer-aided diagnostic system for mammograms based on YOLOv3

Abstract

Similar content being viewed by others

Computer-aided diagnosis of breast cancer from mammogram images using deep learning algorithms

Evaluation of Learning Approaches Based on Convolutional Neural Networks for Mammogram Classification

Automated early breast cancer detection and classification system

Explore related subjects

1 Introduction

2 Related work

3 Computer-aided diagnosis system based on YOLOv3

3.1 Dataset

3.2 Image preprocessing method

3.3 Cluster anchor boxes

3.4 Structure and implementation of YOLOv3 based network

3.5 Network output processing method

4 Experiment and analysis

4.1 Experiment setup

4.1.1 Operating environment and training strategy

4.1.2 General, mass and microcalcification model

4.2 Evaluation system

4.2.1 Evaluation logic

4.2.2 Evaluation methods and metrics of lesion detection

4.2.3 Evaluation methods and metrics of lesion classification

4.3 The detection performance of our models

4.4 The classification performance of our models

4.5 The test speed of our models

4.6 Comparison of our system with other methods

5 Conclusion

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher’s note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation