Keywords

1 Introduction

Mammography is the first major means of breast cancer screening and plays an important role in reducing the mortality of female breast cancer. As early as 1976, Wolfe [1, 2] proposed that Mammographic breast density (MBD) can be used as an independent risk factor for breast cancer. Later, studies by scholars Park and Eriksson [3, 4] showed that gland density was a major risk factor for new and recurrent breast cancer. Gland density is described and recorded as an important sign in ACRBI-RADS (Breast imaging reporting and data system) classification. It is obvious that it is very important to accurately evaluate the density of female breast glands.

In the 2017 China Cancer report, China revealed that the incidence of female breast cancer is also the first among malignant tumors, and the risk of breast cancer among women in big cities is nearly twice as high as that in small cities [5], and the mortality rate is much higher than that in the United States. Although the relationship between gland density and the molecular and biological mechanism of breast cancer is still being studied, breast gland density is an important factor affecting breast cancer screening sensitivity and prevention risk prediction. Literature [6] reported that the sensitivity of breast X-ray to detect lesions in fatty and dense glands decreased from 87 to 62.9%. Women with dense glands have a 4.64 times higher risk of breast cancer during their lifetime than women with low gland density due to the increased density of glands and due to their rich glandular matrix [7].

The clinical evaluation of the density of female glands has always been observed by doctors with the naked eye, and the four grades of ABCD of glands have been quantitatively evaluated according to the BI-RADS standard based on experience. In the past, it was difficult to evaluate accurately and quantitatively on the two-dimensional image of mammography. Lehman and Fieselmann [8, 9] have shown that the quantitative evaluation of gland content by computer on breast X-ray images is highly consistent with that by doctors’ naked eyes, which indicates that it is reliable to use a computer to quantify glands in a clinic. Quantitative calculation of glands is more accurate. Studies by Ng and Lau et al. [10] confirmed that no matter what equipment was used to examine the glands of the same woman in one year, the physical quantity of the glands was the same. Bonmat et al. [11] used the same female breast X-ray and breast MRI examination to register 2D and 3D gland images obtained by the two examination modes, so as to obtain accurate quantification of glands on conventional breast X-ray images, which makes it possible to quantitatively evaluate glands on breast X-ray images. The advantage of measuring breast density by breast X-ray image lies in its low cost and wide use. Lu [12] comparing breast MRI with breast X-ray images stated that the quantitative evaluation of glands in the same woman has a high consistency within a certain period of time. In order to obtain accurate gland content, segmentation is very important. This study attempts to find the mapping relationship between mammography and breast MRI and to establish a mapping model between mammography and breast MRI. At present, there is little similar work in the literature; firstly, the breast x-ray image is mapped to the pseudo breast MRI image through the mapping model.

2 Material and Methods

The proposed method for calculating mammographic PD% consists of the following core steps:

  1. a.

    Preprocessing of mammographic images about the breast;

  2. b.

    Reconstruction of mammographic image based on real breast MRI.

  3. c.

    ROI segmentation and PD% calculation.

MRI data were taken from more than 500 patients at different times in a year, with an average of more than two times per patient. The mammography data used in this experiment were synthesized by breast MRI images mentioned above. Real breast MRI data in this study were all obtained from open source. Breast MRI was performed on 1.5-T scanner (signa, GE Healthcare, Milwaukee, WI) with a bilateral phased array breast coil. Approximately 120 + cases were sorted out from the data collected, each of which contained 60 slices.

2.1 Preprocessing of Images About the Breast

During preprocessing of medical images, low-quality images were manually eliminated, so that the adverse impact brought by the original image on the mapping model could be evaded. The breast MRI images were denoised by CBDNet, designed by Kai Zhang’s team for real photograph denoising in 2019 [13,14,15]. It is mainly composed of two parts, one is the noise generation network, which generates an estimated noise image from the input; and the other is a U-Net denoising network (Fig. 13.1). In CBDNet, the original image and noise image predicted by the noise generation network were sent into the second network for denoising.

Fig. 13.1
figure 1

The structure of a CBDNet

In order to achieve CBDnet denoising, it was necessary to denoise the original breast MRI manually to create a dataset. In our study, a professional software to view and process medical images (Radiant DICOM Viewer) was used to denoise the original breast MRI images. The prepared dataset was then substituted into CBDNet inputting, and the denoising outcome was shown in Fig. 13.2.

Fig. 13.2
figure 2

Denoising outcome of CBDNet: a original breast MRI image and b the same MRI images denoised by CBDNet

There were 60 slices in the breast MRI dataset, containing information about the entire breast. The mapping model between mammography and breast MRI was implemented by Generative Adversarial Networks (GAN). Only 12 breast MRI slices were selected for one MRI sample, and they must be physically continuous and as close to the breast center as possible, which was set to exhibit a linear decrease along with its slice number, so as to imitate the attenuation of X-ray energy inside human body during medical X-ray imaging. Breast MRI slices and a sample mammography image are shown in Fig. 13.3.

Fig. 13.3
figure 3

An example of mammography image constructed with breast MRI: a 12 slices of breast MRI and b the constructed mammography image

2.2 Construction of the Mapping Model from Mammography to MRI

In this study, we try to generate several breast MRI slices from one mammography image by a mapping model, and GAN was a good tool to realize that [16]. In our model, we planned to set mammography images as inputs, and the output (target) was designed to be 12 breast MRI slices. For GAN, pix-to-pix was a typical one-to-one correlation in previous researches [17], which successfully tackles the problem of realized one-to-one image transformations.

Parameters to generate our GAN went as follows: input dimension was one (for mammography), the output dimension was 12 (for 12 breast MRI slices). When generating DCGan [18], the input was a small dimension of the noise data, and the target image was output by sampling and increasing dimensions on the convolution layer. Procedures of dimensional reduction and increase correspond to encoding and decoding in the encoder [19], and the generator network is demonstrated in Fig. 13.4.

Fig. 13.4
figure 4

Generator network

For the discriminator network, its parameters went as follows: input dimension was 24 (12 real or pseudo breast MRI plus 12 real MRI images), and the output dimension was one, which represented the probability of all the current input. The structure of our discriminator network is shown in Fig. 13.5.

Fig. 13.5
figure 5

Discriminator network

Based on the GAN structures, binary cross-entropy loss function was selected to describe the generator and discriminator networks. The prepared mammography and breast MRI images were put into the GAN network for training, and Fig. 13.6 illustrates the outcome after 20,000 times of alternating training between the two networks.

Fig. 13.6
figure 6

Outcome of applying mapping model on breast MRI: a real breast MRI and b pseudo breast MRI generated by mapping model

2.3 ROI Segmentation and PD% Calculation

In this study, U-Net was used to segment breast regions and fibro-glandular tissue from breast MRI for further PD% calculations. Mask data was needed for training U-Net to automatically achieve such operations, and we used a software called Labelme to create mask datasets (Fig. 13.7). The K-means algorithm was used to realize binarization for the fibro-glandular region, which were implemented as follows [20]:

Fig. 13.7
figure 7

Structure of the U-Net

  1. a.

    K cluster centers were randomly initialized.

  2. b.

    While the centers of K clusters moved:

    1. I.

      Calculate their distances to reach all samples.

    2. II.

      Separate the samples by their nearest clusters.

    3. III.

      Recalculate cluster centers.

Repeat this process until centers didn’t move anymore.

  1. c.

    And the results were output.

The region reflecting the fibro-glandular tissue was separated from the background with K-means algorithm, and K = 2. The segmentation outcome is shown in Fig. 13.8.

Fig. 13.8
figure 8

Outcome of U-net segmentation: a original breast MRI, b breast region, c fibro-glandular tissue region, and d binarization result from

FGT% is defined as the relative volume percentage of fibro-glandular tissue within the breast, and it is calculated as:

$${\text{PD\% }} = FGT\% = \frac{|FGT|}{{|{\text{Breast}}|}} \times 100$$
(13.1)

2.4 Validation

The effect of U-Net segmentation on breast and fibro-glandular tissue was evaluated by the DSC difference between regions obtained by manual definitions and U-Net, which was a common parameter for this application. It will directly affect the PD% and the evaluation results of our mapping model, taking U-Net segmentation results on breast region as an example.

$${\text{s}} = \frac{2|A \cap B|}{{|A| + |B|}}$$
(13.2)

where A was the manually defined breast region and B was the breast region after U-Net segmentation. The closer the DSC is to 1, the better effect U-Net segmentation produces.

PD% values were calculated for both pseudo and real breast MRI images to evaluate the pseudo’s effectiveness. Linear regression and Pearson’s correlation coefficients were used to evaluate the correlation of PD%, and the calculation was carried out by the following equation:

$${\text{r}} = \frac{{N\sum {x_{i} y_{i} - \sum {x_{i} \sum {y_{i} } } } }}{{\sqrt {N\sum {x_{i}^{2} - (\sum {x_{i} } )^{2} } } \sqrt {N\sum {y_{i}^{2} - (\sum {y_{i} } )^{2} } } }}$$
(13.3)

where x is the PD content of real breast and Y is the gland content calculated by the algorithm.

In order to evaluate the stability of our method of calculation, identical procedures were carried out on the same patient at different times in a year. The smaller the numerical changes of breast PD% obtained were, the more stable the calculation method was.

3 Experiments and Results

3.1 Evaluation of the Mapping Model from Mammography to Breast MRI Images

DSC was used to evaluate the spatial consistency between real and pseudo breast MRI images as a reflection of the mapping model’s performance. In one experiment, breast and fibro-glandular tissue regions were segmented from 30 random real images of their corresponding pseudo ones, followed by subsequent calculations for mean DSC. A total of ten experiments were conducted, and the results are shown in Table 13.1. The overall mean DSC was 0.937 with a mean standard deviation of 0.0069 (mean standard deviation) for the breast region in real and pseudo MRI images. For real and pseudo fibro-glandular tissue regions, the overall mean DSC was 0.8509, and the mean standard deviation of 0.039. Therefore, we could conclude that our mapping model is significantly less effective for the fibro-glandular tissue.

Table 13.1 DSC calculations of breast and fibro-glandular tissue in real and pseudo MRI images

3.2 Evaluation of the Breast PD%

Table 13.2 showed the breast PD% of ten data in the test dataset, and the maximum, minimum, and average errors of PD% were 5.97%, 0.28%, and 2.576% between real and pseudo MRI images, respectively. In order to further explore the stability for PD% calculation, 20 patients were randomly selected from the data set, and their breast PD% values were calculated based on the method mentioned with linear regression and the results are shown in Fig. 13.9.

Table 13.2 PD% calculations of breast and fibro-glandular tissue in real and pseudo MRI
Fig. 13.9
figure 9

Linear regression results for PD% between real and pseudo MRI

4 Discussion

In this study, a GAN-based mapping model between mammography and breast MRI was constructed, providing a new way to calculate PD%. It can directly transform mammography images into pseudo breast MRI, and then segment the breast and fibro-landular tissue regions, enabling easy calculations for the breast PD%.

Calculation results for DSC verify the effectiveness of our model. In terms of breast PD% calculation, the breast density and error analysis show that real and pseudo breast MRI images are highly consistent and correlated, suggesting actual feasibility for 3D reconstruction results from mammography to be applied as MRI images in actual practice.

Further efforts could focus on building a more promising mapping model to generate pseudo breast MRI images with higher quality, so that a more accurate fibro-glandular tissue segmentation could be achieved.