Abstract
Objectives
To compare technical and clinical screening performance parameters between computed radiography (CR) and direct digital radiography (DR) systems.
Methods
The number of women screened with CR was 73,008 and with DR 116,945. Technical and patient dose survey data of 25 CR and 37 DR systems were available. Technical performance was expressed by threshold thickness values at the mean glandular dose (MGD) level of routine practice. Clinical indicators included recall rate (RR), cancer detection rate (CDR), percentage of ductal carcinoma in situ (DCIS), percentage of cancers with T-scores smaller than 1 cm and positive predictive value (PPV).
Results
Contrast threshold values for the 0.1-mm gold disk were 1.44 μm (SD 0.13 μm) for CR and 1.20 μm (SD 0.13 μm for DR). MGD was 2.16 mGy (SD 0.36 mGy) and 1.35 mGy (SD 0.32 mGy) for CR and DR respectively. We obtained for CR, respectively DR, the following results: RR in the first round of 5.48 % versus 5.61 %; RR in subsequent rounds of 2.52 % versus 2.65 %; CDR of 0.52 % versus 0.53 %; DCIS of 0.08 % versus 0.11 %; a rate of cancers with T-scores smaller than 1 cm of 0.11 % versus 0.11 %; PPV of 18.45 % versus 18.64 %; none of them was significantly different.
Conclusion
Our screening indicators are reassuring for the use of CR and DR, with CR operating at 60 % higher MGD.
Key Points
• Breast cancer screening can employ both computed (CR) and direct digital radiography (DR).
• Screening performance parameters for CR and DR technology are not significantly different.
• Screening parameters are in accordance with European Guidelines.
• Radiation doses employed for CR are generally 60 % greater than for DR.
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.Avoid common mistakes on your manuscript.
Introduction
In 2003, the European Council recommended mammography screening for women aged 50 to 69 years in accordance with European Guidelines [1]. These guidelines include the whole process, from the invitation of the women to epidemiological evaluation of screening data. In 2006, the European Guidelines were updated to cope with newer insights and technical evolutions [2]. A new chapter was dedicated to digital mammography systems. The main idea of the additional physical tests was to guarantee that at least the same quality could be obtained with the new technology as obtained with film-screen mammography (FSM).
Digital mammography encompasses a group of technologies. In this study we distinguish between ‘computed radiography’ (CR) and ‘direct digital radiology’ (DR). Computed radiography uses phosphor plates and a separate reader. At the time of data collection, powder-based CR systems were in use, with the majority using a double-sided reader as opposed to single sided. Direct digital radiology systems have an integrated X-ray system and detector. The image is available in the computer immediately after the X-ray exposure [3, 4].
The technical image quality of a digital mammogram depends on many factors, including the quality of the detector, the X-ray beam quality and the dose at which the system is operated [5]. In the European Guidelines [2], the technical image quality of the complete imaging chain is expressed via contrast thresholds. In practice, for a selection of disk diameters, the maximal gold thickness of the just visible disks is set (threshold thickness limit). This criterion is evaluated with the CDMAM (Contrast Detail for MAMmography) test object (Artinis, The Netherlands). The results of a performance analysis with the CDMAM test object are threshold levels expressed as X-ray contrast or in terms of the thickness of gold disks. This is evaluated for various diameters and characterised by the dose at which these threshold values have been obtained. Usually the threshold values of the 0.1-mm disk are evaluated in detail, as this is the most difficult criterion to meet in practice. A number of studies showed the superiority of DR technology when compared to CR [6–8].
The evaluation of the impact of digital mammography on screening indicators is reported in a selected set of prospective and retrospective studies. Whereas the first large multicentre study used both DR and CR technology [9], most studies reported about screening programmes in which only DR technology was used [10–18]. Lipasti et al. [19] reported the use of CR mammography in screening and compared this technology via the diagnostic workup of the patients. In the CR arm, 1.71 % of those screened and in the film-screen arm 1.59 % of screened women were recalled for further workup. The PPV was significantly higher for CR (36 %) than in FSM (26 %). Skaane et al. concluded in their review study [20] that ten studies that compared digital and FSM in breast cancer screening showed divergent and rather conflicting results.
The literature is very scarce in terms of data specifically for CR systems used in a screening setting, notwithstanding the widespread distribution of these systems in Europe. To our knowledge, the only investigation with large-scale use of CR in screening has been performed in France (http://www.invs.sante.fr/publications/2011/programme_depistage_cancer_sein/plaquette_depistage_cancer_sein.pdf) and has shown a significantly reduced cancer detection rate for the group of centres using CR versus DR technology. These facts were the direct trigger for a more detailed investigation of the screening data in Flanders, as there is mixed use of DR and CR technology in our mammographic units. This article reports an analysis of the technical and clinical screening performance indicators with CR versus DR for the 2008–2010 time period.
Material and methods
Breast cancer screening organisation in Flanders
Breast cancer screening was started in our region in 2001. Five (regional) cancer screening centres are responsible for the implementation of the Flemish breast cancer screening programme according to the European Guidelines.
Screening mammograms are taken in 171 mammographic units. The units participate with their CR or DR system, with the only requirements being that the systems have passed the type test [21] and the subsequent acceptance test and yearly tests in the mammographic unit and that daily QC procedures are performed for the X-ray system and monitor. At the end of 2010, there were 96 digital systems, distributed as follows: for DR: 7 Fuji Amulet MS 1000 systems, 10 GE Senographe Essential systems, 2 GE Senographe DS systems, 7 Hologic Selenia systems, 2 IMS Giotto Image systems, 5 Sectra MicroDose systems, 21 Siemens Mammomat Inspiration systems and 3 Siemens Mammomat Novation systems. The following CR technology was available: 3 Agfa MM3.0 systems, 4 Carestream EHR-M2 systems, 31 Fuji Profect systems and 1 Konica CP/Regius 190 system.
Screening population
This study is a retrospective analysis of data from the population-based Flemish breast cancer screening programme. The target population for bi-annual invitation was 715,106 women in 2006 and reached 760,633 women in 2010. Invitation coverage is complete. Over the last 5 years, between 163,793 (in 2006) and 183,384 (in 2010) women were screened each year.
The period of investigation was fixed from January 2008 until December 2010.
Between 2008 and 2010, 73,008 women were screened with CR [17,855 women at their first screening (first round) and 55,153 in subsequent rounds]. In parallel to this, 116,945 women were screened with DR (25,032 in the first round and 91,913 in subsequent rounds).
Mammography systems
As soon as digital mammography was conditionally accepted, a gradual introduction of digital mammography was observed. Whereas in 2006 less than 2 % of the screening mammograms were digital, this percentage reached 60 % in 2010 (Fig. 1).
Physico-technical data for present study were collected in a subset of mammographic units controlled by the medical physics experts of either the University of Ghent or the University of Leuven. This made a total of 62 digital mammography units involved, with 25 CR systems and 37 DR systems. The Ghent group performed the QA activities in 24 of these units and the group of Leuven in 38 units (Table 1).
Patient dose data were available from the 3-yearly legally required patient dosimetry surveys. We took care to select the physico-technical data closest in time to the patient dose survey. In addition, it was verified from daily QC data that there were no substantial changes in the systems between the technical tests and the patient dose survey.
Patient dosimetry
In our mammographic dose surveys, the mean glandular doses (MGDs) were calculated using Dance’s approach, from the following equation [22]:
where K is the incident air kerma at the upper surface of the breast, measured without backscatter and under the compression paddle; factor g is the incident air kerma to MGD conversion factor for a glandularity of 50 %; the factor c corrects for the difference in breast composition from 50 % glandularity and was assumed here as in the UK screened population; factor s corrects for the x-ray spectrum used.
Following our legislation, MGDs have to be calculated for at least 50 successive patients (4 mammograms per patient). Radiographers either note down all relevant exposure-related data in a data file and transmit this file to the medical physicist or images are collected in an automated way and the header information is then scrutinised for exposure data automatically under supervision of a medical physicist as in [23, 24]. In the latter case, data are usually collected in more than 50 patients per mammographic system. An overview of the patient dose data sample is shown in Table 1. We first calculated the mean and median values of the MGDs of all the mammography systems. We then calculated for CR respectively DR the mean values of the mean and median data of the individual systems. Next, the same calculations were performed for the systems tested by the Ghent and Leuven teams separately. T-test analysis was performed to compare whether the group averages for CR and DR were significantly different.
Mean glandular dose as a function of compressed breast thickness for CR and DR is visualised via the best exponential curve fit through all the individual data points.
Technical screening performance indicators
According to the European Guidelines, systems should operate below an MGD of 3 mGy for the 6-cm breast that is simulated by 5 cm of polymethyl methacrylate (PMMA) and below a threshold gold thickness of 1.68 μm for the 0.1-mm disk (acceptable level). The best systems operate at low dose and low threshold gold thickness. Mean glandular doses describing the dose level of the contrast detail test were calculated from an acquisition of a 5-cm PMMA slab that was imaged with automatic exposure control.
Threshold thickness values were obtained from acquisitions of the CDMAM test object (Artinis, Zetten, The Netherlands) using a manual exposure identical to the automatic 5-cm acquisition for the MGD. Manual reading of the CDMAM was performed at acceptance along with an automated approach that was then continued afterwards if the results of the first automated reading were in line with the manual reading. Automatic evaluation was performed with the software cdcom1.5 (downloadable from the Euref website) and Erica2 (available from www.qaelum.com). This type of data processing was worked out by Young et al. [25] and has been proven to predict human readings from the computerised evaluation method successfully.
We then calculated for CR, respectively DR, the mean values of threshold thickness values and corresponding doses. Next, in analogy to the patient dose data, the same calculations were performed for the systems tested by the Ghent and Leuven teams separately.
Clinical screening performance indicators
The following screening indicators were calculated for CR and DR technologies and for both the initial and the subsequent screening rounds: recall rate (RR), cancer detection rate (CDR), the percentage of DCIS, the percentage of tumours with size <1 cm and positive predictive value (PPV). The RR is defined as the proportion of screened women for whom further workup was recommended. The CDR is the number of pathologically proven malignant lesions of the breast (both in situ and invasive) detected in a screening round per 1,000 women screened in that round. It is expressed as a multiple of the underlying, expected breast cancer incidence rate in the absence of screening. It should be higher than three times this incidence rate for initial screening rounds and 1.5 times higher than the incidence rate for subsequent rounds: the background incidence of breast cancer in the absence of screening is 1.25 per 1,000 women. The T-staging represents the size of the tumours. The PPV is the percentage of cancers found in the recalled women. In some cases, the results of the follow-up exams are not known in the screening programme, as these investigations are performed outside the screening setting. We did not correct our numbers for the missing information. For the total group of patients scanned in the period investigated, 12.5 % of the follow-up results of the recalled cases was not available. As missing follow-up results are homogeneously spread over the cohort, our CDR may therefore be underestimated by up to 12.5 %.
The results of CR and DR technologies for the investigated period were compared using Fisher’s exact test and a P-value less than 0.05 was regarded as statistically significantly different.
Results
Patient dosimetry
The MGDs were calculated for a total of 27,745 mammograms, of which 5,623 were acquired with CR and 22,122 with DR. The larger number of DR mammograms is due to the easier dose data acquisition. As we report only mean and median values, this situation does not bias the results. An overview of the dose data, differentiated according to the different X-ray modalities, is presented in Table 2. The mean and median MGDs of CR and DR systems are significantly different (P < 0.0001). The mean and median MGDs and their standard deviations (SD) were 2.16 mGy (SD 0.36 mGy) and 2.04 mGy (SD 0.36 mGy) for CR and 1.35 mGy (SD 0.32 mGy) and 1.26 mGy (SD 0.30 mGy) for DR. The mean and median MGDs were 60 % higher for CR than for DR.
Patient dose data for the units tested by Ghent and Leuven showed similar differences. For Ghent, the mean and median MGDs were 2.22 mGy (SD 0.33 mGy) and 2.10 mGy (SD 0.30 mGy) for CR and 1.57 mGy (SD 0.31 mGy) and 1.47 mGy (SD 0.31 mGy) for DR. Data for Leuven were 2.00 mGy (SD 0.41 mGy) and 1.88 mGy (SD 0.46 mGy) for CR and 1.30 mGy (SD 0.31 mGy) and 1.21 mGy (SD 0.28 mGy) for DR. The doses for CR showed no differences between data as obtained by Ghent and Leuven. The mean MGDs for DR were almost significantly different (P-value 0.05), whereas the median MGDs were different, with a P-value of 0.03. These observations confirm the higher dose settings for CR than for DR in the units controlled by both groups.
Figure 2 visualizes the dose data, allowing an easy comparison of the data according to the limiting values of the European Guidelines. The averaged values remain below the acceptable dose levels. It must be remarked that the limiting values in the Guidelines refer to MGDs estimated from the PMMA acquisitions and are not, strictly speaking, intended for use in patient dose surveys.
Physico-technical data
Physico-technical characteristics in terms of threshold gold thickness at the associated clinical dose setting are shown in Fig. 3.
The CR and DR systems are shown as two groups in the graph, with DR presenting clearly better performance parameters.
Threshold thickness values for the 0.1-mm disk of the DR systems had an average value of 1.20 μm (SD 0.13 μm). The same values for the CR systems were on average 1.43 μm (SD 0.13 μm). These values were obtained at the following doses: 2.26 mGy (mean) and 2.31 mGy (median) with SD 0.44 mGy for CR and 1.26 mGy (mean) and 1.25 mGy (median) with SD 0.27 mGy for DR. These dose values are very similar to the results from the patient dose survey. Both threshold thickness values and MGD were significantly different between CR and DR (P-values < 10-9).
Screening indicators
The screening performance parameters are summarised in Table 3. None of the parameters showed statistically significant differences between CR and DR technology.
The RR for the first round was 5.48 % and 5.61 % for CR and DR technology respectively and well below the acceptable level of 7 % set by the European Guidelines. In subsequent rounds, the recall rate was 2.52 % respectively 2.65 %, which is lower than the achievable level of 3 %.
The CDR for the first round was 0.64 % and 0.63 % for CR and DR technology respectively and above the required three times the incidence rate, estimated to be 0.375 %. In subsequent rounds, the CDR is 0.48 % respectively 0.51 %, which is above 1.5 times the incidence rate.
In the initial rounds we achieved 0.03 % of DCIS for CR and 0.07 % for DR, and in subsequent rounds 0.05 % for CR and 0.08 % for DR. These values were not statistically significantly different. The percentage of DCIS in the detected cancers ranged from 15.6 % in CR to 19.8 % in DR.
The detection rates of cancers with T-scores <1 cm were 0.09 % for CR in initial screening versus 0.12 % in subsequent rounds and for DR 0.13 % and 0.10 % for initial respectively subsequent rounds.
The positive predictive values were also higher than mentioned in the European Guidelines. Computed radiography was 13.9 % for initial rounds and 21.4 % for subsequent rounds; DR was 13.3 % in initial rounds and 21.5 % in subsequent rounds.
Discussion
A unique feature of several breast cancer screening programmes is that the performance is measured continuously as recommended in the European Guidelines. This allows for a follow-up of the effect of new technologies in terms of parameters that are used throughout Europe. The present study collected the data of all patients that had been screened with digital mammography since the introduction of digital mammography and until 2010. In our region both CR and DR had been used.
There are a few study limitations in the screening performance data: (1) while it is known that some of the screening indicators, such as CDR and PPV, may depend on the age of the women, an age-specific analysis could not be performed here as the cohorts with CR and DR were already rather small [26]; (2) we did not document the experience of the second readers. Experience can be judged as acceptable or substantial as the groups have remained largely unchanged since the introduction of screening in 2001; (3) our cohorts were too small to perform a statistical analysis on the numbers of low-grade DCIS lesions versus high-grade lesions as detected with CR versus DR technology. Although the breast imaging community would be eager to learn about possible overdiagnosis, we could not perform the analysis carried out in Bluekens et al.’s study [15] in our cohorts; the groups were too small.
Our main result was that we could not observe any significant difference in the RR, CDR, detection rate of DCIS, percentage of cancers with T-stage <1 cm and PPV in cohorts screened with CR versus DR. This is different from the results in France and could be due to some differences between the French screening programme and our implementation. Our programme has generalised the second reading of all mammograms, whereas in France the first reader can immediately perform add-on exams if deemed necessary. We have soft copy viewing for the first and second reading, whereas in France hard copy reading was allowed. Our programme is unique in having generalised daily and centrally supervised quality control of X-ray systems and monitors, and the age of our screened population is limited to 50–69 years, whereas in France also older women are screened. A limitation of our data sample is of course that it is much smaller than the French sample, and therefore we plan continued evaluation of possible differences in performance. Aside from the differences in the screening setup, differences could also be due to different operational points of the mammography systems. The present study confirmed that in Flanders, CR and DR systems have been set at different MGD and quality levels. The average thickness of gold to detect the 0.1-mm disk was 1.49 μm for CR and 1.20 μm for DR. This was achieved at a mean MGD of 2.26 mGy respectively 1.26 mGy. From daily experience (results not shown), it follows that powder-based CR systems cannot achieve the threshold thickness values seen with DR within the given dose limits of the European Guidelines. Recent experience with needle-based CR systems is promising as shown in a recent comparative technical evaluation of powder-based CR, needle-based CR and an a-Se detector [27]. These CR systems can achieve lower threshold thicknesses, close to the values observed with DR.
In the present study, we showed similar screening performance parameters for CR and DR, but with both types of digital mammography systems operated at different dose and quality levels. This does not prove that these dose levels are optimal or that they could be reduced. Our study is only an indirect approval of the technical requirements put forward in the European Guidelines. Differences in detection of clusters of subtle microcalcification between CR and DR have been quantified recently [7]. The study showed differences between CR and DR and between different dose levels for detection of clusters of microcalcification. Their data are in line with our data in which we observed similar clinical screening parameters with CR and DR but with CR operated at a significantly higher dose level than DR. The present study and the quantitative study do not support lowering the dose settings for either our CR or DR systems. It would be interesting to investigate whether our screening performance parameters could be improved with higher dose DR.
The higher doses used with CR technology (2 mammograms at 2.16 mGy) versus DR (2 mammograms at 1.35 mGy ) for a screening examination at the age of 50 years could lead to an extra 1.6 radiation induced cancers per 100,000 women screened. However, the expected benefit of reduction in premature mortality afforded by routine mammographic screening in terms of either lives saved or years of life saved greatly exceeds this risk [28].
The ultimate test for the performance of the screening programme is reflected in the amount and time distribution of interval cancers. These data are available up to 2007, before the introduction of digital mammography. During the film-screen period, our programme complied with all European Guidelines in this respect. The data are suggestive for compliance in the digital era too.
In summary, all quality assurance efforts in the frame of breast cancer screening offer a unique opportunity to control the effects of the introduction of new imaging techniques. From the physico-technical point of view, DR performs better than powder-based CR in terms of dose and image quality. In the Flemish screening programme we did not see a difference in clinical screening indicators obtained with CR and DR.
References
The official journal of the European Union, ‘Council Recommendation of 2 December 2003 on Cancer Screening’ (2003/878/EC), L327/34 – L327/38, Dec 16, 2003
Perry N, Broeders M, de Wolf C, Törnberg S, Holland R, von Karsa L (2006) European Guidelines for breast cancer screening and diagnosis. Office for the Official Publications of the European Communities, Luxemburg
Bick U, Diekmann F (2009) Digital mammography. Springer Verlag, Berlin
IAEA Human Health Series, No 17. Quality assurance programme for digital mammography. Downloadable from http://www.iaea.org/Publications/index.html
Marshall NW, Monnin P, Bosmans H, Bochud FO, Verdun FR (2011) Image quality assessment in digital mammography: part I. Technical characterization of the systems’. Phys Med Biol 56:4201–4220
Rivetti S, Lanconelli N, Campanini R et al (2006) Comparison of different commercial FFDM units by means of physical characterization and contrast-detail analysis. Med Phys 33:4198–4209
Warren LM, Mackenzie A, Cooke J et al (2012) Effect of image quality on calcification detection in digital mammography. Med Phys 39:3202–3213
Monnin P, Marshall NW, Bosmans H, Bochud FO, Verdun FR (2011) Image quality assessment in digital mammography: part II. NPWE as a validated alternative for contrast detail analysis. Phys Med Biol 56:4221–4238
Pisano ED, Gatsonis C, Hendrick E et al (2005) Diagnostic performance of digital versus film mammography for breast-cancer screening; Digital Mammographic Imaging Screening Trial (DMIST) Investigators Group. N Engl J Med 353:1773–1783
Skaane P, Hofvind S, Skjennald A (2007) Randomized trial of screen-film versus full-field digital mammography with soft-copy reading in population-based screening program: follow-up and final results of Oslo II study’. Radiology 244:708–717
Skaane P, Balleyguier C, Diekmann F et al (2005) Breast lesion detection and classification: comparison of screen-film mammography and full-field digital mammography with soft-copy reading–observer performance study. Radiology 237:37–44
Lewin JM, Hendrick RE, D’Orsi CJ et al (2001) Comparison of full-field digital mammography with screen-film mammography for cancer detection: results of 4,945 paired examinations. Radiology 218:873–880
Hambly NM, McNicholas MM, Phelan N, Hargaden GC, O'Doherty A, Flanagan FL (2009) Comparison of digital mammography and screen-film mammography in breast cancer screening: a review in the Irish breast screening program’. AJR Am J Roentgenol 193:1010–1018
Bluekens AM, Karssemeijer N, Beijerinck D et al (2010) Consequences of digital mammography in population-based breast cancer screening: initial changes and long-term impact on referral rates. Eur Radiol 20:2067–2073
Bluekens AM, Holland R, Karssemeijer N, Broeders MJ, den Heeten GJ (2012) Comparison of digital screening mammography and screen-film mammography in the early detection of clinically relevant cancers: a multicenter study. Radiology 265:707–714
Sala M, Comas M, Macià F, Martinez J, Casamitjana M, Castells X (2009) Implementation of digital mammography in a population-based breast cancer screening program: effect of screening round on recall rate and cancer detection. Radiology 252:31–39
Van Ongeval C, Van Steen A, Vande Putte G et al (2010) Does digital mammography in a decentralized breast cancer screening program lead to screening performance parameters comparable with film-screen mammography? Eur Radiol 20:2307–2314
Vinnicombe S, Pinto Pereira SM, McCormack VA, Shiel S, Perry N, Dos Santos Silva IM (2009) Full-field digital versus screen-film mammography: comparison within the UK breast screening program and systematic review of published data. Radiology 251:347–358
Lipasti S, Anttila A, Pamilo M (2010) Mammographic findings of women recalled for diagnostic work-up in digital versus screen-film mammography in a population-based screening program. Acta Radiol 51:491–497
Skaane P (2009) Studies comparing screen-film mammography and full-field digital mammography in breast cancer screening: updated review. Acta Radiol 50:3–14
Thierens H, Bosmans H, Buls N et al (2009) Type testing of physical characteristics of digital mammography systems for screening within the Flemish breast cancer screening programme. Eur J Radiol 70:539–548
Dance DR, Skinner CL, Young KC, Beckett JR, Kotre CJ (2000) Additional factors for the estimation of mean glandular breast dose using the UK mammography dosimetry protocol. Phys Med Biol 45:3225–3240
Michielsen K, Jacobs J, Bosmans H (2007) Patient dosimetry for mammography. AIP Conference Proceedings of the Fourth International Summer School on Nuclear Physics Methods and Accelerators in Biology and Medicine 958:292–293
Chevalier M, Morán P, Ten JI, Fernández Soto JM, Cepeda T, Vañó E (2004) Patient dose in digital mammography. Med Phys 31:2471–2479
Young KC, Cook JJH, Okudo JM, Bosmans H (2006) Comparison of software and human observers in reading images of the CDMAM test object to assess digital mammography systems. Proc SPIE 6142, 614206-1 – 614206-13
Bennett RL, Blanks RG, Moss SM (2009) Evaluation of extension of breast screening to women aged 65–70 in England using screening performance measures. Br J Cancer 100:1043–1047
Marshall NW, Lemmens K, Bosmans H (2012) Physical evaluation of a needle photostimulable phosphor based CR mammography system. Med Phys 39:811–824
Yaffe MJ, Mainprize JG (2011) Risk of radiation-induced breast cancer from mammographic screening. Radiology 258:98–105
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Bosmans, H., De Hauwere, A., Lemmens, K. et al. Technical and clinical breast cancer screening performance indicators for computed radiography versus direct digital radiography. Eur Radiol 23, 2891–2898 (2013). https://doi.org/10.1007/s00330-013-2876-0
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00330-013-2876-0