Introduction

The National Lung Screening Trial (NLST) recently reported clinical evidence for lung cancer screening using low-dose helical CT for high-risk examinees. The NLST enrolled approximately 50,000 participants at more than 30 trial sites nationwide over a 20-month period. Participants were required to have a smoking history of at least 30 pack-years and were either current or former smokers without signs, symptoms, or a history of lung cancer. Participants were randomly assigned to be screened annually, three times, with either low-dose helical CT (often referred to as spiral CT) or standard chest X-ray [1] However, the risk–benefit balance for cancer screening tests by multi-detector computed tomography (MDCT) or chest X-ray must be considered, because radiation exposure is a risk factor that increases the probability of the future development of malignant tumors. The discovery that over-dose radiation exposure as a result of perfusion MDCT resulted in alopecia initiated development of new technology to reduce radiation exposure in MDCT [2, 3].

Image reconstruction by the iterative reconstruction (IR) method is one of the most effective techniques for reducing radiation exposure in MDCT [47].

To the best of our knowledge, even though the effectiveness of low-dose MDCT has been proved by using a chest phantom with pure ground-glass nodules (GGNs), detectability by use of low-dose MDCT has not been evaluated quantitatively.

To evaluate the feasibility of the IR method with low-dose MDCT for lung cancer screening, we performed a comparative phantom study with conventional image reconstruction by use of the filtered back-projection (FBP) method.

Materials and methods

Chest CT phantom

A chest CT phantom (LSCT001; Kyoto Kagaku, Kyoto, Japan) containing simulated nodules in simulated lungs was used. The phantom was a composite of Styrofoam and hard urethane powder in urethane resin adhesive. The simulated tumors were spheres of acrylic foam and urethane in the simulated lung (Fig. 1) [8].

Fig. 1
figure 1

Chest CT phantom containing simulated nodules in simulated lungs

Details of this device have been reported elsewhere [9]. Simulated GGNs were placed in the right upper portion (lung apex), right middle portion (tracheal bifurcation), and right lower portion (lung base). Simulated solid nodules were placed in the left upper portion (lung apex), left middle portion (tracheal bifurcation), and left lower portion (lung base). The diameters of simulated GGNs placed in the right lung were 4, 6, 8, 10, and 12 mm (Fig. 2).

Fig. 2
figure 2

Control images. Simulated GGNs with diameters of 4, 6, 8, 10, and 12 mm are seen in the right lung

The CT numbers for the simulated lung, the simulated GGNs, and simulated solid nodules were −900, −800 and −630 Hounsfield units (HU), respectively.

CT scanning

A 16-row MDCT scanner (Alexion; Toshiba Medical Systems, Nasu, Japan) and a 64-row MDCT scanner (Aquilion CXL; Toshiba Medical Systems) were used.

As control images, high-resolution 5-mm images were reconstructed at 5-mm intervals by use of 64-row MDCT scanners operating with the acquisition conditions: tube voltage 120 kVp, scan speed 1.5 s/r, pitch factor 0.641, tube current 350 mA, and detector collimation 0.5 × 64 mm. These conditions resulted in radiation exposure of 99.0 mGy for the volume CT dose index (CTDIvol) and 2,870 mGy/cm for the dose-length product (DLP). The diameters of simulated GGNs placed in the right lung were 4, 6, 8, 10, and 12 mm (Fig. 2).

As subject images, nine different spatial resolution images were used for 16 and 64-row MDCT. The CT acquisition conditions are listed in Tables 1 and 2. In the scan protocol for simulated GGNs, total effective dose (mSv) in 16 and 64-row MDCT ranged from 0.4 to 2.5 and from 0.25 to 1.6, respectively (Tables 1 and 2).

Table 1 Acquisition conditions for 16-row MDCT
Table 2 Acquisition conditions for 64-row MDCT

The nine images of 16 and 64-row MDCT were reconstructed by use of the FBP and IR methods, respectively. In this study, adaptive iterative dose reduction 3D (AIDR 3D) was used as the IR method for MDCT image reconstruction [47]. Finally, 36 images were used for the radiologist-performance test for the detectability of simulated GGNs.

Radiologist-performance test

A radiologist-performance test was performed to investigate the detectability of the simulated GGNs with different radiation exposure using the two different image-reconstruction methods (FBP and AIDR 3D). In total, eight radiologists in five institutes were asked to participate in this radiologist-performance test using the chest phantom with simulated GGNs in the right lung. The radiologists’ experience ranged from 4 to 28 years (mean ± SD, 13.75 ± 9.94 years); five radiologists had <10 years of experience, and three radiologists had >20 years of experience. The eight radiologists usually read whole body imaging, did not specialize in one region, and read chest CT on a regular basis.

The extracted images were displayed on a black-background and the radiologists interpreted the extracted images on a color monitor with a spatial resolution of 1536 × 2048 (RadiForce GS320; Eizo, Ishikawa, Japan) by use of a Digital Imaging and Communications in Medicine Viewer (RapideyeCore; Toshiba Medical Systems). The attenuation was −900 HU for lung parenchyma and −800 HU for the simulated GGNs.

At the start of the test a control image was presented, and the eight radiologists interpreted the image and confirmed the location, shape, and size of the simulated GGNs, which were clearly visible (Fig. 2).

To assess the detectability of pure GGNs, the visual scores of the eight radiologists who independently interpreted the reconstructed CT images by use of conventional FBP and AIDR 3D methods were evaluated. The radiologists were unaware of the CT acquisition conditions, for example the milliampere and pitch factor settings, used for acquisition of the image of the phantom. The time for reading the images was not limited. The window width was set at a constant 1,600 HU, and the window level was set at a constant −600 HU for the visual evaluations.

Confidence level was defined by use of a five-point scale:

  1. 1.

    not visualized;

  2. 2.

    1–49 % of margin visualized;

  3. 3.

    50 % of margin visualized;

  4. 4.

    51–99 % of margin visualized; and

  5. 5.

    100 % of margin visualized (Fig. 3).

To indicate whether or not an object was present, each radiologist placed a score on the recording forms. The images were displayed in a sequence determined by use of a random number table.

Fig. 3
figure 3

Visual scores defined by use of a 5-point scale

Statistical analysis

For the eight radiologists, mean visual scores were calculated as the confidence level for detection of simulated GGNs. A paired t test was used to evaluate the significance of differences between mean visual scores for detectability by use of the FBP and AIDR 3D methods for reconstructed CT images of simulated GGNs with diameters of 6, 8, and 10 mm in the phantom. Statistical analysis was performed with IBM SPSS Statistics 21 (IBM, Chicago, IL, USA). A value of P < 0.05 was regarded as indicative of a significant difference.

Before the radiologist-performance test, the location, shape, and size of the simulated GGNs clearly visible on the control images were shown. i.e. true negative and false-positive results were not designed to be part of the experimental procedure, so receiver operating characteristic (ROC) curve analysis could not be conducted.

Results

Mean visual scores for simulated pure GGNs were compared between the FBP and AIDR 3D methods for reconstructing CT images at the lowest total effective dose (mSv) when the mean visual score for the AIDR 3D method reached more than 2 (Tables 3 and 4).

Table 3 16-row MDCT: mean visual scores for simulated pure GGNs by the FBP and AIDR 3D methods at the lowest total effective dose (mSv) when the mean visual score for the AIDR 3D method reached more than 2
Table 4 64-row MDCT: mean visual scores for simulated pure GGNs by the FBP and AIDR 3D methods at the lowest total effective dose (mSv) when the mean visual score for the AIDR 3D method reached more than 2

With 16-row MDCT, significant differences between mean visual scores for the FBP and AIDR 3D methods were observed for pure GGNs with a diameter of 6 mm in the middle and lower lung fields only and for pure GGNs with a diameter of 10 mm in the upper lung field only (Table 3).

With 64-row MDCT, significant differences between mean visual scores for the FBP and AIDR 3D methods were observed for any location and size, except for pure GGNs with a diameter of 6 mm in the lower lung field only (Table 4).

Representative phantom images in this study are shown in Figs. 4 and 5. Figure 4 shows the 16-row MDCT images reconstructed by use of the FBP and AIDR 3D methods at the lowest total effective dose for visualizing a pure GGN with a diameter of 8 mm. Figure 5 shows the 64-row MDCT images reconstructed by use of the FBP and AIDR 3D methods at the lowest total effective dose for visualizing a pure GGN with a diameter of 6 mm. The lowest total effective dose (mSv) when the mean visual score reached more than 2 is shown in Tables 5, 6.

Fig. 4
figure 4

16-row MDCT images (10 mA, HP15) reconstructed by use of the FBP and AIDR 3D methods at the lowest total effective dose at which a pure GGN of diameter 8 mm was visualized. The upper images were reconstructed by use of the FBP method and the lower images by use of the AIDR 3D method

Fig. 5
figure 5

64-row MDCT images (10 mA, HP27) reconstructed by use of the FBP and AIDR 3D methods at the lowest total effective dose at which a pure GGN of diameter 6 mm was visualized. The upper images were reconstructed by use of the FBP method and the lower images by use of the AIDR 3D method

Table 5 16-row MDCT: the lowest total effective dose (mSv) when the mean visual score reached more than 2
Table 6 64-row MDCT: the lowest total effective dose (mSv) when the mean visual score reached more than 2

A pure GGN with a diameter of 6 mm in the upper lung field was not visualized under any of the 16-row MDCT acquisition conditions in this study, and a pure GGN with a diameter of 6 mm in the upper lung field was not visualized by the FBP method for reconstructed images for any of the 64-row MDCT acquisition conditions used in this study (these results are presented as blanks in Tables 3, 5, and 6). However, a pure GGN with a diameter of 6 mm in the upper lung field was visualized by use of the AIDR 3D method for reconstructed images with 64-row MDCT.

For a pure GGN with a diameter of 6 mm, the lowest total effective dose (mSv) was lower with the AIDR 3D method than with the FBP method irrespective of location (Tables 5 and 6). For pure GGNs with diameters of 8 and 10 mm, the lowest total effective dose (mSv) was lower for the AIDR 3D method than for the FBP method in the upper lung field only (Tables 5 and 6).

The mean visual scores for all simulated GGNs with diameters of 6, 8, and 10 mm were compared between the FBP and AIDR 3D methods for reconstructing CT images at the minimum, medium, and maximum total effective doses (mSv) (Table 7). For both 16 and 64-row MDCT, the mean visual scores for simulated GGNs were higher with the AIDR 3D method than with the FBP method. At the minimum total effective dose, the mean visual scores with the FBP method were less than 2, but with the AIDR 3D method they were more than 2. At the medium and maximum total effective dose, the mean visual scores with both FBP and AIDR 3D methods were more than 2.

Table 7 Mean visual scores for all simulated GGNs with diameters of 6, 8, and 10 mm

With 16-row MDCT, significant differences between the mean visual scores for the FBP and AIDR 3D methods were observed for the minimum total effective dose only. However, with 64-row MDCT, significant differences between the mean visual scores for the FBP and AIDR 3D methods were observed for any total effective dose.

Discussion

The National Cancer Institute (NCI) has released initial results from a large-scale test of screening methods performed to reduce deaths from lung cancer. The NLST, a randomized national trial involving more than 50,000 heavy smokers, compared the effects on lung cancer mortality of two screening procedures for lung cancer, low-dose helical computed tomography (CT) and chest X-ray, and found 20 % fewer lung cancer deaths among trial participants screened by low-dose helical CT [1]. The results of the NLST may facilitate lung cancer screening by MDCT. The finding that over-dose radiation exposure by perfusion MDCT resulted in alopecia, however, initiated the development of new technology to reduce radiation exposure by MDCT [2, 3], and several technical trials with auto exposure control (AEC), multiple rows of detectors, and image-based filter techniques have been performed [7, 10, 11]. Recently, a new technique for reducing radiation exposure, iterative reconstruction, has been used for MDCT image reconstruction. AIDR 3D is one method of iterative reconstruction [47]. Considering the risk/benefit balance, especially for health check-ups, the amount of radiation exposure is a significant factor determining its feasibility [1214].

The extent to which exposure to radiation could be reduced was evaluated by making reconstructed images by use of the AIDR 3D method, a new method with low-dose MDCT. By use of the chest phantom containing simulated GGNs, detectability in reconstructed images was compared between the FBP method and the AIDR 3D method. The nine conditions used for CT acquisition by 16-row MDCT were combinations of tube current 10, 20, or 30 mA and helical pitch, 23, 15, and 11. These resulted in the lowest possible dose for the 16 MDCT scanner to reduce radiation exposure.

In 2000, when MDCTs became available in clinical practice, the 4-row MDCT was a common scanner. After rapid technological innovation over the last 10 years, the row number of MDCT increased to 16. Given this background, 64-row MDCT is likely to become common, which is why 64-row MDCT was investigated with 16-row MDCT in this study.

To investigate the extent to which exposure to radiation could be reduced for lung cancer screening by 64-row MDCT, 9 CT acquisition conditions were chosen: combinations of tube current, 10, 20, or 30 mA; and helical pitch 45, 27, or 21. Again, these resulted in the lowest dose possible for the 64 MDCT scanner to reduce exposure to radiation. Detectability in reconstructed images was compared between the FBP and AIDR 3D methods in the same way as for 16-row MDCT.

A chest phantom has been used in previous experimental studies [810]. In this study, simulated GGNs with a diameter of 6 mm were used, because nodules with diameters more than 5 mm are the earliest lung cancers that can be detected by screening, and the smallest of the simulated GGNs in the phantom with a diameter greater than 5 mm were of diameter 6 mm.

A simulated nodule with a diameter of 10 mm simulates another important lesion, because a nodule of diameter more than 5 mm that grows or reaches a diameter of 10 mm must be confirmed pathologically, either surgically or by obtaining a biopsy specimen [15]. The 8-mm-diameter nodule was selected as a lesion intermediate between the 6 and 10-mm-diameter lesions. This phantom had GGNs in the right lung and solid nodules in the left lung. The GGNs in the right lung were the focus of this study.

Swenson et al. [16] found that nodules were missed in 26 % of patients undergoing their first annual CT examination. GGNs are more difficult to detect than solid nodules; approximately 70 % of lung cancers missed by radiologists during low-dose CT screening were GGNs [17].

Although a GGN is a nonspecific finding that may occur in benign disease, for example focal inflammation, it may be attributable to such malignant lesions as adenocarcinoma in situ (AIS) or minimally invasive adenocarcinoma (MIA). Therefore, reliable detection of GGNs is a critical issue in lung cancer screening with low-dose CT [8].

To increase the objectivity of this study, eight radiologists from five institutes were asked to participate in a radiologist-performance test to investigate the detectability of the simulated GGNs; the test was performed at the same time and under the same conditions and entailed interpretation of images with the same viewer (RapideyeCore; Toshiba Medical Systems). The time allowed for interpretation of the images was not limited.

The images scanned under the 18 different acquisition conditions were reconstructed by use of the FBP and AIDR 3D methods and displayed in a sequence determined by use of a random number table. For quantitative analysis, the visual scores of the eight radiologists who independently interpreted the reconstructed CT images by the conventional FBP method and by the AIDR 3D method were evaluated. The visual scores were defined by use of a five-point scale.

With either 16-row or 64-row MDCT, CT images reconstructed by use of the AIDR 3D method resulted greater detectability of simulated GGNs at the same total effective dose. The simulated GGNs in the upper lung field seemed difficult to identify on reconstructed CT images by the FBP and AIDR 3D methods. It was believed that a bone artefact caused by the ribs near the lung apex interfered with detection. The difference between detectability of GGNs by the FBP and AIDR 3D methods became larger with smaller size and in the upper lung field. When 64-row MDCT was used, simulated GGNs had significantly higher visual scores with the AIDR 3D method than with the FBP method, irrespective of size and location (Tables 3, 4, 5, and 6). To evaluate overall detectability, mean visual scores for all the simulated GGNs with diameters of 6, 8, and 10 mm were compared between the FBP and AIDR 3D methods for reconstructing CT images at the minimum, medium, and maximum total effective dose (mSv) (Table 7). With both 16 and 64-row MDCT, the mean visual scores for simulated GGNs were higher with the AIDR 3D method than with the FBP method. At the minimum total effective dose, the mean visual scores with the FBP method were less than 2, which means that the radiologists could not detect the GGNs. However, with the AIDR 3D method, the scores were more than 2, which means that the GGNs could be detected. For the medium and maximum total effective dose, the mean visual scores with both FBP and AIDR 3D methods were higher than 2.

With 16-row MDCT, significant differences in the mean visual scores between the FBP and AIDR 3D methods were observed at some conditions. However, with 64-row MDCT, they were observed at any conditions.

These results suggested that when 64-row MDCT is used detectability is better for the AIDR 3D method than for the FBP method, under any conditions; this was also true for 16-row MDCT under some conditions.

One limitation of this study is that it was an experimental phantom study. In clinical lung cancer screening, features that cannot be simulated in a phantom study include:

  1. 1.

    the variety of nodular shape and density;

  2. 2.

    the condition of the background lung parenchyma;

  3. 3.

    breath holding; and

  4. 4.

    motion artefact arising as a result of heart beats, etc.

Further clinical studies are needed to determine CT acquisition conditions suitable for AIDR 3D image reconstruction for lung cancer screening [18].

Conclusion

The results of this experimental CT study using a chest phantom showed that the IR method contributed to improving the detectability of simulated pure GGNs.