Introduction

Intratumoral heterogeneity plays a crucial role in the proliferation, vascular supply, and metabolism of various solid malignancies [1]. Such biological characteristics are vital because they are often associated with aggressiveness or sensitivity to a specific therapy. Currently, 18F-fluorodeoxyglucose (FDG) positron emission tomography/computed tomography (PET/CT) is widely used for staging and monitoring various cancers. The uptake heterogeneity observed within tumors assayed through FDG PET is increasingly believed to be of prognostic value [2, 3]. In addition, the measurement of textural indices from tumor PET images has recently been proposed as an adjunct to predict tumor responses to a therapy [413]. Compared with the maximum standard uptake value (SUVmax), metabolic tumor volume (MTV), and total lesion glycolysis (TLG) alone, the uptake distribution within a tumor can theoretically provide more insight into the underlying tumor biology. On the basis of this assumption, researchers have studied 18F-FDG uptake heterogeneity in various tumors by computing textural indices in baseline PET images.

Clinical stage and lymph node (LN) metastasis are major determinants of both disease-free and overall survival in patients with cervical carcinoma [14, 15]. Studies have indicated that the uptake heterogeneity observed within tumors on FDG PET may be of prognostic value [911]; however, to date, few studies have reported an association between intratumoral FDG uptake heterogeneity and clinical parameters in this cancer setting. Theoretically, textural features associated with adverse clinical parameters may play a role in treatment outcomes. We hypothesized that textural features derived from the FDG uptake heterogeneity of cervical tumors may correlate with LN involvement, clinical stage, and histological types. Particularly, despite PET/CT having a high sensitivity and specificity for detecting LN metastasis [1517], such an association has not been reported. To provide a new avenue for studying the underlying mechanisms of textural features, more information is required on the association of underlying biological characteristics and clinical parameters with comprehensive textural features.

Materials and Methods

Study population

In this retrospective cohort study, we enrolled 170 patients (median age, 55 years) newly diagnosed as having cervical cancer at China Medical University Hospital between July 2009 and December 2015. All the patients had undergone pretreatment (18F-FDG PET/CT) for radiotherapy planning or pretreatment staging. Patients with a history of diabetes were excluded, and all the patients had normal serum glucose levels before undergoing PET/CT. The minimum size of the eligible criteria for the primary tumors was maximal diameter ≧2 cm in computed tomogram. Only primary tumors were considered because a reliable textural analysis of small lesions is difficult owing to the small number of voxels involved. This study was approved by a local institutional review board [DMR99-IRB-010(CR6)]. We performed tumor staging according to the International Federation of Gynecology and Obstetrics (FIGO) and observed that 41, 91, 35, and 3 patients had stage I, II, III, and IVA cancers, respectively. PET/CT has a high sensitivity and specificity for detecting the nodal status in cervical cancer [1517]; therefore, it was performed for diagnosing pelvic LNs (PLNs) or para-aortic LN (PALN) metastasis [16]. Briefly, experienced nuclear medicine physicians performed a visual analysis by using a five-point scoring system (0, normal; 1, probably normal; 2, equivocal; 3, probably abnormal; and 4, definitely abnormal), which had been routinely used in interpreting FDG PET/CT images in our institution. We used dichotomous variables in this study: a score 3 or 4 was considered to have LN metastasis, and score 0, 1, or 2 was regarded as absence of metastasis. Accordingly, we identified ≥1 PLNs and ≥1 PALNs in 84 and 25 patients, respectively. Table 1 lists the patient characteristics.

Table 1. Patient characteristics (n = 170)

Training and validation cohorts

Because the large number of patients had PLN metastasis, we divided the 170 patients into two cohorts (85 for training, 85 for validation) to confirm the reliability of predictors for PLN metastasis. The first cohort came from patient lists between July 2009 and June 2013, whereas the validation cohort consisted of patients with pretreatment PET/CT after June 2013. Basically, the two cohorts received the same protocol of PET/CT imaging and interpretation which would be mentioned below. The presence of ≥1 PLN metastasis was identified in 44 and 40 patients for the training and validation cohorts, respectively.

On the other hand, owing to the limited number of patients with PALN metastasis or nonsquamous histology, the split of the patients into two cohorts was not employed in the same way when analyzing PALN metastasis or histological types.

PET/CT imaging

All patients were scanned using a PET/CT scanner (PET/CT: 16 slices; Discovery STE; GE Medical System, Milwaukee, WI, USA). The patients were requested to fast for at least 4 hours before undergoing FDG PET/CT, and the technique was conducted approximately 60 min after the administration of 370 MBq of 18F-FDG. Thus, FDG uptake was determined by calculating the SUV. The procedure was previously described [18].

Delineation of the metabolic tumor volume of cervical tumors

The MTV of a cervical tumor was delineated by using the region growing and excluding the bladder if it was connected. In the PET volume, the voxel was defined as a local maximum if its SUV was not less than those of all neighbors. The SUVmax identified by the user indicated the largest uptake among all local maxima within a cervical tumor, and was used as the initial seed point for a region growing procedure. The region growing procedure delineates the MTV as all voxels connected to SUVmax that have an SUV not less than a specified fraction of SUVmax. The threshold used in this study was 50% of SUVmax. Although it is debatable to limit the inclusion of nonpathological adjacent organs in the heterogeneity analysis, the decision of the spatial extent of a MTV delineated by a fixed threshold at 50% of SUVmax was a trade-off between the adequate separation and the sufficient intratumoral uptakes. If the bladder was not included, the formed region was defined as the MTV of the cervical tumor. Otherwise, the SUVmax of the bladder was identified and created a new region by using the same aforementioned criteria. The new region was partitioned into several spherical objects by employing watershed transform technique [19]. The objects belonging to the bladder were manually identified and excluded from the region, whereas the remaining areas were integrated and defined as the MTV of the cervical tumor. The TLGmean was calculated by multiplying the SUVmean of the cervical tumor by the MTV.

Calculation of textural indices

Discretization of the SUVs within the previously delineated MTV was the fundamental for the calculation of textural indices. Two types of the discretization method were employed to divide the SUV range within the MTV into a fixed number of bins or into a fixed bin size. The discretization method using a fixed number of bins divided the SUV range into 4, 8, 16, 32, 48, 64, 80, 96, 112, or 128 bins, respectively. The SUV range was resampled by a width of 0.01, 0.025, 0.05, 0.075, 0.1, 0.25, 0.5, 0.75, or 1 g/ml when the discretization method using a fixed bin size. For all quantization results acquired from each parameter of a discretization method, we calculated four matrices to evaluate the heterogeneity, namely the gray-level co-occurrence matrix (GLCM) [20], neighboring gray-level dependence matrix (NGLDM) [21], gray-level run length matrix (GLRLM) [22], and gray-level size zone matrix (GLSZM) [23]. That is, a total of 76 matrices (10*4 + 9*4) were calculated. Moreover, 18-connectivity is used to describe the relationship between any two adjacent voxels. In the GLCM and GLRLM, the neighbor of each voxel (x, y, and z) within a cervical tumor is defined to be the located within the offset (0, 1, and 0). For every parameter of a discretization method, a total of 33 texture indices listed in Appendix 1 were derived from the four matrices [2023].

Statistical analysis

In this study, PET/CT was performed to correlate various textural indices with clinical parameters to determine the optimal indices for predicting LN metastasis and differentiating clinical stages and histological types. Receiver operating characteristic (ROC) curves were constructed to evaluate the optimal predictive performance among the SUVmax, MTV, and TLGs, and four groups of textural indices. In addition, serum tumor markers, namely squamous cell carcinoma antigen (SCC-Ag) and carcinoembryonic antigen (CEA), were analyzed. Quantitative differences in all variables between LN groups and histological types were determined using the Mann–Whitney U test. Kruskal–Wallis and post-hoc tests were used to examine the correlation between tumor stages and textural indices. In addition, we used multivariate logistic regression analysis to determine the independent factors among all the variables for predicting LN metastasis. Two-tailed tests were used, and P < 0.05 was considered statistically significant. All calculations were performed using SPSS 13.0 for Windows (SPSS Inc., Chicago, IL, USA).

Results

Selection of the discretization method and parameter setting

The selection of the discretization method and its parameter setting was determined by the performance of the indices in predicting PLN metastasis. In the process of the discretization, the SUV range within the MTV was quantized by a specific parameter and then used to derive textural features. Then, the advantage of a discretization method was evaluated by the performances of the derived textural indices by the ROC curves in terms of PLN metastasis. For the textural indices obtained from all parameters per discretization method, all acquired areas were averaged as listed in Appendix 1. In this study, the area under the ROC curve was regarded as significant if the value was greater than 0.7 or less than 0.3. There were 4, 4, 5, and 2 textural indices with significant performance when a fixed number of bins was used in the discretization method. In contrast, 3, 1, 1, and 1 indices were significant if a fixed bin size was adopted in the discretization method. Obviously, the indices derived from the discretization method using a fixed number of bins demonstrated a better performance than those using a fixed bin size. Moreover, the areas of the ROC curves originated from the discretization method using four bins were generally greater than those using the other number of bins. As a result, all textural indices presented in this study were derived from the same quantized results of the discretization method.

Predictive value of textural indices for PLN metastasis

We retrieved the four groups of textural indices and various thresholds of classical MTVs and TLGs. As summarized in Table 2, the ROC curves indicated that among several indices in the four groups of textural features, conventional PET/CT parameters and serum SCC-Ag levels were predictors of PLN metastasis in both cohorts. Numerous indices across the groups were positively associated with PLN metastasis, whereas some showed a negative correlation. All the statistically significant indices were subjected to multivariate logistic regression analysis. In the training cohort, the homogeneity index derived from the GLCM [P < 0.00001, odds ratio (OR) = 648938.3, 95% confidence interval (CI) = 634.7–6635245.2] was the sole independent predictor of PLN metastasis. As listed in Table 2, the predictive parameters identified through ROC curves and logistic regression analysis in the training cohort were tested in the validation cohort. The results revealed that the homogeneity index from the GLCM (P = 0.001, OR = 636572.4, 95% CI = 178.3–2.27E+009) remained the single predictor of PLN metastasis, whereas increased SUVmax showed a marginal impact (P = 0.05, OR = 1.09, 95% CI = 0.999–1.19) in the validation cohort. Although the SUVmax did not serve as a consistent predictor between the cohorts, this parameter was an independent factor of PLN metastasis when lumping the whole data together (P = 0.01, OR = 1.08, 95% CI = 1.02–1.15). Figure 1 presents the quantitative differences in the SUVmax and homogeneity values between the PLN-positive and PLN-negative groups. The mean SUVmax for PLN-positive and PLN-negative patients was 12.3 ± 5.7 and 10.4 ± 5.8, respectively, whereas the mean homogeneity values for the two groups were 0.64 ± 0.08 and 0.57 ± 0.07, respectively. Figure 2 shows the scatterplot between these groups. The homogeneity index seems to be a more accurate predictor of PLN metastasis than does the SUVmax.

Table 2. Predictive textural indices for pelvic lymph node metastasis and the area under the ROC curve in training and validation cohorts
Fig. 1
figure 1

Quantitative differences in the SUVmax values in all patients (1A), homogeneity in training cohort (1B), and homogeneity in validation cohort (1C) of the primary tumors between the pelvic lymph node (PLN)-positive and PLN-negative patients.

Fig. 2
figure 2

Scatterplot of the SUVmax and homogeneity of the primary tumors according to pelvic lymph node (PLN) metastasis status.

Risk stratification of PLN metastasis according to adverse imaging features

Because the use of SUVmax has become a standard component of diagnosis and staging in oncology, we postulated a risk stratification of PLN metastasis according to the median values of SUVmax and homogeneity index. The patients were classified into three risk groups. Group A comprised 43 patients with less than 50% of both SUVmax and homogeneity (median SUVmax and homogeneity: 10.4 and 0.61, respectively). Group B comprised 84 patients with one of the 2 factors, and group C included 43 patients with ≥50% of both risk factors. Accordingly, the estimated risk of PLN metastasis was 23%, 50%, 74% in groups A, B, and C, respectively.

Predictive value of textural indices for PALN metastasis

Table 3 shows that several textural features among the four groups, MTV, TLG, and serum SCC-Ag levels were related to PALN metastasis. Multivariate logistic regression analysis revealed the TLGmean (P = 0.001, OR = 1.004, 95% CI = 1.001–1.006) as the sole independent feature of PALN metastasis. Figure 3 shows the quantitative differences in the TLGmean values between the PALN-positive and PALN-negative groups. The TLGmean for PALN-positive and PALN-negative patients was 531.1 ± 12.8 and 199.4 ± 4.7, respectively. The risk of PALN in patients with <50 and ≥50 percentile of TLG values was 6% and 24%, respectively.

Table 3. Predictive textural indices for para-aortic lymph node metastasis and the area under the ROC curve in whole patients (n = 170).
Fig. 3
figure 3

Quantitative differences in the TLGmean values of the primary tumors between the para-aortic lymph node (PALN)-positive and PALN-negative patients.

Correlation of textural indices with FIGO stage

When the patients were classified into three groups according to the FIGO stage (I, II, and III–IVA), serum SCC-Ag, conventional PET-related parameters, and numerous indices revealed the differences between stage I and III–IVA as well as between stage II and III–IVA (Table 4). However, the imaging variables did not differ significantly between stages I and II. On dichotomizing the patients into stage I, II, and III–IVA, most indices showed a diverging trend (Appendix 2).

Table 4. Kruskal–Wallis and Mann–Whitney tests comparing the association between FIGO stages and textural features

Association of textural indices for histological types

The tumor histology was dichotomized into squamous cell carcinoma (SCC; n = 138) and nonsquamous cell carcinoma (NSCC; n = 32). NSCCs are a histological type of adenocarcinoma or adenosquamous carcinoma. The extracted MTV did not significantly vary between the two groups. The MTVmean was 31.9 ± 32.2 and 39.8 ± 51.6 mL for the SCC and NSCC groups, respectively (P = 0.67 using a Mann–Whitney U test). As summarized in Table 5, short-zone emphasis (SZE) derived from the GLSZM was the sole textural index associated with quantitative differences between the two groups. The mean SZE for SCC and NSCC was 0.30 ± 0.15 and 0.38 ± 0.15, respectively (Fig. 4).

Table 5. Predictive textural indices in differentiating histological types and the area under the ROC curve
Fig. 4
figure 4

Quantitative differences in the values of indices between squamous cell carcinoma (SCC; n = 138) and nonsquamous cell carcinoma (NSCC; n = 32)

Discussion

An advanced FIGO stage and LN metastases have prognostic value in cervical cancer; however, the association between these clinical parameters and intratumoral FDG PET uptake or underlying textural features remains unclear. In this study, textural indices were used to provide a novel insight on the effects of FDG uptake heterogeneity on the outcomes in some patients with cancer who exhibit nodal involvement or advanced stage. In addition, our study is the first to indicate that SZE from the GLSZM are associated with tumors of NSCC. Furthermore, we observed that advanced-stage tumors are associated with quantitative differences in several textural features. Although our findings must be validated, the novel discovery of this study can be applied to and tested for other cancers.

In this study, comprehensive matrix indices [13] were defined to minimize disparities among studies and avoid neglecting useful indices. In addition, bin number and interval setting were varied to quantize the SUVs within the MTV of cervical tumors. Subsequently, the texture indices were calculated under different quantization methods and parameters. The final selection of the quantization method and parameter is determined by the performance in differentiating LN metastasis. Although we found that the best predictive values could be achieved when the SUVs within the MTV of a cervical tumor were quantized into four bins, an additional approach is required to clarify the effects of the different setting of quantization methods and parameters to maximize the clinical importance of texture analysis.

Textural quantification may provide valuable complementary information compared with directly visualizing the retroperitoneal LN through FDG PET alone. Theoretically, a combination of textural parameters can be assumed have a higher correlation with a specific underlying physiological process compared to a single feature [4]. On the basis of our study, with a combination of SUVmax and homogeneity index, the pelvic lymphatics of patients with high-risk features, such as group B and C patients, should be more aggressively treated. For example, the irradiation field or dose can be adjusted or optimized.

In a retrospective study conducted by Brooks et al. [24], FDG PET images of 47 PLN-negative and 38 PLN-positive patients with FIGO stage IIb SCCs were analyzed. They observed no statistically significant differences between the two groups for any metric or plausible combination and explained the results with two main reasons. First, the precise uptake heterogeneity observed in other tumor types may not be applicable to cervical carcinomas. Second, the patient population lacked variation. Consequently, the apparent FDG uptake heterogeneity is probably a statistical effect of the partial volume attenuation of markedly different tumor types, shapes, and sizes.

The clinical feasibility of textural indices depends on their repeatability and reproducibility. The major determinants included the delineation of the MTV [25], reconstruction protocols of PET images [25], discretization methods [2628], and PET images acquired at different times [29, 30]. In this study, the MTV delineation and the discretization methods played a major role. If the separation of the MTVs between the cervical tumor and the connected bladder was conducted manually by the user, the acquired MTV might be susceptible to a lack of reproducibility. Therefore, it is imperative to develop an automatic and standardized separation method to minimize this limitation. Besides, the selection of discretization method would affect the quantization results and transform the performance of the textures [2628]. Because this study aimed to find the ability of textural indices in predicting PLN metastasis, we compared the performance of the two discretization methods to circumvent the limitation. In addition, the standard deviation for the areas under the ROC curves could be used to assess the reproducibility. For example, the standard deviation of the homogeneity in predicting PLN metastasis was 0.017. The low values might imply a high reproducibility of the index.

In this study, when the SUVs of the MTV were quantized into 4, 8, 16, 32, 48, 64, 80, 96, 112, and 128 bins, the areas under the ROC curves for the homogeneity in detecting PLN metastasis were 0.751, 0.747, 0.739, 0.725, 0.721, 0.714, 0.722, 0.715, 0.704, and 0.703, respectively, which evidenced that the homogeneity is a reliable parameter for predicting PLN metastasis. It would be interesting that an increased homogeneity value within tumors is linked to higher risk of PLN metastasis. Pearson’s correlation coefficient analysis revealed that homogeneity is not correlated with the SUVmax (P = 0.45), but it is highly correlated with the MTV and TLGmean (P < 0.0001, R = 0.57 and P < 0.0001, R = 0.56, respectively). Therefore, we assumed that an increased tumor burden is directly associated with an elevated LN metastasis risk. Appendix 3 shows that tumors with either a large MTV and high homogeneity or a high TLGmean and homogeneity increased the risk of PLN metastasis. MTVs were presumed to be a key factor affecting the homogeneity because they were also used for calculating the TLGmean. Notably, the homogeneity should be meticulously measured because the MTV would be overestimated if the MTVs are associated with the bladder. Because the uterus is cylindrical with a bottleneck, the cavity within the uterus may simultaneously increase the MTV and homogeneity. This is a plausible reason for the positive correlation between the two factors. Contrastingly, the TLGmean (MTV × SUVmean) may invalidate this assumption. In case of a cavity within the uterus, a decreased SUVmean would lead to a reduced TLGmean. Altogether, this suggests that the calculation of the homogeneity is highly dependent on the extracted MTV, regardless of the existence of cavities.

Tixer et al. [4] conducted a textural study investigating the useful indices for predicting responses to chemoradiotherapy in esophageal cancer; local homogeneity and entropy yielded the most favorable results in the GLCM features to differentiate the treatment responses. In their study, increased homogeneity values were associated with poor responders. Altogether, the biological mechanisms underlying the homogeneity must be examined in future studies, in which treatment outcomes are a study endpoint. In addition, correlation studies between gene expression profiling and the homogeneity index are required.

Although most cervical cancers are related to a latent infection of human papilloma virus, studies have suggested significant changes in the differential expression between SCC and NSCC [31, 32]. To date, few studies have reported the imaging diversities caused by different pathological types. This study is the first to evidence the differences in SZE values between SCC and NSCC, probably because of the disparities in tissue type and density. Although a quantitative association was observed, the molecular mechanisms underlying the change in textural features between the two factors must be investigated. Future studies should integrate gene expression profiling into textural analysis to elucidate the essential biological implications for SZE in tumor tissue engineering. In addition, sequential dynamic changes of SZE after treatment might play a role in determining the outcome and should be more extensively studied.

The findings of this study should be interpreted cautiously because they represent a retrospective study design in a single institute. External validation studies that use a multi-institutional data set with imaging studies and a range of scanners, resolution settings, and reconstruction algorithms are necessary to confirm our findings. Conducting such studies is crucial because textural features can highly depend on reconstruction schemes and imaging parameters. In this setting, disparities between the performance of different PET scanners will become evident and must be discussed. In addition, regardless of the higher accuracy when visualizing the retroperitoneal LN through FDG PET, a correlation study using pathological specimens of patients with cervical cancer receiving PLN or PALN sampling is required to confirm the actual predictive role of imaging analysis. Furthermore, the use of a fixed threshold at 50% of SUVmax to define MTV can potentially lead to underestimation of functional volumes especially in cases of high heterogeneity of intratumoral uptake [33]. Although we found this approach could reduce the probability for the connection between the extracted MTV and the bladder, it will be imperative to compare the superiority and weakness between our method and advanced segmentation algorithms using a contour-based or clustered-based approach for different biological endpoints [30]. Finally, integrating gene expression profiling into the current texture analysis would be more explanatory because it would reveal the molecular pathway underlying intratumoral heterogeneity.

Conclusion

The findings of this study suggest that in patients with cervical cancer, PLN or PALN metastasis can be predicted by using textural feature of homogeneity derived from the GLCM and TLGmean, respectively. In addition, SZE from the GLSZM was the sole textural index associated with quantitative differences between SCC and NSCC. Additional external validation studies are required to confirm these findings.