Introduction

Accurate prediction of lateral pelvic lymph node (LPN) metastasis has been regarded as a critical issue for a long time with regard to rectal cancer treatment. The treatment strategy for rectal cancer significantly depends on the presence of LPN metastasis [1, 2]. Although total mesorectal excision (TME) plays a central role in the treatment strategy for locally advanced low rectal cancer, TME alone might be insufficient to prevent local recurrence [3,4,5,6]. In Japan and some other countries, lateral pelvic lymph node dissection (LPND) is often performed in addition to TME for locally advanced low rectal cancer to reduce local recurrence, according to the Japanese Society for Cancer of the Colon and Rectum guidelines [7, 8]. In contrast, neoadjuvant therapies, including radiotherapy and chemotherapy, are considered essential for the treatment of advanced low rectal cancer in Western countries [9,10,11]. However, in both Western and Asian countries, it is beneficial to predict LPN metastasis before initiating rectal cancer treatment. In addition, several risk factors for LPN metastasis, such as the short axis and shape of LPN metastasis, have been reported [11,12,13,14]. Although there are some predictive models for LPN metastasis [12,13,14,15,16], they have not been fully validated. There is currently no verified tool for predicting LPN metastasis in rectal cancer that can be easily used in daily practice. A nomogram is a graphical model, and its effectiveness in the treatment of colorectal cancer has been widely reported [17,18,19]. Therefore, this study aimed to construct a nomogram to predict LPN metastasis based on clinical and magnetic resonance imaging (MRI) findings.

Patients and methods

We conducted and described this report in accordance with the TRIPOD reporting guidelines [20].

Study setting and patients

The Japan Society of Laparoscopic Colorectal Surgery collected the clinical data of 1500 patients with clinical stage II–III low rectal cancer. These patients underwent open or laparoscopic surgery at 69 institutes participating in the Japan Society of Laparoscopic Colorectal Surgery between January 2010 and December 2011. The primary study reported short-term and mid-term outcomes [21]. The indications for neoadjuvant treatment were determined at each institution’s discretion. In addition to the data of the primary study, we collected long-term prognosis data and MRI findings. We used these data as training data after excluding the data from Kyoto University Hospital.

Eligibility

The inclusion criteria for this study were as follows: (1) clinical stage II–III low rectal cancer (tumor located below the peritoneal reflection), (2) preoperative MRI conducted (MRI after neoadjuvant therapies in patients undergoing neoadjuvant therapies), and (3) surgery with curative intent. In contrast, the exclusion criteria were as follows: (1) multiple primary cancers, (2) a history of treatment of other pelvic malignancies, and (3) patients who did not experience LPN metastasis but whose observation periods were insufficient (< 3 years). The peritoneal reflection was identified by barium or gastrografin enema and/or MRI.

We used the patients’ data from the Kyoto University Hospital from June 2005 to December 2016 as validation datasets. Eligibility criteria were the same as those in the training dataset.

Definition of potential lateral lymph node metastasis

We set potential LPN metastasis as an outcome, which was defined as a composite event of pathological LPN metastasis in patients with LPND and postoperative recurrence in lateral regions diagnosed by image inspections regardless of LPND. There are several cases of lateral recurrence due to omission of LPND, and treating only positive pathology as an outcome for patients with LPND might be an underestimation. Therefore, we adopted a composite outcome by including patients who did not undergo LPND.

Risk factors for potential lateral pelvic lymph node metastasis

We analyzed the following risk factors: age, sex, carcinoembryonic antigen (CEA), tumor location (distance from the anal verge), and preoperative MRI findings. The MRI findings included tumor depth (clinical T stage), tumor size, circumferential resection margin (CRM), extramural vascular invasion (EMVI), and the short and long axes of the LPN and perirectal lymph node (PRLN). Since this was a retrospective study, MRI imaging conditions varied by institution and time period. MRI for the training dataset was centrally assessed by six radiologists who established a consensus about MRI assessment before the start of the study. MRI for the validation dataset was assessed by surgeons of Kyoto University Hospital. CRM and EMVI were categorized into two groups according to the Mercury study [22] and the study by Smith et al. [23], respectively. Continuous variables were categorized for practicality. For example, short axis of LPN was categorized as < 3 mm, 3–5 mm, 5–7 mm, 7–9 mm, and ≥ 9 mm. CRM was defined as positive if the distance from the tumor to the rectal fascia or puborectalis was < 1 mm or the tumor extended beyond the internal anal sphincter [22]. EMVI was defined as positive if the MRI-EMVI score was 3 or 4 and negative if the MRI-EMVI score was 0–2 [23].

Statistical analysis

A logistic regression model was used to identify the risk factors for potential LPN metastasis. Risk factors with p < 0.20 in the univariable analysis were included in the multivariable analysis. We sought to identify the best combination of risk factors based on the clinical significance, and results of the stepwise backward elimination method referred to information criteria, such as the Akaike Information Criterion and Bayesian information criterion [24]. The Akaike Information Criterion penalizes the complexity of the model by decreasing the p value threshold at which variables are dropped from the model in proportion to the number of variables selected [24]. The Bayesian information criterion confers a larger penalty for models with several parameters. A nomogram was constructed using the results of the multivariable logistic regression model.

Assessment of nomogram performance

The nomogram performance was evaluated in terms of its discrimination and calibration [20]. Discrimination was the ability to distinguish between patients with events from those without events and was evaluated by area under the curve (AUC) after plotting the receiver operating characteristic curve (ROC). Calibration referred to how well the predicted probabilities from the model were in agreement with the observed outcome frequencies in the data, and it was assessed based on the calibration plot [24].

Statistical analysis was performed using R software, version 3.6.1 [25]. Nomogram construction and validation were performed using the “rms” package. The ROC curve was plotted using “pROC” packages, and the calibration plot was plotted using “ggplot2” packages.

Results

Patients’ characteristics

Of the 1500 patients enrolled in the primary study, preoperative MRI was available for 752 patients. Forty-seven of these patients were excluded (14 were at Kyoto University Hospital, 18 had insufficient observation periods, and 15 had missing data). Subsequently, 705 patients from 49 institutes were included in this study. The flow diagram of patient selection in the training dataset is shown in Fig. 1.

Fig. 1
figure 1

Diagram of patient enrollment in the training dataset

Clinical characteristics of the patients enrolled in the training dataset are shown in Table 1. Among 409 patients who underwent LPND, 84 patients were diagnosed with potential LPN metastasis, including 64 patients with LPN metastasis in the pathological assessment after surgery and 20 patients experiencing long-term lateral recurrence. In addition, of 296 patients without LPND, 15 laterally recurrent cases were diagnosed as potential LPN metastasis; thus, 99 patients had a diagnosis of potential LPN metastasis. A summary of the patients’ characteristics that were excluded from the analyses is shown in Table S1.

Table 1 Patient characteristics in the training and validation datasets

In the validation dataset, 108 patients with rectal cancer were identified. Of the 108 patients, 4 without preoperative MRI data and 4 other non-recurrent cases without sufficient treatment period were excluded. Subsequently, 100 patients were enrolled in the validation dataset. The flow diagram of patient selection in the validation dataset is shown in Fig. S1. The characteristics of patients enrolled in the validation dataset are shown in Table 1 and Table S1.

Nomogram development

In the univariable logistic regression analysis, age, sex, CEA, tumor location, clinical T stage, tumor size, CRM, EMVI, and short and long axes of LPN and PRLN were found to be risk factors for potential LPN metastasis. We identified that a combination of the short axis of LPN, tumor location, EMVI, and short axis of PRLN was the most effective in predicting potential LPN metastasis based on the results of multivariable logistic regression analysis with a backward stepwise method (Table 2). We constructed a nomogram using these factors (Fig. 2).

Table 2 Risk factors for potential lateral pelvic lymph node metastasis in low rectal cancer
Fig. 2
figure 2

Nomogram for the preoperative prediction of potential lateral pelvic lymph node metastasis

The ROC curve is shown in Fig. 3. The AUC of this nomogram was 0.74, positive predictive value was 25.4%, and negative predictive value was 93.4% (Table 3). Calibration plots are shown in Fig. 4, and the nomogram was moderately calibrated.

Fig. 3
figure 3

Receiver operating characteristic curve of the nomogram in the training dataset

Table 3 Performance of the prediction model
Fig. 4
figure 4

Calibration plot in the training dataset

Nomogram validation

The ROC curve in the validation dataset is shown in Fig. S2. The AUC of the nomogram was 0.71, positive predictive value was 25.0%, and negative predictive value was 94.7% (Table 3). Calibration plots in the validation dataset are shown in Fig. S3, and the nomogram was moderately calibrated.

Discussion

In this study, the combination of the short axis of LPN, tumor location, EMVI, and the short axis of PRLN was identified to be most effective in predicting potential LPN metastasis in rectal cancer, and a nomogram was constructed using these factors. The nomogram had a high discrimination ability (AUC, 0.74) and was moderately calibrated. In addition, the nomogram was well validated in an external cohort (AUC, 0.71, moderate calibration). Evidently, neoadjuvant treatment might have had a significant impact on the diagnosis of potential LPN. We analyzed neoadjuvant treatment as a risk factor; however, it was not significant. Similarly, we created separate nomograms for those with and without neoadjuvant treatment using the same items; however, a minimal significant difference was observed (data not shown). Using this nomogram, we can determine the indications for LPND based on detailed risks rather than relying solely on size. Even if we determined not to perform LPND, screening examination intervals after surgery could be shortened based on the risk.

For an adequate treatment strategy, LPN metastasis should be diagnosed before initiating treatment. Several risk factors for LPN metastasis have been reported, such as LPN diameter, shape, and internal structures [12,13,14, 26, 27]. In particular, the short axis of LPN in MRI was reported to be the most crucial factor for predicting LPN metastasis, regardless of neoadjuvant therapies [13, 14, 26, 27]. However, the optimal cutoff value of the short axis of LPN for LPN metastasis was not sufficient to reach definitive conclusions. In addition, the diagnostic performance of MRI alone was reported to be limited [28]. Moreover, the predictive ability based on a single risk factor similarly had a limited utility. In this study, we used four risk factors, namely, the short axis of LPN, tumor location, EMVI, and short axis of PRLN, in the nomogram; in addition, the AUC of these risk factors was 0.67, 0.61, 0.57, and 0.55, respectively, all of which were lower than the AUC of the nomogram (0.74) that was constructed by the combination of these factors.

Matsuoka et al. [12] have used multiple LPN features, such as long axis, shape, and internal structure of LPN in MRI, to better predict LPN metastasis, with an AUC of 0.71. Nakanishi et al. [16] have reported that a radiomics-based prediction model was useful for predicting LPN metastasis (AUC, 0.83). Although the performance of the models was similarly high, they used data of a small number of patients from a single institution and mainly focused on imaging findings. In addition, subjective measures or special mechanics, such as LPN appearance and radiomics, were used [15, 16]. In contrast, we considered that clinicopathological factors were associated with LPN metastasis and LPN imaging. In addition, it was necessary to construct a predictive model using a large cohort for high generalizability. Thus, we considered it critical to create a predictive model using both clinicopathological factors and imaging findings by collecting data from nationwide multi-institutions.

To the best of our knowledge, this is the first study to develop a nomogram that predicts potential LPN metastasis in patients with low rectal cancer based on both clinicopathological factors and imaging findings. Moreover, this nomogram was internally and externally validated using different datasets. The accurate prediction of potential LPN metastasis is beneficial in developing an adequate treatment strategy and avoid unnecessary indications for invasive treatments, such as LPND and neoadjuvant chemoradiotherapy. Both treatments may reduce local recurrence; however, they could worsen short-term and long-term outcomes such as prolonged treatment time and decline in sexual and urinary functions [29,30,31].

The strength of this study lies in its use of data from institutions nationwide specialized in the treatment of rectal cancer in Japan. We included only cases with sufficient observation periods, and the only common variables across institutions where rectal cancer surgery was performed were used in this nomogram. The nomogram was validated using external data from the training dataset. Thus, we considered that this nomogram could be used widely and stably in daily clinical practice. However, this study had several limitations. Not all MRI scans were assessed by multiple radiologists. Furthermore, several patients were excluded because of missing data or insufficient observation periods. Additionally, the present study was a retrospective study, wherein characteristics of selection bias and incomplete data are inherent. Therefore, further research on prospective validation and controlled trial studies are required. These limitations could influence the prediction performance of the nomogram.

Conclusions

We constructed a nomogram to accurately predict potential LPN metastasis using the short axis of LPN, tumor location, EMVI, and the short axis of PRLN. The performance of this nomogram was high, and it was well validated.