Introduction

Approximately 10-40% of patients with solid tumors will develop brain metastases (BM) over their clinical course [1]. The most common primary tumors that metastasize to the brain are lung cancer, breast cancer, and melanoma, which account for 67–80% of BM [2]. Synchronous brain metastases (S-BM) and metachronous brain metastases(M-BM) are two distinct clinical conditions. M-BM refer to those that occur during a known underlying cancer disease and have undergone multimodal treatment such as radiotherapy, chemotherapy, immunotherapy, and targeted therapy. Compared with this, S-BM refer to the initial symptoms of previously unknown systemic spreading cancer diseases [3]. S-BM show lower overall survival [4, 5]. Previous studies on the BM of lung adenocarcinoma have also elucidated the differences in tumor microenvironment and mutation profiles between S-BM and M-BM [6]. In this project, we focus on the identification of risk factors in S-BM and the prediction of survival harboring these lesions.

The five-year survival rate is a clinical indicator used to reflect the severity and progression of a certain cancer and also commonly used to evaluate the treatment effect or certain risk factors for malignant tumors [7, 8]. However, the malignancy and progression rate of different tumors vary. This means that the evaluation indicators for different malignant tumors are distinct. Previous studies have also used a three-month survival rate to evaluate the risk factors for malignant tumors and provide some clinical evidence for determining short-term treatment plans for BM [9]. Currently, little is known about the early mortality rate and related risk factors in patients with S-BM. Therefore, identifying the risk factors for early death is of great significance for the prognosis evaluation and clinical treatment guidance of patients with S-BM.

This study focused on patients with S-BM of solid tumors in the SEER database to evaluate the incidence of overall survival rate and explore risk factors for 1-year early mortality. In addition, we developed a nomogram that includes some prognostic factors for predicting 1-year early mortality in S-BM patients.

Materials and methods

Data and study population

We retrospectively analyzed the public data registered in the SEER database from 2000 to 2019, covering 17 registration centers in different regions of the USA. In our study, data from malignant solid tumor patients with S-BM were extracted from the SEER database (2010–2017). The variables included sex, age at diagnosis, race, marital status at diagnosis, primary site, histological type, tumor laterality, T-stage, N-stage, whether or not bone metastasis, whether or not liver metastasis, whether or not lung metastasis, surgery records, chemotherapy records and radiation records. The primary endpoint was all-cause mortality within 12 months.

All patients with S-BM of solid tumor were selected according to the International Classification of Diseases for Oncology, Third Edition (ICD-O-3) standard: head and neck cancer: C0-C14,C76; thyroid: C73,9; lung: C34; breast: C50; colorectal: C18-C21; liver: C22; kidney: C64.9, C65.9; ovary: C56; uterus: C54; prostate: C61.9; melanoma: 872, 873, 874, 875, 876, 877, 878, 879; Other solid tumors. The exclusion criteria were as follows: (a) patients with missing or incomplete information about survival time, survival status, or other important characteristics; (b) preinvasive carcinoma and primary intracranial tumor; and (c) patients with a follow-up time less than 12 months.

Construction and evaluation of the nomogram

Pathological and clinical characteristics such as sex, age at diagnosis, race, marital status at diagnosis, primary site, histological type, tumor laterality, T-stage, N-stage, whether or not bone metastasis, whether or not liver metastasis, whether or not lung metastasis, surgery records, chemotherapy records and radiation records were used for analysis. We randomly selected 70% of the patients included from 2010 to 2017 as the training set and the remaining 30% as the validation set [10]. For the training set, we used univariate and multivariate logistic regression analyses to determine independent risk factors and calculate odds ratio (OR), and 95% confidence intervals (CI). Then, we established a nomogram for predicting the 1-year early mortality of S-BM using these identified independent risk factors.

We evaluated the discrimination of the nomogram by drawing the receiver operating characteristic (ROC) curve [11], drawing a calibration plot to evaluate the calibration of the nomogram, and drawing a decision curve analysis (DCA) curve [12] to evaluate the effectiveness of the nomogram. The value of the area under the ROC curve (AUC) greater than 0.7 indicates a reasonable evaluation. The calibration plot connects the value points of different groups with line segments to form a calibration line in turn and compares it with the standard line of y = x; the closer the calibration curve of the model to the standard line, the better the calibration ability of the model. The nomogram was evaluated by measuring the above curves both internally (training set) and externally (validation set) [13].

Statistical analysis

All statistical analyses were conducted by the R software version 4.3.2 (Institute for Statistics and Mathematics, Vienna, Austria; www.r-project.org). Classified data are described using frequency and percentage (N, %), and continuous data are represented by the mean ± standard deviation (MD ± SD). Early death, defined as death within 1 year after diagnosis, is the endpoint of interest in this study. A two-tailed P value less than 0.05 was considered statistically significant. We used the Kaplan-Meier (KM) method to draw survival curves.

Results

Patient characteristics

We primarily collected 52,665 S-BM of solid tumor patients from the SEER database from 2010 to 2017 (for longer follow-up time), excluding patients with missing or incomplete important characteristics, leaving 47,572 patients. Then, the KM curve was used to evaluate the overall survival rate of patients, and it was found that the overall survival curve of patients rapidly decreased within 1 year, while the survival rate of patients showed a slow downward trend after 1 year (Fig. 1). Therefore, predicting the 1-year survival rate of patients with S-BM has explicit clinical significance for patient management. Then, patients with a follow-up time of less than 12 months were excluded. Finally, 47,284 patients were included in the research (Fig. 2).

Fig. 1
figure 1

The overall survival curve of 47,572 S-BM patients from the SEER database (2010–2017). The red line is the deadline for one year

Fig. 2
figure 2

Flow diagram of the data selection process

The patients were divided into the training (N = 33,098) and validation (N = 14,186) sets at a ratio of 7:3 using the R function “createDataPartition” to ensure that outcome events were distributed randomly between the two sets [11]. There was no statistically significant difference in baseline data between the training and validation sets. The demographics and tumor characteristics of the patients are summarized in Table 1.

Table 1 Demographics and tumor characteristics of the patients with S-BM of solid tumors

Identifying independent factors

We used univariate and multivariate logistic regression analyses to identify risk factors associated with early mortality in patients with S-BM of the training set. The results of univariate and multivariate factor analyses are shown in Table 2.

In univariate analysis, all clinical and pathological features including age, sex, marital status, race, primary tumor location, primary tumor side, T-stage, N-stage, whether or not bone metastasis, whether or not liver metastasis, whether or not lung metastasis, whether or not surgery, whether or not radiotherapy and whether or not chemotherapy were associated with the likelihood of early death. The results of multivariate logistic regression analysis reveal that age (for each year of increase, OR = 1.029, 95% CI: 1.026–1.032, P < 0.001), other marital status (vs. married, OR = 1.272, 95% CI: 1.200-1.348, P < 0.001), primary tumor in the uterus (vs. head&neck, OR = 2.328, 95% CI: 1.082–4.914, P = 0.028), T-2 (vs. T-1, OR = 1.420, 95% CI: 1.292–1.560, P < 0.001), T-3 (vs. T-1, OR = 1.656, 95% CI: 1.499–1.829, P < 0.001), T-4 (vs. T-1, OR = 1.758, 95% CI: 1.596–1.937, P < 0.001), T Other (vs. T-1, OR = 1.543, 95% CI: 1.376–1.731, P < 0.001), N-1 (vs. N-0, OR = 1.272, 95% CI: 1.153–1.403, P < 0.001), N-2 (vs. N-0, OR = 1.542, 95% CI: 1.428–1.665, P < 0.001), N-3 (vs. N-0, OR = 1.654, 95% CI: 1.509–1.814, P < 0.001), N Other (vs. T-1, OR = 1.250, 95% CI: 1.104–1.415, P < 0.001), patients with synchronous bone metastasis (vs. none, OR = 1.229, 95% CI: 1.153–1.310, P < 0.001), synchronous liver metastasis (vs. none, OR = 2.008, 95% CI: 1.854–2.177, P < 0.001) and synchronous lung metastasis (vs. none, OR = 1.101, 95% CI: 1.027–1.181, P = 0.007) are all associated with a higher risk of early mortality. Contrary to above, the results including female (vs. male, OR = 0.691, 95% CI: 0.651–0.733, P < 0.001), API race (vs. NHW, OR = 0.441, 95% CI: 0.401–0.486, P < 0.001), other race (vs. NHW, OR = 0.775, 95% CI: 0.699–0.859, P < 0.001), primary tumor located in the prostate (vs. head&neck, OR = 0.079, 95% CI: 0.040–0.151, P < 0.001), primary tumor located in the breast (vs. head&neck, OR = 0.305, 95% CI: 0.163–0.544, P < 0.001), the primary tumor was melanoma (vs. head&neck, OR = 0.368, 95% CI: 0.195–0.661, P < 0.001), surgery (vs. none, OR = 0.365, 95% CI: 0.324–0.410, P < 0.001), radiation therapy (vs. none, OR = 0.695, 95% CI: 0.647–0.747, p < 0.001) and chemotherapy (vs. none, OR = 0.215, 95% CI: 0.201–0.231, P < 0.001) are associated with a lower risk of early mortality.

Table 2 The results of univariate and multivariate factor analysis of the training set

Nomogram construction and validation

We successfully constructed a nomogram (Fig. 3) to predict the probability of 1-year early mortality of S-BM patients based on the independent risk factors screened by the multivariate logistic regression in the training set. Each indicator has a specific value. By projecting the values onto the points scale in the first row, the corresponding scores for each indicator can be obtained. The total score is obtained by adding up the scores of all indicators. By calculating the sum of the scores assigned to each variable, the sum can correspond to the probability of early death.

Fig. 3
figure 3

The nomogram for predicting the probability of early death in patients with S-BM in the SEER-17 database from 2010 to 2017

In the training and validation sets, the AUC values for predicting S-BM in patients with solid tumors were 0.764 (95% CI: 0.758–0.769) and 0.761 (95% CI: 0.752–0.770), respectively, indicating that the nomogram has exact discriminative ability (Fig. 4a and b). The calibration curve of the nomogram shows that the predicted and observed early mortality rates are highly consistent (Fig. 5a and b). In addition, DCA analysis indicates that the model has excellent clinical application value (Fig. 6a and b).

Fig. 4
figure 4

ROC curves for the nomogram. (a) The ROC curve for the overall early death nomogram in the training set; (b) The ROC curve for the overall early death nomogram in the validation set

Fig. 5
figure 5

Calibration plots for the nomogram. (a) Calibration plot for the overall early death nomogram in the training set; (b) Calibration plot curve for the overall early death nomogram in the validation set

Fig. 6
figure 6

DCA curve for the nomogram. (a) DCA curve for the overall early death nomogram in the training set; (b) DCA curve for the overall early death nomogram in the validation set

Discussion

Brain metastases are the most common cause of malignant brain tumors in adults.

Of the nearly 1.5 million patients in the USA who received a primary diagnosis of cancer in 2007, approximately 70 000 of these primary diagnoses are estimated to eventually relapse in the brain [14, 15]. Patients with solid tumors present with either synchronous, latent, or metachronous brain metastases that exhibit different manifestations [16]. For the large and retrospective cohorts, undertaken in 17 registries in the USA, we found a 12-month early mortality rate of 76.4% in patients with S-BM. To our knowledge, this study represents the largest cohort of patients with a total of 47,284 patients, and it is the first study to use the SEER database to evaluate risk factors associated with early death in the entire population of patients with S-BM.

Most studies focus on the long-term survival of patients with brain metastases [17, 18], and there is a lack of research exploring the early mortality of these patients. Shen et al. investigated the risk factors for early mortality in patients with synchronous brain metastases from lung cancer. They found that the definition of early death varies from study to study and could be defined as 30 days to 3 months after diagnosis typically. In their study, death that occurred within 3 months was chosen as early death [9]. However, in our view, their setting lacks scientific validity. Cagney et al. studied 26,430 patients diagnosed with invasive solid malignant tumors originating outside of the central nervous system (CNS). Patients with prostate cancer (median survival 12.0 months) displayed the longest survival while that of pancreatic cancer patients (median survival period of 2 months) was the shortest. The median survival time for the most patients was between 4 and 10 months [19]. Therefore, setting early death as three months is not reasonable enough. Our K-M curve based on the population of brain tumor patients suggests that the overall survival curve of patients rapidly decreases near 1 year and tends to flatten after 1 year. Thus, setting early death at 12 months is more reasonable for patients with brain metastases, which is also convenient for comparison with other similar studies [20, 21].

The study of early death has been applied to advanced cancers in other systems and has shown important clinical significance. Kouka et al. found that patients who were male, were older, had a larger tumor diameter, had distant metastasis, and had oral, oropharyngeal, and hypopharyngeal tumors had a significantly higher 180-day mortality rate [22]. Li et al. analyzed data from 1140 patients in the SEER database to evaluate the overall survival and early mortality rates of patients with pheochromocytoma and paraganglioma. Simultaneously, logistic regression analysis was used to identify risk factors and construct a nomogram for early mortality [23]. Similarly, our study also employed logistic regression for risk factor analysis and constructed a nomogram for predicting the 1-year early mortality of synchronous brain metastases, which can be helpful for clinical practitioners in patient management. The Cox proportional risk model focuses on the survival time of patients, but when analyzing highly heterogeneous tumors such as brain metastases, there may be significant biases, which would lead to inaccurate predictions and other issues. Therefore, we adopted logistic regression analysis to identify the factors associated with early 1-year mortality, and that makes our analysis results more accurate and practical.

At present, the treatment of brain metastases usually includes a combination of surgery, radiation therapy, and/or systemic drug therapy, with the overall goal of selecting the best treatment plan for individual patients to maximize their quality of life and overall survival [24]. The SEER database does not provide information related to hormone therapy, targeted therapy, and immunotherapy for patients, which is an inherent limitation of the SEER database. However, the SEER database provides an overview of whether patients undergo surgery, radiation therapy, and chemotherapy after their initial diagnosis, and that had been confirmed in the research of Shen. Nevertheless, Shen et al. overlooked the impact of surgery, radiotherapy, and chemotherapy on prognosis and did not provide a reasonable explanation when conducting risk factor analysis and establishing predictive models. We have fully considered the impact of treatment on the prognosis of S-BM and establish a nomogram for predicting early death, which can better assist clinical management decisions and generate high clinical value. That provides a widely recognized method by researchers [25, 26]. The nomogram of our research showed good predictive ability and clinical applicability. Meanwhile internal validation of the nomogram showed good agreement between the predicted early deaths and the actual deaths. DCA curves showed that our nomogram has good clinical value and practicability in predicting the survival rate.

Apart from the limitations mentioned in the above chapters, this study has certain limitations. Due to the lack of data on recurrence, different treatment methods, and hormone status in the SEER database, we were unable to conduct research and calculate progression-free survival. In addition, the SEER database does not provide information on medical complications that are important for evaluating quality of life, mental health, and disease burden. Additionally, our preliminary results and predictive model should be externally validated.

Conclusion

We used the SEER database to evaluate the risk factors for 1-year early mortality in patients with S-BM, and found that the 1-year early mortality rate was 76.4%. Our data show a total of 13 covariates associated with 1-year early mortality. We constructed a validated nomogram using these covariates to effectively predict 1-year early mortality in patients with S-BM. This nomogram can help clinical workers screen high-risk patients to develop more reasonable treatment plans.