Background

Gastric cancer is highly prevalent worldwide [1], with South Korea reporting the highest number of incidences [2, 3]. In 2020, gastric cancer was ranked as the fourth most common cancer in South Korea, after thyroid, lung, and colorectal cancers [2]. Data from the Korean Statistical Information Service (KOSIS) revealed that gastric cancer accounted for approximately 10.8% of all cancers in South Korea, ranking second in incidence among men, with an age-standardized incidence rate of 76.2 per 100,000 individuals [2]. This rate was twice as high in men than in women (31.6%). Despite a gradual decline in the incidence rate of gastric cancer between 2010 and 2020, South Korea still has the highest incidence rate of gastric cancer among mens compared to 185 other countries [2, 3].

The increasing prevalence of cancer imposes a substantial economic burden on patients, families, health systems, and society as a whole [4,5,6]. A previous study conducted in 2014 on the economic burden of gastric cancer in patients reported a total cost of US $557,571 considering both direct and indirect expenses [7]. Other studies on cancer treatments have shown that the cost of cancer treatment is higher owing to repeated hospitalizations, numerous outpatient visits, laboratory services, advanced diagnostic tests, chemotherapy, expensive medications, surgeries, and radiotherapy, among other essential services [8, 9]. Consequently, there is a growing demand for more proactive approaches to effectively treat cancer with fewer side effects, highlighting the significance of accurate and rapid gastric cancer screening [10, 11].

Gastric cancer is a multifactorial disease influenced by various demographic characteristics, diet, lifestyle, genetic predisposition, family history, medical conditions, and infections [12, 13]. Age, sex, socioeconomic status, smoking status, diet, obesity, physical activity, and Helicobacter pylori infection have been associated with gastric cancer. Smoking status, high salt intake, and H. pylori infection are considered modifiable factors [12,13,14,15,16,17,18,19]. Although upper gastrointestinal endoscopy is necessary to assess the risk of gastric cancer, it has potential risks and entails high screening costs [11, 20]. For example, one study found that individuals who underwent gastrointestinal endoscopy reported a 0.13% adverse complication rate and a 0.004% mortality rate [21]. Therefore, gastrointestinal endoscopy has been proposed for high-risk subgroups [22].

In 1999, South Korea implemented a national screening program for gastric cancer, which recommends biennial upper endoscopy or upper gastrointestinal series for individuals aged ≥ 40 years [23]. The “high-risk group” is defined based on age (≥ 40 years), family history of gastric cancer, a previous diagnosis of precancerous conditions, such as atrophic gastritis and intestinal metaplasia, Helicobacter pylori (H. pylori) infection, and chronic conditions, such as pernicious anemia and gastric ulcers.

The observed decrease in the incidence of gastric cancer in South Korea between 2010 and 2020 can be attributed to several factors. For instance, the widespread implementation of a national endoscopic screening program [23], which has led to the early detection and treatment of precancerous lesions. Additionally, efforts to eradicate H. pylori [24], a known risk factor for gastric cancer, have contributed to this decline. Widespread H. pylori screening and eradication programs have been in place in South Korea since the early 2000s. Studies have shown that H. pylori eradication can reduce the risk of gastric cancer by approximately 50%. Other factors that may have influenced this trend include improvements in lifestyle and dietary habits [25], and enhanced sanitation and living conditions [26].

Several studies have primarily focused on health expenditures related to end-of-life care and those incurred after a cancer diagnosis [27,28,29]. However, these studies have mainly focused on the financial burden on patients, families, health systems, and society following cancer occurrence. Given the availability of accurate and rapid screening methods for gastric cancer, such as gastrointestinal endoscopy [30], the most effective strategy is to develop a prediction model that can identify high-risk subgroups by analyzing the patterns prior to the onset of gastric cancer.

This study focused on health expenditure patterns. Using Andersen’s behavioral model of health services utilization, health utilization, and expenditures were determined based on predisposing, enabling, and need factors [31]. Although personal preferences and other predisposing and enabling factors may over- or underestimate health expenditure, previous research has shown that health expenditure is associated with health conditions [32,33,34]. Previous research has also examined the association between health expenditures and cancer incidence and found that higher health expenditures are associated with increased cancer incidence [35]. However, this approach may overlook the patterns of longitudinal health expenditure changes before the onset of gastric cancer. Because the gastrointestinal carcinogenic process typically spans 5–15 years [36], it is imperative to examine the potential impact of longitudinal health expenditure patterns. Whereas a single event can lead to a sudden surge in health expenditures in a particular year, it may not be indicative of the patterns associated with gastric cancer incidence. This underscores the importance of examining health expenditure patterns over time before gastric cancer occurrence. This study utilized data from the National Health Insurance Service (NHIS) Senior Cohort in South Korea and identified specific health expenditure trajectories preceding the onset of gastric cancer, and examined the associations of these trajectories with gastric cancer incidence using a nested case-control study design to address this gap.

Methods

Data and study population

This study used data from the NHIS Senior Cohort of South Korea. The NHIS covers approximately 97% of all Korean citizens and the remaining 3% are covered by Medical Aid [37]. Consequently, the NHIS collects data on the use of medical services by all citizens and stores them anonymously. The NHIS senior cohort data comprised a randomly selected sample stratified based on various criteria, such as sex, age, and insurance premium quantile, representing approximately 8% of South Korean individuals aged ≥ 60 years in 2008 (approximately 510,000 individuals). Moreover, since 2009, the NHIS senior cohort has added 8% older adults who turn 60 each year, resulting in approximately 540,000 older adults between 2009 and 2019. Overall, approximately 1,057,784 older adults participated in the cohort study between 2002 and 2019 [38]. The dataset included information on each individual’s health status, healthcare utilization, and sociodemographic characteristics such as sex, age, and region of residence [38].

This study used data from 2008 to 2019 and included 761,917 individuals aged 60 years or older at baseline between 2008 and 2014. Study participants with a history of cancer (N = 94,260), those on Medical Aid (N = 33,697), and those who died from non-incident gastric cancer during the study period (N = 132,342) were excluded. After excluding 442,126 participants who did not meet the inclusion criteria, 59,492 were included in the analysis (Fig. 1).

Fig. 1
figure 1

Selection process of the study participants

Variables

Outcome measure

The primary outcome measure in this study was gastric cancer incidence. Gastric cancer was classified based on the principal diagnosis between 2008 and 2019, using the International Statistical Classification of Diseases, 10th revision (ICD-10; code C16).

Independent variables

The independent variable used in this study was health expenditure. Health expenditures included all expenses related to medical services incurred before the diagnosis of gastric cancer. This included outpatient and inpatient medical services, except for oriental medicine, dental services, and out-of-hospital prescriptions To account for the annually increasing inflation, health expenditures were calculated using a conversion index applied to health expense calculations [39]. In South Korea, the annual rate of increase is determined by negotiations between the NHIS and provider associations [39]. All health expenditures at different timepoints were calculated using an annual conversion index based on the last data point of the study (2019). The annual conversion index rates used to calculate health expenditures are presented in Supplementary Table 1. Individual health expenditures from 1 to 5 years prior to the diagnosis of gastric cancer were aggregated in units of six months and divided into 20 groups according to the percentiles (Supplementary Table 2). For participants with no incidence of gastric cancer, health expenditures were calculated based on the date of diagnosis of the matched patient with gastric cancer. Seven health expenditure trajectories were identified based on the health expenditure patterns of the segmented health expenditure groups 1–5 years gastric cancer diagnosis. The seven types of health expenditure trajectories over 5 years were categorized based on the following patterns: “consistently lowest,” “rapidly increasing,” “gradually increasing,” “consistently second-highest,” “middle-low,” “gradually decreasing,” and “consistently highest’’ (Fig. 2).

Fig. 2
figure 2

Types of health expenditure trajectories

Covariates

The covariates included sociodemographic factors, underlying health conditions, health behaviors, and pre-existing conditions. Sociodemographic factors included sex (men, women), age (60–64, 65–69, 70–74, 75–79, and > 80 years), income (quartile), and region of residence (Seoul, Gyeonggi, metropolitan, rural). Underlying health conditions included disability and the Charlson comorbidity index (CCI). Disability was categorized as non-disabled and registered under Korea’s Disability Registration system [40]. The CCI was calculated according to Quan’s updated criteria and categorized as 0, 1, 2, > 3 [41]. Sociodemographic factors and underlying health conditions were assessed at baseline. Health behaviors included smoking status (no answer (N/A), never, former, current) and alcohol consumption status (N/A, no, yes). Preexisting conditions included eight symptoms associated with gastric cancer, which were identified based on the Korean Standard Classification of Symptoms (KCD). These symptoms included abnormal weight loss (R10.49), dysphagia (R13), dyspepsia/gastro-esophageal reflux disease (K21, K30), hematemesis/melaena (K92.1), H. pylori infection (B98,0), diabetes mellitus (E10-E14), and pneumonia (J12-J18) [42,43,44,45,46,47,48]. The presence of these symptoms was confirmed from baseline until cancer diagnosis.

Statistical analysis

The study was conducted in four steps. First, the gastric cancer diagnosed (cases) and non-diagnosed (controls) groups were matched in a 1:3 ratio according to the nested case-control (NCC) method, which involved the selection of participants (controls) with similar characteristics upon the event incidence [49]. Matching was performed according to sex, age, and income level upon gastric cancer diagnosis. The study design is shown in Supplementary Fig. 1. Second, a latent class trajectory model (LCTM) analysis was conducted to identify health expenditure patterns among the matched participants. LCTM is an analytical method that clusters latent classes based on changes in the dependent variable over time and is suitable for categorical variables [50, 51]. The number of clusters in the latent class was determined by the goodness of fit and pattern proportion of the model. For model fit, we considered the lowest value based on the Akaike information criterion (AIC) and Bayesian information criterion (BIC). At the same time, we examined whether any cluster group had a proportion of < 5%. Third, conditional logistic regression was performed for matched data analysis to investigate the relationship between healthcare expenditure trajectories and gastric cancer incidence. Fourth, a sensitivity analysis was conducted to enhance the robustness of the study’s results. The sensitivity analysis comprised two parts: (a) interesting variable change and (b) subgroup analysis. For the interesting variable change, the data was analyzed by dividing the health expenditures one year before gastric cancer diagnosis, five years before diagnosis, and the average health expenditures from one to five years into quintiles. In the subgroup analysis, participants were categorized into below-median and above-median income groups. The statistical significance of the regression analysis was determined using a 95% confidence interval (CI) of the odds ratio (OR), where a lower bound > 1 was considered significant. All analyses were performed in 2023 using SAS (version 9.4, SAS Institute, Cary, NC, USA) and R (version 3.3.1, R Foundation for Statistical Computing).

Results

LCTM analysis was performed to identify the health expenditure trajectory before the incidence of gastric cancer. The LCTM analysis confirmed the model fit and proportion of each pattern (Table 1). Although class 8 showed the best model fit, its proportion was < 5%. Therefore, class 7, which satisfied both the model fit and proportion criteria, was determined as the final model.

Table 1 Number of classes of latent class trajectory models based on health expenditure before gastric cancer incidence

Table 2 shows the general characteristics of the study sample. A total of 59,492 older Korean individuals (14,873 cases and 44,619 controls) were included in this study. The study sample displayed the following general characteristics; most had high health expenditure (54.3%), were men (68.4%), were under 65 years of age (42.2%), belonged to the highest income group (31.8%), resided in rural areas (36.9%), were non-disabled (88.3%), had a CCI score of 0 (63.1%), had never smoked (51.2%), and did not drink alcohol (55.6%). Additionally, 49.9% of the older adults had diabetes mellitus, 18.7% had pneumonia, and 83.3% experienced dyspepsia/gastro-esophageal reflux disease. Other pre-existing conditions were reported by < 5% of the study participants.

Table 2 General characteristics of the study population

The frequency of gastric cancer incidence for each variable is presented in Table 2. Based on the type of health expenditure trajectory, the incidence of gastric cancer was noted in the following order: rapidly increasing (N = 1,251; 35.5%), consistently highest (N = 2,341; 29.8%), gradually increasing (N = 2,264; 27.6%), and consistently second-highest (N = 3,339; 26.2%). The incidence of gastric cancer was higher in groups with consistently higher or increasing health expenditure before the incidence of gastric cancer than in other groups (p < 0.001). With regard to sex, age, and income, the diagnosed and non-diagnosed gastric cancer groups were matched at a ratio of 1:3 using the NCC method. In addition, the incidence of gastric cancer was higher in rural areas (p < 0.001), among those with disabilities (p < 0.001), those with a higher CCI score (p < 0.001), and those who currently smoked (p < 0.001). Furthermore, individuals with pre-existing conditions had a higher incidence of gastric cancer than those without (p < 0.001).

We performed a conditional logistic regression analysis to investigate the association between health expenditure trajectory type and gastric cancer incidence (Table 3). The results indicated that compared to the middle-low group, individuals in the consistently lowest group had an increased OR of 1.40 (95% CI; 1.30–1.51), whereas those in the rapidly increasing and gradually increasing groups had ORs of 2.11 (95% CI; 1.94–2.30) and 1.26 (95% CI; 1.17–1.35), respectively. However, the consistently second highest, gradually decreasing, and consistently highest groups showed no significant difference. Furthermore, we found that certain regions had higher ORs for gastric cancer incidence, with metropolitan and rural areas showing ORs of 1.09 (95% CI; 1.03–1.16) and 1.24 (95% CI; 1.18–1.31), respectively, compared to Seoul. Additionally, the presence of a disability, higher CCI score, former or current smoker status, and pre-existing conditions were associated with a higher OR for gastric cancer incidence. Particularly, pre-existing conditions had higher ORs for gastric cancer incidence; abnormal weight loss (OR = 1.42, 95% CI; 1.21–1.68), dysphagia (OR = 2.04, 95% CI; 1.76–2.36), dyspepsia/gastro-esophageal reflux disease (OR = 2.53, 95% CI; 2.36–2.70), hematemesis (OR = 5.16, 95% CI; 3.91–6.80), melena (OR = 5.13, 95% CI; 4.66–5.65), H. pylori infection (OR = 3.28, 95% CI; 2.90–3.71), diabetes mellitus (OR = 1.38, 95% CI; 1.33–1.44), and pneumonia (OR = 1.30, 95% CI; 1.24–1.37), compared to that in individuals without pre-existing conditions.

Table 3 Results of the conditional logistic regression analysis on the association between types of trajectory about health expenditure and gastric cancer incidence

The results of the sensitivity analysis are presented in Table 4. In the analysis of interesting variable change, participants were divided into groups based on their health expenditure one year and five years before gastric cancer diagnosis. For the one-year health expenditure group analysis, compared to the middle group, individuals in the lowest group had increased ORs of 1.42 (95% CI; 1.33–1.51), while those in the middle-low groups had ORs of 1.11 (95% CI; 1.04–1.18). In the analysis of the five-year health expenditure group, compared to the middle group, individuals in the lowest group had increased ORs of 1.10 (95% CI; 1.04–1.17), whereas those in the highest groups had ORs of 1.44 (95% CI; 1.35–1.53). In the analysis of the one–five-year average health expenditure group, compared to the middle group, individuals in the lowest group had increased ORs of 1.15 (95% CI; 1.07–1.23), while those in the highest groups had ORs of 1.22 (95% CI; 1.15–1.30). For each subgroup, health expenditure trajectory groups were determined based on health expenditures from one to five years before gastric cancer diagnosis. The classification of health expenditure trajectory groups was based on criteria such as AIC, BIC, and ensuring that no cluster group had a proportion of less than 5% (Supplementary Tables 3 and Supplementary Fig. 2). In the conditional logistic analysis of the below-median income group, compared to the middle-low group, individuals in the consistently lowest group had increased ORs of 1.38 (95% CI; 1.23–1.53), while those in the rapidly and gradually increasing groups had ORs of 2.06 (95% CI; 1.82–2.34) and 1.18 (95% CI, 1.06–1.31), respectively. For the conditional logistic analysis of the above-median income group, the results showed that individuals in the consistently lowest income group had increased ORs of 1.30 (95% CI; 1.16–1.45) compared to the middle-low group. Additionally, those in the rapidly and gradually increasing groups had ORs of 2.18 (95% CI; 1.92–2.47) and 1.31 (95% CI, 1.19–1.45), respectively. The gradually increasing group had an OR of 1.15 (95% CI; 1.03–1.28).

Table 4 Results of the sensitivity analysis

Discussion

To the best of our knowledge, this is the first study to predict an association between health expenditure trajectories and the incidence of gastric cancer. In South Korea, there is only one mandatory social health insurance [52], so it is a good environment to conduct research on health expenditure and health utilization. The primary objectives of this study were to identify specific health expenditure trajectories preceding the onset of gastric cancer and to examine the associations between these trajectories and gastric cancer incidence using an NCC study design. The findings revealed that individuals in the rapidly increasing (OR = 2.11, 95% CI; 1.94–2.30), consistently lowest (OR = 1.40, 95% CI; 1.30–1.51), and gradually increasing (OR = 1.26, 95% CI; 1.17–1.35) groups had a higher risk of developing gastric cancer.

The sensitivity analysis results confirmed the short and long-term association between gastric cancer diagnosis and average health expenditures before diagnosis. The result of the lowest group was the same as that of the consistently lowest trajectory group. The highest group in five-year average health expenditure was significant because some increasing trajectory groups were included in the highest group. This indicates that the increasing trajectories were not able to be captured in the sensitivity analysis from the traditional perspective. It demonstrated the importance of considering the trajectory context. Furthermore, subgroup analyses based on median income levels also demonstrated that consistently low or increasing health expenditures before gastric cancer diagnosis were significantly associated with the diagnosis of gastric cancer. In the health equity and accessibility perspectives, the health expenditures can differ from the income group, but the sensitivity results showed the robustness of our hypothesis. These results provide valuable information for identifying high-risk patients most likely to benefit from gastrointestinal endoscopy.

Health expenditure represents the utilization of healthcare services to meet health demands, categorized as need-based, unnecessary, avoidable, or supplier-induced demands [53]. This study’s results demonstrated various combinations of demand aspects. The consistently highest group may reflect need-based utilization or overuse (such as unnecessary or supplier-induced demands) [54]. The consistently lowest group represents health conditions or the underused and unmet needs that may be attributed to awareness, choice, or access [55]. Both overuse and underuse can be detrimental to patients [54, 56], but underuse can lead to serious health problems, including increased cancer incidence. Thus, the consistently lowest group presents a higher risk than the consistently highest group. The rapidly and gradually increasing groups also showed a high risk of developing gastric cancer. These increasing trends indicate a change that can be attributed to needs-based utilization. Initially, it may involve unnecessary or supplier-induced demands; however, owing to its association with cancer incidence, it ultimately becomes need-based and avoidable.

Gastric cancer prediction models are understudied regarding their ability to identify high-risk populations that can benefit from gastrointestinal endoscopy. Although a few studies have been conducted to predict cancer incidence using claims data, most have overlooked health expenditure patterns. For example, Yeh et al. utilized epidemiological factors, clinical history, and the results of clinical examinations (blood tests, genetic analysis, and imaging results) to predict lung cancer incidence [57]. Similarly, Taninaga et al. used longitudinal medical check-up data (biological characteristics, H. pylori infection status, endoscopic findings, and blood test results) to predict gastric cancer incidence [11]. Our findings suggest that monitoring health expenditure can serve as an indicator for identifying specific risk groups that can be targeted with interventions. However, further research is required to thoroughly explore and validate the predictive potential of health expenditure patterns for gastric cancer.

The incidence of gastric cancer was higher in rural areas and among individuals with disabilities than in other areas and among those without disabilities, which is consistent with the results of previous studies [58, 59]. Song et al. reported that rural areas have a relatively higher incidence of gastric cancer compared with urban areas [58]. They also found a high incidence of gastric cancer in coastal cities and southern provinces, possibly because of higher consumption of highly salted seafood. Evidence suggests that high salt intake may directly stimulate the gastric mucosa or have a synergistic effect with H. pylori infection, leading to gastric cancer. Similar to our findings, Kim et al. (2020) reported that individuals with disabilities tend to be diagnosed at a later stage, particularly those with severe disabilities [60]. Individuals with disabilities are diagnosed with gastric cancer at a later stage, receive inadequate treatment, and have a higher risk of mortality than those without disabilities [59, 60]. Previous research has shown that individuals with disabilities fall within the national free cancer-screening service policy in South Korea [61]. Although this study focused on the incidence of gastric cancer, disability status appeared to be related to the entire cancer control continuum.

This study found that participants with pre-existing conditions and higher CCI scores had a greater incidence of gastric cancer, which is consistent with previous studies indicating that participants with comorbidities or pre-existing conditions have a greater risk of cancer incidence [62, 63]. Hoang et al. (2020) examined the association between comorbidity risk score strata and cancer incidence and observed a dose-dependent relationship between them [63]. In a large population-based prospective study of 405,878 Taiwanese participants, Tu et al. (2018) found a significantly increased risk of incident cancer for eight diseases and markers individually, with adjusted hazard ratios ranging from 1.07 to 1.44 [62]. Although many studies have investigated the effect of comorbidity scores on overall cancer incidence, to the best of our knowledge, this study is the first to examine the association between CCI scores and gastric cancer incidence in South Korea.

Despite its significance, this study had several limitations that should be acknowledged when interpreting the findings. First, although LCTM offers the advantage of grouping by patterns of change in health expenditure over time, a major limitation is the inability to account for factors that influence these patterns. Therefore, future studies should consider a methodology that accounts for the factors affecting the change patterns. Second, there is a possibility of a mismatch between the recorded diagnostic codes and the actual hospital diagnoses. However, it is important to note that a policy has been implemented since September 2005 in South Korea to reduce the copayment for cancer patients, resulting in a 5% decrease in out-of-pocket expenditure [60]. Therefore, all cancer patients are assigned special incidence codes when they receive healthcare services, minimizing the likelihood of code mismatches. Hence, we believe that the code mismatch rate is low. Third, we could not include information related to health status or screening history owing to the characteristics of the claims data. We incorporated the CCI score as a variable reflecting the patient’s underlying health status to compensate for this limitation. Although this design was useful for investigating the associations in a population sample, it has limitations regarding its broader applicability. Since the categorization of health expenditure is data-driven, it is important to recognize that our classifications may not fully capture the health expenditure patterns of the general population. To address concerns regarding generalizability, we conducted sensitivity analyses comparing the characteristics of our nested case-control sample with the overall cohort and general population data.

Conclusion

This study suggests that individuals in the rapidly increasing, consistently lowest, and gradually increasing groups had a higher risk of developing gastric cancer. Health expenditure patterns can help identify high-risk populations; therefore, monitoring these trajectories is important.

This results can be used to guide the high-risk populations to take cancer screenings based on the analysis of health utilization records by NHIS. Alternatively, individuals can assess their cancer risk and take secondary preventive action by using the My Data platform, which allows them to download their health utilization records, including health expenditures in Korea [64].