Introduction

Lower back pain is among the most prevalent causes of long-term disability worldwide [29]. Degenerative disc disease (DDD), the progressive deterioration of the intervertebral disc, is a common etiology of lower back pain [21]. Although there is a wide selection of available pharmacological treatments and physical therapy, patients with lumbar DDD refractory to conservative measures must often resort to discectomy, fusion, or lumbar disc arthroplasty for a definitive solution. Currently, lumbar fusion procedures are the most commonly utilized technique, well known for their stability and time-tested effectiveness on pain relief [12]. However, lumbar fusions are not benign procedures— the high frequency of complications such as adjacent segment disease (ASD) and pseudoarthrosis has continued to stimulate research into more optimal techniques for surgical management [10].

Total disc replacement (TDR), also known as arthroplasty or artificial disc replacement, is a newer surgical approach that may offer benefits over traditional arthrodesis, including preserved range of motion, restoration of disc height, and reduced disc subsidence [2, 4, 19]. Additionally, several studies have reported lower pain and disability scores and higher patient satisfaction rates with TDR versus lumbar fusion. However, some of these studies reflect outcomes for posterolateral fusion instead of anterior fusion [18, 28, 34]. Recently, Auerbach et al. directly compared the lumbar motion profiles in patients undergoing fusion versus TDR and concluded that fusions have steeper lumbar motion gradients at the adjacent segment during flexion and extension compared with TDR and control [1]. However, as previous studies have not adequately evaluated the incidence of surgical complications and the need for additional surgery, controversy remains regarding whether TDR is superior to ALIF [11, 13, 14, 23, 32]. To this end, we assess postoperative outcomes, including reoperation rates following single-level arthroplasty compared to single-level ALIFs in patients with DDD.

Methods

Data source

This retrospective cohort study utilized longitudinal data from the MARINER database (PearlDiver Technologies, Colorado Springs, USA), an all-payer claims database that includes records for over 120 million patients across all 50 US states. This database includes all healthcare encounters and prescription claims billed to all payers from 2010 to 2020. Independent third parties audit all claims. Diagnoses, procedures, and patient demographics can be identified in the database with International Classification of Diseases (ICD) 9 and 10 codes and Current Procedural Terminology (CPT) codes. Our institution’s Institutional Review Board provided exemption status for this study due to the minimal risk to participating individuals. This study followed the Strengthening the Reporting of Observational Studies in Epidemiology (STROBE) guidelines.

Study population

Patients undergoing single-level ALIF or arthroplasty procedures between January 2010 and October 2020 were identified using ICD-9, ICD-10, and CPT codes. To be included in the study, patients had to be 18–84 years old, have complete records with demographic information, and have a continuous active insurance record in the database for at least 5 years following their index procedure. Additionally, patients with a history of previous spine surgery or those diagnoses of spine fractures, spondylolysis, spondylolisthesis, lumbar stenosis, and osteoporosis were excluded.

The ALIF and arthroplasty cohorts were matched based on patient age, sex, and comorbidities. Patient comorbidities within 1 year before their index procedure date were identified using ICD-9 and ICD-10 codes. These comorbidities included the Charlson Comorbidity Index, type 2 diabetes mellitus, hypertension, hyperlipidemia, obesity, history of smoking, and prescribed opioid use within 30 days before surgery. The patient selection procedure is detailed in Fig. 1. Patient demographics and comorbidities are tabulated in Table 1.

Fig. 1
figure 1

Patient selection flow chart

Table 1 Descriptive characteristics of matched patients undergoing 1-level arthroplasty versus ALIF

Outcome definitions

The primary outcome of this retrospective cohort study was the need for additional lumbar surgery within 5 years of the index procedure. This was identified using ICD-9, ICD-10, and CPT codes. Other outcomes that were measured similarly included length of stay, the number of patients taking opioids at 30 days following surgery, morphine milligram equivalents per day among patients taking opioids at 30 days following surgery, all-cause readmissions within 30 days of surgery, urinary tract infections, pneumonia, hematoma, wound dehiscence, and deep vein thrombosis within 30 days following surgery.

Statistical analysis

Descriptive characteristics calculated were all patients. Binomial and logistic regression models were utilized to match the two cohorts (ALIF vs. arthroplasty) based on age, sex, and comorbidities (Supplemental Tables 2 and 3). Categorical variables such as sex, age ranges, and comorbidities were compared using χ2 tests. Quantitative variables such as mean length of stay and morphine milligram equivalents were compared using Student’s t tests. Additionally, odds ratios with 95% confidence intervals were calculated for 30-day postoperative complication rates and 5-year additional lumbar surgery rates. Statistical analysis was performed using R version 4.1 (R Foundation, Vienna, Austria).

Table 2 Postoperative outcomes at 30-days following 1-level arthroplasty versus ALIF
Table 3 Odds of additional surgery and 30-day postoperative complications following lumbar arthroplasty versus ALIF

Results

Descriptive characteristics

Our database query yielded 51,410 patients who underwent single-level arthroplasty or ALIF procedures between 2010 and 2020. Of these, 3287 were excluded who were missing demographic information, did not have active insurance for 5 years following their index procedure, or did not meet our age criteria. An additional 36,756 patients were excluded due to diagnoses of spondylolisthesis, spondylolysis, lumbar stenosis, lumbar spine fractures, osteoporosis, or previous spine surgery (Fig. 1). Descriptive characteristics for the remaining 11,367 eligible patients are tabulated in Supplemental Table 1.

Eight hundred forty-six patients were matched 1:1 and distributed equally between the arthroplasty and ALIF cohorts. Age range distributions were identical between groups (p > 0.99). The most prevalent comorbidities in the study population were hypertension (21.28%), history of smoking (17.73%), and hyperlipidemia (10.64%). Between ALIF and arthroplasty patients, there was no statistically significant difference in rates of diabetes mellitus type II, hypertension, hyperlipidemia, chronic kidney disease, obesity, smoking, and prior opioid use. Demographic and comorbidity data for the matched patient population are detailed in Table 1.

Outcomes

The rate of all-cause readmissions at 30 days was significantly higher in the ALIF cohort compared to the arthroplasty cohort (2.60% vs. 0.71%, p = 0.02). On the other hand, the mean length of stay was significantly shorter in the ALIF cohort compared to the arthroplasty cohort (1.043 days vs. 2.174 days, p < 0.001). There were no significant differences in the other outcomes measured, including the number of patients still taking opioids, morphine milligram equivalents per day, urinary tract infections, pneumonia, hematoma, wound dehiscence, deep vein thrombosis, and composite complications. Outcomes following single-level arthroplasty or ALIF are summarized in Table 2.

The odds of 30-day postoperative complications were no different for patients in the arthroplasty cohort compared to the ALIF cohort (OR 0.72, 95% CI 0.29 to 1.81). Likewise, the odds of additional lumbar surgery within 5 years of the index operation were not significantly different among patients who underwent arthroplasty versus ALIF (OR 0.83, 95% CI 0.51 to 1.35) (Table 3). The 5-year free probability of additional lumbar surgery among patients who underwent ALIF versus arthroplasty is plotted in Fig. 2.

Fig. 2
figure 2

Five-year additional surgery-free probability among patients undergoing ALIF versus arthroplasty

Discussion

Arthroplasty devices are a well-established surgical option for patients with DDD [26, 34]. Nevertheless, arthroplasty has yet to be widely adopted in the USA as an alternative to ALIF, even though arthroplasty has been shown to better preserve the biomechanics of the spine compared to ALIF [2, 4, 19, 25]. Proponents of arthroplasty suggest that by mimicking the intervertebral disc structure, artificial discs attenuate the progression of ASD and reduce rates of further surgical intervention. However, opponents contend that the degeneration that eventually occurs is simply inherent to the natural progression of the underlying disease and that the literature has not borne out the claim of arthroplasty superiority over ALIF [13, 22, 24].

This is the largest retrospective cohort study to compare the rates of additional lumbar surgery at adjacent segments following ALIF versus arthroplasty. Our study utilized 91 million patient records from the MARINER database and ultimately analyzed 846 patients in 1:1 exact matched cohorts with no significant difference in patient characteristics. Our investigation did not identify an increased risk of further surgery in patients who underwent fusion compared to arthroplasty, as additional surgery rates nor 30-day postoperative complications were statistically different between cohorts.

These results align with several other studies published in the literature, although these are primarily industry-supported trials whose results should be taken with a grain of salt [15, 16, 22]. For example, Kitzen et al. followed up with 141 patients who had undergone TDR for DDD, with a mean follow-up of 16.7 years. They found long-term motion preservation in only 38.0% of patients; nonetheless, there was no association between residual mobility and the occurrence of ASD [15]. Indeed, the literature suggests that arthroplasty may not provide long-term preservation of motion in most patients and may not be associated with a reduction in ASD or reoperation regardless of motion preservation. Instead, the natural progression of DDD may predispose patients to further procedures on adjacent segments in the lumbar spine [6].

Although arthroplasty may not provide significant benefit over ALIF regarding reoperation on adjacent segments, it may still be a reasonable alternative. Several studies on arthroplasty versus lumbar fusion have established the noninferiority of arthroplasty in terms of both short-term and long-term outcomes for the treatment of DDD [7, 9, 30]. Our matched cohort analysis corroborated these findings compared to ALIF; both approaches demonstrated similar risk of complications at 30 days post-op or additional surgery at 5 years.

On the other hand, we found a shorter stay with ALIF at 1.043 days versus 2.174 days in contrast to published literature [16, 27]. Previous literature on this topic has been mixed. Shultz et al. in 2018 found a slightly increased length of stay by 0.28 days for ALIF patients. However, they also discussed the unclear clinical significance of this minor difference, as there was no translation to perioperative outcomes [27]. The length of stay difference uncovered in this study is much greater and warrants consideration by surgeons considering either ALIF or arthroplasty, especially given the costs associated with prolonged hospital stays.

Further research on this topic will be needed to better understand the cause of this finding which is especially surprising when considering the similarities between the ALIF and arthroplasty procedures. We also identified high readmission rates with ALIF compared to arthroplasty, similar to prior studies, which found higher or no different rates of readmissions with ALIF compared to arthroplasty [7, 27]. One possible explanation for the lower readmissions in arthroplasty patients is that the operation is performed in a limited selection of institutions given its novelty and thus only by a few surgeons now experienced with the technique. This could minimize the length of the operation, extraneous tissue manipulation, and invasiveness of the operation, leading to a decrease in postoperative problems causing readmission. It is also worth considering that the cohort of patients undergoing arthroplasty has been cherry-picked to some extent again, given the novelty of the technology and the industry’s desire to demonstrate positive results. No difference in 30-day postoperative opioid use was found between procedures, indicating that arthroplasty may comparably improve pain. A previous retrospective analysis by Mattei et al. suggested that arthroplasty patients may experience less back pain [18].

There are several limitations to this investigation that should be considered. Classification of patient demographics, comorbidities, and outcomes relies on accurate and reliable ICD-9, ICD-10, and CPT coding. Coding errors, including underreporting and miscoding, may have affected the results of our analysis. Additionally, while this investigation focused on reoperation rates at adjacent segments, the database used in this study lacks sufficient detail to assess the extent of adjacent segment degeneration in patients who underwent additional surgery within the 5-year follow-up period. Likewise, the assessment of motion preservation among patients who underwent ALIF or arthroplasty fell outside the scope of this study due to insufficient available data. In addition, arthroplasty is performed at fewer centers than ALIF, and thus the analyzed arthroplasty patients may be from a few overrepresented centers in the USA. However, the impact of this possibility is somewhat mitigated given the large sample size of 846.

Conclusion

ALIF and lumbar arthroplasty procedures are equally safe and effective in treating DDD. Our findings suggest that lumbar arthroplasty should take a larger role in DDD management in the USA as it did not increase reoperation rates. Interestingly, we found that ALIF was associated with a lower LOS. However, further prospective, retrospective, and biomechanical studies will need to be undertaken to better characterize this finding and considerations for ASD in the surgical management of DDD.