Introduction

Magnetic resonance imaging (MRI) is the best noninvasive examination for the diagnosis of lumbar spinal stenosis (LSS) because it provides excellent visualization of anatomical structures and soft tissue contrast [1, 2]. Previous studies have demonstrated that axial loading MRI could provide more diagnostic information, similar to that of upright lumbar spine MRI [3,4,5,6]. The axial loading device currently applied in clinical, is the “DynaWell L-Spine” (Dynawell Diagnostics, NY, USA). This device works by applying pressure through the shoulders and feet of the patient while the patient maintains a straight-leg position with a cushion placed beneath the low-back for stabilization [7, 8].

During the examination the patient's lumbar spine was actually in extended position which may also aggravate LSS [9, 10]. The design of this device, with a pillow beneath the low-back, requires the patient to be imaged in extension, which is a potential confounder when determining if it is axial-loading or extension eliciting the structural changes. We used a new lumbar spine axial loading device that applies pressure through the shoulders and hips. The new device could be used in two positions: lumbar spine extended position or psoas-relaxed position with knees flexion. In this study, we investigated a new axial-loading device that can be used in both psoas-relaxed and extension positions to determine the effects of extension, axial-loading, and a combination of these on the spinal morphology.

Materials and methods

Participants

This study was approved by the Institutional Review Board of the 305 Hospital of PLA and conducted in accordance with the Declaration of Helsinki (as revised in 2013). Written informed consent was obtained from all subjects. A total of 40 patients were included between January 2023 and July 2023. The inclusion criteria for the patients were: neurogenic intermittent claudication and/or radicular leg pain and/or numbness with a duration of at least 3 months. The exclusion criteria were: previous spinal surgery, severe osteoporosis, severe cardiopulmonary dysfunction, history of spinal fracture, spinal malignant tumor, lower limb disease, and claustrophobia.

Axial loading device

The new lumbar spine alMRI (axial loading MRI) device developed by our team consists of wearable components and pressure components, which can achieve axial loading of lumbar spine by exerting pressure on the shoulders and hips (Fig. 1A) [11, 12]. The pressure is applied in the pneumatic mode. The control system of the pressure component pneumatically pressurizes the cylinder connected to the wearable components through a long ventilation tube and tightens the piston to generate axial loading of the lumbar spine. The pressure could be steadily applied and accurately measured through the control system remotely. The axial loading pressure is controlled to be equal to 40-50% of the patient's own body weight, simulating the load on lumbar spine when the body is upright, and the compression time was 5 minutes [13,14,15].

Fig. 1
figure 1

The new axial loading MRI device (A) consists of wearable components (red box) and pressure components (black box). The control system pneumatically pressurizes the cylinder connected to the wearable components through the ventilation tube (arrow) and tightens the piston to generate axial loading of the lumbar spine. Patient underwent 4 MRI scanning protocols in sequence: rMRI (B), eMRI (C), r+alMRI (D) and e+alMRI (E). rMRI, psoas-relaxed position MRI; eMRI, extended position MRI; r+alMRI, the psoas-relaxed position axial loading MRI; e+alMRI, the extended position axial loading MRI

MRI protocol

All MRI examinations were performed on a 1.5-T system (Signa Optima, GE Medical Systems, Milwaukee, WI, USA) using a surface coil. Sagittal T2-weighted fast recovery fast spin-echo (FRFSE), T1-weighted fast spin-echo (FSE), short tau inversion recovery (STIR) and axial T2-weighted FRFSE were performed using a standard protocol (Table 1).

Table 1 Details of sequences of the lumbar spine MRI

Imaging acquisition

Each patient underwent 4 MRI scanning protocols in sequence (Fig. 1B-E): (1) the psoas-relaxed position MRI (rMRI) with a cushion under the legs, (2) the extended position MRI (eMRI) with extended legs and a cushion under the lumbar spine, (3) the psoas-relaxed position axial loading MRI (r+alMRI) with a cushion under the legs, (4) the extended position axial loading MRI (e+alMRI) with extended legs and a cushion under the lumbar spine. The cushion under the lumbar spine was a thick memory foam of 60*30*15cm. The cushion under the legs was a thick hard foam of 80*50*30cm.

Patients underwent rMRI and eMRI on the first day, followed by r+alMRI and e+alMRI on the second day, and the two examinations each day were separated by 1 hour. The purpose of these arrangements was to exclude the effect of the previous scanning and to prevent patients from becoming fatigued and thus influencing the examinee comfort assessment. All scans were performed between 8:00 am and 10:00 am to minimize the effect if timepoint variations on the lumbar spine as much as possible [16].

Image quantitative measurements

The dural sac cross-sectional area (DSCA), sagittal vertebral canal diameter (SVCD), disc height (DH) and ligamentum flavum thickness (LFT) were measured at L3-L4, L4-L5, and L5-S1. Meanwhile, the lumbar lordosis angle (LA) was measured as L1-S1 angle on the mid-sagittal image (Fig. 2). The DSCA, SVCD and LFT were measured on the axial image. The DSCA was measured at each intervertebral space with the smallest area. The thickest part of the ligamentum flavum was selected for measurement. The SVCD, DH and LFT were measured in mm, DSCA in mm2, respectively, while LA in degree. All measurements were kept to one decimal place.

Fig. 2
figure 2

Measurements of (a) DSCA, (b) SVCD and (c) LFT in an axial T2-weighted FRFSE image, (d) DH and (e) LA in T2-weighted FRFSE image. DSCA, dural-sac cross-sectional area; SVCD, sagittal vertebral canal diameter; DH, disc height; LFT, ligamentum flavum thickness; LA, lordosis angle; FRFSE, fast recovery fast spin-echo

Image quality assessment

All image quality was assessed by two readers using a Likert 5-point scale: score 5, the main structures were perfectly displayed, no artifacts or imaging distortion, no effect on diagnosis; score 4, well displayed structures, few artifacts, slight image distortion, little effect on diagnosis; score 3, relatively well displayed, a few artifacts, slight image distortion, a certain effect on diagnosis; score 2, moderate displayed structures, moderate artifacts and image distortion, significant effect on diagnosis; score 1, severe artifacts, image distortion, or poor signal intensity, unavailable for diagnosis. The image quality score is performed by the reading physician at the same time as each image measurement.

Examinee comfort assessment

After every MRI scanning, patients were asked to assess the level of comfort of the examinations on a Likert 5-point scale: score 5, no discomfort during the examination; score 4, only mild discomfort during the examination; score 3, certain discomfort, but examination can be done easily; score 2, obvious discomfort, but examination can be done difficultly; score 1, intolerable discomfort, and examination could not be done (examinations that were not completed due to patient intolerance were counted as score 1). After all examinations were completed, patients were asked to revise their scores for each examination. We also interviewed patients with score 3 and below about the reasons for their discomfort.

Image interpretation

All images were read by two radiologists with 14 and 7 years of experience in reading lumbar spine MRI imaging respectively. All parameters were measured using a workstation (AW, version 4.6, GE Medical Systems). The MRI images were sent to the radiologists with the types of examinations and details of the participants’ information withheld.

When measuring quantitative parameters, the images are magnified to the appropriate extent to ensure the accuracy. All quantitative parameters were measured three times by each reader and averaged, with each measurement taken 3 days apart. The final quantitative data used was the average of the values measured by the two readers. If the quantitative values difference were more than 10% or the results of the qualitative indicators are inconsistent, an agreement will be reached after consultation by two readers.

Statistical analysis

The statistical analyses were performed using the software SPSS 25.0 (IBM Corp., Armonk, NY, USA). The cohort was characterized using means and standard deviations to describe continuous variables and proportions to describe categorical variables. Differences in LA, DSCA, DH and LFT in 4 examinations were compared using repeated measures analysis of variance (ANOVA), and Bonferroni post hoc correction was used to analyze which of the pairways comparisons from an ANOVA are significant. Image quality was compared using the Friedman test followed by the Dunn test for multiple comparisons. Inter-observer and intra-observer reliability for the quantitative parameters assessment were calculated by the intraclass correlation coefficient (ICC). Absolute agreement, two-way random effects, and single measure models were adopted. 95% CIs were calculated with bootstrapping. Statistical significance was defined as p < 0.05. The ICC values <0.4, between 0.4 and 0.54, between 0.55 and 0.69, between 0.70 and 0.84, and exceeding 0.85 represented poor, weak, moderate, good, and excellent agreement, respectively [17].

Result

Study population

A total of 40 symptomatic patients were included in the study. The baseline characteristics and clinical symptoms are summarized in Table 2. All 40 patients successfully completed rMRI. 3 patients were unable to tolerate eMRI, but they all completed r+alMRI. 1 patient was unable to tolerate r+alMRI as well as e+alMRI, and 4 patients were unable to tolerate e+alMRI. 35 patients finally completed all 4 examinations (Fig. 3).

Table 2 Baseline characteristics and clinical symptoms of the patients
Fig. 3
figure 3

Study flowchart for patients undergoing 4 examinations. rMRI, psoas-relaxed position MRI; eMRI, extended position MRI; r+alMRI, the psoas-relaxed position axial loading MRI; e+alMRI, the extended position axial loading MRI

Comparison of quantitative parameters

The mean value of DSCA, SVCD, DH, LFT and LA for 40 patients are listed in Table 3.

Table 3 qualitative parameters among 4 examinations

The DSCA values among the 4 examinations are shown in Fig. 4A. Compared to rMRI, a statistically significant reduction in DSCA was found only at the L4-5 level (p = 0.033) in eMRI, but not at the L3-4 and L5-S1. However, a statistically significant decrease was observed at all levels (L3-4, p = 0.041; L5-S1, p = 0.005; L4-5, p = 0.002) in r+alMRI, with a further decrease at all levels in e+alMRI (all p < 0.001) (Fig. 5).

Fig. 4
figure 4

Comparison of DSCA(A), SVCD (B), DH (C), LFT (D) at L3-L4, L4-L5, L5-S1 and LA (E) of L1-S1. Data are the means ± SD. *p < 0.05, ** p < 0.01. DSCA, dural-sac cross-sectional area; SVCD, sagittal vertebral canal diameter; DH, disc height; LFT, ligamentum flavum thickness; LA, lordosis angle; FRFSE, fast recovery fast spin-echo. rMRI, psoas-relaxed position MRI; eMRI, extended position MRI; r+alMRI, the psoas-relaxed position axial loading MRI; e+alMRI, the extended position axial loading MRI

Fig. 5
figure 5

A 34-year-old man. Sagittal T2-weighted fast recovery fast spin-echo (FRFSE) (A, B, C, D) and axial T2-weighted FRFSE of (E, F, G, H) of rMRL (A, E), eMRI (B, F), r+alMRI (C, G) and e+alMRI (D, H). Relative to rMRI, spinal stenosis in L4-5 aggravated progressively in the other three examinations. DSCA was 115.3mm2, 86.3 mm2, 73.2 mm2 and 63.1 mm2 in rMRI, eMRI, r+alMRI and e+alMRI, respectively. rMRI, psoas-relaxed position MRI; eMRI, extended position MRI; r+alMRI, the psoas-relaxed position axial loading MRI; e+alMRI, the extended position axial loading MRI; DSCA, dural-sac cross-sectional area

The SVCD and DH values among the 4 examinations are shown in Fig. 4B and C. Compared to rMRI, eMRI did not show statistically significant difference at all levels, r+alMRI showed statistically significant reductions at the L3-4 (p = 0.018) and L4-5 for SVCD (p = 0.011) and all levels for DH (L3-4, p = 0.027; L5-S1, p = 0.001; L4-5, p < 0.001), while e+alMRI showed statistically significant reductions at all levels for SVCD and DH (p < 0.001).

The LFT values among the 4 examinations are shown in Fig. 4D and had a statistically significant increase in the extension position compared to the psoas-relaxed position (p < 0.001). The LFT was thicker in eMRI compared to r+alMRI at all levels (L3-4, p = 0.038; L5-S1, p = 0.012; L4-5, p = 0.002).

The LA in extended position had a statistically significant increase compared to the psoas-relaxed position with or without axial loading (Fig. 4E, p < 0.001).

The inter-observer ICCs for DSCA, SVCD, DH, LFT and LA were 0.950 (95% CI 0.894 to 0.969), 0.911 (95% CI 0.822 to 0.958), 0.898 (95% CI 0.831 to 0.964), 0.739 (95% CI 0.567 to 0.819) and 0.884 (95% CI 0.819 to 0.959), respectively, suggesting good to excellent reliability.

The intra-observer ICCs for DSCA, SVCD, DH, LFT and LA were 0.987 (95% CI 0.935 to 0.994), 0.960 (95% CI 0.909 to 0.986), 0.921 (95% CI 0.853 to 0.962), 0.833 (95% CI 0.682 to 0.895) and 0.915 (95% CI 0.841 to 0.948), suggesting good to excellent reliability. The full reliability analysis are presented in Table 4.

Table 4 Inter- and intra-observer ICCs for quantitative parameters between examinations

Image quality assessment

The scores of image quality assessment for the 4 examinations are listed in Table 5, and the mean scores for the 4 examinations were 4.88±0.00,4.76±0.01, 4.79±0.01 and 4.69±0.17, respectively. There were with no statistical difference between the image quality of the 4 examinations (p = 0.124).

Table 5 Image quality assessment among 4 examinations, no. (%)

Examinee comfort assessment

The scores of examinee comfort assessment for the 4 examinations are listed in Table 6, and the mean scores for the 4 examinations were 4.33±0.69,3.90±1.26, 3.98±1.05 and 3.65±1.39, respectively. Compared to rMRI, examinee comfort scores were decreased in both eMRI and r+alMRI, and with a statistically significant decrease in e+alMRI (p = 0.004).

Table 6 Examinee comfort assessment among 4 examinations, no. (%)

All patients with scores of 1or 2 were due to increased pain or numbness in the low back and legs after extension or axial loading of the lumbar spine. Patients with scores of 3 had the above reasons besides the complaints of MRI noise during the examination or/and mild claustrophobia in 3 patients. No patients complained of discomfort from the axial loading device.

Disscusion

The mechanism of the previous axial loading device, which applies pressure through the shoulders and feet, required that the lumbar spine must be in the extended position during the examination to ensure stability. Therefore, it is not possible to distinguish the effects between axial loading and extension on LSS. In previous studies, we have demonstrated that the new device provides more diagnostic information for patients with spinal stenosis and has good stability and repeatability [11, 12]. An advantage of the new device is to allow for a kinetic examination in different positions of lumbar spine such as lumbar spine extended or psoas-relaxed position. In this study, we found that both axial loading and positions affected LSS, however, the pattern of influence was different.

The DSCA was the most accurate parameter for evaluating LSS and reflected the severity of LSS directly [18, 19]. A significant reduction in DSCA after axial loading was considered a meaningful change, and previous studies have found that patients with this sign had more severe clinical symptoms [20, 21]. Compared to rMRI, both eMRI and r+alMRI showed a decrease in DSCA, while r+alMRI showed a greater decrease than eMRI. During eMRI, only L4-5 showed a statistically significant decrease in DSCA, which was found at all 3 levels during r+alMRI. It suggests that axial loading has a greater effect on LSS compared to extended position. Meanwhile, e+alMRI further aggravated the severity of LSS.

In the DH and SVCD, only alMRI showed a statistically significant decrease compared to rMRI. These two parameters represented flattening and posterior herniation or bulging of the lumbar disc after compression [22,23,24]. The decrease in DH and SVCD explained that alMRI affected the morphology of the lumbar intervertebral disc by axial compression.

This study showed a statistically significant increase in LFT only in the extended position rather than in axial loading. We speculated that the main cause of the increased LFT was the buckling of the ligamentum flavum due to extension. This speculation was consistent with other studies, which suggested that the increase in LFT was mainly due to in-folding and bending of the ligamentum flavum into the spinal canal [25, 26].

Of the 4 examinations, e+alMRI showed the greatest variation in all quantitative parameters. Compared with rMRI, the DSCA, SVCD, DH and LFT showed statistically significant changes at all three intervertebral levels. These results were consistent with studies using Dynawell, and these changes in quantitative parameters may led to measurable advancement in the diagnosis of LSS [5, 27]. These changes may potentially change the strategy for the next step in treatment. In a study, after three neurosurgeons read the alMRI images and comprehensively analyzed the patients' condition, treatment was changed from conservative treatment to decompression surgery for five of the 20 patients [28].

However, an important issue with e+alMRI is the comfort of the patient with the examination. There was a statistically significant reduction in patient comfort scores with e+alMRI compared to rMRI. In this study, of the consecutive 40 patients, 5 could not tolerate e+alMRI. Even for eMRI, 3/40 will patients could not tolerate the examination, while for r+alMRI, only 1/40 could not tolerate it. All three patients who could not tolerate eMRI complained of significantly increased low back pain during the examination. We hypothesized that the reason for the increased low back pain might be related to the increased pressure on the posterior lumbar spine. In eMRI, the LFTs at all three intervertebral levels were thicker than those in alMRI, suggesting that the posterior lumbar compression was more pronounced. Some studies have demonstrated that degeneration and inflammation of the posterior lumbar joints are an important reason for low back pain [29, 30]. In a study of lumbar spine extended position MRI, 2/44 patients were unable to tolerate the lumbar spine extended position [10]. Some studies using Dynawell have mentioned that patients might not tolerate the examination, but no studies have yet evaluated examinee comfort [5, 31]. Interestingly, some patients who felt discomfort were more concerned with the discomfort of the MRI itself rather than the axial loading device or extension, such as noise and claustrophobia [32, 33].

There might be safety and efficiency issues during the examination using Dynawell. If patients could not tolerate, the technician need to take time to return to the examination bed and adjust a manual rotating knob to release or adjust the pressure [16, 34,35,36,37,38]. These problems are resolved with our new device which adopts a pneumatic mode that allows pressure release and regulation remotely.

This study had some limitations. First, some patients may not achieve maximum extension of the lumbar spine during the examination. Although we used a thick cushion with memory foam to ensure that it adapts as much as possible to the different patients' lumbar spine. However, the conclusions drawn from the study were still persuasive. Second, for patients with thick back fat, the cushion could cause a distinct pressure mark recognizable resulting in the readers not being fully blinded to the examinations. Third, we only compared some basic quantitative parameters but lacked some semi-quantitative and qualitative data such as foraminal stenosis, disc herniation.

In conclusion, both axial loading and extension of lumbar spine could exacerbate LSS, but the mechanisms may be different. In patients with suspected LSS, e+alMRI can provide more information for diagnosis, and in patients who cannot tolerate e+alMRI, r+alMRI may be a good choice.