Introduction

Immunoglobulin light-chain (AL) amyloidosis is a plasma cell dyscrasia that is characterized by the production of monoclonal free light chains (FLCs) and misfolded amyloidogenic proteins [1]. The estimated incidence rate was 10 cases per million population with an increase in disease prevalence of 12% per year [2, 3]. All organs except for the central nervous system may be affected and up to 57%–70% of AL amyloidosis patients had renal involvement [4, 5]. Nephrotic range proteinuria is the major clinical presentation of renal AL amyloidosis patients, which may lead to end-stage renal disease (ESRD) eventually. According to previous studies, the mortality of AL amyloidosis patients with advanced cardiac involvement remained high (around 40%) in 3 months from treatment [6]. But the decreased estimated glomerular filtration rate (eGFR) only appeared in the stage of irreversible damage of amyloid protein to the kidney. Therefore, patients often face a great risk of death in the early stages of AL amyloidosis, leading to insufficient understanding of renal adverse outcomes. With the progress of treatment regimens, the outcome of AL amyloidosis has improved. However, there are still some patients who experience renal adverse outcomes despite achieving hematologic very good partial response (VGPR) or better [7, 8]. The reason for this phenomenon may be related to the deposition status of amyloid in renal tissue [9]. Nonetheless, the renal pathological features of AL amyloidosis patients were often overlooked.

In 2017, a novel scoring tool that quantifies the degree of amyloid deposition in the kidney was proposed by Rubinstein et al. and the amyloid score (AS) was found to be associated with progression to ESRD by a 12-month landmark analysis in a case–control study of 39 patients [10], and the correlation between AS and adverse renal outcomes was validated in another independent cohort [11]. Nevertheless, the definitive effects of AS when taking into other clinical factors and its suitability needed to be tested. In addition, the current staging systems for renal outcome in renal AL amyloidosis patients were developed without including renal pathological features [12, 13], which may ignore the value of renal pathology for guiding treatment and offering prognostic information.

Therefore, the objective of this study was to elucidate the precise significance of that pathological scoring tool in conjunction with other clinical factors and further explore a new renal prognostic staging system including pathological indicators for renal AL amyloidosis patients, thereby providing references for avoiding bad outcomes.

Methods

Patients

This study included AL amyloidosis patients with renal involvement who were newly diagnosed by renal biopsy at Xijing Hospital from January 1, 2017, to December 31, 2022. The involvement of organs was assessed according to the 10th International Symposium on Amyloid and Amyloidosis, and renal involvement was defined as 24-h urine total protein (24UTP) excretion more than 500 mg/day [14]. Patients were recruited if they had (1) symptoms of renal involvement; (2) evidence of monoclonal protein; (3) Congo red-positive fibril deposition on renal biopsy or confirmed as AL amyloidosis by mass spectrometry; (4) not been treated with anti-plasma regimens. Exclusion criteria were as follows: (1) secondary to multiple myeloma according to the criteria by the International Myeloma Working Group [15], (2) lack of renal pathological information or evidence of renal pathology indicating the presence of alternative forms of nephropathy, (3) eGFR < 15 ml/min/1.73m2 at the confirmation of AL amyloidosis, (4) follow-up time less than 3 months unless they met the endpoint. This study was authorized by Xijing Hospital's ethics committees and review board, and informed consent was not required because of the nature of the retrospective study.

Data collection

Patients’ clinical characteristics, renal pathological features, and treatment responses were extracted from the amyloidosis-specialized database. Clinical characteristics, such as demographic and laboratory examination data, were based on the results at the time of renal biopsy. All patients were regularly followed up in the amyloidosis specialist outpatient clinic at Xijing Hospital. Two experienced pathologists independently reviewed and evaluated the original renal biopsy slides of patients. Discrepancies in assessment results were resolved by discussion. If they cannot reach a unified conclusion on the pathological results, we will seek another senior pathologist to make a final judgment. To assess the effect of early treatment on the prognosis of patients, we also collected the change of the difference between involved and uninvolved free light chain (dFLC) and 24UTP at the time of two cycles of treatment.

Definition

The endpoint of this study was the occurrence of kidney events, which were ESRD or renal progression. eGFR less than 15 ml/min/1.73m2, requiring long-term dialysis, or kidney transplantation were defined as ESRD. Progression in renal function is defined as eGFR decreased by more than 25% [16]. Time to kidney events was defined from the date of renal biopsy until ESRD or renal progression, whichever occurred first. Patients were censored if they died without kidney events at the date of death or the last follow-up until May 31, 2023. The renal pathological characters of AL amyloidosis patients was evaluated according to the AS proposed by Rubinstein et al. AS is the sum of AL amyloid deposition scores at mesangial, capillary, interstitial and vascular, which was scored on a semiquantitative scale of 0 to 3 (0 = absent, 1 ≤ 25%, 2 = 25–50% and 3 ≥ 50% of glomerular tuft) [10]. Hematologic and organ response was evaluated according to the criteria of 2023 National Comprehensive Cancer Network guidelines [16], in which the renal response (RR) is 24UTP drop more than 30% or below 0.5 g/24 h without renal progression. Patients who had kidney events within 3 months were classified as no RR. The hematologic response was defined as no response if the patient died within 3 months.

Statistical analysis

The continuous variables were presented by mean (standard deviation) or median (interquartile range), and the categorical variables were expressed as number (frequencies). The t test or the Mann–Whitney U test was performed to compare continuous variables, and the χ2 test or the Fisher exact test was applied for categorical variables. Death due to all-causes was set as competing events for adverse kidney events. The variables had statistically significant in univariate and multivariate competing risk models (Fine-Gray subdistribution hazard model) were identified as the independent risk factors. The optimal cut-point of variables was generated according to maximally selected rank statistics (MSRS). MSRS can be used for survival analysis where the sample data does not follow a normal distribution. It also has strong robustness in studies with a small sample size. The cumulative hazard curve was made to observe the probability of kidney events in renal AL amyloidosis patients over time, and the Log-rank test was utilized to assess the differences across groups. The Concordance index (C-index) and five-fold internal cross-validation were calculated to evaluate the model’s performance. Besides, the Decision Curve Analysis (DCA) was also performed to confirm the clinical benefits of different models. All statistical analyses were finished by R version 4.3.0 (R Project for Statistical Computing) and SPSS software package version 25.0 (IBM, Armonk, New York). Two-sided P values less than 0.05 were considered statistically significant. Specific R packages used were “cmprsk 2.2–11”, “survminer 0.4.9”, and “rms 6.7–0”.

Results

Baseline characteristics

A total of 74 patients were included in this study among 131 AL amyloidosis patients (Fig. 1). The baseline characteristics of enrolled patients are presented in Table 1. The mean age was 57.9 ± 9.7 years. 43 (58.1%) patients were male. 58 (78.4%) patients had lambda amyloidogenic light chains. All patients had renal involvement and 47 (63.5%) patients had cardiac involvement. At diagnosis, the median eGFR and 24UTP were 87 (66.9, 101) mL/min/1.73m2 and 3030 (1448, 4784) mg/24 h, respectively. According to Mayo 2012 staging system, 27 (36.5%), 20 (27.0%), 18 (24.3%) and 9 (12.2%) patients were in the I, II, III and IV stage. By comparison, patients who experienced kidney events had worse eGFR with higher cystatin c (CysC), n-terminal pro-brain natriuretic peptide (NT-proBNP), 24UTP and the ratio of 24UTP to eGFR. Meanwhile, they also had a greater proportion of cardiac involvement.

Fig. 1
figure 1

Flowchart of this research. ND AL amyloidosis newly diagnosed immunoglobulin light-chain amyloidosis; eGFR estimated glomerular filtration rate; 24UTP 24-h urine total protein; ESRD end-stage renal disease

Table 1 Baseline characteristics of enrolled patients

Pathological features

The varied degrees of renal pathology are also presented in Fig. 2. In our cohort, the median AS was 3 (2, 4) and patients who experienced kidney events had higher AS (P = 0.021). Mesangial was the area where amyloid deposition is most commonly found, with just 5 (6.8%) patients having no amyloid deposition there, and 33 (44.6%) patients had more than 2 + amyloid deposition. Besides, both vascular and interstitial were the common areas with mild amyloid deposition.

Fig. 2
figure 2

Pathological results of enrolled patients. Black arrows showed Congo Red depositions. Representative slides showing the different amyloid scores. A = 7 (200 x), B = 6 (400 x), C = 3 (Mesangial and interstitial, 200 x), D = 3 (Mesangial and vascular, 200 x), E = 1 (Mesangial, 400 x), F = 1 (No depositions was found in Mesangial, 400 x)

Clinical outcomes

At 3 months, 32 (43.2%) patients achieved hematologic VGPR or better. There was no difference among the patients with or without kidney events in terms of that. Meanwhile, RR was also achieved in thirty-two patients. Patients who did not experience kidney events were more likely to obtain RR (P = 0.025). Moreover, RR and hematologic VGPR or better were achieved simultaneously in 18 (24.3%) patients, and no difference observed between the two groups (Table 1). Following a median follow-up of 24.7 (11.9, 50.7) months, 23 (31.1) patients died due to disease progression and ten patients had kidney events (P < 0.01) (Table 1). Of the 16 (21.6%) patients who had kidney events, 11 patients progressed to ESRD and 5 patients had progression in renal function. According to the competing risk model, the rate of kidney events in 12-, 36- and 60-months were 17.9%, 21.3% and 24.1%, while the incidence of death were 11.5%, 18.3% and 21.2%, respectively (Supplemental Fig. 1).

Risk factors affecting kidney events

In order to simplify the competing risk model, dFLC, NT-proBNP and cardiac troponin t (cTnT) were dichotomized according to Mayo 2012 staging system, respectively. The Spearman analysis was also conducted to remove variables with correlation coefficients greater than 0.75. Then, AS, total bilirubin, CysC, eGFR, 24UTP, Cardiac involvement and RR were selected by univariate competing risk model. Finally, AS (HR = 1.63, 95%CI 1.03–2.59, P = 0.039) and eGFR (HR = 0.96, 95% CI 0.93–0.99, P = 0.012) were identified as independent risk factors affecting kidney events in multivariate competing risk model (Supplemental Table 1).

The optimal thresholds estimated for AS and eGFR were 2.0 and 61.93875 mL/min/1.73m2 according to MSRS, respectively. Then, all patients were divided into high-risk group or low-risk group based on those values, and the Kaplan–Meier curve was made. By log-rank test, we can find higher AS (Supplemental Fig. 2A), and lower eGFR (Supplemental Fig. 2B) had more chances of kidney events.

The development and validation of the novel renal staging system

A key eGFR threshold of 60 mL/min/1.73m2 and a key threshold of 2 for AS were set to establish the renal staging system, leading to the categorization of all patients into three groups. Additionally, patients were stratified into three groups based on previous research findings [11, 12], with the outcomes summarized in Table 2. As we can see, all renal staging systems were effective in distinguishing patients with different risks for adverse kidney events. The renal staging systems formulated by Palladini et al. and our study had more power in identifying the patients with the greater probability of kidney events, while the renal staging systems proposed by Kastritis et al. and our study had a better performance in identifying patients with stable renal function.

Table 2 Rate of kidney outcomes according to different staging systems

Meanwhile, the cumulative hazard curves based on different renal staging systems are presented (Fig. 3A–C) and indicated the great performance of them. The C-index of renal staging systems proposed by Palladini et al. and Kastritis et al. were 0.60 (95% CI, 0.47–0.74) and 0.757 (95% CI, 0.67–0.84), respectively. The renal staging system formulated by our study had a greater C-index of 0.81 (95% CI, 0.73–0.89), which meant the greater performance than both of them (P < 0.01). In addition, the five-fold internal cross-validation also indicated the stable C-index of renal staging system formulated by our study at 12-, 36- and 60-months (Supplemental Fig. 3). Finally, the DCA was performed to confirm the great clinical decision benefits of our renal staging system (Fig. 3D–F). At 12-months, the renal staging system proposed by Palladini et al. was most limited, while the renal staging system formulated by our study had the greatest benefit over other staging systems at 60-months.

Fig. 3
figure 3

The cumulative hazard curves and clinical decision curves based on different staging systems. AS amyloid score; eGFR estimated glomerular filtration rate; 24UTP 24-h urine total protein. A, B and C are cumulative hazard curves of different staging systems; D, E and F are clinical decision curves of different staging systems. Staging system A I was defined as eGFR ≥ 50 mL/min/1.73m2 and 24UTP < 5000 mg; Staging system A II was defined as eGFR < 50 mL/min/1.73m2 or 24UTP ≥ 5000 mg; Staging system A III was defined as eGFR < 50 mL/min/1.73m2 and 24UTP ≥ 5000 mg; Staging system B I was defined as 24UTP/eGFR < 30 mg/mL/min/1.73m2; Staging system B II was defined as 24UTP/eGFR among 30-99 mg/mL/min/1.73m2; Staging system B III was defined as 24UTP/eGFR ≥ 100 mg/mL/min/1.73m2; Staging system C I was defined as AS ≤ 2 and eGFR ≥ 60 mL/min/1.73m2; Staging system C II was defined as AS > 2 or eGFR < 60 mL/min/1.73m2; Staging system C III was defined as AS > 2 and eGFR < 60 mL/min/1.73 m2

Discussion

To the best of our knowledge, this study is the first to explore the key risk factors for adverse kidney events from renal pathological characteristics, baseline clinical indicators and early treatment responses in renal AL amyloidosis patients. By competing risk models, eGFR and AS were identified as independent risk factors for adverse kidney events. And the AS was validated in Asian renal AL amyloidosis patients. Furthermore, a novel renal staging system was developed by incorporating renal pathological characteristics, which was proven its great performance.

As the gold standard for assessing renal function, the eGFR was also identified as an independent risk factor for adverse kidney events. In individuals with renal involvement, the decreased eGFR appeared in the advanced phase of irreversible damage of FLCs to kidney and was associated with higher AS [11]. Besides, the decreased eGFR seems to be related to prerenal kidney injury brought on by cardiac involvement [17]. In our study, patients who experienced kidney events had a relatively longer delay in diagnosis, more cardiac involvement, and higher NT-proBNP levels, which was consistent with the above ideas. Hence, identification of renal AL amyloidosis as soon as possible and mitigation of the renal burden resulting from cardiac involvement may offer insights into preventing unfavorable renal outcomes.

As a rare plasma cell disorder, prognostic studies of AL amyloidosis patients with renal involvement are lacking and the specialized pathological scoring tool for these patients had not been proposed until 2017. The scoring tool proposed by Rubinstein et al. indicated that AS is useful indicators for judging the progression to ESRD in AL amyloidosis within 12 months [10]. Furthermore, AS was associated with poor renal outcomes in another study [11]. However, the effectiveness of AS in Asian patients remained to be tested. In our research, AS was also firstly validated its prognostic effect on the occurrence of renal adverse events in Chinese patients. The values of AS in our research were lower than those in previous studies, and the optimal threshold of AS for predicting adverse kidney events was only 2. According to previous research, 37% of AL amyloidosis patients were diagnosed 1 year after the onset of symptoms and 69% of patients visited 3 or more physicians before the confirmation of AL amyloidosis [18]. Therefore, correctly identifying AL amyloidosis is frequently delayed. Actually, the median delay from symptom onset to kidney biopsy was only approximately six months in our cohort, which is considerably shorter than the time required in other international studies [19], suggesting a relatively timely diagnosis and less renal damage from amyloid protein. This was also validated by the fact that patients in our cohort had better renal function than in the original study which the median eGFR was only 61 mL/min/1.73 m2 [10]. Besides, the kidney being the most commonly affected organ rather than the heart in China [20, 21], the popularization of renal biopsy and referral center bias may be reasonable for this discrepancy too.

Most importantly, a novel renal staging system was formulated by incorporating AS and eGFR. These two indicators used have been recognized by international peers. The eGFR was calculated according to the Chronic Kidney Disease Epidemiology Collaboration equation, which was recommended in the KDIGO 2024 Clinical Practice Guideline for the Evaluation and Management of Chronic Kidney Disease [22]. As a pathological assessment tool for AL amyloidosis patients with renal involvement, AS has also been validated [11]. Therefore, we think our conclusions could be extrapolated to diverse ethnic or geographic groups. Compared with previous renal staging system based on 24UTP [12, 13], the staging system proposed by our study may prevent staging inaccuracies caused by fluctuations in 24UTP due to different test methods. By comparing C-index and DCA of our staging system and previous systems, our renal staging system proved its better performance in predicting the occurrence of ESRD or progression in renal function, helping clinical doctors make early medical decisions and taking intervention measures in advance. Besides, the threshold of eGFR in our staging system was consistent with the current criteria for the evaluation of chronic kidney disease, which further enhanced the clinical practice of our staging system.

Hematological VGPR or better has long been the therapeutic goal for patients with AL amyloidosis, but its applicability to patients with specific organ involvement has been challenged. In our cohort, hematologic VGPR or better and RR even to simultaneous RR and hematologic VGPR or better were not identified as significant factors in preventing adverse kidney events in renal AL amyloidosis patients, which underscored the damage due to amyloid deposition and scarring injury may be hard to reverse by current therapy. Nevertheless, there is currently no recommended regimen for AL patients with renal involvement. This phenomenon could potentially be attributed to the pre-existing multi-organ involvement in the majority of patients upon diagnosis. Furthermore, the organ response exhibits a considerably slower pace compared to the hematologic response, thereby prompting previous studies to refrain from designating it as the primary observation endpoint. While it is true that a more favorable hematologic response is associated with a more profound organ response [23, 24], it is imperative to acknowledge the existence of a subgroup of patients who do not derive any advantages from such hematologic response [7, 8, 25, 26]. Furthermore, renal pathology changes may persist despite hematologic improvement [9, 25, 26], implying ongoing renal damage. Consequently, it is imperative to promptly verify the effectiveness of novel therapies aimed at eliminating amyloid deposits in affected organs, such as CAEL-101 or NEOD001 [27, 28].

Meanwhile, our study also has some limitations. At first, due to the biases of single center retrospective studies and rarity of renal AL amyloidosis, the number of patients included in our study was relatively modest with no external validation. However, the reliability of our study was ensured by regular follow-up, strict data management, and internal cross-validation. In addition, as we mentioned above, our cohort had a relatively low degree of renal burden and the threshold for pathology score was derived to be lower than that of other studies. To obtain the most accurate threshold, a prospective study with a large cohort of patients from multi-center was taken into the agenda. Eventually, our study shifted its focus towards ESRD or renal progression, rather than solely on the occurrence of ESRD. As a result, our staging system may exhibit limited predictive performance compared to the previous staging system in forecasting ESRD.

Conclusion

Our study identified risk factors from renal pathological characteristics, baseline clinical indicators, and early treatment responses for the occurrence of adverse kidney events in renal AL amyloidosis patients. Specifically, we found that only AS and eGFR were independent risk factors. Furthermore, a novel renal staging system based on AS and eGFR was formulated, which may provide prognostic guidance for these patients.