Introduction

Kawasaki disease (KD) is an acute self-limiting vasculitis mostly seen in children younger than 5 years of age. It predominantly affects medium-sized vessels with particular possible involvement of the coronary arteries. In high-income countries, KD is the leading cause of pediatric-acquired heart disease [1]. If untreated, coronary artery lesions (CALs) develop in about 15–25% of patients and may lead to severe complications such as ischemic heart disease and sudden death during either the acute or chronic phases of the disease [2]. The state-of-the-art treatment is high-dose intravenous immunoglobulin (IVIG) together with acetylsalicylic acid (ASA). Intravenous immunoglobulin should be instituted as early as possible within the first 10 days of illness onset, characterized by a high-grade fever. However, 10–15% of patients experience persistent or recurrent fever after completion of the initial IVIG administration and are thus classified as unresponsive to IVIG treatment [3]. Many studies have shown that this subgroup of patients is at increased risk of developing coronary artery abnormalities [4, 5].

Since the incidence of KD is 10 to 20 times higher in North-East Asian countries (especially Japan, Korea, and Taiwan) than in the USA and Europe [6], the largest cohort of patients and the longest surveillance reports come from Japan. Genetic susceptibility to KD is known to affect incidence, responsiveness to conventional treatment with IVIG, and coronary involvement. Indeed, the incidence among Japanese descendants living in Hawaii is similar to that reported for children from Japan, while the incidence among Caucasian Hawaiian children is comparable with that reported for white children in the continental USA [7]. However, findings regarding the associations between genetic background and distinct aspects of the disease found in Asian children have not been demonstrated in patients of other races [8,9,10].

The identification of patients at high risk for IVIG resistance at the time of presentation can allow us to identify those who may benefit from more intensive monitoring of their condition and who may require treatment modulation during the acute phase, with the potential addition of other anti-inflammatory agents to the conventional IVIG treatment. With this intention, risk-scoring algorithms have been developed by Asian authors, which take into account demographic (age at presentation), clinical (days of illness, presence of lymphadenopathy), and laboratoristic variables (% of neutrophils, platelet count (PLT), hemoglobin (Hb), aspartate aminotransferase (AST), alanine aminotransferase (ALT), albumin, sodium (Na), C-reactive protein (CRP), lactate dehydrogenase (LDH), and total bilirubin as risk factors in their populations. The Kobayashi (KS) [11], Egami (ES) [12], and Sano [13] scores are, to date, the most widely known and used. More recently, another score has been developed for a population of Taiwanese children [14].

Interestingly, a number of studies aimed at evaluating the use of Japanese scoring systems in populations of mixed ethnicity failed to demonstrate real effectiveness in predicting IVIG resistance and coronary involvement in these non-Japanese populations. Tremoulet et al. evaluated the ES with 362 children in San Diego County diagnosed with KD in 2006. When applied to their population, the ES missed more than 60% of IVIG-resistant patients. The authors developed their own scoring system, based on days of illness ≤ 4, age-adjusted hemoglobin, % neutrophil bands, and gamma glutamyl-transferase, which appeared to have higher sensitivity (73% vs 38% using the ES) for detecting IVIG resistance and was more sensitive among Caucasian children, compared to other ethnicities [15].

Similar findings were reported by Sleeper et al. who evaluated the KS, ES, and Sano scores on North American children of mixed ethnicity using the dataset from the Pediatric Heart Network’s randomized, placebo-blind trial of pulsed corticosteroids for primary therapy for KD. The sensitivity of the abovementioned scores in predicting IVIG resistance was 33%, 42%, and 40%, respectively, with, at most, modest associations with CA z-scores as well [16]. Moreover, no differential treatment effect of steroids was found for subjects from this population classified as high versus low risk for IVIG resistance on the basis of these 3 scores, as opposed to the results of the RAISE study performed in a Japanese population which showed significant reduction in the development of CALs in patients with KD considered to be at risk for IVIG resistance according to the KS [17].

Previous studies aimed at applying different risk scores in non-Japanese populations failed to show a good effectiveness in predicting IVIG resistance and coronary involvement [18,19,20].

The purpose of this study was to evaluate the effectiveness of three different risk scoring systems from Japan and Formosa in order to identify subjects at risk for IVIG resistance and thus for the development of CALs in our cohort of Italian children with KD.

Materials and methods

The present multicenter retrospective study included all children diagnosed with KD between 2000 and 2015 in 13 Pediatric Units in Emilia-Romagna, a northern region of Italy. All KD diagnoses were made in accordance with the 2004 American Heart Association (AHA) Guidelines [21], distinguishing complete and incomplete/atypical forms of clinical presentation. The onset of illness was defined as the first day of fever. All patients were treated, in accordance with the 2004 AHA Guidelines, with IVIG at 2 g/kg in a single infusion before the tenth day of fever, together with aspirin at 80–100 mg/kg/day, subsequently switched to 3–5 mg/kg/day once the patient had been afebrile for at least 48 h. Intravenous immunoglobulin resistance was defined as persistent/recrudescent fever for at least 36 h but for no longer than 7 days after the completion of the first IVIG infusion. Treatment was defined as late when the first dose of IVIG was given after the 10th day of fever.

Echocardiography was performed in all children. Diagnosis of CALs was based on the clinical judgment of the reporting physician; the left anterior descending artery and right coronary artery were measured. Two criteria were applied depending on the definition used by each participating center. The first follows the definition of the Japanese Ministry of Health which classifies coronary arteries as abnormal if the internal lumen diameter is > 3 mm in children < 5 years old or > 4 mm in children ≥ 5 years old and if the internal diameter of a segment measures ≥ 1.5 times that of an adjacent segment. The second criteria defines CALs as a coronary z-score ≥ 2.5 [21]. Coronary artery lesions were defined if an artery exceeded the Japanese criteria or the z-scores ≥ 2.5.

A database was prospectively created and then retrospectively reviewed. It included demographic features and clinical characteristics, total days of fever, IVIG responsiveness, laboratory values (white blood cells (WBC), neutrophil and lymphocyte percentage, RBC, Hb, PLT, CRP, erythrocyte sedimentation rate (ESR), serum albumin, AST, ALT, Na) of the acute (from onset to the 10th day) stage of the disease. Patients who received IVIG after the 10th day of fever and those who were not treated with IVIG were excluded from the data analysis.

We applied the KS, ES, and Formosa (FS) risk score systems [11, 12, 14] (Table 1) to our cohort. The KS assesses the parameters days of fever, age ≤ 12 months, Na, AST, neutrophil percentage, CRP, and PLT; the ES assesses the parameters age < 6 months, days of fever, PLT, ALT, and CRP; and the FS assesses lymphadenopathy, neutrophil percentage, and albumin levels. A cut-off point of ≥ 4, ≥ 3, and ≥ 3, respectively, identifies patients at high risk of IVIG resistance in the abovementioned scoring systems. Table 1 shows published KS, ES, and FS data for IVIG resistance.

Table 1 Published risk scoring systems for IVIG resistance for Asian subjects Kobayashi risk score

Statistical analysis

Data were expressed as the mean ± standard deviation (SD) for continuous variables, and median with range or number with percentage, as appropriate, for categorical variables. Student’s t test and Mann-Whitney U tests for continuous data and χ2 test for categorical data were used to compare variables between groups.

We divided KD patients into two groups (IVIG responders and non-responders) according to their response to IVIG. We subsequently compared demographic and clinical characteristics, presence of CALs, and laboratory findings from the acute and subacute phases between the two groups.

Sensitivity and specificity of the predictor cut-off for the positive score in KS, ES, and FS were calculated to predict IVIG unresponsiveness and the development of CALs. A ROC curve was developed to see the effectiveness of the overall score. The test was considered to be effective when the area under the ROC curve was higher than 0.5.

P < 0.05 was considered to be statistically significant.

The study analysis was performed using SPSS for Windows.

Results

Diagnosis and patient characteristics

A total of 304 consecutive children diagnosed with KD were enrolled in this study. Ten children were excluded due to incomplete data, 26 due to late treatment, and 11 because they were not treated. Of the 257 patients included, 214 (83.3%) were IVIG responders and 43 (16.7%) were IVIG non-responders.

Baseline characteristics of IVIG responders and non-responders are shown and compared in Table 2. Race was not significantly different between the two groups: Caucasian was the most common ethnicity (91.1% of IVIG responders and 83.7% of non-responders), followed by Asian (4.2% and 9.3%, respectively). There was a higher incidence of CALs in IVIG non-responders than in responders (p = 0.01). Seasonality and total days of fever were significantly different between the two groups (p = 0.03 and p < 0.0001, respectively), while age at diagnosis, gender, and clinical presentation were not.

Table 2 Comparison of demographic and clinical characteristics between IVIG-resistant and IVIG-responsive KD patients

Comparison of laboratory values at the time of diagnosis showed that IVIG non-responders had higher neutrophil percentages (p = 0.008) and AST (p = 0.02), and lower lymphocyte percentages (p = 0.008). No other significant differences were found among the laboratory parameters (Table 3).

Table 3 Comparison of laboratory values if the acute phase between IVIG-resistant and IVIG-responsive KD patients

Complete data were available to calculate KS, ES, and FS risk scores for 145, 188, and 113 subjects, respectively.

Kobayashi, Egami, and Formosa scoring systems and IVIG resistance

Sensitivity and specificity of the KS, ES, and FS are shown in Table 4. ROC curves are shown in Fig. 1.

Table 4 Sensitivity and specificity of Egami, Kobayashi, and Formosa scoring systems
Fig. 1
figure 1

ROC curve for KS, ES, and FS in an Italian population. AUC stands for area under the curve

Sixty-one children/145 (42%) had a positive KS (score ≥ 4), and 25/145 (41%) were IVIG non-responders. Sixteen children with a positive KS (26.2%) were IVIG non-responders. The sensitivity of the score was 64.0% and the specificity 62.5%. The negative predictive value was 89.3%, and the positive predictive value 26.2%. The area under the ROC curve was 0.4.

When applied to the subgroup of seven Asian patients, the KS had a sensitivity of 100% and a specificity of 75%.

Forty-eight patients out of 188 (25.5%) had a positive ES (score ≥ 3) and 29 out of 188 (15.4%) were IVIG non-responders. Twelve children with a positive ES were IVIG non-responders. Sensitivity was 41.4% and specificity was 77.4%. The area under the ROC curve was 0.32.

Sixty-six out of 113 patients (58.4%) had a positive FS (score ≥ 3) and 89 out of 113 patients (78.7%) were IVIG non-responders. Seventeen patients with a positive FS were IVIG non-responders. The sensitivity of the score was 70.8%, and the specificity was 44.9%. The area under the ROC curve was 0.32.

Kobayashi, Egami, and Formosa scoring systems and coronary anomalies

The predictive value of the KS, ES, and FS for CALs was poor. Results are shown in Table 4.

The KS and ES had a sensitivity of 41.9% and 38.5% and a specificity of 57.9% and 77.9%, respectively. Negative and positive predictive values were even lower than those calculated to identify IVIG non-responders. The FS showed a sensitivity of 72.0% and a specificity of 45.5%, a positive predictive value of 27.3%, and a negative predictive value of 85.1%.

Discussion

Our study evaluated the effectiveness of three different Asian scoring systems in predicting IVIG resistance and coronary lesions in the same Mediterranean population.

Since IVIG unresponsiveness is associated with increased risk for coronary involvement, it is mandatory to identify children at risk as soon after diagnosis as possible in order to adapt adjunctive anti-inflammatory treatment. Widely used risk scoring systems for predicting IVIG non-responsiveness come from studies in Asian populations, and they are based on demographic, clinical, and laboratoristic parameters. In these populations, these scores were effective for identifying KD patients who could benefit from additional anti-inflammatory therapy to reduce CALs.

The data presented show that the KS, ES, and FS systems were poor predictors of IVIG resistance in our cohort of Italian children with KD; thus, they represent unhelpful tools for identifying those patients who could benefit from adjunctive anti-inflammatory therapy during the acute phase. Moreover, our results also demonstrate that none of the considered scores were predictive for the development of CALs. As previously demonstrated, our findings show that IVIG non-responders have a higher incidence of CALs and longer fever duration. Gender, age at diagnosis, and clinical presentation were not significantly different between the two groups in our cohort, while the percentage of neutrophils and AST were significantly higher and the percentage of lymphocytes lower among the IVIG responders.

In Japan, the RAISE study demonstrated that in patients with a positive KS, adjunctive treatment with steroids is effective in reducing coronary lesions, and therefore cardio-vascular morbidity [17]. More recently, in patients at risk according to the ES, first-line treatment with intravenous steroids plus IVIG was shown to be effective in preventing coronary involvement [22].

Very recently, a study in Germany failed to prove a good prognostic value of the KS, ES, and Sano scores in a primarily Caucasian population because of poor sensitivity, and only the Sano score had a positive likelihood ratio for CALs 4 weeks after the onset of the disease [23]. In 2011, the KS, ES, and Sano scores were shown to have good specificity, but low sensitivity, for refractory KD and only the KS could significantly predict coronary artery involvement 5 weeks after the onset of KD in a Northern American Cohort [16]. Previous studies in non-Japanese populations from North America [15, 16], the US Midwest [19], Spain [18], and The UK [20] applied a single risk score and showed low sensitivity and specificity in predicting unresponsiveness to conventional treatment. Thus, so far, Asian scores have been demonstrated to be neither sufficiently sensitive nor specific in identifying high-risk patients in multi-ethnic non-Asian populations. Highlighting the crucial role played by ethnicity in terms of response to conventional and adjunctive treatments, a recent meta-analysis failed to demonstrate, in non-Asian patients, a beneficial effect of corticosteroids as rescue therapy in IVIG non-responders in reducing the incidence of coronary abnormalities [24], while it was shown to be effective in Asian children.

In our study, we were able to apply 3 score systems in the same population. Our findings show that in Italian children, the FS had the best sensitivity (slightly under 80%) and the lowest specificity, while the ES had the best specificity but the lowest sensitivity for IVIG resistance. However, ROC curves for the 3 scores were less than 0.5 demonstrating that all 3 scores were not useful for identifying IVIG resistance among the KD patients in our mostly Caucasian population. Furthermore, the considered scores were also ineffective for predicting coronary involvement.

The ethnicity of our cohort is similar to the German population described by Jakob et al. [23] and the Spanish cohort described by Sanchez-Manubens et al. [18], in which Caucasian children were the most highly represented (91.1% in our cohort, 95% in the German cohort, and 81.1% in the Spanish cohort). Both the German and Spanish authors also reported the failure of the different Asian scores in their cohorts.

The development of clinical and laboratory scores to predict IVIG resistance and coronary lesions in European populations is mandatory in order to permit early recognition and treatment with additional anti-inflammatory agents in children at increased risk. This would potentially prevent or, at least, limit coronary involvement and subsequent cardiovascular morbidity.

Limitations of this study include its retrospective nature and its sample size. In contrast, we were able to apply three different Asian Scores in the same Cohort of Italian children.

In conclusion, the KS, ES, and FS systems are not effective screening tools for predicting IVIG unresponsiveness or CALs among Italian patients with KD. This may be due the fact that our patients are mostly Caucasian, thus presenting a different genetic predisposition to the disease and, possibly, clinical and laboratoristic characteristics. Further multicenter studies are necessary in order to find a specific score for non-Asian populations which will enable the early identification of patients at increased risk for coronary involvement who could benefit from adjunctive anti-inflammatory treatment in addition to IVIG.