Introduction

The optimal treatment for infant hydrocephalus is yet to be definitively determined [18]. Two main surgical options exist: endoscopic third ventriculostomy (ETV) and CSF shunt. Direct comparisons of the outcome of ETV and shunt are very few and limited, almost exclusively, to retrospective studies [913]. Based on the current, albeit limited, literature, a few conclusions can be drawn. Young age, especially under 1 year, is known to be a prognostic factor for ETV failure, but is likely also a prognostic factor for shunt failure [13]. From a patient and family perspective, failure of either procedure requires repeat surgery and is, perhaps, equally detrimental, although, in the long-term, parental worry about complications seems to be less after ETV [14, 15]. Neither treatment appears to show a clearly superior long-term health-related quality of life outcome [10, 1619]. Therefore, the debate regarding ETV versus shunt remains unresolved.

To help provide clarity, the International Infant Hydrocephalus Study (IIHS) was started in 2004, under the aegis of the International Federation of Neuroendoscopy (previously known as the International Study Group for Neuroendoscopy) [20]. The IIHS is an international, prospective, multicenter study that aimed to answer the question: in infants (<24 months old) with symptomatic triventricular hydrocephalus from aqueductal stenosis, does initial treatment with ETV result in superior or no worse outcome at 5 years of age compared to shunt? The primary outcome was the Health Utilities Index Score [17, 21] at 5 years of age. The population of infants (<24 months) with aqueductal stenosis is a unique group. Their young age makes them less-than-ideal candidates for ETV, but their etiology (pure obstructive hydrocephalus) is among the most favorable for ETV [22]. They are, therefore, an ideal population in which to compare ETV and shunt, since community equipoise appears to exist.

Recruitment for the IIHS ended in December 2013, and herein, we present the first set of analyses relating to treatment failure. The accrual of the 5-year outcome, which is the overall primary outcome of the IIHS, is continuing and that analyses will be presented in a future publication.

Methods

The IIHS was designed as a prospective comprehensive cohort study, which contained both a randomized and non-randomized arm [20, 23] See Appendix for further details regarding IIHS structure and personnel. Participating neurosurgical centers were all experienced in pediatric neuroendoscopy (≥10 neuroendoscopic procedures per year per surgeon and ≥2 ETV operations in infants per surgeon in total). Patients who met the following criteria were considered eligible for the study: <24 months of age at time of operation; symptomatic triventricular hydrocephalus (TVH) requiring first treatment; born at >36 weeks gestation; preoperative MRI showing aqueductal stenosis with no other major brain anomalies. Patients with a history of intraventricular hemorrhage (intrauterine or post-natal) or intracranial infection were included, unless this related to prematurity. Patients were excluded if they had the following: open spina bifida; Dandy Walker syndrome with vermian agenesis/dysgenesis; perinatal asphyxia; severe brain dysmorphic anatomical features; known chromosomal abnormality; or intracranial tumor. Eligibility criteria were independently adjudicated for all patients.

Intervention

Patients were allocated to intervention using a comprehensive cohort design. The primary mode of treatment allocation was randomization. If families consented to randomization, the intervention was determined by 1:1 randomization, stratified by center. If families did not consent to randomization, they were enrolled in the treatment arm of their choice. The ETV intervention consisted of a standard frontal burr hole and use of an endoscopic camera to visualize the floor of the third ventricle. A ventriculostomy was created using the surgeon’s own preferred method of perforation. At the surgeon’s discretion, a post-operative temporary external ventricular drain or reservoir was inserted. The ventriculoperitoneal shunt intervention involved creating a burr hole in the frontal or occipital regions and cannulating the ventricle with a silastic catheter. This was then attached to a valve mechanism of the surgeon’s choice and distal silastic tubing which ran subcutaneously to the peritoneal cavity. Prophylactic antibiotics were used.

Follow-up

Following the initial intervention (ETV or shunt), patients were regularly followed as per departmental and surgeon’s routine, but with scheduled visits at 1, 2, 3, and 5 years after surgery. Adverse events were documented. All data were collected prospectively. At enrolment, baseline clinical data were collected. Post-operative data were collected, including assessments of complications and treatment failures. As part of IIHS, patients underwent neurocognitive and imaging studies, but these data are still being collected and are not presented in this paper.

Treatment failure was defined as the need for any repeat intervention for definitive CSF diversion (including repeat ETV or shunt insertion/revision), as determined by the treating surgeon, following standard clinical practice, or death related to hydrocephalus. At treatment failure, the treating surgeon decided whether to repeat the original treatment (i.e., a repeat ETV for a patient in the ETV group or a shunt revision for a patient in the shunt group), or whether to “cross-over” (i.e., attempt an ETV for a shunted patient or insert a shunt for an ETV patient).

Sample size

The study was initially expected to enroll 182 randomized patients and was powered to detect a 0.10 difference in 5-year health status using the Health Utilities Index [17]. Study recruitment began as early as 2005 in some centers, with staggered entry of new participating centers thereafter. However, enrolment in the randomized arm was slower than anticipated. Therefore, recruitment was stopped in December 2013, at the recommendation of the Data Safety Monitoring Committee (DSMC) on the basis of futility of reaching the targeted randomized cohort sample size. The DSMC suggested that analysis be focused on treatment failure data, pooling together the randomized and non-randomized arms, while continuing to follow previously enrolled patients for their 5-year primary outcome. In this paper, we present the data on surgical treatment failure.

Analysis

For these analyses, the randomized and non-randomized arms were pooled to compare those who underwent ETV versus shunt as their first surgical intervention. Baseline data between these two groups were compared to determine imbalances in preoperative characteristics, using chi-squared or Mann-Whitney, as appropriate. Survival curves were constructed using the Kaplan-Meier method for time-to-first treatment failure and compared using log-rank test. The primary analysis of this paper was performed using Cox proportional hazards regression to compare time-to-first treatment failure of ETV versus shunt, adjusting for patient age (months), history of infection/hemorrhage (yes/no), geographical continent (since there were too few patients in each center to adjust by individual center), and randomization status (i.e., whether the patient entered the study in the randomized or non-randomized arm). Geographical continent was categorized as the Americas (since there only a few patients from North America alone), Europe, and Asia. Preoperative head circumference, analyzed as an age-corrected Z score, was not associated with treatment failure and, therefore, was not included for further analysis. Proportional hazards assumption was checked by assessing the significance of each variable as an interaction with time. Randomization status did not meet the proportional hazards assumption. Therefore, analysis was stratified by randomization status. Proportional hazards assumption was confirmed for all other variables.

The ETV Success Score (ETVSS) was calculated for each patient. The ETVSS provides the predicted chance of ETV success at 6 months (ranging from 0 to 90 %), based primarily on patient age and etiology of hydrocephalus [22]. We compared the ETVSS (predicted chance of ETV success) to the observed 6-month ETV success rate in our cohort.

The IIHS was publically registered (NCT00652470) and received ethics approval from all participating institutions. Participating investigators and other trial personnel are listed in the “Appendix” section.

Results

A total of 182 patients were initially enrolled in the study (Fig. 1). However, of these, 24 were eliminated after independent adjudication and were removed from further analyses (11 did not meet eligibility criteria, and 13 were missing important preoperative data). This left 158 patients from 27 centers in 4 continents (64 from Europe, 61 from Asia, 22 from South America, and 11 from North America), of whom 115 had ETV as first intervention (33 randomized, 82 non-randomized) and 43 had shunt as first intervention (19 randomized, 24 non-randomized). Baseline data for these patients are shown in Table 1. Baseline differences between the two were noted only for age, with shunt patients being younger at surgery.

Fig. 1
figure 1

Diagram showing the flow of patients through the study and their outcome results

Table 1 Patient characteristics

Peri-operative complications included the following: 1 early CSF infection in the shunt group (2.3 %), 7 CSF leaks in the ETV group (6.1 %), and 1 perioperative seizure in the ETV group (0.9 %). There was no perioperative mortality. Overall shunt infection rate was 4.7 % (2 infections).

Median length of available follow-up was 739 days (IQR 69–1553), during which time, 38 ETV patients and 9 shunt patients demonstrated treatment failure. Among these failures, there was 1 hydrocephalus-related death (in the non-randomized shunt arm) due to presumed shunt failure in a child who died before being able to be transferred to the treating neurosurgical center. There were 3 other mortalities, but these were deemed to be not related to hydrocephalus after independent review and were not counted as treatment failures.

Table 2 shows the Kaplan-Meier estimates of ETV and shunt success at various time points. The ETV success was 66.3 % at 6 months and 64.1 % at 36 months, which was relatively stable. The shunt success was 87.8 % at 6 months and 79.1 % at 36 months.

Table 2 Kaplan-Meier estimates of treatment success

Unadjusted survival curves comparing time-to-first treatment failure for ETV and shunt are shown in Fig. 2. There was no significant difference in these curves (p = 0.07, log-rank). However, after adjusting for age, continent, history of infection/hemorrhage, and stratifying by randomization status, the adjusted hazard ratio for ETV was 3.17 (95 % confidence interval 1.45–6.96, p = 0.004), suggesting a significantly higher risk of treatment failure compared to shunt. Full results of the Cox regression are shown in Table 3. This model revealed that younger age was also associated with a greater chance of treatment failure (hazard ratio 0.89 (0.82–0.98), p = 0.015), regardless of type of treatment. Addition of an interaction term between type of treatment and age was not significant (p = 0.94). Continent and history of previous infection/hemorrhage were not significantly associated with treatment failure. The results were similar within both the randomized patient sample and the non-randomized patient sample.

Fig. 2
figure 2

Unadjusted Kaplan-Meier survival curves comparing time-to-first treatment failure for ETV and shunt for the entire cohort

Table 3 Results of Cox regression analysis for time-to-first treatment failure, stratified by randomization status

In order to further investigate the association between age and treatment effect, separate survival curves were constructed for patients <6 months of age (N = 75 ETV and 37 shunt, Fig. 3a) and those ≥6 months (N = 40 ETV and 6 shunt, Fig. 3b). These revealed that the most dramatic relative difference in ETV versus shunt failure occurred in the <6-month-old group, where shunt was substantially better. However, the number of patients in the >6-month-group was very small, greatly limiting its interpretation.

Fig. 3
figure 3

Unadjusted Kaplan-Meier survival curves comparing time-to-first treatment failure for ETV and shunt for patients <6 months of age (Fig. 3a) and those ≥6 months (Fig. 3b)

The mean ETVSS, i.e., the predicted 6-month success rate, for the ETV patients was 57.0 %, which is slightly lower than the actual ETV 6-month success of 66.3 %. For those operated at <6 months, the 6-month ETV success rate was 58.6 % (compared to an ETVSS of 48.0 %). For those ≥6 months, the 6-month ETV success rate was 79.5 % (compared to ETVSS of 74.0 %).

Discussion

We have presented the largest intervention study of infant aqueductal stenosis ever reported. Although modern ETV has been popular since the early 1990s, [24] the IIHS is the first ever multicenter, prospective comparison of ETV versus shunt in the literature. The participating centers, all experienced in neuroendoscopy, spanned 4 continents, which strengthens the external validity of our work. Data was collected prospectively, and importantly, we were able to adjust for baseline differences in age and history of infection or hemorrhage. The results of IIHS suggest that infants presenting with symptomatic triventricular hydrocephalus from aqueductal stenosis have higher rates of treatment failure with ETV compared to shunt. Despite this, the overall success rate of treatment was relatively high for both groups (64.1 % for ETV and 82.5 % for shunt at 1 year). In particular, the ETV group had a success rate at 6 months that was approximately 9 % higher than what would have been predicted by the ETV Success Score (66 versus 57 %). This difference was most obvious in the <6-month age group (actual success was 10.6 % higher than ETVSS) and less so in those >6 months old (actual success was only 5.5 % higher than ETVSS). This might be partially explained by the fact that the sample upon which the ETVSS was developed contained relatively few infants (especially those <6 months old) with aqueductal stenosis, so its ability to predict accurately within such a sample is more limited [22]. The shunt group, as well, had notably lower rates of failure than both older [25] and more recent [26] prospective historical comparisons.

Neither procedure was associated with significant perioperative morbidity, although, notably, the rate of CSF leak in the ETV group was 6.1 %. A further finding in our study is that even within this infant group, younger age appeared to be associated with higher rate of treatment failure for both ETV and shunt. Although there was no significant statistical interaction between age and procedure performed, survival curves stratified by age (Fig. 3) showed that ETV appeared to have relatively higher failure compared to shunt for the youngest patients under 6 months of age. However, these curves, especially in the older age group, were limited by smaller sample size.

Our study has several limitations. Recruitment for our study ended prematurely, falling short of our intended sample size target. Regardless, this still represents the largest prospective study of its type, and the sample size was sufficient to show statistically significant differences. More worrisome, however, is that overall study enrollment was strongly biased towards ETV, with few patients in the randomized arm. This highlights the great difficulty in performing a randomized trial when comparing two such different surgical interventions, which often come with strong parental preferences. There was no independent adjudication of treatment failure in our study. There are, however, several features that serve to strengthen confidence in the validity of our data and, in particular, how treatment failure was determined. First, the observed temporal pattern of treatment failure of ETV and shunt was very similar to previous works [12, 27]. Second, the effect of age on treatment failure is also consistent with previous work [2835]. Third, previous post hoc analyses have shown that independent adjudication in hydrocephalus trials have little impact on the final results compared to surgeon-determined treatment failure [36]. There were violations in study enrolment, with 24 patients needing to be removed after initial enrolment, following independent adjudication (Fig. 1). These patients were excluded from the final analysis, but resulted in an imbalance in the number of randomized patients in the ETV and shunt arm.

We did not include choroid plexus cauterization (CPC) with our ETV intervention. CPC, as an adjunct to ETV, has been suggested to improve the overall chances of ETV success, particularly in infants [3739]. CPC was popularized only after our study began recruitment and currently is still not widely practiced in most settings [40, 41]. While our overall observed success with ETV was already quite high in this sample, it would be interesting to determine if there would be added beneficial effect from CPC. Further studies will need to compare the combined treatment of ETV with CPC against shunt, especially for the youngest infants and those with etiologies of hydrocephalus that are less favorable for ETV alone, such as intraventricular hemorrhage of prematurity or myelomeningocele.

In interpreting our results, it is important to note that the sample defined by our eligibility criteria is a relatively rare subset within the spectrum of pediatric hydrocephalus. Therefore, the applicability of our results to the wider hydrocephalus population remains to be determined. This group of infants with aqueductal stenosis was chosen because it was a relatively clean and homogenous population set that offered a focused controversial etiology with respect to treatment options, with community equipoise.

In summary, we interpret our results to suggest that, for at least this unique group of patients, initial treatment with ETV might be considered a reasonable alternative to shunt, with some important caveats. First, treatment failure was much higher in the youngest patients, especially under 6 months. In these patients, the use of ETV must be exercised with care, keeping in mind the greater risk of failure, which should be conveyed in discussions with families. This group is also one in whom the role of CPC might be beneficial and should be investigated. Second, the results presented in this analysis represent only a small part of the overall picture of outcome. Specifically, the overall primary outcome of the IIHS is health status at 5 years of age. Those important results, which are still pending, could alter the balance in favor of one treatment over the other.