Introduction

Cerebral vasospasm is defined as vasoconstriction of at least one major blood vessel in the brain causing a reduction in distal blood flow. Either radiographic or symptomatic vasospasm affects as many as 50–90% of patients after an aneurysmal subarachnoid hemorrhage (aSAH) [1,2,3,4,5]. It is a contributing risk factor of delayed cerebral ischemia (DCI), which is a feared complication of aSAH resulting in a high rate of morbidity and mortality in almost half of the patients who develop it [6]. As many as 20–40% of patients with vasospasm will develop DCI [7]. DCI is usually diagnosed clinically or radiographically based on standardized definitions [8]. While our understanding of the pathophysiology of DCI has significantly evolved, the discovery of radiographic vasospasm either at onset of or leading up to DCI remains a crucial frontier of early endovascular intervention, particularly if medical interventions such as prophylactic nimodipine and situationally adjusted blood pressure management fail [9,10,11]. Therefore, monitoring for, and diagnosis of, radiographic vasospasm that leads to neurologic deterioration is critical in the management of DCI, and the greatest value lies in its diagnosis before it causes clinically significant symptoms. The current gold standard used to diagnose cerebral vasospasm is digital subtraction angiography (DSA), which is an invasive fluoroscopic technique that may require general anesthesia and is associated with a real risk of thromboembolism and stroke [12]. However, transcranial Dopplers (TCD) have emerged as a valuable non-invasive bedside imaging tool to evaluate cerebral vasospasm, especially serially to monitor trends in change of flow velocities [13, 14]. However, it is best studied for middle cerebral artery (MCA), does not measure vessel size, and does not provide any therapeutic benefit. In contrast, computed tomography angiography (CTA) provides direct vessel size information, with overall less radiation administered than the gold standard DSA. Nevertheless, as TCD, it does not have any therapeutic value either.

Both CTA and TCD have been implemented (to different degrees based on institutional preference) to aid in the diagnosis of vasospasm following aSAH. While CTA has good correlation with angiographic vasospasm, the evidence is less clear with TCDs [7]. In addition, given the variability in their methodologic quality and results in previously published research such as different sensitivity and specificity provided in recent meta-analyses [15, 16], it is difficult to draw definitive conclusions about the utility of these two modalities from the existing literature. Currently, the Neurocritical Care Society (NCS) and the American Heart Association guidelines provide a moderate-strong recommendation for use of TCD, whereas only the NCS provides a weak recommendation for CTA [17, 18]. Other meta-analyses were published investigating TCDs or CTAs [15, 16, 19], but none of these meta-analyses compared the two modalities to one another. We included a uniform set of stringent inclusion and exclusion criteria investigating both TCDs and CTAs to provide valid comparison with DSA encompassing all vessels in diagnosing radiographic vasospasm, in contrast to other meta-analyses [15], while interpreting the findings against a backdrop of current guidelines and recent practices. By evaluating these methods, we hope to provide high-quality data regarding sensitivity, specificity, and pooled likelihood ratios that may be used in the utilization of these techniques in diagnosing vasospasm, improve upon the quality of evidence, and provide a recommendation based on GRADE criteria [20].

Materials and methods

Study selection

The literature search strategy was developed using the PRISMA checklist [21]. PubMed, EMBASE, and Cochrane databases were used to find articles that were published from inception to September 2020. Relevant medical subject heading (MeSH) terms, text words (tw), and Emtree terms were used in the search strategy (Appendix 1). The study protocol was not pre-registered. We included comparative studies (cohorts/case controls/RCTs) examining the sensitivity and specificity of CTA or TCD in comparison to the gold standard DSA for detecting vasospasm in adults aged ≥ 18 years with aSAH. We examined studies in which DSA was performed within 48 h of the TCD or CTA study being completed. Because these imaging modalities focus on specific segments within patients, studies that considered patients as the unit of observation (instead of segments) to derive the sensitivity and specificity of the imaging modality being used were excluded. Studies in which two different readers graded vasospasm were also excluded. Non-English and non-human studies were excluded.

Because TCD provides only indirect information about vessel diameter through blood flow velocity, there is no threshold TCD measurement that definitively identifies spasm, although certain cutoffs such as a mean flow velocity (MFV) of > 200 cm/s in MCA are considered to be highly probable of vasospasm. Moreover, due to differences in baseline vessel diameter, the mean blood flow velocity through the MCA differs significantly from that through the vertebrobasilar system, even under normal conditions [22, 23]. As such, blood flow velocity cutoffs for vasospasm must be tailored to the specific artery being assessed. When monitoring a patient with regular TCDs, further testing is generally indicated in the following three scenarios: (1) an MCA velocity > 120 cm/s, (2) an increase of MFV of > 50% from baseline through any vessel [24], and (3) the sudden appearance of unexplained neurological symptoms [25]. For this meta-analysis, to ensure uniformity, we excluded any paper that did not use a 120 cm/s flow velocity threshold for the MCA, as this is the most commonly utilized cutoff in the literature and, as stated above, has become the clinical convention [16, 26, 27]. We used the same cutoff for ACA. There is less existing research on vessels within vertebrobasilar vasospasm; however, there have been multiple studies that have shown enhanced sensitivity and reasonable specificity when using a cutoff of at least 85 cm/s or greater [28, 29]. Thus, we chose to include papers that employed a cutoff of at least 85 cm/s when analyzing the vertebral or basilar arteries. Using a similar rationale, we adopted a cutoff of least 25% luminal narrowing for CTA [30, 31]. Admittedly, these threshold values for TCD and CTA were discretionary choices. However, they were based on evidence from literature and clinical standard.

A two-stage screening of articles including (1) title and abstract screening and (2) full-text screening was done individually and then reviewed by four authors (SC, SD, JM, UT). Discrepancies were resolved by the senior authors (RM and WG).

Data extraction

The following variables were extracted from the articles that were selected through full-text screening: general characteristics (publication year, country of origin), study characteristics (study design and timing), and outcome characteristics (total number of arterial segments analyzed, total number of patients, sensitivity, specificity, intervention type). For the studies that did not report the count data for the number of segments that were true positive, true negative, false positive, false negative, and only reported either sensitivity or specificity, we performed our own calculations from the data we extracted [32]. Data extraction was conducted by four independent authors (SC, SD, JM, UT), and any discrepancies were resolved by discussions among the extracting authors.

Data analysis

A random-effects model using the DerSimonian–Laird method was used for the meta-analysis [33]. Unlike our previous work on comparing CTA and DSA for post-clipping aneurysm obliteration detection where a bivariate model (which takes into account the relationship between the observed pairs of sensitivity and specificity) was used [34], we did not have enough studies in this current analysis; therefore, the univariate meta-analytic approach was more appropriate to perform [35]. We analyzed the sensitivity and specificity of both CTA and TCD in the detection of vasospasm after aSAH compared to the gold standard DSA, based on unit of arterial segments. Individual studies were pooled to give an overall estimate of the primary outcome using a forest plot. The associated confidence intervals and exact point estimates were reported. Posterior test probabilities such as likelihood ratios for the CTA and TCD tests were calculated from the multiple test results (sensitivity and specificity) for each of the arterial segments’ and patients’ analysis. Likelihood ratio positive (LR +) reflects how much the odds of a disease increases when the test is positive. A LR + above 10 is considered a strong evidence to rule in a diagnosis. Likelihood ratio negative (LR −) reflects how much the odds of a disease decreases when the test is negative. A LR − below 0.1 is considered a strong evidence to rule out a diagnosis. The analysis was performed using Comprehensive Meta-Analysis (CMA) version 3 (Copyright 1998–2018, Biostat Inc.).

Heterogeneity assessment

The heterogeneity of the research was assessed between the studies by using I2 to measure the proportion of total variation due to this heterogeneity. The I2 value of greater than 40% is considered to be high. [36]

Evaluation of bias, study quality, and strength of recommendation

Study quality assessment was performed using the QUADAS-2 tool (a revised tool for Quality Assessment of Diagnostic Accuracy Studies) [37]. It comprises four major domains: patient selection, reference standard (DSA), index tests (CTA or TCD), and flow and timing (interval between the tests) of patients in the study. While the four domains were assessed in terms of risk of bias, the first three domains listed were assessed in terms of applicability as well, which led to seven domains per study. The domains were assessed by answering the signaling questions for each of them and judging them as having a low risk, high risk, or unclear risk of bias. For example, in the flow and time domain, the signaling question asked, “Was there an appropriate interval between the index test and reference standard?” A time span of 24–48 h maximum was established based on previous literature that showed vasospasm as a complex process that differed in disease severity over a time of 48 to 72 h. [38, 39] Overall, studies were categorized as “low risk” if they were judged as “low” on all seven domains or “at risk” of bias if any of the seven domains was judged as “high” or at “unclear” risk of bias. Potential selection bias was minimized through strict adherence to study protocol. Small study bias assessment was not feasible owing to the limited number of studies per specific outcome (< 10 studies) [40]. The Grading of Recommendations, Assessment, Development, and Evaluations (GRADE) scale was used to evaluate the overall quality and strength of the evidence provided for each outcome included in the meta-analysis. [20]

Results

After removing duplicates, we identified 2070 articles from the comprehensive search on PubMed, Cochrane, and Embase. Once articles failing to meet our inclusion criteria after title and abstract screening were excluded, 191 remained for full-text review. Of these, 145 articles were excluded with reasons shown in Fig. 1. Overall, seven studies [12, 41,42,43,44,45,46] that evaluated and reported on various arterial segments were included in the final analysis, with the categories specified in the first column of Table 1. More specifically, Lennihan (1993) reported sensitivity and specificity results for middle cerebral artery and anterior cerebral artery segments individually; Burch C (1996) reported for internal carotid artery and middle cerebral artery. Both studies were divided in two sub-studies each, while Sloan (1994) reported for separate vertebral artery imaging on TCD. Out of the final studies included, six studies and sub-studies used TCD on different segments [12, 43,44,45], and four studies used CTA [12, 41, 42, 46] to detect vasospasm. Overall, 1646 arterial segments (from 283 patients) were analyzed (Table 1).

Fig. 1
figure 1

Study selection process of the identified articles

Table 1 Characteristics of studies that analyzed segments using TCD or CTA compared to DSA

Quantitative assessment

The pooled estimate for sensitivity was found to be 82% (95% CI, 67%, 91%; I2, 77.2%) for CTA (Fig. 2a) and 38% (95% CI, 19%, 62%; I2, 87.7%) for TCD with reference to DSA (Fig. 2b). The pooled specificity was 97% (95%CI, 93%, 98%; I2, 73.6%) for CTA (Fig. 3a) and 91% (95%C,: 87%, 94%; I2, 22.9%) for TCD (Fig. 3b). This resulted in a pooled positive likelihood ratio (LR +) that favored (> 10) CTA (27.3) but not TCD (4.22) to rule in diagnosis. The pooled negative likelihood ratio (LR −) for CTA (0.19) and TCD (0.68) was approaching clinical significance (< 0.1) to rule out diagnosis, when compared to the gold standard (DSA) (Table 2).

Fig. 2
figure 2

Forest plot for pooled sensitivity comparing diagnostic techniques CTA vs DSA (A) and TCD vs DSA (B) to detect vasospasm. VA, vertebral artery; BA, basilar artery; a, internal carotid artery (ICA); and b, middle cerebral artery (MCA)

Fig. 3
figure 3

Forest plot for pooled specificity comparing diagnostic techniques CTA vs DSA (A) and TCD vs DSA (B). VA, vertebral artery; BA, basilar artery; a, internal carotid artery (ICA); and b, middle cerebral artery (MCA)

Table 2 Pooled sensitivity (SE), specificity (SP), positive likelihood ratio (LR +), and negative likelihood ratio (LR −)

Bias evaluation and qualitative assessment

One study [44] was at risk of bias, whereas the remaining six [12, 41,42,43, 45, 46] were at low risk. One study was judged at high risk of bias due to patient selection; if the comparison was not conducted on the “correct” series of participants, there is a possibility that the estimate of comparative accuracy will be skewed [44]. Based on the grade criteria, the quality of the included evidence was determined to be high for CTA sensitivity (+ 4) and specificity (+ 5), while it was determined to be very low (+ 1) for TCD sensitivity and moderate (+ 3) for TCD specificity. As for the LRs, which is directly calculated from sensitivity and specificity, we provided a strong recommendation for using CTA in ruling in or out vasospasm based on high quality of evidence. However, our recommendation against using TCD to rule in diagnosis was weak, and that towards using TCD in ruling out vasospasm was also weak, based on an overall low quality of evidence.

Discussion

This was the first meta-analysis that compared CTA and TCD with reference to the gold standard (DSA) used for early detection of vasospasm among aSAH patients. Our study showed that when compared to DSA, CTA has higher sensitivity and specificity at detecting vasospasm in patients with aSAH than TCD. Moreover, CTA had more favorable positive and negative LR to both rule in and rule out vasospasm than TCD. TCDs were shown to have high specificity indicating that increasing mean flow velocities are concerning enough to prompt further investigation to confirm angiographic vasospasm and a reasonably low LR − to increase confidence in ruling out vasospasm. However, based on LR calculations and overall low quality of evidence, we provide a weak recommendation against using TCD to rule in vasospasm and a weak recommendation for using TCD in ruling it out. Overall, CTA was very comparable to DSA in its performance, which has been demonstrated previously [19]. Our study is different from other meta-analyses done on this topic as our inclusion criteria were distinct from what was previously published, as one did not include CTA in their analysis [15, 19]. Moreover, the necessity of our study was the need to provide quantifiable data on sensitivity and specificity of TCD and CTA in diagnosing radiographic vasospasm in comparison to DSA, by using a uniform set of inclusion and exclusion criteria, thereby ensuring validity of comparison throughout the entirety of the study.

The notion that TCD’s reliability in detecting vasospasm is less than that of CTA is not entirely surprising, as TCD relies on using mean flow velocities (MFV) as a surrogate for vessel diameter. Additional inaccuracies may result from inability to get good insonation windows, inter-observer variability, factors influencing velocities such as hyperdynamic and hypervolemic therapies, and small vessel embolic occlusions [7]. Our study demonstrated that TCD likely missed one out of every two cases of angiographic vasospasm, leading to a high rate of false negative results, which may be detrimental due to the potential for missing DCI that is attributable to radiographic vasospasm. This is supported by other studies demonstrating that as many as 40% of cases with DCI may never achieve a MFV > 120 cm/s in the MCA [47]. For this reason, it is not advisable to rely solely on TCD velocities in diagnosing vasospasm. This is not to suggest that TCD monitoring does not have utility. It is a safe, non-invasive, and cost-effective way of trending velocities, thus allowing for more rapid detection of deviations from baseline. Moreover, because we did not stratify our results by aSAH severity or clinical symptoms, it is possible that TCD may have better sensitivity for clinically symptomatic vasospasm [48]. A MFV threshold of 200 cm/s also aids in improving both specificity and positive predictive value for angiographically confirmed vasospasm, as opposed to the 120 cm/s threshold used in our study [48, 49]. TCDs have been labeled as an extension of neurologic exam, which is especially useful when sedation is required to control intracranial pressure, and incorporating its use makes intuitive sense given that vasospasm is a dynamic phenomenon and can change with time and treatment factors, as opposed to CTA or DSA, which provide only a single time point of evaluation [50]. In addition, despite having substantial intra-rater reliability, the inter-rater reliability has been moderate at best even when results were stratified according to specialty and experience, which limits reliable use of CTA for diagnosis of radiographic vasospasm. This may be from lack of consensus on diagnostic criteria, and further work may focus on defining specific criteria for various segments of different vessels and directly compare to the same segments from the DSA. Other factors that may adversely influence CTA’s diagnostic utility are low cardiac output and the route of contrast administration. [51]

These favorable factors for TCDs are supported further by the Neurocritical Care Society’s (NCS) recommendation for daily TCD monitoring and clinical exams as monitoring tools, with more detailed work-up being indicated if there is a deviation from the patient’s baseline MFV or neurological functioning [25]. While CTA’s use is limited by cost, radiation, requirement for intravenous contrast, patient transport, and moderate inter-rater reliability, its high sensitivity and specificity may facilitate its role in diagnostic confirmation especially from 4 to 14 days post-aneurysm rupture when the vasospasm risk is highest [25]. This approach may be even more pertinent in situations where neurologic exams are limited (e.g., comatose patients) or there remains clinical suspicion of vasospasm despite normal TCD velocities. Various institutions, including ours (Brigham and Women’s Hospital), already employ a similar strategy while supplementing with continuous electroencephalography and multi-modality monitoring [11]. However, we do advise caution in this approach given that the positive LR for TCD is poor, and therefore, its ability to rule in vasospasm is limited. A reasonably low negative LR does lend some confidence in TCD’s ability to rule out vasospasm, but our recommendation for its use is weak due to an overall low quality of evidence. The NCS guidelines also provide a weak recommendation for CTA due to low quality of evidence. However, our data strengthens this recommendation significantly, as we found a high quality of evidence supporting its use. [18]

There were some limitations and strengths in our study. First, we were unable to address the severity of vasospasm because the included studies did not classify vasospasm according to vessel caliber or MFV. Not only does prevalence of vasospasm increase when all severities of vasospasm are included, but lower agreement may be seen between TCDs and CTAs [52, 53]. Second, due to the limited number of studies, our analysis may not fully account for different heterogeneity sources such as inconsistencies in acquiring images and study protocols, possible discordance in various portions of arterial segments between CTA and DSA, and different aneurysm locations. We also could not conduct bivariate random-effects model in our meta-analysis due to limited studies, which takes into account the correlation between sensitivity and specificity, as done in our previous work [34]. Nevertheless, the likelihood ratios enabled us to assess pooled sensitivity and specificity together to provide a holistic diagnostic assessment. Third, the diagnostic cutoffs used for vasospasm detection on TCD were variable. To address this heterogeneity, we excluded studies that did not utilize the MFV threshold of 120 cm/s for MCA and ACA and 90 cm/s for VA/BA. Fourth, we based our recommendation for TCD using a cutoff of 120 cm/s, whereas using a separate cutoff of 200 cm/s may have possibly improved our recommendation for using TCD to rule in vasospasm. Aside from the rigorous statistical methods we adopted, another strength worth mentioning is that we also integrated study quality using the QUADAS-2 tool in the study design, which is the most suitable method for assessing the quality of diagnostic accuracy studies. [37] Lastly, our understanding of the pathogenesis of DCI has continually evolved to unravel multiple mechanisms including but not limited to neuroinflammation, microcirculatory dysfunction, and glymphatic impairment [10]. However, detection of radiographic vasospasm with clinical deterioration still remains an important treatment target for endovascular intervention, and early detection is crucial in implementing this rescue therapy. [11]

Conclusion

Although CTAs offer greater sensitivity and specificity in the detection of cerebral vasospasm in comparison to TCDs, TCDs do have high specificity indicating that increasing mean flow velocities approaching vasospasm threshold should be investigated. Based on likelihood ratio calculations, we assign a strong recommendation for use of CTA in ruling in or out vasospasm based on high quality of evidence. However, we provide a weak recommendation against using TCD to rule in and a weak recommendation for using TCD to rule out vasospasm, based on an overall low quality of evidence. Future studies should evaluate TCD and CTA based on severity of vasospasm, aSAH grades, and various TCD velocity cutoffs as these factors may influence the diagnostic accuracies of either modality or the agreement between them.