Introduction

Fundamentally, the only reason to give a patient a fluid challenge is to increase cardiac output (CO) [1]. Patients who increase their CO following a fluid challenge are known as fluid responders. Studies in diverse patient populations have demonstrated that only about 50 % of haemodynamically unstable patients are fluid responders [2]. In patients with acute circulatory failure, predicting whether volume expansion will actually induce a significant increase in CO has become a common practice, is recommended by International guidelines [3, 4] and has generated considerable research [2].

Among the tests and indices that have been developed to predict fluid responsiveness, the passive leg raising (PLR) test has gained considerable attention [4, 5]. The PLR is performed by raising the legs of the patient to 45° [6]. This results in a hydrostatic increase in the mean systemic pressure, thus mimicking the effects of volume expansion. In case of preload dependence of both ventricles, the resultant increase in central venous return will result in an increase in CO and venous return.

The substantial increase in cardiac preload induced by PLR [7] challenges preload dependence of both ventricles. One of the advantages of the PLR test is that it can be used in instances in which the fluid responsiveness indices that are based on the respiratory variation of stroke volume are not reliable, such as spontaneous breathing, cardiac arrhythmias, low lung compliance and low tidal volume [8]. Furthermore, no fluid is actually administered and the PLR can thereby be considered a reversible or “virtual” fluid challenge.

Following the first study demonstrating the accuracy of the PLR test [9], this test has been the subject of a number of publications. In 2010, a meta-analysis pooling the nine studies that had been published at that time confirmed its accuracy [10]. Since this meta-analysis, some original articles dealing with the PLR test have been published [1123], some of them casting doubt on its reliability.

We conducted a systematic review of all these studies and performed a meta-analysis in order to assess the ability of PLR to predict fluid responsiveness, to determine the pooled threshold of the PLR-induced increase in CO (or surrogates) that should be considered as significant, and to compare the diagnostic accuracy of PLR when assessed by changes in CO (or surrogates) or by changes in arterial pulse pressure (PP). Our hypothesis was that the diagnostic ability of the PLR test is good when its effects are assessed on CO and less when its effects are assessed on PP.

Methods

Clinical research question

The clinical research question was: What is the sensitivity and specificity of the passive leg raising test to predict fluid responsiveness when assessing its effects on cardiac output and arterial pressure?

PICO statement

The PICO statement was the following:

  • P-patient, problem or population: Patients with acute circulatory failure in whom the effect of volume expansion on cardiac output is unknown and needs to be predicted.

  • I-intervention: Passive leg raising (PLR), induced by moving the patient from the semi-recumbent or horizontal position

  • C-comparison, control or comparator: Fluid responsiveness was defined as a significant increase of cardiac output during a volume expansion.

  • O-outcomes: Ability of the PLR test to predict fluid responsiveness.

Identification of records

Our aim was to identify all studies evaluating the ability of the PLR test to predict a significant increase in CO, cardiac index, stroke volume or a surrogate (aortic flow velocity, aortic velocity time integral) compared to that induced by a subsequent fluid infusion. We only included studies that provided the sensitivity and specificity of the PLR test. Furthermore, only studies that were published in full text or accepted for publication in indexed journals were included in this analysis. We excluded studies in which fluid was not administered to test fluid responsiveness. We included only studies performed in adult patients. Figure 1 illustrates the search strategy.

Fig. 1
figure 1

Flowchart of study selection

We searched the US National Library of Medicine’s MEDLINE database, the EMBASE database and the Cochrane Database of Systematic Reviews for relevant studies published in English from 1966 to April 2015. We used the following medical subjects headings and keywords: “passive leg raising or raise”, “fluid or preload or volume responsiveness”, “fluid or preload challenge”, “preload dependence or independence or dependency or independency”, “functional haemodynamic monitoring” and “fluid therapy or management”. We also looked for relevant articles cited in review articles, commentaries and editorials. The search was performed iteratively until no new records could be found. The quality of the included studies was evaluated by using the QUADAS-2 scale [24]. The meta-analysis was performed according to the PRISMA statement (http://www.prisma-statement.org ).

Data extraction

Using a standardised data form, we extracted several data elements from the included studies including characteristics of the investigated population, the methods used to perform the PLR test and to assess its haemodynamic effects, the modalities of fluid administration and definition of fluid responsiveness, the correlation observed between the changes in CO or surrogates induced by volume expansion and by PLR, the number of true positives, true negatives, false positives and false negatives, the area under the receiver operation characteristics (ROC) curve and the best threshold of PLR-induced increase in CO or surrogates that was found to predict fluid responsiveness. If the ability of the PLR-induced changes in arterial PP to predict fluid responsiveness were also investigated, these results were also extracted. For studies in which more than one technique was used to estimate CO, we only included the technique which is considered to be the most reliable. When both calibrated pulse contour analysis and end-tidal carbon dioxide or bioreactance were used to estimate CO, only the results provided by the calibrated pulse contour analysis were evaluated. Indeed, end-tidal carbon dioxide has been suggested to reflect cardiac output changes in very few studies [25, 26] and the reliability of bioreactance has been questioned by some studies [13, 27, 28]. When the effects of PLR were assessed by echocardiography and by uncalibrated pulse contour analysis or end-tidal carbon dioxide or Doppler femoral peak velocity, only the results provided by echocardiography were included in the analysis. The reliability of uncalibrated pulse contour analysis in patients receiving vasopressors has been questioned [4, 29] and the reliability of Doppler femoral peak velocity to estimate CO changes has not been evaluated. When the effects of PLR were assessed by bioreactance and carotid Doppler, only the results provided by bioreactance were included in the analysis. No other publications support the use of carotid Doppler as a reflection of change in CO.

Assessment of risk of bias in included studies

Two authors (X.M. and P.E.M.) assessed the risk of bias of the included studies by following the criteria specified in the QUADAS-2 scale [24]. For the “patient selection” domain, they examined whether patients were consecutively included and whether inappropriate exclusions were avoided. For the “index test” domain, they examined whether the threshold used to define volume responsiveness was pre-specified. For the “reference standard” domain, they examined whether the result of volume expansion on CO was assessed without knowledge of the PLR test result and whether the reliability of the device used to estimate changes in CO induced by PLR is widely validated. Finally, for the “flow and timing” domain, the authors examined whether there was an appropriate interval between PLR and volume expansion, whether patients received the same volume expansion and whether all patients were included in the analysis. For each criterion, the risk was judged as high, low or unclear.

Statistical analysis

We performed a meta-analysis in order to determine the pooled area under the ROC curve, the pooled sensitivity and specificity and pooled threshold for PLR as a predictor of fluid responsiveness when its effects are assessed on CO (or surrogate) and when they are assessed on PP. The pooled correlation between the PLR-induced and the fluid-induced changes in CO (or surrogate) was also determined.

Homogeneity between studies was tested by the Chi-squared test (likelihood ratios test for sensitivity and specificity and Cochran’s Q test based upon inverse variance weights for likelihood ratios and diagnostic odds ratios) and by the I 2 index. To investigate a threshold effect, we also calculated the Spearman correlation coefficient between sensitivity and specificity. According to heterogeneity, a random effect model was used to perform meta-analysis.

In addition, in order to investigate the influence of study heterogeneity, we conducted subgroup analyses. We compared studies with ICU setting versus non-ICU setting, making the hypothesis that the PLR test could be more reliable in ICU patients. We compared studies with pulse contour analysis or echocardiography versus studies with other methods for measuring CO, making the hypothesis that the reliability of the PLR test is better when assessed by pulse contour analysis-derived cardiac output or by echocardiography than by other techniques. We compared studies with suprasternal Doppler or bioreactance versus studies with other methods for measuring CO, making the hypothesis that the reliability of the PLR test is poorer when assessed by suprasternal Doppler or bioreactance than by other devices. We compared studies where fluid responsiveness was defined by an increase in CO or surrogate ≥15 % versus studies with other definitions of fluid responsiveness, testing the hypothesis that the reliability of the PLR test is better when fluid responsiveness is defined by a larger increase in cardiac output. We compared studies where volume expansion was performed with crystalloids versus studies where it was performed by colloids, testing the hypothesis that the reliability of the PLR test is better when volume expansion is performed with colloids. Finally, we compared studies with the lowest risk of bias versus highest risk of bias, testing the hypothesis that the reliability of the PLR test is the best in studies with a highest risk of bias. To distinguish studies with the lowest and highest risks of bias, we attributed points to the risk of bias for each item of the QUADAS evaluation (one point for low risk, three points for high risk, two points for unclear risk). For each study, we calculated the sum of these points, which was used as an estimate of the global risk of bias. If the sum was below the median of the sums of all studies, the study was classified in the lowest risk of bias group, and the opposite if the sum was higher than the median. A subgroup analysis was also performed with the method of PLR (starting with the trunk at 0° or at 30°–45°) as moderating variables. The hypothesis was that the PLR test was more reliable to detect fluid responsiveness when started from the 45° semi-recumbent position than from the horizontal position. Causes of heterogeneity were also investigated by meta-regression based on the Littenberg and Moses linear model and including the same criteria as the subgroup analyses.

Results are expressed as mean (95 % confidence interval) or as mean ± standard deviation. The meta-analysis was performed with Meta-Disc v.1.4 (Universidad Complutense, Madrid, Spain) [30]. The additional statistical analysis was performed with MedCalc 15.2.2 (MedCalc Software, Mariakerke, Belgium).

Results

Characteristics of included studies

A flow diagram describing the study selection is provided in Fig. 1. We identified 21 studies (991 adult patients, 995 fluid challenges) that reported the ability of the PLR to predict fluid responsiveness [9, 1123, 3137]. The characteristics of the included studies are presented in Table 1. Thirteen studies [1123] had not been included in the meta-analysis of Cavallaro et al. [10]. The vast majority of the studies were conducted in patients with acute circulatory failure. The absence of intra-abdominal hypertension was mentioned in four studies [14, 17, 18, 35]. The PLR was started from the supine position at 0° in four studies [21, 23, 31, 37] and from the semi-recumbent position in the remaining studies (Table 1).

Table 1 Main characteristics of included studies

Fluid responsiveness was defined by an increase in CO (or surrogate) by more than 15 % in 16 studies [9, 1119, 3136], 10 % in 2 studies [20, 21], 12 % in 1 study [37], 9 % in 1 study [22] and 7 % in 1 study [23] (Table 1).

CO was measured by echocardiography in 6 studies [16, 19, 32, 3537], calibrated pulse contour analysis in 6 studies [1214, 17, 18, 34], bioreactance in 4 studies [11, 20, 22, 23], oesophageal Doppler in 3 studies [9, 15, 31], either transpulmonary thermodilution or pulmonary artery catheter in 1 study [21] and suprasternal Doppler in 1 study [33] (Table 1). For the 2 studies in which the effects of PLR were assessed by echocardiography using both cardiac output and aortic velocity time integral [32, 37], we included only the cardiac output data.

For 2 studies in which the effects of PLR were assessed by calibrated pulse contour analysis and by end-tidal carbon dioxide [17] or bioreactance [13], only the results provided by the calibrated pulse contour analysis were evaluated. For the 3 studies in which the effects of PLR were assessed by echocardiography and by uncalibrated pulse contour analysis [35] or end-tidal carbon dioxide [15] or Doppler femoral peak velocity [36], only the results provided by echocardiography were included in the analysis. For the study in which the effects of PLR were assessed by bioreactance and carotid Doppler [20], we included only the results provided by bioreactance. The diagnostic accuracy of the PLR-induced changes in PP was investigated in 8 studies (392 fluid challenges) [9, 14, 15, 17, 18, 21, 34, 36].

The results of the QUADAS-2 evaluation are provided in Table 2 and Fig. 2. Among the sources of potential biases, 19 studies did not include consecutive patients (or no information was provided on this item).

Table 2 QUADAS-2 results
Fig. 2
figure 2

QUADAS-2 results. Light grey low risk, middle grey unclear risk, dark grey high risk

Correlation between the PLR-induced and the fluid-induced changes in CO

The correlation between the changes in CO (or surrogate) induced by the PLR and volume expansion was reported in 14 studies (541 fluid challenges) [9, 1319, 22, 31, 32, 34, 36, 37]. The pooled correlation coefficient was 0.76 (0.73–0.80) (p < 0.001) (Fig. 3).

Fig. 3
figure 3

Forest plot for the correlation coefficients (with 95 % confidence intervals) between the changes in cardiac output or surrogates induced by passive leg raising and those induced by volume expansion

Prediction of fluid responsiveness by the PLR-induced changes in CO

Among the 995 fluid challenges, 54 ± 9 % led to a positive response. All studies but four [11, 19, 21, 22] reported the value of percent changes in CO (or surrogate) induced by PLR in case of response to volume expansion. On average, the increase in CO in such cases was 20 ± 9 %.

For the PLR-induced changes in CO (or surrogate), the pooled sensitivity was 0.85 (0.81–0.88) and the pooled specificity was 0.91 (0.88–0.93) (Table 3). All studies but two [11, 20] reported the area under the corresponding ROC curve. The pooled AUC was 0.95 ± 0.01 (Table 3; Fig. 4). All the studies but one [23] reported the best threshold of the PLR-induced increase in CO (or surrogate) that predicted fluid responsiveness; this averaged 10 ± 2 %, ranging from 7 % [21] to 15 % [33] (Table 3).

Table 3 Main results of included studies with pooled and mean values
Fig. 4
figure 4

Receiver operating characteristics curve (with 95 % confidence interval) for the prediction of fluid responsiveness by the effects of passive leg raising on cardiac output or surrogates. Circles represent each study included in the meta-analysis. The size of each solid circle indicates the size of each study

When including only studies in which PLR was started from the semi-recumbent position (n = 17, 783 fluid challenges), the pooled sensitivity for the PLR-induced changes in CO (or surrogate) was 0.85 (0.81–0.88) and the pooled specificity was 0.91 (0.88–0.94).

Prediction of fluid responsiveness by the PLR-induced changes in arterial PP

For the PLR-induced changes in PP (8 studies, 432 fluid challenges), the pooled sensitivity was 0.56 (0.49–0.53) and the pooled specificity was 0.83 (0.77–0.88). The pooled area under the ROC curve was 0.77 ± 0.05 (Fig. ESM2). The best threshold of PLR-induced increase in PP that predicted fluid responsiveness was 12 ± 4 % on average, ranging from 6 [17] to 18 % [14] (Table 3).

Investigation of heterogeneity and of causes of heterogeneity

The Spearman correlation coefficient between sensitivity and specificity was −0.178 (p = 0.441), indicating no threshold effect. The heterogeneity Chi-squared was 58 % for sensitivity and 47 % for specificity. The I 2 statistics was 67 % for sensitivity, 58 % for specificity, 34 % for the pooled ROC curve analysis and 10 % for the coefficient of correlation analysis.

Subgroup analyses are presented in Fig. ESM1. None of the covariates included in the meta-regression were found to be the significant source of heterogeneity (Table ESM1). In particular, the comparison between studies with pulse contour analysis or echocardiography versus studies with other devices, and between studies with bioreactance or suprasternal Doppler versus other devices had no influence on sensitivity and specificity. In studies with pulse contour analysis or echocardiography, the average threshold indicating a positive PLR was 10 ± 2 %. In the studies with bioreactance or suprasternal Doppler, it was 11 ± 3 %. Sensitivity and subgroup analyses were consistent with the primary analysis

Discussion

This meta-analysis which summarised the results of 21 studies conducted in 991 adult patients and including 995 fluid challenges demonstrated that the changes in CO (or surrogate) during a PLR test predicted fluid responsiveness with excellent pooled sensitivity and specificity. The pooled area under the ROC curve was 0.95 ± 0.01. The mean of the best threshold was a PLR-induced increase in CO of more than 10 ± 2 %. The PLR test less accurately predicted fluid responsiveness when its effects were assessed by the arterial PP.

The PLR test has been developed to predict whether volume expansion will increase CO during the resuscitation of patients with acute circulatory failure. Moving the patient to a position in which the trunk is horizontal and the lower limbs are lifted at 45° increases the stressed blood volume due to the gravitational transfer of some venous blood from the lower limbs and the abdomen toward the cardiac cavities. PLR significantly increases the mean systemic pressure [38], what promotes venous return. It increases the right and left cardiac preload to an extent that is sufficient to challenge preload responsiveness of both ventricles [7], although this has been challenged [21, 39]. The PLR is now accepted as a test for predicting fluid responsiveness in clinical practice. It is recommended by a recent consensus conference of the European Society of Intensive Care Medicine [4] and has been implemented in the bundles of the surviving sepsis campaign [5].

In 2010, Cavallaro and co-workers performed a systematic review and meta-analysis that included nine studies evaluating the diagnostic accuracy of the PLR test in predicting fluid responsiveness [10]. These authors concluded that the PLR-induced changes in CO reliably predicted fluid responsiveness regardless of ventilation mode, underlying cardiac rhythm and technique of CO measurement. Since this meta-analysis, additional studies have been published, some of them reporting less positive results [21, 22]. In addition, some studies and one editorial [40] have suggested that the PLR would not be reliable under specific conditions, such as insufficient increase in central venous pressure [21] or intra-abdominal hypertension [39]. Furthermore, recent publications have offered the opportunity to better establish the threshold of PLR-induced increase in CO that must be considered to detect volume responsiveness. Finally, the meta-analysis of Cavallaro et al. included only three studies investigating the prediction of fluid responsiveness provided by PLR when assessed by changes in PP [10]. All these arguments justified an updated meta-analysis.

Our meta-analysis confirms the meta-analysis performed by Cavallaro et al. [10] and reinforces the message that the PLR test is valid to test fluid responsiveness. Based on the results obtained in a large number of patients, we found that the area under the pooled ROC curve was large and close to one. In addition, the studies reported threshold values of PLR-induced increases in CO that were close to each other, which further reinforce the consistency of our positive conclusion.

Some authors have claimed that, in order to be a valid test of preload responsiveness, the PLR should increase the central venous pressure by more than 2 mmHg, reflecting an adequate increase in cardiac preload [21]. The present meta-analysis does not support this concept, as we demonstrated that the PLR was highly predictive of fluid responsiveness without taking into account how the PLR changed the central venous pressure. It has already been suggested that the results obtained by Lakhal et al. were explained by the fact that PLR was started from the supine position and not from the semi-recumbent position [41]. This may have resulted in a lesser increase in the mean systemic pressure, which is the upstream pressure of venous return, and thus of central venous pressure.

Mahjoub et al. have suggested that the PLR test produces false negative results in patients with intra-abdominal pressure [40]. The results of this study were not included in the present meta-analysis as it only included patients who were fluid responsive. This study has a number of limitations, including the fact that the investigators did not measure the intra-abdominal pressure during the test [40]. We postulate that the PLR may decrease intra-abdominal pressure by reducing the weight of the diaphragm and thorax on the abdominal cavity. Nevertheless, since the intra-abdominal pressure was not assessed in the studies included in the present meta-analysis, the question remains unresolved as to whether intra-abdominal hypertension reduces the diagnostic accuracy of the PLR test in predicting fluid responsiveness.

The diagnostic accuracy of the PLR test was poorer when its haemodynamic effects were assessed on the arterial PP than on CO, confirming what had been already observed by Cavallaro et al. [10]. This is likely due to the fact that changes in PP and CO are not well correlated and that changes in PP are a poor indicator of the changes in CO [43, 44]. When assessed through changes in PP, the specificity of the PLR test was 0.83. The sensitivity was much poorer. This suggests that, when assessed by changes in PP, the PLR should be taken into account only if it increases PP, and that even in such a case, this should be done cautiously. This finding illustrates an important shortcoming of the PLR test, that only the CO (or surrogate) should be used when performing the PLR test.

The importance of correct positioning of the patient when performing the PLR test has been previously emphasised [45]. Compared to starting the test from the horizontal lying position, starting it from the semi-recumbent position augments the effects of the test on cardiac preload [45], likely by mobilising the venous blood from the abdominal compartment in addition to that of the lower limbs [6]. This meta-analysis could not confirm this finding. However, this result should be interpreted with caution since PLR was started from the horizontal position in only four studies.

An important point to keep in mind is that a positive PLR test should not systematically lead to administering fluid. It is only in cases of circulatory failure, as assessed by low arterial pressure and/or signs of poor tissue perfusion, when the need for increasing CO is required, that fluid administration should be considered. In addition, the beneficial increase of CO induced by volume expansion in patients who are fluid responsive must always be balanced with the risk of excessive fluid administration, which may be harmful, especially in patients with septic shock [46], acute renal failure [47] and acute respiratory distress syndrome [48]. Of note, using the PLR test has never been demonstrated to improve survival. Fluid responsiveness is not associated with improved outcome. This is not surprising since it is a diagnostic tool. Nevertheless, as many diagnosis tools, the test is not required to change outcome to be useful.

Various methods of measuring CO were used in the studies included in this meta-analysis. This must be regarded as a limitation, since the reliability of the PLR test likely depends on the accuracy of the device used to measure CO [49]. In particular, the threshold used to define fluid responsiveness should be higher than the least significant change of the CO measurement method. While this least significant change has been reported to be lower than 5 % for echocardiography [50], pulse contour analysis [34, 51] or oesophageal Doppler [15], it has been reported to be higher for bioreactance [22, 52, 53], while, as far as we know, it has not been investigated for suprasternal Doppler. Subgroup analysis does not support that the reliability of the PLR test was influenced by the type of cardiac output monitoring device, including bioreactance and suprasternal Doppler. Nevertheless, some of the subgroups were small. Thus, it is reasonable to consider that the reliability of the PLR test may be less solid if assessed with less reliable measures of cardiac output.

The heterogeneity of the included studies represents a limitation of the meta-analysis. Nevertheless, no threshold effect was detected. Furthermore, both the subgroup analyses and the meta-regression argued against the influence of heterogeneity on the results. A further limitation of this study is that the included studies were mainly conducted in the ICU, the sample size was generally small and that several different methods of measuring of CO were used. Therefore, this systematic review is only applicable to the specific patient populations studied and the devices used to measure CO. In addition, this systematic review was not prospectively registered in Prospero, an international database of prospectively registered systematic reviews in health and social care, what may have increased the risk of reporting bias. Additional limitations include the fact that we did not include studies not reporting sensitivity/specificity, non-full-text studies, studies not in English, and unpublished studies. This may have also increased the risk of reporting bias. Since the reliability of the PLR test in cases of intra-abdominal hypertension and in cases of insufficient increase in central venous pressure was investigated in one study for each condition, the meta-analysis does not definitely come to a conclusion about this reliability of PLR in these circumstances. Finally, two of the authors (X.M. and J.L.T.) were the authors of a large number of the studies included in the meta-analysis. Nevertheless, this was not the case for the third author (P.E.M.), so that the risk of bias was minimized.

In conclusion, this meta-analysis shows that the PLR test is accurate to predict fluid responsiveness in the intensive care unit. Its accuracy is better when its haemodynamic effects are assessed by a direct measurement of CO than by the arterial PP.