Introduction

Lymphoma consists of over 50 histologically and biologically distinct lymphoid malignancies, classified into Hodgkin lymphoma (HL) and non-Hodgkin lymphoma (NHL). Each subtype has a different prognosis and requires a specific treatment approach, depending mainly on the stage of the disease. Remarkable advances have been made in diagnosis and treatment, with an improvement in 5-year survival rate documented in Europa, particularly for NHL [1, 2]. The introduction of FDG PET/CT as a staging and response assessment tool in FDG-avid lymphoma has contributed to better clinical management in lymphoma patients and it has been included among the standard criteria of evaluation [35]. When used for remission assessment in HL and aggressive NHL, PET/CT shows a high predictive value; in indolent lymphomas, too, PET/CT appears to be a good predictor of outcome, especially in high tumour burden follicular lymphomas (FL) [68].

At the end of treatment, residual masses are frequently detected by CT scan, but PET/CT is able to differentiate between viable pathological cells and fibrotic scar tissue with greater accuracy [9]. It is known that FDG uptake revealed by PET may reflect several phenomena and does not unequivocally indicate the presence or absence of malignancy. In particular, inflammatory changes related to treatment or to residual disease are not infrequent at the end of treatment and may be associated with higher FDG uptake. Nevertheless, the likelihood of malignancy increases with increasing FDG uptake. The main challenge is to identify an accurate threshold for discriminating between these two conditions.

In 2007, the Imaging Subcommittee of the International Harmonization Project (IHP) in Lymphoma proposed a standardization of performance and interpretation of PET examinations [10]. According to the IHP criteria (IHPC), response to therapy is assessed on the basis of both PET tracer uptake and size of residual lesions measurable on CT images. Although the IHPC have been validated only in small cohorts of HL and NHL patients and the interobserver variability has not been assessed [11, 12], these criteria have been strongly recommended for use in clinical practice and research trials [13]. Alternative criteria, known as the Deauville Criteria (DC), have also been proposed for evaluation of PET scans based on the application of a five-point scale using the mediastinum and liver activity as the reference standard. The prognostic value of PET using the DC after one to three cycles of chemotherapy (interim PET) in patients with HL and aggressive NHL and the potential benefits for clinical management have been investigated [1416]. Furthermore, assessment of the DC in multicentre studies has revealed good diagnostic accuracy and intercentre concordance [17, 18]. More recently, the five-point scale has been recommended for reporting both interim PET and post-therapy PET (ptPET) [4, 5]. Nevertheless, the true effectiveness of the IHPC and DC remains unclear and there is also uncertainty over whether PET/CT performed at the end of therapy for HL and NHL is of prognostic value when using these criteria.

Therefore, the aims of this study were:

  • To evaluate the accuracy of PET/CT performed at the end of therapy for HL and NHL when adopting the standardized criteria (IHPC and DC) in defining status of disease

  • To assess which of the two models (IHPC or DC) for the interpretation of ptPET provides better prediction of the risk of progression and survival.

Materials and methods

This retrospective analysis included consecutive patients with lymphoma obtained from the central PET databases of the departments of nuclear medicine of University College London Hospital (UCLH) and San Raffaele Scientific Institute of Milan (OSR) for the period April 2008 to October 2013.

The inclusion criteria was as follows:

  1. 1.

    Diagnosis of lymphoma confirmed by histology (fine-needle or open biopsy) available at the respective centres

  2. 2.

    Availability of conventional imaging and non-imaging data for staging disease

  3. 3.

    FDG PET/CT scan performed at staging, with evidence of FDG-avid disease

  4. 4.

    FDG PET/CT scan performed at the end of conventional treatment, at least 3 weeks after the end of chemotherapy or 6 weeks following radiotherapy, in accordance with the IHPC

  5. 5.

    Available clinical information in the form of multidisciplinary team case notes and/or imaging information for validation of patient PET findings

  6. 6.

    Posttherapy surveillance, until death or for at least 12 months after the PET/CT scan to evaluate the risk of progression and the survival rate

  7. 7.

    Written informed consent for the FDG PET/CT scan and the anonymous publication of disease-related information

Patients with CNS lymphoma, patients with concomitant malignancy and retroviral positivity, and patients on other research protocols were excluded from the analysis. Finally, 101 patients (60 from UCLH; 41 from OSR) were evaluated: 55 men and 46 women with a median age of 46 years (mean 47 years, range 6 – 87 years). Of the 101 patients, 35 had HL and 66 NHL. All were treated according to the each institution’s standard protocol.

Disease status was followed by reference to the available clinical and imaging information, identifying patients negative for the presence of disease and patients with relapse and/or progression of disease (PD) at follow-up (median duration of follow-up in the overall cohort 18 months; 95 % confidence interval, CI, 15 – 20 months). From these data we also calculated time of PD, progression-free survival (PFS) and overall survival (OS). The study was performed in compliance with the principles of the Declaration of Helsinki. Patient characteristics are summarized in Table 1.

Table 1 Characteristics of the patient population

FDG PET/CT imaging

Both centres involved in the study use PET/CT tomographs to perform clinical studies and both comply with international guidelines concerning patient preparation, radiopharmaceutical administration and dose, radiation protection, imaging acquisition and quality control of scanners.

Studies were acquired using a Discovery™ PET/CT tomograph (GE Healthcare, Milwaukee, WI) or a Gemini GXL16 PET/CT tomograph (Philips Medical Systems, Veenpluis Best, The Netherlands). PET images were acquired for each bed position from the base of the skull to the mid-thigh and CT (90 – 120 mA, 140 kV, 0.8 s per tube rotation) was used for nonuniform attenuation correction. FDG PET/CT images were interpreted by two experienced nuclear medicine physicians who had access to the results of previous imaging and clinical information, but were unaware of follow-up results at the time of the ptPET evaluation in order to avoid any significant influence on image evaluation. The whole-body FDG PET/CT scans in each patient were evaluated qualitatively using either the IHPC as described by Juweid et al. [10] or the DC (five-point scale, see below) [16].

FDG PET/CT analysis

Any previously involved nodal or extranodal site identified on pretherapy staging was evaluated. According to the IHPC, ptPET was considered positive when a lymph node lesion greater than 2 cm on CT images showed residual uptake more intense than the mediastinal blood pool activity; if the lesion was smaller than 2 cm, the surrounding background activity was considered as the reference [19]. Additional rules were considered for lung nodules, residual uptake in the bone marrow and hepatic and splenic parenchyma, in accordance with the established IHPC [10]. Using the DC, the following five-point scale was applied: 1 no residual uptake, 2 residual uptake less than or equal to that in the mediastinum, 3 residual uptake greater than that in the mediastinum but less than that in the liver, 4 residual uptake moderately higher than that in the liver, and 5 residual uptake markedly higher than that in the liver or the presence of new sites of uptake. If a new site of increased uptake was observed, all potential causes other than PD, such as inflammation/infection, were considered; for example, diffusely increased bone marrow or spleen uptake, even if more intense than the liver uptake, was often due to a postchemotherapy reaction [18]. Two different thresholds of positivity for the DC were used: a score of at least 3 (DC3, i.e. scores 3 – 5) and a score of at least 4 (DC4, i.e. a score of 4 or 5).

The ptPET results were considered true-positive (TP) or false-positive (FP) and true-negative (TN) or false-negative (FN) according to the corresponding disease state at the time of the PET examination, as recorded in the clinician’s or multidisciplinary team’s case notes.

In order to estimate the risk of progression and the survival rate, calculating PFS and OS, available posttherapy surveillance data were considered. Assessment of disease state comprised routine conventional diagnostic procedures such as ultrasonography, contrast-enhanced CT, MRI, and biopsy of suspicious residual disease (when performed). Also cases judged doubtful were included in the analysis if the suspected disease state was confirmed by a short-term controlled follow-up. In the accuracy analysis, PET/CT findings were classified as positive or negative using each of the three models of evaluation mentioned above (DC3, DC4 and IHPC). For each model, negative FDG PET/CT results were classified as TN when assessment of disease state confirmed negative results or findings due to inflammatory changes. Positive FDG PET/CT findings were considered TP when assessment of disease state indicated the presence of PD. FDG PET/CT findings were considered FP or FN when the reference yielded discordant evidence.

Statistical analysis

The D’Agostino normality test was used to assess the normality of the distributions of the variables under study. The Wilcoxon-Mann-Whitney test (nonparametric) and Student’s t test (parametric) were used as appropriate. To evaluate the usefulness of FDG PET/CT in restaging, we calculated sensitivity, specificity, negative predictive value (NPV), positive predictive value (PPV), accuracy and likelihood ratios (LH+ likelihood ratio for positive results, LH− likelihood ratio for negative results) using standard methods [2022]. ROC curve analysis was performed to confirm sensitivity and specificity and to calculate the area under the curve (AUC). Simple linear regression was used to evaluate the reproducibility of the IHPC and DC. OS was defined as the time from the PET assessment performed at the end of therapy to the date of death or last clinical/imaging information. PFS was obtained by calculating the time from the PET assessment performed at the end of therapy to the appearance of clinical or radiological progression. PFS and OS were calculated using the Kaplan-Meier method. Multivariate Cox proportional hazards models were fitted in all samples to determine whether the IHPC, DC3 and DC4 were significantly associated with survival. A Cox regression model was built using a step-wise selection procedure, with the p value set to 0.05 for a feature to be entered into or left out of the model. The relationships between outcome and the included variables are summarized in terms of hazard ratios (HR) and 95 % CI. All statistical analyses were performed using IBM SPSS Statistics software, version 20.0 (IBM Corp., Armonk, NY).

Results

Of the 101 patients studied, 31 showed therapy failure at the end of treatment (group 1). Of these 31 patients, 21 had experienced further PD at the time of study observation. Despite salvage therapy, 3 of the 21 died of their disease, and 13 were still ill (6 with PD, 6 with stable disease, and 1 with partial remission) at the end of the time of observation; the remaining 5 achieved complete remission and were free of disease until the end of the study. The mean PFS in group 1 was 9.7 months (95 % CI 6.6 – 13 months), with a median of 11 months (95 % CI 3 – 12 months).

The remaining 70 of the 101 patients were judged free of disease after the end of treatment (group 2). Of these 70 patients, 9 experienced relapse in the first year after the end of treatment (mean 5 months, median 5 months, range 3 – 8 months), and 5 of these 9 experienced further subsequent PD, including 1 with lymphoblastic T lymphoma, who died. The other 4 patients achieved complete remission after salvage therapy. One of the 70 patients, who was cured of lymphoma, died of a haemorrhagic stroke (Fig. 1). The mean PFS in group 2 was 20 months (95 % CI 17.5 – 23.3 months) with a median of 15 months (95 % CI 14 – 18 months). The mean OS in group 1 was 23.1 months (95 % CI 19.6 – 27 months) with a median of 18 months (95 % CI 16 – 24 months), and mean OS in group 2 was 23 months (95 % CI 20 – 26 months) with a median of 17 months (95 % CI 15 – 21 months).

Fig. 1
figure 1

Outcome in patients (CR complete remission, PD progression of disease, SD stable disease, PR partial remission, CCR continued complete remission, dagger symbol died)

The performance of ptPET was evaluated according to the corresponding disease state at the time of the PET examination.

IHPC evaluation

In group 1, 30 of the 31 patients were positive on PET (TP) using the IHPC. In one patient who received chemotherapy for stage III scleronodular HL (cHL), residual disease was suspected on the basis of the clinical examination 1 month after the end of chemotherapy, although ptPET was negative using both the IHPC and DC. Two months later, a further PET/CT scan and a contrast-enhanced CT scan confirmed lymph node disease in the right laterocervical chain, as assessed by previous clinical evaluations. In this patient the ptPET result was judged FN. In group 2, 47 of the 70 patients were negative on ptPET (TN). The sensitivity, specificity, PPV, NPV and accuracy of ptPET were 97 %, 67 %, 57 %, 98 % and 76 %, respectively (Table 2). In the likelihood ratio analysis, LH+ was 2.9 and LH− was 0.04. The AUC was 0.82 (95 % CI 0.73 – 0.90).

Table 2 Diagnostic performance of FDG PET/CT performed at the end of treatment using different evaluation criteria

In the Kaplan-Meier analysis, the 2-year cumulative PFS and OS rates in patients positive on FDG PET/CT were 49 % and 53 %, respectively. In contrast, the 2-year cumulative PFS and OS rates in patients negative on FDG PET/CT were 75 % and 74 %, respectively (Fig. 2). The differences in PFS and OS rates between patients positive and negative on FDG PET/CT were significant (p < 0.05). The multivariate Cox regression analysis for the risk of PD in patients positive on FDG PET/CT using the IHPC showed HR of 1.57 (CI 95 % 0.37 – 6.4, p = 0.03; Table 3).

Fig. 2
figure 2

Kaplan-Meier analysis of progression-free survival in relation to the PET/CT results using the IHPC

Table 3 Risk of disease progression and 2-year cumulative PFS and OS rates in patients undergoing FDG PET/CT at the end of treatment, using the IHPC, DC3 and DC4

DC evaluation

In group 1, 30 of the 31 patients were positive on PET using DC3, and 26 of the 31 using DC4 (TP). In group 2, 53 of the 70 patients were negative on ptPET using DC3 and 61 of the 70 using DC4. The sensitivity, specificity, PPV, NPV and accuracy of ptPET were 97 %, 76 %, 64 %, 98 % and 84 %, respectively, using DC3, and 92 %, 87 %, 74 %, 92 % and 86 %, respectively, using DC4 (Table 2). In the likelihood ratio analysis, LH+ was 4.04 for DC3 and 7.07 for DC4, and LH− was 0.03 for DC3 and 0.09 for DC4. The AUC was 0.86 for DC3 (95 % CI 0.78 – 0.93) and 0.90 for DC4 (95 % CI 0.83 – 0.97).

In the Kaplan-Meier analysis, the 2-year cumulative PFS rates in patients positive on FDG PET/CT using DC3 and DC4 were 47 % and 20 %, respectively. In contrast, the 2-year cumulative PFS rates in patients negative on FDG PET/CT using DC3 and DC4 were was 74 % and 73 %, respectively (Figs. 3 and 4). The differences in PFS rates observed between patients positive and negative on FDG PET/CT were significant (p < 0.05) using both DC3 and DC4.

Fig. 3
figure 3

Kaplan-Meier analysis of progression-free survival in relation to the PET/CT results using DC3

Fig. 4
figure 4

Kaplan-Meier analysis of progression-free survival in relation to the PET/CT results using DC4

The 2-year cumulative OS rates in patients positive on FDG PET/CT were 48 % and 36 % using DC3 and DC4, respectively, and in patients negative on FDG PET/CT were 74 % and 75 % using DC3 and DC4, respectively (p < 0.05 for both DC3 and DC4). The multivariate Cox regression analysis for the risk of PD in patients positive on FDG PET/CT using DC3 showed HR 0.7 (CI 95 % 0.1 – 3.7, p = 0.07), and using DC4 showed HR 3.2 (CI 95 % 0.9 – 9.1, p = 0.006); Table 3). Simple linear regression between DC4 and the IHPC scores (highest values) showed a partial but not complete correlation (R 2 = 0.5, f ratio 74; p < 0.0001).

Discussion

The diagnostic role of PET assessment at the end of treatment in patients with lymphoma has been widely addressed in the literature, with PET being considered of particular value in patients with radiological evidence of residual disease [4]. Determination of residual disease is important in both HL and aggressive NHL, where the objective of therapy is the complete eradication of disease. In the case of FL and other NHL subtypes characterized by lower clinical aggressiveness, guidelines do not recommend the routine use of FDG PET/CT for response assessment at the end of therapy. However, there is a consensus among clinicians that wider clinical use of FDG PET/CT is appropriate at least in FL, because emerging evidence indicates that metabolic evaluation at the end of treatment permits better prognostic stratification of patients [7]. In order to obtain a scenario very similar to everyday reality, consecutive patients treated for HL and NHL were included in our series, which included patients with FL and other NHL histotypes with the proviso that evidence of FDG-avid disease was present at staging and either chemotherapy or radiotherapy was implemented.

Among our patient cohort, significant variability in the diagnostic performance of FDG PET/CT was observed depending on the interpretation criteria adopted (i.e. IHPC, DC3 or DC4). The IHPC and DC3 gave a very high sensitivity of 97 %, and their specificities were 67 % and 76 %, respectively. Due to high FP rates, limited PPVs (57 % for the IHPC and 64 % for DC3) were observed; on the other hand, NPV was 98 % for both criteria. Using DC4, sensitivity was lower (92 %), specificity was higher (87 %), PPV was higher (74 %), and NPV was adequate (92 %). Previous FDG PET/CT studies in HL patients have shown NPVs of 95 – 100 % and PPVs of more than 95 % [23, 24]. In recent studies, mostly involving interim PET in aggressive NHL, NPV for ptPET was found to range between 80 % and 100 %, while PPV was lower and more variable from 50 % to 100 % [2528].

As indicated above, using the different FDG PET/CT evaluation criteria, we obtained a high NPV (92 – 98 %) and a more variable PPV (57 – 74 %). Taking into account the fact that about two-thirds of patients enrolled in our study were affected by NHL, this variability in PPV and NPV appears similar to the range of results from evaluation of the cumulative data from previous studies. This observation could support the hypothesis that the principal cause of the variation in results obtained in different studies was the use of different criteria for image interpretation, even if heterogeneity with respect to instrumentation and acquisition protocols may also explain some of the variability [29].

We found that DC, and in particular DC4, provided better diagnostic accuracy than the IHPC. Although the IHPC were based mainly on the authors’ expertise, given the limited available literature, they represent a milestone on the road toward standardization in the use of PET for response assessment in lymphoma. A qualitative evaluation of residual uptake has been judged adequate for determining whether a PET scan is positive or negative. The feasibility of using semiquantitative data is in any case severely limited by variability with regard to scanner and image reconstruction, and such data do not appear essential for ptPET interpretation, even if some authors have hypothesized that evaluation of relative reductions in standardized uptake values (SUV) may be beneficial for response assessment [10, 30].

The diagnostic accuracy of the DC for interim PET reporting has previously been tested in other retrospective studies (e.g. Gallamini et al. [15]) and the use of the DC is currently under evaluation in prospective studies [31]. Based on the positive experiences with the use of the DC in reporting interim PET, it has been proposed that these criteria might also be employed for the evaluation of ptPET [4, 32], particularly in clinical trials. It is to be noted in this context that discordant interpretations among reviewers can occur in more challenging cases, e.g. in discriminating physiological from pathological uptake in scans with prominent brown fat uptake, in distinguishing the healing process from residual disease in patients with pathological fractures, in differentiating misregistered physiological gut uptake from liver activity, and in interpreting uptake when different arm positioning has been used in baseline and interim studies [18]. In our analysis, instances of discordant scoring among the observers using different criteria were resolved by consensus. Analysis of concordance among the observers was not performed because good agreement in the application of the DC has already been demonstrated in previous studies [17, 18].

While the use of DC4 yielded the best accuracy in our study, the rate of FN results was markedly higher than when using DC3 and the IHPC (18 % vs. 3 %). One patient who experienced recurrence after therapy for stage III cHL was negative with the use of all three criteria. The available literature data indicate that at the end of treatment Deauville scores of 1 and 2 reliably represent a complete metabolic response, while scores of 4 and 5 indicate residual disease. Experts suggest a cautious approach in evaluating a score of 3, and our data support this recommendation, even if current evidence shows that the majority of patients with this score have a good prognosis. Generally, such a score warrants prompt further clinical investigations to ensure that there is no delay in initiating salvage therapy, should it be required. Especially in response-adapted trials exploring treatment de-escalation, a score of 3 should be considered to represent an inadequate response to therapy, to avoid under-treatment [4]. These considerations also suggest that the threshold selected for PET positivity may vary depending on the clinical context, the histological subtype and the therapies used.

As shown by Chiaravalloti et al., in contrast to mediastinal blood pool activity, the intensity of liver uptake can vary during chemotherapy. However, while statistically significant differences in liver SUV were found between PET at staging and interim PET, no relevant changes in liver SUV were found between PET at staging and ptPET. Therefore, changes in hepatic metabolism should not affect ptPET scoring in any way [33].

To date, few reported studies have focused on the prognostic value of PET when performed at the end of therapy in patients with HL or NHL using current standard of care therapy outside clinical trials, and the results of these studies have been heterogeneous. One study, by Barnes et al. [34], showed a lack of value of interim PET in predicting progression of HL or NHL, whereas scans carried out at completion of therapy were highly predictive of PFS and OS. Similarly, Trotman et al. demonstrated that FDG PET/CT status at the end of immunochemotherapy induction in patients with FL is strongly predictive of outcome [35]. Nevertheless, there have been no previous comparisons of the use of the DC and IHPC for PET assessment at the end of therapy in a patient population mimicking a standard clinical scenario.

Our survival results obtained using Cox regression and Kaplan-Meier analysis revealed significant differences when using the IHPC, DC3 and DC4. A positive PET/CT using DC4 was strongly associated with a higher risk of PD and increased mortality, and this model of evaluation proved to be a better predictor than DC3 or the IHPC. Thus, evaluation of PFS and OS at 2 years of follow-up showed a clear association between a positive examination and worse outcome, which was most pronounced when using DC4: PFS, OS and HR were 20 %, 36 % and 3.2 using DC4; 47 %, 48 % and 0.7 using DC3; and 49 %, 53 % and 1.57 using the IHPC, respectively; Figs. 2, 3 and 4). Conversely, a negative scans predicted favourable outcomes, but with similar results among all three models (Table 3).

It is expected that ongoing prospective studies on PET-tailored therapy will yield further data that will assist in defining the best strategy for the use of the DC in particular situations. In future, it would also be useful to evaluate whether the introduction of a semiquantitative approach, such as SUV analysis comparing a baseline and ptPET/CT studies, could improve accuracy. Another issue warranting study is whether the integration of ptPET/CT results with other prognostic indices, such as the Follicular Lymphoma International Prognostic Index (FLIPI) or lactate dehydrogenase levels, might improve prediction of outcome in lymphoma patients.

The main limitation of this study was its retrospective nature. Consecutive patients treated for different histological subtypes of lymphoma were included in our series. Different therapeutic approaches were used depending on the patients’ characteristics, with a consequent impact on overall outcome. In addition, the instrumentation and acquisition protocols used varied between the centres, and in some cases also over the observation period, preventing reliable semiquantitative analysis. On the other hand, these very heterogeneous characteristics simulated a scenario very similar to everyday reality and permitted a comparison among the different PET interpretation criteria. The use of the IHPC and DC systems in clinical reality is illustrated in Figs. 5 and 6 that show PET/CT images and their interpretation in two example patients.

Fig. 5
figure 5

A 39-year-old female patient with stage II diffuse large B-cell lymphoma. a Coronal PET/CT images at staging: there is intense tracer uptake in the upper mediastinum corresponding to a bulky mass. b Corresponding coronal ptPET images: at the end of chemotherapy, residual uptake equal to liver uptake is apparent (red arrow). This result corresponds to a Deauville score of 3 (DC3). c Axial ptPET images: residual mediastinal tissue greater than 2 cm on CT images shows residual uptake more intense than the mediastinal blood pool activity. This result corresponds to a positive IHPC score. This patient showed a complete response and was still free of disease after a follow-up of 4 years

Fig. 6
figure 6

A 28-year-old female patient with stage IV Burkitt lymphoma. a Coronal PET/CT images at staging: there is evidence of disease at the right pulmonary hilar (red arrow) and mediastinal lymph nodes, with involvement of the subcarinal region (black arrow). b Corresponding coronal ptPET images: at the end of chemotherapy, residual uptake slightly greater than liver uptake is apparent in the subcarinal (black arrow) and right pulmonary hilar (red arrow) lymph nodes. This result corresponds to a Deauville score of 4 (DC4) and a positive IHPC score. This patient showed a partial response and reached a complete response after salvage therapy 2 months later

Another important limitation of this study was that the disease state was histologically confirmed only when this was required by the clinical management. Such an investigation to confirm disease recurrence was therefore available in only four patients; in most patients the presence of disease was confirmed by instrumental investigations (41 of the 101 patients) or on the basis of follow-up (101 patients). The results of our study in a heterogeneous population simulating a routine clinical situation indicate that the the DC are simple to apply and certainly more effective than the IHPC. Nevertheless, the experience so far derived using the DC in research protocols cannot be immediately translated into clinical ptPET/CT evaluation, although these criteria may be a valuable tool capable of assisting in image interpretation. Future large prospective studies in homogeneous populations are required to define the best strategy for DC use in particular clinical situations.

Conclusion

This retrospective study indicates that, in lymphoma patients, the value of FDG PET/CT for response assessment after completion of treatment is influenced by the interpretation criteria adopted. Differences in the risk of PD and survival rate were shown in patients with positive PET using the IHPC, DC3 and DC4. The best performance and survival prediction results were obtained using DC4, but further studies are needed to definitively confirm the reliability of this approach in the clinical setting.