Keywords

1 Introduction

Magnetic resonance imaging (MRI) is an established imaging modality integral to many modern cancer management pathways. Amongst the many strengths of MRI are its superior soft tissue contrast resolution, multi-planar capability and the lack of ionising radiation. MRI is also widely available in many hospitals.

With MRI, the anatomical arrangement of normal and pathological structures, and their tissue properties, are probed by echoes returned from radiofrequency pulse sequences targeted at regions of interest in the body, within magnetic fields in the scanner [1]. Exogenous contrast agents are often used in conjunction [2]. Current clinical MRI examinations utilise multiple pulse sequences to generate images with different weighting to tissue contrast. These images can provide important details of oncological disease status, such as nature and structure of tumours, tumour size, tissue planes, adjacent neuromuscular bundle or organ invasion.

Anatomical coverage of clinical MRI examinations is traditionally limited to individual body parts (such as brain, liver, pelvis). This is as a trade-off for meticulous spatial and contrast resolution and for scan times to be reasonably tolerable for patients. Clinical MRI examinations are, therefore, adept to loco-regional disease assessment and response evaluation.

Functional MRI parameters have recently been enabled by advances in MRI hardware and sequence developments. Diffusion-weighted imaging (DWI) sequences are now widely adopted. Tumour vascularity and perfusion can be assessed with different methods, such as with dynamic contrast enhanced sequences using exogenous contrast agents [3], or arterial spin labelling [4], with MR tagging of blood as endogenous tracer. Tissue hypoxia can be probed with sequences such as blood oxygen level-dependent (BOLD) MRI [5]. Chemical composition within tumours and its microenvironment may be glimpsed by techniques such as MRI spectroscopy or chemical exchange saturation transfer [6, 7]. Functional MRI sequence development continues to be an expanding field of research.

Another area of development has been the expansion to whole body coverage. While this may not be a novel concept, improvement in hardware, such as improved gradient systems, interconnected phase array coils and protocol and sequence designs, such as parallel imaging strategies, have enabled adequate quality images with full body coverage to be attainable within an acceptable scan duration. These have paved ways for the technique to be entering routine clinical use in malignancy with tendency for disseminated lesions, such as in metastatic prostate, breast cancer and in multiple myeloma [8,9,10].

MRI has been shown to be of high accuracy in primary disease assessment in a multitude of cancer type and location, for example, in the brain, head and neck, hepatobiliary, rectum, gynaecological tract and in the prostate, bones and soft tissues [11,12,13,14,15,16,17]. It is intuitive then that MRI would be the natural candidate of choice for response evaluation for these tumours. Depending on the treatment options for the particular clinical context, this may be after the use of ablation, chemotherapy or radiation for loco-regional disease (in neoadjuvant, primary treatment or adjuvant setting), or after systemic treatment in more widespread disease.

It is important to note that while many MRI techniques are now feasible and available for response evaluation, there is much variation in the extent to which these have been validated or adopted into clinical practice. For many indications (e.g. in post-neoadjuvant rectal tumour), use of MRI has been well studied [12, 18, 19]. In other areas (e.g. whole body MRI in multiple myeloma [8]), robust validation of MRI evaluation of treatment response remains work in progress. Some advanced techniques may show potential, but clinical translation may be further away on the horizon.

The rest of the chapter would focus on some of the general concepts underpinning response evaluation with MRI, with particular emphasis on established criteria and DWI.

2 Response Evaluation by Tumour Burden or Anatomical Parameters

Change in tumour burden, as visualised on radiological images, is widely thought of as useful for response evaluation in cancer management. Its acceptance is built on the premise that changes in tumour burden (such as size and number of lesions) can act as surrogate predictors for patients’ survival and quality of life, the ultimate end points of cancer therapeutics.

2.1 WHO and RECIST Criteria

In 1981, the World Health Organization (WHO) first published tumour response criteria [20]. Over the next decades, Response Evaluation Criteria in Solid Tumours (RECIST) was published in 2000 [21], attempting to further standardise measurement of tumour burden on anatomical imaging. This was further validated in cohorts of patients with metastatic cancer undergoing cytotoxic therapy treatment, resulting in the refined criteria published as RECIST 1.1 in 2009 [22].

By providing internationally agreed principles and common languages, they provide standardised frameworks for reporting tumour response, thereby increasing robustness and comparability of clinical trial results from different research centres. This is important for when an ‘objective response’ may be an important end point in determining the success of a phase II clinical trial, or for when a clear definition of disease progression is needed for trials using progression-free survival as an end point, such as in a phase III trial [21].

While the main use of these criteria may be for clinical trials reporting, especially in the metastatic disease context, it is undoubted that many key features have influence on day-to-day clinical oncological imaging practice. Common to the evolving WHO, RECIST 1.0 and 1.1 criteria are:

  1. 1.

    Definitions of measurable lesions

  2. 2.

    Specification of how dimensions should be measured and

  3. 3.

    How changes of these measurements over time should be computed to generate response categories

In RECIST 1.1, multiple strategies are adopted to increase practicality, precision, reproducibility and robustness of response evaluation. For example, defining a minimum size for individual measurable lesion (at least 10 mm for longest dimension of a tumour, 15 mm for short axis measurement of nodes) and summing together of the diameters of target lesions as a single parameter (SPD) to define response category may help to limit impact of individual measurement errors. Limiting the number of lesions, which needs to be longitudinally measured to up to five ‘target lesions’ (selected based on size, perceived reproducibility on repeated measurements and representativeness of all involved organs), reduces the burden to reporters. Clarity of the different response categories with appropriate thresholds increases robustness and reproducibility at the response category level (e.g. Partial response is defined when there has been at least 30% in the SPD of target lesions. Progressive disease can be defined when there is new lesion, or at least 20% increase (and at least 5 mm absolute increase) in the SPD of target lesions compared to baseline, or the smallest SPD, achieved on the study).

MRI may be used under RECIST1.1. It is acknowledged that there are many acquisition variables in MRI, which may impact greatly on image quality, lesion conspicuity and measurement. Rather than specifying exact sequences to be used in any particular MRI study, RECIST1.1 stipulated that scanning sequences should be optimised, and the same scanner and protocol should be matched as closely as possible across imaging time points. Unless for isotropic acquisition, maximal diameter of each target lesion should always be measured on the planes in which a particular MRI sequence is acquired in (not re-formatted).

2.2 iRECIST

With increasing use of immune modulators as anticancer therapy, it has become recognised that some patients may demonstrate unusual pattern of response as per anatomical parameters. These may include a delayed response or even apparent initial increases in tumour burden, which is followed by late, deep and durable responses (pseudoprogression). In light of this, investigators began to apply different modifications to the conventional RECIST, culminating in the RIECIST steering group to propose an additional response evaluation algorithm for use in studies evaluating efficacy of immunotherapies in 2017 (iRECIST) [23, 24].

Many key features, particularly surrounding lesion measurements and summations remain similar to RECIST1.1. A main new change is the requirement of a confirmatory imaging study in 4–8 weeks if there are features of progressive disease on initial response evaluation studies. Termed iUPD (unconfirmed PD) at this initial time point, if there are features reaching threshold of further growth on the following confirmatory study, then disease progression is confirmed (confirmed PD, iCPD). Assuming the patient remains clinically stable, it has been recommended that treatment be continued after iUPD. It is acknowledged that it is challenging to differentiate between pseudo and true progression. The proposed strategy hoped to strike the balance between the undesirable early discontinuation of an effective treatment and equally undesirable continuation of a non-effective therapy. Robust validation of this approach is an on-going effort.

2.3 Other Response Evaluation by Anatomical MRI in Specific Disease Contexts

Comprehensive coverage and detail critique of all the individual scenarios, where MRI can be used for response evaluation, are beyond the scope of this chapter, but some of the more established areas in use in clinical practice are described.

2.3.1 Brain Tumour

Specific challenges in measuring brain tumour burden for response evaluation had led to the community adopting specific response criteria (e.g. that proposed by the Response Assessment in Neuro-Oncology (RANO) working group) [25], such as for high-grade and low-grade gliomas, leptomeningeal metastases and brain metastasis. MRI is an indispensable tool in these.

With irregular shapes of gliomas, bi-dimensional measures are used for measurements of gliomas in RANO criteria, unlike single-dimension measures used in RECIST. Furthermore, how response evaluation of gliomas may be described on different specific MRI sequences has been specified. This is because tumoural changes in different MR sequences may carry different biological significance.

For example, anti-angiogenic agent, such as bevacizumab, is believed to reduce vascular permeability, even if a glioma is biologically progressing. Definition of PD has, therefore, been expanded to include changes such as significant increase of T2/FLAIR signal areas and emergence of new lesion (beyond the more intuitive increased of enhancing tumour area). These are in order to capture patients with non-enhancing but progressing disease. In addition, post-contrast images are not used in response criteria for low-grade glioma, as these tumours do not often show contrast enhancement.

Other key features of the RANO criteria are the incorporation of clinical status and the patient’s reliance on corticosteroid use as part of overall response evaluation criteria (Fig. 3.1). Pseudoprogression can also confound response evaluation of brain tumours, leading to some authors to propose additional immune-related response criteria (irRC) for immunotherapy trials. These adopt similar concepts of iRECIST by specifying need for a confirmatory scan at a later time point to confirm PD.

Fig. 3.1
figure 1

Glioblastoma multiforme response evaluation. Top row T2 weighted axial images of the brain; bottom row corresponding T1 post-contrast axial images; from right, (c, f)—baseline post debulking surgery; (b, e) 2 months post-radiotherapy and commencement of steroid and temozolomide; (a, d) 5 months post-radiotherapy and commencement of steroid and temozolomide. (c, f) show residual tumour manifesting as enhancing cavitating tumour with intermediate to high surrounding T2 tumour signal. (b, e) show increase in size of enhancing cavitating lesion with new adjacent nodules, and some increase in T2 signal change, in keeping with radiological progression; however, the patient remained stable and well at this stage. (a, d) similar further progressive radiological change, and by this time point, there was parallel clinical deterioration, further supporting progressive disease despite treatment

2.3.2 Hepatocellular Carcinoma (HCC)

HCC is the third most common cause of cancer related death. Liver-directed, loco-regional therapies are commonly used as a stand-alone palliative modality or as a bridge to selected patients for whom liver transplant may be a curative option. These may include thermal ablations, or transarterial embolization with varying agents to include chemotherapy, drug-eluting beads or radioactive spheres. These treatments induce local disease control through tumour necrosis, and it is recognised that there can be poor correlation between the clinical benefit and conventional size-based methods of response assessment. For instance, following adequate radiofrequency ablation, it would be anticipated that the focal area of the ablation zone would be greater than the pre-treatment tumour size.

Modified RECIST (mRECIST) was proposed as an improved tool, which has also been adopted by the European Association for the Study of the Liver (EASL) [26, 27]. Hepatic carcinogenesis is associated with a peculiar vascular derangement, manifesting as arterial enhancement, followed by washout on portal venous and/or delayed phases on imaging including MRI. For a lesion >1 cm and in a high pre-test population cohort (cirrhotic patient), this typical imaging appearance is considered diagnostic of HCC, which in turn also provides a handle for MRI evaluation of response evaluation. One of the key modifications in mRECIST is of taking measurements of only the viable (arterial enhancing) component of tumour for assessment of tumour volume and for summation to compute overall response at a patient level. Necrotic areas are ignored. mRECIST has been validated by studies showing that responders had better survival compared to non-responders to these loco-regional treatments.

Furthermore, the Liver Imaging Reporting and Data System (LI-RADS) lexicon [28], originally proposed by the American College of Radiology, has recently expanded with addition of a treatment response algorithm. This has some similarities and overlaps with mRECIST, including placing emphasis on the arterially enhancing tumour residuum. On the other hand, LI-RADS provides a more detail lexicon of MRI features and is a lesion-by-lesion response assessment tool.

2.3.3 Rectal Cancer

Neoadjuvant chemoradiation (CRT) has become standard treatment for patients with locally advanced rectal cancer. Neoadjuvant treatment aims to downsize and downstage tumours, with an aim to increasing resectability and long-term local disease control. In a minority of patients showing complete response, it is becoming apparent that they may be spared the morbid surgical resection without significantly comprising long-term outcome. Standardised and accurate MRI response evaluation is thus of paramount importance [29].

Change in tumour length is advocated as a useful measure for response evaluation, as a compromise between the measurements on any one imaging plane, which may not be reproducible, given the often complex non-orthogonal planes on which tumour grows, and ultimate 3D volumetry [29] (Fig. 3.2). Treatment effect can also be inferred by reduction of often intermediate T2 signal representing viable tumour and emergence of low-signal fibrotic tissues. This had been used in a semi-quantitative scoring system (MRI tumour regression grade), with a five-point scale of relative proportion of tumour signal and fibrotic signal, which has shown moderate success when validated with pathological tumour regression [30]. In addition, some primary non-mucinous tumour undergo mucinous change with CRT, manifesting as increasing high T2 signal, which is also a form of treatment response [31].

Fig. 3.2
figure 2

Rectal tumour response evaluation post-neoadjuvant treatment. Top row: post-neoadjuvant treatment MRI; bottom row: pre-treatment scan for comparison; from left to right: sagittal T2 TSE, coronal oblique T2 TSE, b-1000 DWI images, ADC maps. (a, b, e, f) show the primary rectal tumour in different planes pre- and post-treatment, highlighting complex orientation of the tumour. These also show the potential limitation of using single measurement of tumour length as a parameter for response evaluation (significant reduction in tumour bulk on both imaging planes: red arrows in (b, f) and orange arrow in (a, e), but only comparatively modest change in tumour length demonstrated by the orange arrows in (a, e)). Images (g, h) show high DWI signal on b-1000 images and corresponding low ADC value (yellow arrow in h), inferring restricted water diffusion due to increased cellularity pre-treatment. These has improved post treatment (c, d). A persistent low ADC band in (d) may represent a residual fibrotic band when correlated to the same region in image (b)

Furthermore, specific to the neoadjuvant, pre-surgical setting is the need to precisely describe residual tumour extent for assessment of resectability and planning surgical approach. MRI has been shown to be able to predict feasibility of successful sphincter preserving low-rectal cancer surgery, and with lesser success also at predicting outcome of clearance of the mesorectal fascia, an important surgical landmark [30].

3 Response Evaluation with Diffusion-Weighted Imaging (DWI)

Diffusion of water could be probed in vivo by DWI MRI. A basic DWI sequence involves a pair of de-phasing and refocusing pulses. Water molecules, which had moved away of its voxel between these pulses, would lead to incomplete rephasing, and hence signal loss. The degree of signal loss is related to the degree of water motion [32].

The sensitivity of the diffusion sequence to water motion can be varied by changing the parameter known as the b-value (a higher b value may be more sensitive, but at the expense of signal to noise ratio and/or acquisition time) [32]. In clinical practice, DWI MRI is usually performed with at least two b-values (e.g. b = 50 or 0 s/mm2 and other b-values from 0 to 1000 s/mm2), which enable calculation of an apparent diffusion coefficient (ADC), latter often presented in the form of a parametric map.

In biological tissues, diffusion of water molecules is thought to be ‘restricted’ by increased cell membranes and macromolecules [33]. DWI-derived parameter of ‘diffusivity’ can, therefore, act as a surrogate imaging marker of cellularity. This may be exploited for assessment of tumour response to treatment.

Some authors had proposed that DWI may act as a predictor for response to treatment, postulating that hypoxic tumours, which may have areas of necrosis (manifesting as tumours with higher ADC on pre-treatment MRI) would have a poorer response. This has been observed in some studies showing an inverse relationship between pre-treatment ADC values and eventual treatment outcome, for example, in primary colorectal tumour and liver metastases. However, such relationship is not observed consistently across tumour types and contexts [32, 34].

Use of DWI MRI in early and interim response evaluation had shown success in some studies (e.g. in breast cancer, melanoma, head and neck), although more widespread validation remains lacking.

In conventional post-treatment response evaluation settings, many studies have confirmed that responded tumours have an increased ADC, supporting the general notion that cellularity decreases with treatment response [35,36,37,38]. There are, however, important caveats. Some tumour response may be accompanied by reduced ADC, reflecting treatment response with fibrosis and scarring (Fig. 3.2). Response evaluation in lymph nodes is difficult with inherent ‘physiological’ low ADC (high cellularity) (Figs. 3.3, 3.4, and 3.5). Increases in T2 signal of tumour as a result of treatment may result in apparent increase in DWI signal (T2 shine through) though this should be readily appreciable by reviewing the ADC parametric map. Interpretation of bone marrow lesion DWI response may sometimes be difficult because of potential for development of reactive marrow hyperplasia, fatty replacement or fibrosis as a result of post-treatment change, which also influence DWI signal return [39]. Standardisation of techniques remains a challenge, from variations of acquisition techniques to models for ADC calculations. Quantitative measures of DWI (changes in absolute ADC values, volumetry) had been studied with success, but qualitative assessment of change remains the routine mainstay assessment method. Despite advances, it remains the case that DWI is an adjunctive MRI response evaluation tool, and it should be assessed in conjunction with other MRI, clinical and laboratory findings (Fig. 3.3).

Fig. 3.3
figure 3

Paediatric lymphoma response evaluation. Whole body MRI with DWI. 3D maximal intensity projection (MIP) of b-800 images. (a) Interim study post two cycles of treatment, (b) baseline image. See also Figs. 3.4 and 3.5. At baseline, there are extensive thoracic lymphadenopathy showing enlargement and increased DWI signal, in keeping with known diagnosis of classical Hodgkin lymphoma (stage 2). Blue arrows on both images show prominent right inguinal lymph nodes, which have high DWI signal but with normal morphology (see Fig. 3.5). These are confirmed benign on FDG PET and ultrasound (not shown). Orange arrows show ‘false-positive’ areas of high DWI signal at the sternum and anterior ends of ribs on the interim study and left anterior ilium of the baseline study; these are thought to be as a result of combination of field inhomogeneity and imperfect fat suppression, as well as respiratory motion on the interim study. This is confirmed by absence of signal abnormalities on other structural sequences (not shown). Dotted green and blue lines correspond to approximate levels of axial images in Figs. 3.4 and 3.5

Fig. 3.4
figure 4

Paediatric lymphoma response evaluation. Bottom row showing baseline appearances with intermediate T2 signal (middle column) enlarged anterior mediastinal nodes; these demonstrate high DWI signal on b-800 images (left hand column) and corresponding low ADC value (right hand column). There is reduction in size and DWI signal on the interim study (top row)

Fig. 3.5
figure 5

Paediatric lymphoma response evaluation. Bottom row showing baseline appearances of the inguinal level. Small right inguinal lymph nodes (red arrows) have normal morphology including retained fatty hilum, corroborated to be benign on FDG PET and ultrasound (not shown). These show inherent ‘physiological’ high DWI signal. These remain unchanged on the interim response scan (top row). Green arrows show retracted right testis in the right inguinal canal as an incidental finding

More sophisticated methods of DWI modelling may allow extraction of additional biologically relevant imaging markers. For instance, diffusion tensor imaging (DTI) exploits the directionality of water molecular diffusion and could be used to study tumour microstructural changes with treatment [40]. Intravoxel incoherent motion (IVIM) explores the bulk motion of water flow within tumour microvasculature (as supposed to water diffusion), which may be dominant in low b-values images and may be modelled through bi-exponential decay of DWI signal loss [41, 42]. These are more resource-intensive with need for longer acquisition and post-processing time. Tumour response evaluation with these parameters had been shown to be feasible, but these remain exploratory.

4 Conclusion

With superior soft tissue contrast resolution and widespread availability, MRI has established roles in a multitude of tumour response evaluation setting. Evaluation of anatomical tumour burden change remains the main parameter for response evaluation. Tumour and MRI sequence specific parameters and criteria may be necessary in different clinical scenarios for optimal assessment.

In addition, with its versatility, advances in hardware and sequence design, a vast number of functional MRI parameters are available, which are at different stages of validation. DWI had been described briefly in this chapter, but many exciting developments are beyond the scope of this chapter. Standardisation and validation remain a challenge to implementation.