Introduction

Intrafraction motion during prostate radiation therapy could lead to inaccurate dose delivery through geometric miss of the target and excessive irradiation of organs at risk including rectum [114]. In context of prostate stereotactic ablative radiation therapy and intra-prostatic nodal boost, the dose delivered in individual fractions becomes critical. However, the effect of the intrafraction motion on the delivered treatment is not well understood, which is vital for assessing the requirements of intrafraction motion management methods. Few studies [47] have attempted to correlate the 3D prostate motion with the corresponding dosimetric impact. Li et al. [6] calculated the motion inclusive dose for patients’ treatment courses by convolving static treatment doses with corresponding motion probability functions. They acknowledged that the convolution approaches do not account for dose-per-fraction effects and inter-play effects. Therefore, correlation of individual fraction motions and their dosimetric impact was not investigated. In another study, Langen et al. [4, 5] investigated the dosimetric effect of the motion for each fraction, for helical Tomotherapy, finding no strong relationship between the motion and dosimetric impact on the target. However, the applicability of their result is also limited to helical Tomotherapy systems; whereas other delivery techniques such as intensity modulated radiation therapy (IMRT) and volumetric modulated arc therapy (VMAT) are more widely available and utilised, and may show correlation. Colvill et al. [7] in their study assessed the dosimetric impact of multi-leaf collimator (MLC) tracking and gating as intrafraction motion correction strategies and compared these with no motion correction using 20 fractions from five patients. They showed strong correlation between the motion and its dosimetric impact on the target. The authors acknowledged that their study had selection bias as most of the studied fractions were in the upper range of the motion percentiles. It should be noted that none of the existing studies investigated the use of various motion metrics on the correlation between the motion and their dosimetric impact on target coverage and OARs. This is important because quantification of prostate motion itself is likely to have a major effect on the correlation. In case of IMRT and VMAT the dosimetric effect of the motion has so far not been investigated in a study without any selection bias. In addition, the correlation between the motion and organs at risk (OARs) doses has also not been assessed to date.

The objective of this study was to quantify 3D prostate motion using novel motion metrics along with other metrics available from the literature and to correlate these motion measures with the dosimetric effect of the motion on targets and OARs. The aim was to investigate whether such metrics can be used to indicate action thresholds for prostate motion, i.e., when effects will be significant or not and hence when treatment parameters and decisions may need to take this into account.

Materials and methods

Patient cohort

Prostate intrafraction motion data for 294 fractions from a cohort of 18 patients were retrospectively investigated. Of these, 8 patients had intrafraction motion determined using a novel kV fluoroscopic imaging system, called kilovoltage intrafraction monitoring (KIM) [15]. Another 10 patients were monitored using the Calypso® system [8, 16]. Both of these trials had the necessary ethics, governance, legal, and regulatory processes completed prior to the initiation. Informed consent was obtained from all individual participants included in the study. Patients from these two trials were pooled together to increase the sample size for the analysis and differences between the measured motion from them were not significant (p < 0.05). All the fractions with dose reconstruction available from the trials, at the time of the initiation of this study, were used without any selection (or exclusion) bias. Patient treatments were delivered using VMAT and a standard departmental protocol [17] with all patients simulated with an empty rectum and a comfortably full bladder. The patient cohort comprised of two main treatment fractionation schedules, 13 patients received the conventional fractionation schedule of 80 Gy in 40 fractions and 5 patients received the second fractionation schedule involving radiation therapy boost fractions. A detailed distribution of patient cohort and treatment types and times is presented in Table 1. Patients were aligned daily with imaging before treatment and negligible inter-fraction variations were assumed.

Table 1 Summary of patient cohort

Motion monitoring

KIM real-time target position was used to monitor the intrafraction prostate motion in the KIM trial. The KIM technology uses kV fluoroscopy to monitor, in real-time, the 3D position of three radio-opaque markers implanted into the prostate target [9, 15]. It uses 2D X-ray images to accurately estimate the 3D target position at 5–10 Hz during the treatment delivery [18]. The Calypso® system uses implanted electromagnetic transponder-based markers [19] to report the position at 10 Hz. Each patient had three implanted markers. Both the KIM [9, 15] and Calypso® [19] systems have been reported to have sub-millimetre accuracy. In both trials, the motion of the centroid of the implanted markers, with no corrections, was used as a surrogate for intrafraction prostate motion. The planning target volume (PTV) margins were 5 mm posteriorly and 7 mm elsewhere on the clinical target volume (CTV). In this study PTV was modelled with the same motion as the target to represents a CTV with zero margins and therefore is a representation of the most extreme case.

Measure of prostate motion

The mean prostate motions across patient fractions were visualised using the probability of 3D displacement (see Fig. 1). This was defined as the percentage of time the 3D displacement was over a certain value, ‘x’, during an individual fraction. It provides a descriptive representation of the motion and has previously been used to present prostate motion [11]. The prostate motion from individual fractions was quantified to a single figure-of-merit through various motion metrics. These metrics comprised of 9 newly defined measures. For each prostate motion trajectory, motion was quantified by the mean of the highest x% (H x% ) of the 3D displacements (motion); x was investigated for 10, 20, 30, 40, 50, 60, 70, 80, 90 and 100%, where H100% represents the mean of all 3D displacements at a fraction while lower percentages represent the more outlying motion. Previously in the literature, Langen et al. [12] have defined and used the fraction of time the prostate displacements are: >3, >5 and > 7 mm; and Li et al. [6] have defined and investigated R95, R90, and R80, which represent that the 3D displacement is less than Rx (magnitude) during x percentage of the monitoring time. Metrics from both these studies were also evaluated here.

Fig. 1
figure 1

Probability of 3D displacement. The dashed line is the median of probability and the solid line is the mean probability. The shaded region shows the mean ± 1 standard deviation

Motion inclusive dose reconstruction

Previously, motion inclusive dose reconstruction for the trial data used in this study has been reported [79, 16]. Dose delivered in the presence of prostate motion, with and without motion compensation, during each fraction was estimated using a motion-synchronized isocenter shift dose reconstruction method developed and validated by Poulsen et al. [20]. In this study, the estimated dose delivered without any motion compensation after the initial pre-treatment beam-prostate alignment (i.e. first intrafractional shift is always zero) was used to investigate the effect of motion on the dosimetry. It should be noted that the motion inclusive dose reconstruction system assumes that prostate and surrounding OARs move together rigidly and ignores any differential motion between the prostate and OARs.

Analysis

For all patients, intrafraction motion was visualised through the probability of 3D displacements. The dosimetric impact of motion was investigated through correlation between the motion metrics and variations in PTV D95% (minimum dose to 95% of the PTV), CTV D99% (minimum dose to 99% of the CTV), rectum V65% (volume of rectum receiving at least 65% of the prescribed dose) and bladder V65% (volume of bladder receiving at least 65% of the prescribed dose) from the planned values in the presence of motion. The analysis performed in this study is schematically presented in Fig. 2. PTV D95% was evaluated as it is representative of the CTV coverage without margin. Rectum and bladder were investigated as these are OAR for prostate radiation therapy. We specifically investigated V65% as this is a critical dose-volume criteria used to infer plan quality within our local planning protocol [17]. The correlations were evaluated using the Pearson’s correlation coefficient and the coefficient of determination (R 2) was reported. An action level threshold value of motion metric was estimated by segmenting the results (motion metric and measure of the dosimetric effect) into two groups using an iterative linear regression model fitting procedure. The slopes of the two segmented groups were compared and also the differences in the dosimetric effect in these two groups were statistically compared using the F test (ANOVA). The Wilcoxon rank sum test was used to evaluate the differences between variations in PTV D95% and CTV D99%.

Fig. 2
figure 2

Schematic of methods used in this study. Correlation of prostate motion of individual fractions and corresponding dosimetric impact: the dose delivered in the presence of intrafraction motion was estimated using motion inclusive dose reconstruction which utilises the original treatment plan and motion from individual fractions. This estimated dose was compared with the original planned dose and the differences (dosimetric impact metrics) were correlated with motion metrics used to quantify intrafraction motion. Note Fx. stands for fraction

Results

Prostate motion

The prostate motion for the patients is presented, through probability of 3D displacement, in Fig. 1. The quantification of this motion using various metrics was performed and mean values (across all the fractions) and 5th and 95th percentiles values are presented in Fig. 3. In the calculation of the percentage of time that 3D motion is greater than a certain value, only the >3 mm metric had considerable sample size (>30) with non-zero values. The other two metrics >5 and >7 mm with small sample size (<30) were not considered further in this study. There is large variability across fractions in all the motion metrics as seen in Fig. 3. For example, averages (ranges) of some of the metrics are: mean of all motion is 1.5 mm (0.3–9.9 mm); mean of highest 50% of motion is 1.9 mm (0.4–15.4 mm); the percentage of the time motion  > 3 mm is 9.4% (0–100%); and R80 is 1.9 mm (0.4–16.0 mm). Moreover, values of the three R80, R90 and R95 metrics are similar and this is also true for means of highest and means of lowest. This might be due to small motion in the majority of fractions analysed in this study as seen in Fig. 1.

Fig. 3
figure 3

Results: Measure of various motion metrics. Comparison of 3D displacement metrics: a mean measures, Hm% represent means of highest m% and 100% represents the mean of all the values; b the percentage of the time motion is greater than a certain value measures, n values gives number of data points in individual categories with non-zero values; c Measures of displacement such that 3D displacement is less than Rx (magnitude) during × percentage time. The marker represents mean value (across all the fractions) and error bar represents 5th and 95th percentile values

Dosimetric impact of the intrafraction motion

Correlation between the dosimetric impact of motion, differences in PTV D95% (CTV + 0 mm margin) and CTV D99%, rectum V65% and bladder V65% from the planned values, and the motion metrics of all the fractions are presented in Table 2. The PTV D95% had substantially stronger correlation with the measured motion metrics than CTV D99%, rectum V65% and bladder V65%, for all the metrics. Mean of highest 50 and 60% of motion had the strongest correlations with the differences in PTV D95% with R 2 value of 0.81. Mean of highest 20 and 30% motion and R90 had highest correlations with CTV D99% with R 2 value of 0.62. The distribution of differences in the target coverage with respect to the mean of highest 50% of motion is presented in Fig. 4. The PTV D95% and CTV D99% had some correlation (R 2 = 0.58) with each other and the differences between these two values were significant (p < 0.05). In the case of OARs, for rectum V65% correlation was highest with the percentage of the time motion >3 mm with R 2 value of 0.58, whilst for bladder V65% correlation was highest with mean of highest 90% of motion and overall means of the motion with R 2 values of 0.69. As noted earlier dose reconstruction system has a limitation in terms of estimation of OAR doses because it assumes the same rigid motion as for the prostate. However, at present the dose reconstruction method is one of few methods that allow for potential estimation of OARs doses.

Table 2 Correlation (R 2, and 95% confidence intervals) of various motion measures and the variations from the planned CTV and PTV coverage and rectum and bladder doses
Fig. 4
figure 4

Scatter plot showing differences between planned and delivered PTV D95% and CTV D99% versus the mean of the highest 50% motion at 294 treatment fractions. Correlation (R 2) of motion with PTV D95% and CTV D99% were 0.81 and 0.58 respectively

Action threshold for the intrafraction motion

The motion measure, mean of highest 50% of motion, which had one of the strongest correlations with the differences in PTV D95%, was investigated for action threshold for prostate motion. An iterative linear regression model estimated 3.0 mm (95% confidence interval: 2.7–3.2 mm) as a value of mean of highest 50% as a threshold for PTV D95% in the data-set of this study. For the values smaller than or equal to 3.0 mm, the slope of regression was 1.0%/mm (R 2 value of 0.40) and for the values greater than 3.0 mm, the slope was 2.9%/mm (R 2 value of 0.92). Therefore, the dosimetric impact (per unit increase in motion) and correlation of prostate motion is substantially more when the mean of highest 50% of motion is greater than 3.0 mm. Additionally, 95% confidence intervals for the slopes of the two groups were non-overlapping and this strengthens the use of the threshold value of 3.0 mm. The ANOVA found dosimetric impact to be significantly higher (p < 0.05) when the mean of highest 50% of motion is greater than 3.0 mm in comparison to when the value is less than or equal to 3.0 mm. Summaries of these two groups and results are presented in Table 3.

Table 3 Comparison of fractions with the mean of highest 50% of motion > 3.0 and ≤ 3.0 mm

Discussion

The dosimetric impact of intrafraction 3D prostate motion on the target coverages’ (CTV and PTV) and organs at risk doses (rectum and bladder) of the corresponding fraction during VMAT treatments, has been investigated for the first time. The present study established strong correlation (R 2 > 0.75) between a number of motion metrics (mean of highest 20, 30, 40, 50, 60, 70, 80 and 90% of motion, R90 and R80) and the dosimetric coverage (PTV D95%). These results are particularly useful because clinically an understanding of the dosimetric consequences of the motion is of interest, rather than the motion itself, for evaluating motion management needs. Amongst these metrics, the mean of highest 50% of motion was one of the best indicator of the dosimetric impact of intrafraction prostate motion. Use of this metric for our data shows the action level threshold of 3.0 mm and if the mean of highest 50% was greater than 3.0 mm then the PTV D95% was compromised on an average by 6.2% for the individual fraction. The under-dose will typically be present near the periphery of the PTV. This knowledge, where clinically justified, could be used for treatment in the remaining fractions.

The motion and dose metrics were investigated per fraction and this study did not present the cumulative affect over a number of fractions. Clearly, reducing the number of treatment fractions will increase the impact of individual fractions on the cumulative dose. These metrics will be useful to guide intervention when multiple days are shown to express large motion; towards, for example, reviewing daily CBCT for variable organ at risk volume change to potentially modify patient immobilisation or diet, or modifying (adaptively replanning) the treatment plan to respond to areas of deviated dose delivery from intended.

Only a few studies [47] have attempted to correlate motion measures to its dosimetric impact. For helical tomotherapy, Langen et al. [4, 5] investigated the dosimetric effect of the motion on each treatment fraction, using a 4D dose calculation engine, similar to the one used in this study. Their study did not find a strong relationship between motion and its dosimetric impact on the target (R 2 were less than 0.50). A common motion metric, the percentage of the time motion >3 mm could also not be correlated well with the dosimetric impact in this study. The observed motion was similar in both studies with the average (range) percentage of time with >3 mm motion of 9.4% (0–100%) in this study and 13.2% (0–99%) in the study by Langen et al. Similar to Langen et al. [4], this study also found that dose to the CTV, which has added margins, is affected less by motion than the PTV dose, and that variations in the PTV and CTV dose were correlated. One of the purposes of margins to the CTV is to ensure CTV coverage in the presence of intrafraction motion.

Colvill et al. [7] in their study with a selective set of data showed that the mean intrafraction motion and variation in PTV D95% and CTV D99% from the planned values dose reduction are highly correlated (R 2 values were 0.94 and 0.72 respectively). Similarly our study found that for the fractions with large intrafraction motion (H50% > 3.0 mm), the motion and the dosimetric impact PTV D95% was highly correlated (R 2 values was 0.92). In terms of intra-prostatic lesion (IPL) boosted treatments, Pommer et al. [21] found that in the IPL boosted approach, intrafraction motion had a significantly larger (p < 0.001) impact on the dosimetry than for the conventional approach. The majority of the fractions in this study were from standard fractionation and therefore the impact of motion in nodal boost fractions was not investigated.

Previously, studies have investigated various factors that can affect the intrafraction prostate motion. The status of the rectum (such as filling, gas) has been shown to be the most significant predictor for intrafraction prostate motion [2, 13, 14]. Bladder filling has been found to have correlation with a slight posterior drift of the prostate when little or no gas is present [14]. Other studies have found strong correlation between prostate motion and observation time (>5 min) [2, 3, 11, 12]. In standard fractionation, fraction number has been found to have no effect on the intrafraction prostate motion [3, 12, 22]. The primary objective of this study was to investigate various motion measures and their correlation with dosimetric impact of the motion. Therefore the effect of these various factors on the motion was not evaluated in this study.

One of the limitations of this study is that the dose reconstruction performed utilised translational motion only. Also, for all the treatment fractions, patient anatomy has been assumed to be the same as at treatment planning. Rotational as well as deformation components of the motion can substantially affect the dose delivery during the treatments, this also needs to be investigated [23, 24]. Additionally, as noted earlier, another limitation is that the dose reconstruction system assumes that the prostate and surrounding OARs move together rigidly. Moreover, there are factors other than motion magnitude that are likely to have dosimetric impact such as the timing and the direction of the target motion and treatment plan parameters. These require a comprehensive investigation, which is out of the scope of the current study. In addition, the delivery and planning techniques were identical across the patients in this study. Though the patients from the MLC tracking trial had real-time adaptation in their actual treatment delivery but for the purpose of current study that adaptation is not considered for the estimation of the dosimetric deviation in the presence of motion.

Conclusions

This study established that motion metrics can be used as a surrogate to represent the dosimetric impact of prostate motion on target dose. These metrics could be used in studies to report intrafraction prostate motion and thereby indirectly present dosimetric consequences of the reported motion. The mean of highest 50% of motion was the best indicator of the dosimetric impact of intrafraction prostate motion. The results also confirm that the PTV coverage is significantly (p < 0.05) more compromised by the motion than the CTV which has margins to ensure adequate coverage. Finally, it should be noted that the findings from this study are based on a small number of patients (18) and ought to be independently validated in a study involving a larger cohort of patients.