1 Introduction

The fundamental problem in cancer radiotherapy consists in achieving the best trade-off between maximizing the tumour cell kill while sparing the surrounding normal tissue. Indeed, finding strategies aimed at improving the outcome of cancer radiation treatments is a subject widely investigated in the literature related to clinical practice and to mathematical applications. Among the methods meant to improve the treatment efficacy we mention: the use of chemical agents able to enhance the tumour response to radiation or to reduce the normal tissue response (Tannock and Goldenberg 1998; Prasanna et al. 2012), techniques to achieve the optimal volume distribution of the radiation dose intensity (Li et al. 2012), and procedures to optimize the dose-fractionation protocols with respect to the fraction sizes and the overall treatment time (Jones and Dale 1999; Fowler 2010, 2012; Bertuzzi et al. 2013b; Astrahan 2008; Yang and Xing 2005).

Protocol optimization methods are based on models of the radiation response of tumours and normal tissues. The most frequently used model to represent the relation between a single radiation dose and the fraction of cells surviving the irradiation is the linear-quadratic (LQ) model (Thames 1985; Fowler 1989; Jones and Dale 1999) that expresses the lethal damage produced by a single irradiation as the sum of two terms proportional to the dose and to the squared dose, respectively according to the radiosensitivity parameters \(\alpha \) and \(\beta \). The linear term accounts for non repairable lesions to DNA while the quadratic term accounts for lethal misrepair events following DNA double strand breaks (Hlatky et al. 1994). The LQ model combines a simple formalism, derived from biophysical principles, with the possibility of representing both tumour and normal tissue responses during different radiotherapy treatments.

When multiple radiation doses are delivered and the repopulation due to the regrowth of surviving cells is taken into account, the survival fraction is expressed by more complex expressions including also the treatment time (Fowler et al. 2003a; Fowler 2008). The LQ model has been extended to include, beyond the repopulation, other processes characterizing the cell response to radiation, i.e. the repair of the radiation damage, the redistribution of cells among the cell-cycle phases, the reoxygenation of tissues (Wong and Hill 1998). A recent review by O’Rourke et al. (2009) examines the LQ formalism with emphasis on the modelling of repopulation and redistribution mechanisms. A resensitization term, intended to account for both the redistribution and the reoxygenation, has been included in the LQ model leading to the LQR model proposed by Brenner et al. (1995). The kinetic effects of repopulation and reoxygenation have been modeled in studies where the geometry of the tumour mass was explicitly taken into account (Düchting et al. 1992, 1995). The diffusion/consumption of oxygen in the tumour cell aggregate has been represented in models of the radiation response of tumour cords (Bertuzzi et al. 2008) and of multicellular tumour spheroids (Bertuzzi et al. 2010). Simulation models with cell-cycle structure have been proposed to account for the different phase-specific radiosensitivities of cells (Dionysiou et al. 2004; Ribba et al. 2006). A modified model, the linear-quadratic-linear (LQL) model, has been proposed by Guerrero and Li (2004) to provide a better fit to dose-response data at high fractional radiation doses (Guerrero and Li 2004; Astrahan 2008).

The LQ and LQR models have been applied to a variety of experimental cell population data, in order to estimate the model parameters (Brenner et al. 1995). However, the assessment of these parameters from clinical data may be critical in highly heterogeneous populations such as the human tumours (Qi et al. 2006; Jones and Sanghera 2007).

Following the LQ formalism, recent papers propose radiotherapeutic strategies in the context of optimization techniques. For instance, the LQR model has been used by Lee et al. (2006) in a mixed integer programming procedure for improving the \(3\)-D distribution of the radiation dose by determining the optimal beam angles and intensities in the intensity-modulated radiation therapy (IMRT). Optimal adaptive fractionation schemes accounting for variations of the relative positions between tumour and healthy tissues during the treatment have been derived by Lu et al. (2008a, b). An application of cancer treatment optimization has been presented in the paper by Ledzewicz and Schättler (2012), where a model of the tumour dynamics under radio and anti-angiogenic therapy is analyzed, and an optimal control problem is set with the objective of minimizing the tumour volume subject to constraints limiting the negative effects on healthy tissues.

In the framework of radiotherapy protocol optimization, different approaches have been proposed. For instance, the LQ model with the repopulation term has been used by Fowler (2007, 2008) to investigate optimal schedules for head and neck cancer, taking into account both the early reacting normal tissues and the late complications. In these papers, the author proposed an empirical procedure in order to optimize the total duration of the treatment, keeping the late tissue damage fixed and using uniform fractionation schedules. Yang and Xing (2005), using the complete LQR model with parameter values taken from the literature, investigated by a numerical procedure (simulated annealing) optimal radiotherapy schemes for fast proliferating and slowly proliferating tumours. The optimization procedure searches for the highest tumour biologically effective dose (BED) over the total treatment length, while the BED of the late normal tissue is kept constant. Interestingly, the resulting optimal fractionation scheme was found to be not necessarily uniform. In a previous paper, we proposed the analytical formulation of an optimal radiotherapy problem, describing the tumour and normal tissue responses by the LQ model that included the repopulation term and the sublethal damage term due to incomplete repair (Bertuzzi et al. 2013b). Assuming that the number of treatment weeks is fixed and that some rule is assigned to distribute the total tolerable damage to healthy tissues over the weeks, the optimization problem can be formulated with respect to the fraction sizes of a single week (one fraction/day, five fractions/week). Unlike the uniform protocols, consisting of five equal fractions, used in clinical practice, a single fraction dose per week was found to be optimal for slowly proliferating tumours, either taking into account the sublethal damage due to incomplete repair or not. Concerning fast proliferating tumours, in the absence of the incomplete repair term the optimal weekly scheme equalled the standard uniform fractionation. Including the incomplete repair term in the model led to non-uniform schemes, though the optimal fraction sizes (and then the log cell kill) were negligibly affected.

In the present paper, we reconsider the problem of finding the optimal radiotherapy scheme, over a fixed treatment time, that maximizes the tumour damage satisfying constraints related to the tolerable damage to the early and late responding normal tissues. On the basis of the results of (Bertuzzi et al. 2013b; Papa and Sinisgalli 2013) we do not include the sublethal damage term due to incomplete repair in the cell response model. A new constraint consisting of an upper bound for the daily fraction is introduced here in order to strengthen the normal tissue constraints, especially with respect to late complications possibly occurring months or years after irradiation. Indeed, while the LQ model is commonly accepted to express acute reactions of tissues, the prediction of late complications by the same model is a controversial issue, especially at high fraction doses (Yang and Xing 2005; Astrahan 2008; Brenner 2008; Kirkpatrick et al. 2008, 2009; Macchia et al. 2010; Ling et al. 2010). The problem is analytically formulated and, using classical non-linear programming results, we determine the optimal fractionation scheme as a function of the tumour class and of the other model parameters. In order to simplify the problem solution we followed a procedure that allows to successively reduce the number of candidates and in Sect. 3 we find that only two sets of candidates exist: a finite set consisting of isolated points having particular structures and a possibly dense set of non-structured points (Bertuzzi et al. 2013a). In Sect. 4, the influence of normal tissue and tumour parameters on the structured and non-structured candidates is analyzed. In Sect. 5, the influence of the constraint on the fraction size is examined, showing how the candidates change as the upper bound changes. Collecting the results of Sects. 35, we get the extremals table in terms of tumour type, fraction upper bound and normal tissue parameters. Finally, in Sect. 6, the comparison of the cost function (log cell kill) among the extremals provides the optimal solution for different tumour classes. To illustrate and complement the analytical results on the basis of the literature relevant to radiotherapy treatments, we give numerical examples of application of the proposed procedure, determining the optimal fractionation schemes and the log cell kill they produce on specific tumour types in comparison with clinical fractionation schemes.

2 Formulation of a constrained radiotherapy problem and optimality conditions

We address the problem of finding the fractionated radiotherapy scheme that maximizes the overall tumour damage, keeping within given admissible levels the damages to the normal tissues as well as the size of each daily dose fraction. The LQ model is assumed to describe the response to radiation of a homogeneous cell population, so that the logarithmic fraction of surviving cells is written as

$$\begin{aligned} \ln (S)= - \alpha \sum _{k = 1}^{5\nu } d_{k} - \beta \sum _{k = 1}^{5\nu } d_{k}^{2}+\frac{\ln (2)}{T_{P}} (T - T_{K}) H(T - T_{K}). \end{aligned}$$
(2.1)

The first two terms represent the cumulated effect of instantaneous lethal damages, where \(\alpha , \beta > 0\) are the LQ radiosensitivity parameters of the population, accounting for non-repairable lesions to DNA and, respectively, for the lethal misrepair events occurring in the repair process of DNA double strand breaks (Hlatky et al. 1994). Moreover \(d_{k} \ge 0\) is the radiation dose at day \(k\)-th, \(k=1,\ldots ,5\nu ,\,\nu \) is the number of treatment weeks assumed integer and assigned. The positive term in (2.1) accounts for cell repopulation, where \(T_{P}\) is the repopulation doubling time, \(T_{K}\) is the kick-off time of compensatory proliferation, \(T = 7 \nu - 3\) is the (fixed) overall treatment length, and \(H\) is the Heaviside function.

Representing by the LQ model the response to radiation of both tumour and surrounding normal tissues, the problem is that of minimizing the tumour survival (2.1) with respect to the doses \(d_{k}\), under the constraint of keeping the damages to the early and late responding normal tissues below maximal admissible levels \(C_{e}, C_{l} > 0\):

$$\begin{aligned}&\alpha _{e} \sum _{k = 1}^{5\nu } d_{k} + \beta _{e} \sum _{k = 1}^{5\nu } d_{k}^{2}-\frac{\ln (2)}{T_{Pe}} (T - T_{Ke}) H(T - T_{Ke}) \le C_{e},\end{aligned}$$
(2.2)
$$\begin{aligned}&\alpha _{l} \sum _{k = 1}^{5\nu } d_{k} + \beta _{l} \sum _{k = 1}^{5\nu } d_{k}^{2}\le C_{l}, \end{aligned}$$
(2.3)

where the parameters have been indexed by subscripts “e” and “l” when referring to the early and late tissues respectively. The cell repopulation term is absent in constraint (2.3) as it is negligible for late responding tissues (Fowler 2012). While using the LQ model is rather accepted to express acute complications after radiation therapy, predicting late complications by the same model is a controversial issue, especially for high fraction doses (Yang and Xing 2005; Astrahan 2008; Brenner 2008; Kirkpatrick et al. 2008, 2009; Macchia et al. 2010; Ling et al. 2010). For this reason, in addition to the constraints (2.2) and (2.3), already envisaged by Bertuzzi et al. (2013b), we introduce a constraint directly limiting the level of radiation, setting an upper bound \(d_{M}\) on the dose fraction size:

$$\begin{aligned} d_{k}\le d_{M},\quad k=1,\ldots ,5\nu . \end{aligned}$$

In order to simplify the optimization problem reducing the number of variables and, at the same time, to strengthen the constraints (2.2) and (2.3), we consider the total damage equi-distributed over the treatment weeks. So we can formulate the optimization problem over a single week, assuming the obtained solution repeated for each treatment week. Let us introduce the notations

$$\begin{aligned}&\rho = \frac{\alpha }{\beta }, \quad \rho _{e} = \frac{\alpha _{e}}{\beta _{e}}, \quad \rho _{l} = \frac{\alpha _{l}}{\beta _{l}}, \nonumber \\&\quad k_{e} = \frac{1}{\nu \beta _{e}} \left[ C_{e}+ \frac{\ln (2)}{T_{Pe}} (T - T_{Ke}) H(T - T_{Ke})\right] , \quad k_{l} = \frac{C_{l}}{\nu \beta _{l}}, \end{aligned}$$
(2.4)

recalling that \(\rho _e>\rho _l\) (Fowler 2012). We have the following optimization problem.

Problem 1

Minimize the function:

$$\begin{aligned} J(d) = - \rho \sum _{k = 1}^{5} d_{k} - \sum _{k = 1}^{5} d_{k}^{2}, \end{aligned}$$
(2.5)

on the admissible set:

$$\begin{aligned} D&= \left\{ d \in R^{5}| g_{e}(d) = \rho _{e} \sum _{k = 1}^{5} d_{k} + \sum _{k = 1}^{5} d_{k}^{2} - k_{e}\le 0,\right. \\&\quad \left. g_{l}(d) = \rho _{l} \sum _{k = 1}^{5} d_{k} + \sum _{k = 1}^{5} d_{k}^{2} - k_{l} \le 0,\quad d_{k} \in [0, d_{M}],\ k=1,\ldots ,5\right\} . \end{aligned}$$

\(\square \)

The same formulation holds as long as the weekly damages to the normal tissues \(k_{e},k_{l}\) are assigned, even though not necessarily constant over the weeks of treatment. A basic result comes from the Weierstrass theorem that states the existence of optimal solutions for Problem 1. The result is crucial, however, as the problem is not convex and only necessary conditions, provided by the Kuhn–Tucker theorem, are available (see, e.g. Pierre 1969). The Lagrangian function associated to Problem 1 is

$$\begin{aligned} L(d, \lambda _{0}, \eta _{e}, \eta _{l}, \eta , \mu ) = \lambda _{0} J(d) + \eta _{e} g_{e}(d) + \eta _{l} g_{l}(d) - \sum _{k = 1}^{5} \eta _{k} d_{k} + \sum _{k = 1}^{5} \mu _{k} (d_{k} - d_{M}), \end{aligned}$$

where \(\lambda _{0},\eta _{e},\eta _{l}\) are scalar multipliers and \(\eta ,\mu \) are \(5\)-dimensional vectors with components \(\eta _{k}, \mu _{k}, k=1,\ldots ,5\), respectively. Introducing the notations

$$\begin{aligned} \delta (\lambda _{0},\eta _{e},\eta _{l})=-\lambda _{0} \rho + \eta _{e} \rho _{e}+\eta _{l}\rho _{l},\quad \sigma (\lambda _{0},\eta _{e}, \eta _{l})=2(- \lambda _{0} + \eta _{e} + \eta _{l}),\quad \quad \end{aligned}$$
(2.6)

the necessary minimum and admissibility conditions are

$$\begin{aligned}&\frac{\partial L}{\partial d_{k}}=\delta (\lambda _{0}, \eta _{e}, \eta _{l})+\sigma (\lambda _{0}, \eta _{e}, \eta _{l}) d_{k} - \eta _{k} + \mu _{k}=0,\quad k=1,\ldots ,5, \quad \end{aligned}$$
(2.7)
$$\begin{aligned}&\eta _{k} d_{k}=0,\quad k=1,\ldots ,5, \end{aligned}$$
(2.8)
$$\begin{aligned}&\mu _{k} (d_{k}-d_{M})=0,\quad k=1,\ldots ,5, \end{aligned}$$
(2.9)
$$\begin{aligned}&\eta _{e} g_{e}(d)=0, \end{aligned}$$
(2.10)
$$\begin{aligned}&\eta _{l}g_{l}(d)=0, \end{aligned}$$
(2.11)
$$\begin{aligned}&g_{e}(d) \le 0,\quad g_{l}(d) \le 0, \end{aligned}$$
(2.12)
$$\begin{aligned}&0 \le d_{k} \le d_{M}, \quad k=1,\ldots ,5,\end{aligned}$$
(2.13)
$$\begin{aligned}&\lambda _{0}, \eta _{e}, \eta _{l}, \eta _{k}, \mu _{k} \ge 0,\quad k=1,\ldots , 5, \end{aligned}$$
(2.14)

with \(\lambda _{0},\eta _{e},\eta _{l},\eta _{k},\mu _{k},k=1, \ldots ,5\), not simultaneously equal to zero.

In order to find the extremal solutions of system (2.7)–(2.14) we carried out a procedure in three sequential steps that allows to successively restrict the admissible set reducing the number of candidates. The adopted procedure is the following:

  1. (i)

    solve Eqs. (2.7)–(2.9) with respect to \(d,\eta ,\mu \), assuming \(\lambda _{0},\eta _{e},\eta _{l}\) fixed;

  2. (ii)

    solve Eqs. (2.10), (2.11) with respect to \(\eta _{e},\eta _{l}\) for \(\lambda _{0}>0\) (\(\lambda _{0}=0\) not possible), taking into account constraints \(g_{e}(d)\le 0,g_{l}(d)\le 0,d_{k}\ge 0,k=1,\ldots ,5\), and (2.14);

  3. (iii)

    impose \(d_{k}\in [0,d_M],\,k=1,\ldots ,5\).

Among the extremals so determined, the optimal solutions can be obtained by direct comparison of the cost function values, and are classified in terms of tumour and normal tissue parameters, and of \(d_M\).

3 Classes of structured and non-structured solutions

Performing step (i), we consider the subsystem made up of Eqs. (2.7), (2.8) and (2.9), assuming \(\lambda _{0},\eta _{e},\eta _{l}\) fixed, as we intend to focus on the solutions with respect to the subset of variables \(d,\eta ,\mu \). Fixing the triple \(\lambda _{0},\eta _{e},\eta _{l}\), the quantities \(\delta \) and \(\sigma \) in (2.6) become fixed coefficients of Eq. (2.7). The following theorem classifies the problem solutions in terms of \(\delta \) and \(\sigma \).

Theorem 1

Problem 1 admits two sets of extremal candidates: “structured” and “non-structured”. The set of structured candidates is associated to values of \(\lambda _{0},\eta _{e},\eta _{l}\) such that \(\sigma \ne 0\) and it consists at most of \(3^{5}\) structures \((\)including the trivial vector \(d = 0)\) that can be grouped into \(21\) mutually exclusive classes. Structures in each class are characterized by the number of doses equal to zero\(,\) \(d_{M}\) and \(A\in (0, d_{M}),\) with

$$\begin{aligned} A=-\frac{\delta }{\sigma }, \end{aligned}$$
(3.1)

independently of the dose positions. Having the same value of the cost function \(J,\) all the structures of the same class are equivalent. The classes are listed in Table 1, where \(d(i,j),\,i,j=0,\ldots ,5,\,0 \le i+j \le 5,\) denotes the class whose elements contain \(i\) doses equal to \(A\) and \(j\) doses equal to \(d_{M},\) while \(A(i, j)\) denotes the value of the intermediate dose of the structure \(d(i, j),\,i\ne 0.\)

The set \(\{\mathring{d}\}\) of non-structured candidates is associated to values of the multipliers \(\lambda _{0},\eta _{e},\eta _{l}\) such that \(\sigma =\delta =0\), and consequently \(\eta =\mu =0\), making Eq. (2.7) identically satisfied.

Table 1 Classes of equivalent structures for Problem 1

Proof

Let us multiply each equation \(\frac{\partial L}{\partial d_{k}}=0\) in (2.7) by \(d_{k}\,(d_{k} - d_{M})\). In view of (2.8) and (2.9), we obtain

$$\begin{aligned} d_{k}\,(d_{k} - d_{M}) [\delta + \sigma d_{k}]=0,\quad k=1,\ldots ,5, \end{aligned}$$

and, for values of the multipliers \(\lambda _{0},\eta _{e},\eta _{l}\) such that \(\sigma \ne 0\), we get three values for \(d_{k}\):

$$\begin{aligned} d_{k}&= 0, \nonumber \\ d_{k}&= d_{M},\\ d_{k}&= -\frac{\delta }{\sigma }=A.\nonumber \end{aligned}$$
(3.2)

For \(A\in (0,d_M)\), the values (3.2) are distinct and their \(3^{5}\) dispositions with repetition in a vector \(d\in R^5\) give all the possible structured candidates. As vectors containing the same number of \(A\) and \(d_M\) are indistinguishable with respect to \(J\), i.e. they are equivalent, the vectors can be grouped into \(21\) mutually exclusive classes of equivalent structures. In each class, a single structure can be chosen as representative, as shown in Table 1.

Let us now reconsider the original system (2.7)–(2.9) supposing that the fixed values of the multipliers \(\lambda _{0},\eta _{e},\eta _{l}\) are such that \(\sigma = 0\).

If \(\delta \ne 0\) it can be verified that the solutions are \(d(0,0)\) and \(d(0,5)\), already listed in Table 1. If instead \(\delta = 0\), Eqs. (2.7)–(2.9) imply \(\eta _{k} = \mu _{k} = 0,\,k=1,\ldots ,5\), and the system is identically satisfied, providing no information about the value of \(d_{k}\) or the structure of \(d\). Therefore, when \(\sigma =\delta =0\) the system of necessary conditions admits a set of “non-structured” solutions \(\{\mathring{d}\}\) associated to multipliers \(\eta =\mu =0\) with dose values defined only by constraints (2.10) and (2.11), as fully specified in the next sections. \(\square \)

As yet, structured and non-structured candidates given by Theorem 1 depend on \(\lambda _{0},\eta _{e},\eta _{l}\) that still have to be found. Next, we show that some of the \(2^{3}\) combinations of \(\lambda _{0},\eta _{e},\eta _{l}\) originated by (2.14) are inconsistent with the necessary or admissibility conditions and must be discarded.

4 Influence of normal tissue and tumour parameters on the extremals

We go through the second step of the procedure outlined in Sect. 2, selecting structured and non-structured candidates on the basis of normal tissues and tumour parameters. We find that the dose fractions amplitude is related to the normal tissue parameters, while the multipliers are affected by the tumour parameter \(\rho \). Extremal candidates identified during the present section must undergo the upper bound check at step (iii).

The next two corollaries demonstrate what values of \(\lambda _{0},\eta _{e},\eta _{l}\) have to be excluded to obtain non negative \(\eta ,\mu \) and admissible vectors \(d\).

Corollary 1

No extremals exist for \(\lambda _{0}=0.\)

Proof

Let us exclude \(\lambda _{0}=0\) by contradiction. Setting \(\lambda _{0}=0\) in (2.6), from (2.14) it follows

$$\begin{aligned} \eta _{e} \rho _{e}+\eta _{l}\rho _{l}\ge 0,\quad 2(\eta _{e} + \eta _{l})\ge 0. \end{aligned}$$

Then, no extremals exist for \(\eta _{e} \ge 0,\eta _{l} \ge 0\). In fact, if \(\eta _{e}=\eta _{l}=0\), both the quantities \(\delta (0,0,0)\) and \(\sigma (0,0,0)\) vanish and Eq. (2.7) imply

$$\begin{aligned} \eta _{k}=\mu _{k},\quad k=1,\ldots ,5. \end{aligned}$$

If \(\eta _{k}=\mu _{k}>0\), the complementarity conditions (2.8) and (2.9) require \(d_{k}=0\) and \(d_{k}=d_{M}\) at the same time, which is absurd. If instead \(\eta _{k}=\mu _{k}=0\), all the multipliers are zero which is excluded by the Kuhn–Tucker Theorem.

As it cannot be \(\eta _e=\eta _l=0\), at least one multiplier \(\eta _{e}\) or \(\eta _{l}\) must be positive, which means both \(\delta (0,\eta _{e},\eta _{l})\) and \(\sigma (0,\eta _{e},\eta _{l})\) are strictly positive and Eq. (2.7) imply

$$\begin{aligned} \eta _{k}>\mu _{k},\quad k=1,\ldots ,5. \end{aligned}$$

But then (2.8) impose \(d_{k}=0\) for any \(k\), incompatible with \(g_e(d) = 0\) or \(g_l(d) = 0\). \(\square \)

So, \(\lambda _{0}\) must be strictly positive and we can set \(\lambda _{0}=1\). Concerning the multipliers \(\eta _{e}\) and \(\eta _{l}\), we have the following result.

Corollary 2

If the dose fraction upper bound is such that

$$\begin{aligned} d_{M}>\min \{A_{e}(5,0),A_{l}(5,0)\}, \end{aligned}$$
(4.1)

where

$$\begin{aligned} A_{e}(5,0)=-\frac{\rho _{e}}{2}+ \sqrt{\left( \frac{\rho _{e}}{2}\right) ^{2} + \frac{k_{e}}{5}}, \quad A_{l}(5,0)=-\frac{\rho _{l}}{2} + \sqrt{\left( \frac{\rho _{l}}{2}\right) ^{2} + \frac{k_{l}}{5}}, \quad \end{aligned}$$
(4.2)

no extremals exist for \(\eta _{e}=\eta _{l}=0,\) i.e. extremals must satisfy at least one of the conditions \(g_e(d)=0,g_l(d)=0.\) On the contrary\(,\) if

$$\begin{aligned} d_{M} \le \min \{A_{e}(5,0), A_{l}(5,0)\}, \end{aligned}$$
(4.3)

the only extremal \(d(0,5)\) exists\(,\) and it is the optimum.

Proof

Let us suppose \(\eta _{e}=\eta _{l}=0\) in (2.6). It follows \(\delta (1,0,0)\), \(\sigma (1,0,0)<0\) and from Eq. (2.7) we deduce \(\mu _{k}>\eta _{k},\, k=1,\ldots ,5\), so that \(\mu _{k}>0\). Hence, Eq. (2.9) yield the only solution \(d_{k}=d_{M}, k=1,\ldots ,5\), that is the structure \(d(0,5)\). The structure \(d(0,5)\) can be extremal if and only if it satisfies constraints (2.12) that become

$$\begin{aligned} 5d_{M}^{2}+\rho _{e}5d_{M}-k_{e}\le 0,\quad 5 d_{M}^{2}+ \rho _{l}5d_{M}-k_{l}\le 0, \end{aligned}$$
(4.4)

which, taking into account (4.2), are equivalent to the inequalities

$$\begin{aligned} d_{M}\le A_{e}(5,0),\quad d_{M} \le A_{l}(5,0). \end{aligned}$$

So, the pair \(\eta _{e}=\eta _{l}=0\) provides the only candidate \(d(0,5)\), which clearly cannot be an extremal if the upper bound \(d_M\) is set according to (4.1). That proves the first part of the corollary.

When instead \(d_M\) is chosen according to (4.3), \(d(0,5)\) is an extremal as it satisfies (4.4) along with all the necessary and admissibility conditions, and we are going to prove that no other extremals can exist. Let us proceed supposing \(\eta _{e}\) or \(\eta _{l}\) strictly positive. According to the complementarity conditions (2.10), (2.11), only “boundary” vectors, i.e. vectors satisfying \(g_e(d)=0\) or \(g_l(d)=0\), can be extremals. Boundary vectors enjoy the property of having at least one entry greater than or equal to \(\min \{A_{e}(5,0), A_{l}(5,0)\}\). The latter statement can be proved by means of a nonlinear programming argument. Precisely, we set the problem of finding the minimum value of the maximal entry of \(d\) on the early or late constraint boundary, proving that the minimum equals \(A_{e}(5,0)\) or \(A_{l}(5,0)\) respectively. Without loss of generality, on each boundary the problem can be solved choosing one particular ordering of the entries, for instance \(d_5\ge d_4\ge d_3\ge d_2\ge d_1\ge 0\). Minimizing \(d_5\) on each boundary, \(g_e(d)=0\) or else on \(g_l(d)=0\), under the previous constraint, leads to the minimum \(d_{5} = A_{e}(5,0)\) on \(g_{e}(d) = 0\), attained for \((A_{e}(5,0), A_{e}(5,0), A_{e}(5,0), A_{e}(5,0), A_{e}(5,0))\), while the minimum \(d_{5} = A_{l}(5,0)\) on \(g_{l}(d) = 0\), attained for \((A_{l}(5,0), A_{l}(5,0), A_{l}(5,0), A_{l}(5,0), A_{l}(5,0))\). As a consequence of this property, if \(d_M<\min \{A_{e}(5,0), A_{l}(5,0)\}\) no boundary vectors verify the upper bound \(d_{k}\le d_M,\,k=1,\ldots ,5\), meaning that no extremals with positive \(\eta _{e}\) or \(\eta _{l}\) can exist. If instead \(d_M=\min \{A_{e}(5,0), A_{l}(5,0)\}\), only the boundary vector \(d\) with \(d_{k}=\min \{A_{e}(5,0), A_{l}(5,0)\},\,k = 1,\ldots ,5\), can be an extremal, as it satisfies the upper bound, and it coincides with the extremal \(d(0,5)\). In conclusion, when \(d_{M}\) is chosen according to (4.3), no other extremal than \(d(0,5)\) exists, so it is theoptimum. \(\square \)

From a geometrical point of view, when \(d_M\le \min \{A_{e}(5,0),A_{l}(5,0)\}\), the box \(0 \le d_{k} \le d_M,\,k=1,\ldots ,5\), is included in the intersection region of the healthy tissues constraints, and satisfying (2.13) implies automatically satisfying (2.12), so that the optimization problem could be reformulated without these latter constraints. Then the result of Corollary 2 is rather intuitive since the maximal damage to the tumour is just achieved by making the dose fractions as large as possible and equal to the maximum value \(d_M\).

Therefore, the present study is carried out under the hypothesis \(d_M > \min \{A_{e}(5,0), A_{l}(5,0)\}\). As stated in Corollary 2, three alternatives are left for the multipliers \(\eta _{e},\eta _{l}\):

$$\begin{aligned}&(1) \quad \eta _{e} > 0,\quad \eta _{l} = 0, \nonumber \\&(2) \quad \eta _{l} > 0,\quad \eta _{e} = 0, \\&(3) \quad \eta _{e} > 0,\quad \eta _{l} > 0.\nonumber \end{aligned}$$
(4.5)

Merging (4.5) with (2.10) and (2.11), the extremals are found from the following systems:

$$\begin{aligned}&(1) \quad g_{e}(d) = 0,\quad g_{l}(d) \le 0, \nonumber \\&(2) \quad g_{l}(d) = 0,\quad g_{e}(d) \le 0, \\&(3) \quad g_{e}(d) = 0,\quad g_{l}(d) = 0.\nonumber \end{aligned}$$
(4.6)

In other words, all the structured and non-structured extremals are “boundary” points as they must satisfy at least one of the constraints \(g_{e}(d),g_{l}(d)\) with the equality sign.

A few comments about the related value of the cost function follow. Any point solution of system (3) of (4.6) has sum of the doses and sum of the squared doses independent of the point itself and given by

$$\begin{aligned} \sum _{k = 1}^{5} d_{k}&= \frac{k_{e} - k_{l}}{\rho _{e} - \rho _{l}} \triangleq S,\end{aligned}$$
(4.7)
$$\begin{aligned} \sum _{k = 1}^{5} d_{k}^{2}&= \frac{\rho _e k_{l} - \rho _{l} k_{e}}{\rho _{e} - \rho _{l}} \triangleq Q. \end{aligned}$$
(4.8)

Consequently, all the points belonging to the intersection of the boundaries of the normal tissue constraints yield the same value of \(J\) independently of the point

$$\begin{aligned} J(d) = - k_{e} \frac{\rho - \rho _{l}}{\rho _{e} - \rho _{l}} - k_{l} \frac{\rho _{e} - \rho }{\rho _{e} - \rho _{l}}. \end{aligned}$$
(4.9)

On the other hand, it is easy to see that points satisfying system (1) of (4.6) have a cost function

$$\begin{aligned} J(d)=-k_{e}+(\rho _{e}-\rho )\sum _{k=1}^{5}d_{k}, \end{aligned}$$
(4.10)

with a total dose satisfying the inequality

$$\begin{aligned} \sum _{k=1}^{5}d_{k}\ge S. \end{aligned}$$
(4.11)

Similarly, points satisfying system (2) are such that

$$\begin{aligned} J(d) = - k_{l} + (\rho _{l} - \rho ) \sum _{k = 1}^{5} d_{k}, \end{aligned}$$
(4.12)

with

$$\begin{aligned} \sum _{k = 1}^{5} d_{k} \le S. \end{aligned}$$
(4.13)

We observe that evaluating (4.10) and (4.12) for a total dose equal to \(S\), the same quantity (4.9) is obtained. Because of (4.11) and (4.13), it is thus clear that the cost function of any candidate has to be compared with the quantity (4.9) and that the relative ordering of (4.9), (4.10) and (4.12) depends on the sign of the differences \(\rho _{e} - \rho \) and \(\rho _{l} - \rho \).

4.1 Influence of the normal tissue parameters on the structured extremals

Postponing the analysis of non-structured candidates, let us examine the implications of (4.6) on the structured candidates listed in Table 1, noting that the six structures \(d(0,j),\,j = 0,\ldots ,5\), contain only zero or \(d_M\) values, while the remaining \(15\) structures contain at least one dose equal to \(A\in (0,d_M)\).

Concerning the first group, we observe that neither \(d(0,0)\) is an extremal, as it does not satisfy \(g_e(d)=0\) or \(g_l(d)=0\), nor \(d(0,5)\) can be an extremal, as a consequence of Corollary 2. The structures \(d(0,j)\), for \(j \ne 0,5\), have fixed entries independent of the options (4.6), and they can be considered as extremal candidates only when \(d_{M}\) is such that

$$\begin{aligned} j d_{M}^{2} + j \rho _{e} d_{M} - k_{e}=0, \end{aligned}$$
(4.14)

or such that

$$\begin{aligned} j d_{M}^{2} + j \rho _{l} d_{M} - k_{l}=0. \end{aligned}$$
(4.15)

Note that (4.14) and (4.15) are linear with respect to \(j\), so that for \(d_M\) assigned there can exist at most one index \(j\) satisfying (4.14) and one index \(j\) satisfying (4.15).

Among the second group of structures \(d(i,j),\,i=1,\ldots ,5,\, j = 0,\ldots , (5-i)\), the unknown value of the dose \(A(i,j)\) is explicitly determined by the constraints being active in (4.6), without going through the computation of \(\eta _e,\eta _l\) and of the expression (3.1). The non-active constraint (if present) along with the upper bound \(d_M\) determine whether \(d(i,j)\) belongs to the admissible domain. In practice, in case (1) of (4.6), \(A(i,j)\) is the real positive root of the second degree equation

$$\begin{aligned} iA^{2}(i,j) + i\rho _{e} A(i,j) - k_{e} + j d_{M}^{2} + j \rho _{e} d_{M}=0, \end{aligned}$$
(4.16)

and it is given by the following function of the early tissue parameters plus \(d_M\):

$$\begin{aligned} A(i,j) = -\frac{\rho _{e}}{2} + \sqrt{\left( \frac{\rho _{e}}{2}\right) ^{2} + \frac{k_{e} - jd_M (d_M +\rho _{e})}{i}} \triangleq A_{e}(i,j). \end{aligned}$$
(4.17)

In case (2) of (4.6), \(A(i,j)\) is the positive root of

$$\begin{aligned} iA^{2}(i,j) + i\rho _{l} A(i,j) - k_{l} + j d_{M}^{2} + j \rho _{l} d_{M} = 0, \end{aligned}$$
(4.18)

given by the following function of the late tissue parameters plus \(d_M\):

$$\begin{aligned} A(i,j) = -\frac{\rho _{l}}{2} + \sqrt{\left( \frac{\rho _{l}}{2}\right) ^{2} + \frac{k_{l} - jd_M (d_M + \rho _{l})}{i}} \triangleq A_{l}(i,j). \end{aligned}$$
(4.19)

For a given pair \(i,j\), according to (4.17) and (4.19), \(A(i,j)\) can simultaneously satisfy (4.16) and (4.18) only for suitable parameter values such that the same root \(A_{e}(i,j) = A_{l}(i,j)\) is obtained from all the three systems in (4.6).

By means of the expressions (4.17) and (4.19), two sets comprehensive of all the possible values of the intermediate dose \(A\) are identified. If, for some \(i,j\), one of these values equals zero or \(d_{M}\), it cannot be assigned to \(A(i, j)\) conventionally defined in the open interval \((0, d_{M})\) in Theorem 1, although certainly lying in the admissible range of \(d_{k}\).

Noting that the equality \(d_{M} = A_{e}(h, 0)\) coincides with (4.14) written for \(j = h\) and that \(d_{M} = A_{l}(h, 0)\) coincides with (4.15) written for \(j = h\), we prove the following properties, valid for the appropriate indexes:

$$\begin{aligned}&d_{M} = A_{e}(h, 0) \Leftrightarrow \begin{array}{l} A_{e}(i, j) = d_{M},\quad \hbox {with }i + j = h, \\ A_{e}(i, j) = 0,\quad \hbox {with }j = h, \end{array}\end{aligned}$$
(4.20)
$$\begin{aligned}&d_{M} = A_{l}(h, 0) \Leftrightarrow \begin{array}{l} A_{l}(i, j) = d_{M},\quad \hbox {with }i + j = h, \\ A_{l}(i, j) = 0,\quad \hbox {with }j=h. \end{array} \end{aligned}$$
(4.21)

Property (4.20) can be proved starting from the equation defining \(A_{e}(i, j)\):

$$\begin{aligned} iA_{e}^{2}(i,j) + i\rho _{e} A_{e}(i,j) + j d_{M}^{2} + j \rho _{e} d_{M} - k_{e} = 0. \end{aligned}$$
(4.22)

If \(d_{M} = A_{e}(h, 0)\), setting \(j = h\) in (4.22) we immediately obtain \(A_{e}(i, h) = 0\), for any \(i\). Moreover, rewriting (4.22) for \(j = h - i\) and taking into account (4.14) with \(j = h\), we obtain

$$\begin{aligned} i \left( d_{M} + A_{e}(i, j) + \rho _{e}\right) \left( d_{M} - A_{e}(i, j)\right) = 0,\quad i + j = h, \end{aligned}$$
(4.23)

which, being \((d_{M} + A_{e}(i, j) + \rho _{e}) > 0\), (4.23) implies \(A_{e}(i, j) = d_{M}\) for any admissible pair \(i,j\) such that \(i + j = h\).

Concerning the inverse implications in (4.20), given a pair \(i,j\) such that \(A_{e}(i, j) = 0\), for \(j = h\), Eq. (4.22) equals Eq. (4.14), so that \(d_{M} = A_{e}(h, 0)\). Moreover, given a pair \(i,j\) with \(i + j = h\) and such that \(A_{e}(i, j) = d_{M}\), from (4.22), the equality \(d_{M} = A_{e}(h, 0)\) follows. Property (4.21) can be proved applying the same arguments to the “late” doses \(A_{l}(i, j)\).

From property (4.20) it follows that when \(d_{M} = A_{e}(h, 0)\), the group of structures \(d(i, j)\) having \(A(i, j) = A_{e}(i, j)\) for \(i > 0\) and \(j = h\) plus the group of structures \(d(i, j)\), with \(A(i, j) = A_{e}(i, j)\) for \(i > 0\) and \(i + j = h\), both coincide with the structure \(d(0, h)\) which is the only extremal candidate, coherently with the conventional definition of \(A(i, j)\). A similar implication comes from (4.21) for the group of structures \(d(i, j)\) with \(A(i, j) = A_{l}(i, j)\) that are all coinciding with \(d(0, h)\).

With this specification, we can extend the group of structures \(d(i, j)\) with \(i > 0\) to include the structures \(d(0, j)\) with \(j \ne 0, 5\), using the notations \(d_{e}(i, j)\) and \(d_{l}(i, j)\), for \(i=1,\ldots ,5,\,j=0,\ldots , (5-i)\), to indicate the vectors having \(i\) entries equal to \(A_{e}(i,j) \in [0, d_{M}]\) and \(A_{l}(i,j) \in [0, d_{M}]\) respectively, and \(j\) entries equal to \(d_M\).

In principle, at most \(15\) different structures \(d_{e}(i,j)\) plus \(15\) different structures \(d_{l}(i,j)\) might be expected from Eq. (4.6). Still, the following corollary proves that, because of the non-active normal tissue constraint, the number of possible extremals is actually smaller.

Corollary 3

Given a pair \(i,j,\,i=1,\ldots ,5,\,j=0,\ldots ,(5-i),\) only one of the vectors \(d_{e}(i,j),d_{l}(i,j)\) satisfies both normal tissue constraints providing an acceptable candidate\(,\) and in particular the vector containing the minimum value between \(A_{e}(i,j)\) and \(A_{l}(i,j)\).

Proof

Let us consider a pair \(i,j\), with \(i=1,\ldots ,5,\,j=0, \ldots , (5-i)\). By the definition of \(d_{e}(i, j)\), it is \(g_{e}(d_{e}(i, j)) = 0\) and by the definition of \(d_{l}(i, j)\), it is \(g_{l}(d_{l}(i, j)) = 0\). Nevertheless, \(d_{e}(i, j)\) is acceptable if and only \(g_l(d_{e}(i,j)) \le 0\), which amounts to say \(A_{e}(i, j) \le A_{l}(i, j)\). On the other hand, \(d_{l}(i, j)\) is acceptable if and only if \(g_e(d_{l}(i,j)) \le 0\), which amounts to say \(A_{l}(i, j) \le A_{e}(i, j)\). In summary, given a pair \(i,j\), if the parameters are such that \(A_{e}(i,j) = A_{l}(i,j)\), the two vectors coincide and may come from the three systems (4.6). Conversely, only \(d_{e}(i, j)\) can be acceptable when \(A_{e}(i,j) < A_{l}(i,j)\) (given by system 1 of (4.6)), or only \(d_{l}(i, j)\) can be acceptable when \(A_{l}(i,j) < A_{e}(i,j)\) (given by system 2). \(\square \)

Corollary 3 establishes that, for each \(i,j\), only one vector between \(d_{e}(i,j)\) and \(d_{l}(i,j)\) can satisfy both normal tissues constraint, leading to at most \(15\) structured candidates in all. Let us define the sum of the doses and the sum of the squared doses for \(d_{e}(i,j)\) as

$$\begin{aligned} S_{e}(i,j) = iA_{e}(i,j)+jd_{M},\quad Q_{e}(i,j) = iA_{e}^{2}(i,j)+jd_{M}^{2}, \end{aligned}$$
(4.24)

and for \(d_{l}(i,j)\) as

$$\begin{aligned} S_{l}(i,j) = iA_{l}(i,j)+jd_{M},\quad Q_{l}(i,j) = iA_{l}^{2}(i,j)+jd_{M}^{2}. \end{aligned}$$
(4.25)

The acceptability of \(d_{e}(i,j)\) or \(d_{l}(i,j)\) can be checked exploiting properties of the quantities (4.24), (4.25). Let for instance \(d_{l}(i,j)\) be the acceptable vector solution of \(g_{l}(d_{l}(i,j)) = 0,\,g_{e}(d_{l}(i,j)) \le 0\). Then, taking the differences \(g_{e}(d_{l}(i,j)) - g_{l}(d_{l}(i,j))\) and \(\rho _{e} g_{l}(d_{l}(i,j)) - \rho _{l} g_{e}(d_{l}(i,j))\), the following inequalities are obtained for \(d_{l}(i,j)\):

$$\begin{aligned}&S_{l}(i,j) \le S,\end{aligned}$$
(4.26)
$$\begin{aligned}&Q_{l}(i,j) \ge Q. \end{aligned}$$
(4.27)

So, if \(d_l(i,j)\) is acceptable, it is \(A_{l}(i,j)\le A_{e}(i,j)\) and (4.26), (4.27) hold. On the other hand, the acceptability of \(d_{e}(i,j)\) can equivalently be verified by means of the inequality \(A_{e}(i,j)\le A_{l}(i,j)\) or of the inequalities

$$\begin{aligned}&S_{e}(i,j) \ge S,\\&Q_{e}(i,j) \le Q. \end{aligned}$$

Observing the definitions (4.7), (4.8) of \(S\) and \(Q\), it can be seen that the acceptability of \(d_{e}(i,j)\) or \(d_{l}(i,j)\) depends on the sign of the differences \(k_{e} - k_{l}\) and \(\rho _e k_{l} - \rho _{l} k_{e}\). Indeed, these quantities along with the global parameter \(v\) defined as

$$\begin{aligned} v \triangleq \frac{(k_{e} - k_{l})^{2}}{(\rho _{e} - \rho _{l})(\rho _{e}k_{l} - \rho _{l}k_{e})} = \frac{S^{2}}{Q}, \end{aligned}$$
(4.28)

determine the geometry of the admissible domain \(D\), so determining the acceptability of \(d_{e}(i,j)\) or \(d_{l}(i,j)\). Table 2 depicts the relationship between the model parameters and the geometrical properties of \(D\). The details about this result are reported in Appendix.

Table 2 Relationship between normal tissue parameters and geometry of the admissible domain

The geometric prevalence of one constraint over the other means that points satisfying the prevalent constraint necessarily make the other strictly satisfied. Therefore, when the early constraint prevails the acceptable vectors are \(d_{e}(i, j)\) and, similarly, when the late constraint prevails the acceptable vectors are \(d_{l}(i, j)\).

The next corollary concerns the acceptability of \(d_{e}(i,j)\) and \(d_{l}(i,j)\) when the normal tissue parameters are such that no constraint prevails. However, the corollary is not exhaustive providing no information about the structures such that \(j\ne 0,\,i+j>[v]\), which may or may not be acceptable depending on the value of \(d_M\).

Corollary 4

For \(k_{e} - k_{l} > 0,\,\rho _e k_{l} - \rho _{l} k_{e} > 0,\) and \(1 \le v \le 5,\) the acceptable vectors are

  • \(d_{l}(i,j),\,i = 1, \ldots , [v],\,j = 0,\ldots , (5 - i),\,i+j \le [v]\),

  • \(d_{e}(i,0),\,i = \min \{[v]+1,5\},\ldots ,5\),

where \([v]\) denotes the integer part of \(v.\)

Proof

If \(k_{e} - k_{l} > 0,\rho _e k_{l} - \rho _{l} k_{e} > 0\) and \(1 \le v \le 5\), the boundaries \(g_{e}(d) = 0\) and \(g_{l}(d) = 0\) intersect each other and there is no prevalent constraint (see Appendix). In order to establish whether \(d_{e}(i, j)\) or instead \(d_{l}(i, j)\) is acceptable for a given \(i,j\), we firstly set \(j = 0\) and consider \(d_{e}(i, 0)\) and \(d_{l}(i, 0),i=1,\ldots ,5\). Proceeding as in the proof of Corollary 4 of (Bertuzzi et al. 2013b), we obtain

$$\begin{aligned} \min \{A_{e}(i, 0), A_{l}(i, 0)\}={\left\{ \begin{array}{ll} A_{l}(i, 0),\quad i=1\ldots [v], \\ A_{e}(i, 0),\quad i = \min \{[v] + 1, 5\},\ldots ,5,\end{array}\right. } \end{aligned}$$
(4.29)

and, only when \(v\) is integer, it is \(A_{e}(i, 0) = A_{l}(i, 0)\) for \(i = v\). Thus, for \(j = 0\), the acceptable vectors are \(d_{l}(i, 0)\) for \(i \le [v]\), and \(d_{e}(i, 0)\) for \(i > [v]\).

To complete the proof, let us rewrite \(S_{l}(i, j)\) in (4.25) as \(S_{l}(i, t - i)\), introducing the number of non zero doses \(t = i + j,\,t = 1, \ldots , 5\). We intend to study the behaviour of \(S_{l}(i, t - i)\) fixing \(t\) and letting \(i\) vary. Let us then consider \(i\) as a continuous variable \(z > 0\) writing

$$\begin{aligned} S_{l}(z, t - z) = z \left( -\frac{\rho _{l}}{2} + \sqrt{\left( \frac{\rho _{l}}{2}\right) ^{2} + \frac{k_{l} - (t - z) d_M (d_M + \rho _{l})}{z}}\right) + (t -z) d_{M},\nonumber \\ \end{aligned}$$
(4.30)

which is physically meaningful when \(A_{l}(z, t - z)\) is real and positive, that is for \(z\) such that \(k_{l} - (t - z) d_{M} (d_{M} + \rho _{l}) > 0\). Introducing the function

$$\begin{aligned} R(z) = \left( \frac{\rho _{l}}{2}\right) ^{2} + \frac{k_{l} - (t - z) d_M (d_M + \rho _{l})}{z}, \end{aligned}$$

the derivative of (4.30) with respect to \(z\) is given by

$$\begin{aligned} \frac{\partial S_{l}(z, t - z)}{\partial z} = \frac{\left( \sqrt{R(z)} - \left( d_{M} + \frac{\rho _{l}}{2}\right) \right) ^{2}}{2 \sqrt{R(z)}}, \end{aligned}$$

that is obviously non-negative. Then, \(S_{l}(i, t - i)\) is not decreasing as \(i\) increases and we have:

$$\begin{aligned} S_{l}(i, t - i) \le S_{l}(t, 0),\quad t = 1,\ldots ,5,\ i=1,\ldots ,t. \end{aligned}$$

The vectors \(d_{l}(t, 0),\,t = 1, \ldots , [v]\), are acceptable in view of (4.29) and, from (4.26), they satisfy \(S_{l}(t, 0) \le S\). In conclusion, it is

$$\begin{aligned} S_{l}(i, t - i) \le S_{l}(t,0) \le S,\quad t = 1,\ldots , [v],\ i=1,\ldots ,t, \end{aligned}$$

and, being \(t = i + j\), the acceptability of all the vectors \(d_{l}(i, j)\), with \(i + j \le [v]\), is so proved. \(\square \)

4.2 Influence of the tumour parameter on structured and non-structured extremals

We focus now on the effect of the tumor parameter \(\rho \), showing that the possible existence of an extremal can be determined comparing the tumour parameter to the normal tissues parameters \(\rho _{l},\rho _{e}\). In particular, the next corollary identifies the values of \(\rho \) that make it possible to find a pair \(\eta _{e},\eta _{l}\) consistent with (4.5) and \(\eta \ge 0,\mu \ge 0\).

Corollary 5

If \(A_{l}(5, 0) \le A_{e}(5, 0),\) the structure \(d_{l}(5,0)\) is an extremal for any \(\rho \). Otherwise, if \(A_{e}(5, 0) \le A_{l}(5, 0)\) the structure \(d_{e}(5,0)\) is an extremal for any \(\rho \).

Furthermore:

  • the structures \(d_{l}(i, j),\,i = 1,\ldots , 4,\,j = 0, \ldots , (5 - i)\), can be extremals if and only if \(\rho \le \rho _l\);

  • the structures \(d_{e}(i, j),\,i=1,\ldots ,4,\,j = 0, \ldots , (5 - i)\), can be extremals if and only if \(\rho \le \rho _e\);

  • the non-structured set \(\{\mathring{d}\}\), can provide extremals if and only if \(\rho _{l} \le \rho \le \rho _{e}\).

Proof

Let us start examining the non-structured set \(\{\mathring{d}\}\), recalling that its points are associated to multipliers \(\eta _{e},\eta _{l}\) satisfying \(\delta = \sigma = 0\) (proof of Theorem 1). Then, (2.7)–(2.9) imply \(\eta = \mu = 0\). From the definitions (2.6) of \(\delta \) and \(\sigma \), we notice that the equations \(\delta = 0\) and \(\sigma = 0\) are linear with respect to \(\eta _{e},\eta _{l}\) and yield

$$\begin{aligned} \eta _{e} = \frac{\rho - \rho _{l}}{\rho _{e} - \rho _{l}},\quad \eta _{l} = \frac{\rho _{e} - \rho }{\rho _{e} - \rho _{l}}. \end{aligned}$$
(4.31)

Clearly, non-structured extremals \(\{\mathring{d}\}\) do not exist for \(\rho < \rho _{l}\) or for \(\rho > \rho _{e}\), since \(\eta _{e}\), or respectively \(\eta _{l}\), are negative. So, multipliers \(\eta _{e},\eta _{l}\) associated to \(\{\mathring{d}\}\) are non-negative if and only if \(\rho _{l} \le \rho \le \rho _{e}\). Then (4.31) with (2.10), (2.11) allow to determine the points of \(\{\mathring{d}\}\), whenever the set is actually non-empty (see next Theorem 2).

Coming to the structured vectors, which are associated to multipliers \(\eta _{e},\eta _{l}\) such that \(\sigma \ne 0\) (Theorem 1), we exploit Eq. (2.7) written for the vectors \(d_{e}(i, j)\) or \(d_{l}(i, j)\) taking into account the complementarity conditions (2.8), (2.9). The \(k\)-th equation in (2.7) assumes three different expressions depending on the value of the corresponding \(k\)-th entry \(d_{k}\). In particular, from (2.7), (2.8), (2.9) we get: \(j\) equations associated to \(d_{k} = d_M\)

$$\begin{aligned} \delta (\eta _{e}, \eta _{l}) + \sigma (\eta _{e}, \eta _{l}) d_M + \mu _k = 0, \end{aligned}$$
(4.32)

where \(\eta _{k} = 0\); \(5 - i - j\) equations associated to \(d_{k}=0\)

$$\begin{aligned} \delta (\eta _{e}, \eta _{l}) - \eta _k=0, \end{aligned}$$
(4.33)

where \(\mu _{k} = 0\); \(i\) remaining equations associated to entries equal to \(A_{e}(i, j)\) in \(d_{e}(i, j)\), or equal to \(A_{l}(i, j)\) in \(d_{l}(i, j)\). Both \(A_{e}(i, j)\) and \(A_{l}(i, j)\), for some values of the normal tissue parameters, can be equal to \(0\) or \(d_{M}\) leading again to Eq. (4.32) or (4.33). Conversely, when \(A_{e}(i, j) \ne 0, d_{M}\) and \(A_{l}(i, j) \ne 0, d_{M}\) we get \(i\) equations where \(\eta _{k} = \mu _{k} = 0\), and precisely for \(d_{e}(i, j)\)

$$\begin{aligned} \delta (\eta _{e}, \eta _{l}) + \sigma (\eta _{e}, \eta _{l}) A_{e}(i, j) = 0, \end{aligned}$$
(4.34)

or for \(d_{l}(i, j)\)

$$\begin{aligned} \delta (\eta _{e}, \eta _{l}) + \sigma (\eta _{e}, \eta _{l}) A_{l}(i, j) = 0. \end{aligned}$$
(4.35)

The proof develops now imposing \(\eta _{k}, \mu _{k} \ge 0\) in Eqs. (4.32)–(4.35) so to obtain conditions on \(\delta ,\sigma \), that is conditions on \(\eta _{e},\eta _{l}\), and eventually \(\rho \).

Let us begin from \(d_{e}(i, j)\) and \(d_{l}(i, j)\) for \(i \ne 5\). When the normal tissue parameters are such that \(A_{e}(i, j) \ne 0, d_{M}\) and \(A_{l}(i, j) \ne 0, d_{M}\), either from (4.34) and from (4.35), we have \(\delta > 0,\sigma < 0\) or \(\delta < 0,\sigma > 0\), because \(A_{e}(i, j) > 0,A_{l}(i, j) > 0\) and because \(\sigma = \delta = 0\) provides only the already considered non-structured extremals. Moreover, at least one of Eq. (4.32) or (4.33) is present for \(i \ne 5\) and, in order to make \(\eta \) and \(\mu \) non-negative, it must be \(\delta >0,\sigma <0\), i.e.

$$\begin{aligned} \rho < \eta _{e} \rho _{e} + \eta _{l} \rho _{l},\quad \eta _{e} + \eta _{l}<1. \end{aligned}$$
(4.36)

If \(A_{e}(i, j)\ne A_{l}(i, j)\), vectors \(d_{l}(i, j)\) can only satisfy (2.10) with \(\eta _{e} = 0\), whereas vectors \(d_{e}(i, j)\) are necessarily associated to \(\eta _{l} = 0\). So, (4.36) immediately imply that vectors \(d_{l}(i, j)\), with \(A_{l}(i, j) < A_{e}(i, j)\), can be extremals if and only if \(\rho < \rho _{l}\) (and \(\eta _{l} < 1\)), while vectors \(d_{e}(i, j)\), with \(A_{e}(i, j) < A_{l}(i, j)\), can be extremals if and only if \(\rho < \rho _{e}\) (and \(\eta _{e} < 1\)). Suppose now \(A_{e}(i, j) = A_{l}(i, j)\) so that the extremal \(d_{e}(i, j) = d_{l}(i, j)\) may be provided in principle by all the three cases of (4.5), with each case associated to an interval for \(\rho \) derived from (4.36). The widest range in which these extremals exist is

$$\begin{aligned} \rho < \eta _{e} \rho _{e} + \eta _{l} \rho _{l} < \rho _{e} - \eta _{l} (\rho _{e} - \rho _{l}). \end{aligned}$$

and, recalling \(\rho _{e} > \rho _{l}\), we recognize that vectors \(d_{e}(i,j) = d_{l}(i,j)\) (automatically acceptable) can be extremals if and only if \(\rho < \rho _{e}\).

If \(A_{e}(i, j)\) equals \(0\) or \(d_{M}\), or similarly if \(A_{l}(i, j)\) equals \(0\) or \(d_{M}\), for some \(i,j\), only equations of the type (4.32) or (4.33) are present, and they require \(\delta \ge 0,\sigma <0\) to have both \(\mu _k,\eta _k \ge 0\) (\(\delta =\sigma =0\) not possible for structured extremals). Then, the vector \(d_{l}(i, j)\), with \(A_{l}(i, j) < A_{e}(i, j)\), can be extremal if and only if \(\rho <\rho _l\), as \(\sigma <0\) implies \(\eta _{l}<1\) while \(\delta \ge 0\) implies \(\rho \le \rho _{l}\eta _{l}<\rho _{l}\). On the other hand, the vector \(d_{e}(i, j)\), with \(A_{e}(i, j) \le A_{l}(i, j)\), can be extremal if and only if \(\rho <\rho _{e}\), because the widest range of \(\rho \) implied by \(\delta \ge 0,\sigma <0\) is

$$\begin{aligned} \rho \le \eta _{e} \rho _{e} + \eta _{l} \rho _{l} < \rho _{e} - \eta _{l} (\rho _{e} - \rho _{l}) < \rho _{e}. \end{aligned}$$

So far, we have proved that the structures \(d_{l}(i, j)\) and \(d_{e}(i, j)\), for \(i = 1,\ldots ,4,\,j = 0,\ldots , (5 - i)\) can be extremals in the open intervals \(\rho < \rho _{l}\) and \(\rho <\rho _{e}\), respectively. Nevertheless, vectors \(d_{l}(i, j)\) can be extremals also for \(\rho = \rho _{l}\), as well as vectors \(d_{e}(i, j)\) can be extremals also for \(\rho = \rho _{e}\), which occurs when they are associated to multipliers \(\eta _{e},\eta _{l}\) such that \(\delta =\sigma =0\). Then, they belong to the set \(\{\mathring{d}\}\) of points without a particular structure. In particular, according to (4.31), for \(\rho = \rho _{l}\) vectors \(d_{l}(i, j)\) belong to \(\{\mathring{d}\}\) with \(\eta _{e} = 0\) and \(\eta _{l} = 1\), whereas for \(\rho = \rho _{e}\) vectors \(d_{e}(i, j)\) belong to \(\{\mathring{d}\}\) with \(\eta _{e} = 1\) and \(\eta _{l} = 0\).

To complete the proof, we consider the vectors \(d_{e}(5, 0)\) (when \(A_{e}(5,0) \le A_{l}(5,0)\)) and \(d_{l}(5,0)\) (when vice versa \(A_{l}(5,0) \le A_{e}(5,0)\)) which turn out to be extremals for any value of \(\rho \). Indeed, according to (4.34) and (4.35), it must be \(\delta > 0,\sigma < 0\) or \(\delta < 0,\sigma > 0\), and from (2.8), (2.9) it results \(\eta = \mu = 0\). Let us firstly suppose \(A_{e}(5, 0)\ne A_{l}(5, 0)\), assuming for instance the acceptable vector is \(d_{l}(5,0),\,A_{l}(5,0) < A_{e}(5,0)\). This vector verifies (4.35) for \(\rho < \rho _{l}\) with \(\eta _{e} = 0,\,\eta _{l} < 1\) (consistent with \(\delta > 0,\sigma < 0\)) but also for \(\rho > \rho _{l}\) with \(\eta _{e} = 0,\eta _{l} > 1\) (consistent with \(\delta < 0,\sigma > 0\)) and it is actually an extremal, as \(A_{l}(5,0)\) satisfies the upper bound \(d_{M}\) by the hypothesis (4.1). An identical argument applies to \(d_{e}(5,0)\) when \(A_{e}(5,0) < A_{l}(5,0)\) proving it is an extremal if and only if \(\rho < \rho _{e}\) and \(\rho > \rho _{e}\). Clearly, the acceptable vector between \(d_{l}(5,0)\) and \(d_{e}(5,0)\) keeps being extremal for \(\rho = \rho _{l}\) (with \(\eta _{e} = 0,\eta _{l} = 1\)) or, respectively, \(\rho = \rho _{e}\) (with \(\eta _{e} = 1,\eta _{l} = 0\)) as a vector in the non structured set \(\{\mathring{d}\}\). When instead \(A_{e}(5, 0)=A_{l}(5, 0)\) the structured vector \(d_{e}(5, 0) = d_{l}(5, 0)\) is indeed extremal for any \(\rho \), because it is always possible to find a non negative pair \(\eta _{e},\eta _{l}\) consistent with the necessary conditions on \(\delta ,\sigma \). \(\square \)

For any value of \(\rho \), Corollary 5 allows to exclude a subgroup of candidates and, at the same time, to select the candidates that can be extremals provided they satisfy the possible non-active normal tissue constraint (Corollaries 3 and 4), as well as the fraction upper bound (Sect. 5). The next theorem concludes this section giving a complete description of the set \(\{\mathring{d}\}\) of non-structured candidates for \(\rho \) in the range \([\rho _{l}, \rho _{e}]\), also establishing on the basis of normal tissue parameters whether the set is empty or not.

Theorem 2

For \(\rho _{l} < \rho < \rho _{e},\) the set \(\{\mathring{d}\}\) is non-empty if and only if \(k_{e} - k_{l} > 0,\,\rho _e k_{l} - \rho _{l} k_{e} > 0,\,1 \le v \le 5,\) and it is given by

$$\begin{aligned} \{\mathring{d}\}=\{d \in R^{5} | g_{e}(d)=0, \, g_{l}(d)=0, d \ge 0\}. \end{aligned}$$

For \(\rho = \rho _{l},\) the set \(\{\mathring{d}\}\) is non-empty if and only if \(k_{e} - k_{l} > 0,\,\rho _e k_{l} - \rho _{l} k_{e} \le 0\) or \(k_{e} - k_{l} > 0,\,\rho _e k_{l} - \rho _{l} k_{e} > 0,\,v \ge 1,\) and it is given by

$$\begin{aligned} \{\mathring{d}\}=\{d \in R^{5} | g_{l}(d)=0, \, g_{e}(d)\le 0, d \ge 0\}. \end{aligned}$$

For \(\rho = \rho _{e},\) the set \(\{\mathring{d}\}\) is non-empty if and only if \(k_{e} - k_{l} \le 0\) or \(k_{e} - k_{l} > 0,\,\rho _e k_{l} - \rho _{l} k_{e} > 0,\,v \le 5,\) and it is given by

$$\begin{aligned} \{\mathring{d}\}=\{d \in R^{5} | g_{e}(d)=0, \, g_{l}(d)\le 0, d \ge 0\}. \end{aligned}$$

Points of \(\{\mathring{d}\}\) satisfying the upper bound \(d_{M}\) are optimal solutions.

Proof

First of all, we remind that the condition \(\rho _{l} \le \rho \le \rho _{e}\) is necessary for the existence of \(\{\mathring{d}\}\) and that all its points are associated to the same pair \(\eta _{e},\eta _{l}\) in (4.31). Another interesting property is that the cost function \(J\) is constant over the whole \(\{\mathring{d}\}\) with the following value

$$\begin{aligned} J(\mathring{d}) = - k_{e} \frac{\rho - \rho _{l}}{\rho _{e} - \rho _{l}} - k_{l} \frac{\rho _{e} - \rho }{\rho _{e} - \rho _{l}}. \end{aligned}$$
(4.37)

Indeed, as \(\delta = 0\) and \(\sigma = 0\), summing (2.10) and (2.11) it results

$$\begin{aligned} \sum _{k = 1}^{5} d_{k}^{2} + \rho \sum _{k = 1}^{5} d_{k} = \eta _{e} k_{e} + \eta _{l} k_{l} \end{aligned}$$

which, recalling (4.31) and the definition (2.5) of \(J\), provides (4.37).

When \(\rho \) belongs to the interval \((\rho _{l}, \rho _{e})\), it results \(\eta _{e}, \eta _{l} > 0\) and then, in view of (2.10), (2.11), the points of \(\{\mathring{d}\}\) must satisfy both normal tissue constraints with the equality sign, i.e. they must satisfy system (3) of (4.6), which admits solutions \(d \ge 0\) when \(k_{e} - k_{l} > 0,\,\rho _e k_{l} - \rho _{l} k_{e} > 0,\,1 \le v \le 5\) (Table 2). In such a case, points of \(\{\mathring{d}\}\) are optimal solutions as any other (structured) point possibly coming from case (1) or (2) of (4.6) would result in a cost function greater than or equal to \(J(\mathring{d})\). In fact, points satisfying system (1) of (4.6) have the cost function (4.10) which, in view of (4.11) and for \(\rho < \rho _{e}\), is such that \(J(d) \ge J(\mathring{d})\). Similarly, for points (2) of (4.6) it is \(J(d) \ge J(\mathring{d})\), according to (4.12) and taking into account (4.13) and \(\rho > \rho _{l}\).

When \(\rho = \rho _{l}\), from (4.31) we get \(\eta _{e} = 0,\eta _{l} = 1\) so that acceptable points of \(\{\mathring{d}\}\) belong to the boundary of the late constraint and are defined by system (2) of (4.6). Solutions \(d \ge 0\) of system (2) of (4.6) exist for \(k_{e} - k_{l} > 0,\,\rho _e k_{l} - \rho _{l} k_{e} \le 0\) or \(k_{e} - k_{l} > 0,\,\rho _e k_{l} - \rho _{l} k_{e} > 0,\,v \ge 1\) (see bottom rows of Table 2). Moreover, the cost function (4.37) turns out to be \(J(\mathring{d}) = - k_{l}\). This value is the minimum value that \(J\) can assume when \(\rho = \rho _{l}\), because evaluating \(J\) in any other point satisfying system (1) or (3) of (4.6) by means of (4.9), (4.10) and (4.11), it results \(J(d) \ge - k_{l}\).

Otherwise, when \(\rho = \rho _{e}\) it is \(\eta _{e} = 1,\,\eta _{l} = 0\) so that points of \(\{\mathring{d}\}\) belong to the boundary of the early constraint defined by system (1) of (4.6), which admits solutions \(d \ge 0\) only for \(k_{e} - k_{l} \le 0,\,\rho _e k_{l} - \rho _{l} k_{e} > 0\) or \(k_{e} - k_{l} > 0,\,\rho _e k_{l} - \rho _{l} k_{e} > 0,\,v \le 5\) (see top rows of Table 2). For all these points, it is \(J(\mathring{d}) = - k_{e}\) and, evaluating (4.9), (4.12) and (4.13), it can be verified that \(J(d) \ge - k_{e}\) for any other point satisfying system (2) or (3) of (4.6). \(\square \)

The totality of extremal candidates for different intervals of the tumour parameter \(\rho \) and for the normal tissue parameter values characterizing different geometries of the domain \(D\) are summarized in Table 3.

Table 3 Extremal candidates with respect to the tumour and normal tissue parameters

5 Influence of the upper bound \(d_M\) on the extremals

Aim of this section is to identify the extremals of Problem 1, by verifying what are the candidates of Table 3 satisfying the constraint \(d_{k} \in [0, d_{M}],\,k = 1, \ldots , 5\). Let us examine the second column of Table 3, reporting the situation of no prevalent normal tissue constraint, starting from the structured candidates.

In order to check for any \(i = 1, \ldots , 5,\,j = 0,\ldots ,5-i\), if it is \(A_{e}(i, j), A_{l}(i, j)\in [0, d_{M}]\), it suffices to compare the upper limit \(d_{M}\) to the elements of the subset for \(j=0\), i.e. to the quantities \(A_{e}(i, 0), A_{l}(i, 0),\,i = 1,\ldots ,5\), that do not depend on \(d_{M}\) itself, but exclusively on the normal tissue parameters. Although a general ordering of these quantities cannot be defined a priori, as it depends on the normal tissue parameter values, the five minima \(A_{l}(i, 0),\,i = 1,\ldots ,[v]\) and \(A_{e}(i, 0),\,i=\min \{[v]+1,5\},\ldots ,5\), taken within “early-late” couples (see Corollary 4) are univocally ordered, and the next Theorem shows that the comparison of \(d_{M}\) only with these values allows us to select the structured extremals.

Theorem 3

For a given integer \(u = 1, \ldots , 4\), if

$$\begin{aligned} d_{M} \in \Big (\min \{A_{e}(u + 1, 0), A_{l}(u + 1, 0)\}, \, \min \{A_{e}(u, 0), A_{l}(u, 0)\}\Big ], \end{aligned}$$
(5.1)

the structured extremals are

$$\begin{aligned} \left\{ \begin{array}{ll} d_{e}(i, j),&{}\quad \hbox {if }A_{e}(i, j) \le A_{l}(i, j), \\ d_{l}(i, j),&{}\quad \hbox {else }, \end{array} \right. \quad j=0,\ldots ,u,\ i=u+1-j,\ldots ,5-j.\nonumber \\ \end{aligned}$$
(5.2)

Proof

It is useful to preliminarily settle some properties holding for \(i = 1,\ldots , 5,\,j = 0, \ldots ,5-i\) and for both \(A_{e}(i, j),\,A_{l}(i, j)\). Without loss of generality, we consider for instance the values \(A_{e}(i, j)\), and applying the same argument used to derive properties (4.20), we obtain for any index \(h=1,\ldots ,5\)

$$\begin{aligned} d_{M} \ge A_{e}(h, 0) \Leftrightarrow A_{e}(i, j) \le d_{M},\quad i+j=h, \end{aligned}$$
(5.3)

and, for any \(m=1,\ldots ,4\)

$$\begin{aligned} d_{M} \le A_{e}(m, 0) \Leftrightarrow A_{e}(i, j) \ge 0,\quad j=m. \end{aligned}$$
(5.4)

Moreover, from (4.17) it is evident that \(A_{e}(1, 0)>A_{e}(2,0)>\cdots >A_{e}(5, 0)\), which along with (5.3) and (5.4) imply the following admissibility conditions for \(A_{e}(i, j)\):

$$\begin{aligned}&d_{M} \ge A_{e}(h, 0) \Rightarrow A_{e}(i, j) \le d_{M},\quad h \le i + j \le 5,\end{aligned}$$
(5.5)
$$\begin{aligned}&d_{M} \le A_{e}(m, 0) \Rightarrow A_{e}(i, j) \ge 0,\quad 1 \le j \le m. \end{aligned}$$
(5.6)

Note that, being \(A_{e}(i, 0)\) positive by definition, property (5.6) can be extended including \(j=0\) in the range of \(j\), becoming

$$\begin{aligned} d_{M} \le A_{e}(m, 0) \Rightarrow A_{e}(i, j) \ge 0, \quad 0 \le j \le m. \end{aligned}$$
(5.7)

Still from (5.3), (5.4), we can derive also the opposite conditions about the non-admissibility of \(A_{e}(i, j)\):

$$\begin{aligned}&d_{M} < A_{e}(h, 0) \Rightarrow A_{e}(i, j) > d_{M}, \quad 1 \le i + j \le h,\end{aligned}$$
(5.8)
$$\begin{aligned}&d_{M} > A_{e}(m, 0) \Rightarrow A_{e}(i, j)<0,\quad m \le j \le 4. \end{aligned}$$
(5.9)

Keeping in mind that in the current setting it is \(1\le v \le 5\) and that in view of Corollary 3, only structured vectors having doses equal to \(\min \{A_{e}(i, j), A_{l}(i, j)\}\) can be accepted, \(d_M\) is chosen as in (5.1) for \(u = 1,\ldots ,4\) and we can prove that for any \(u\), from properties (5.5)–(5.9) it follows \(\min \{A_{e}(i, j), A_{l}(i, j)\} \in [0, d_{M}]\) for \(j=0,\ldots , u,\,i = u + 1 - j, \ldots , 5 - j\), while it follows \(\min \{A_{e}(i, j), A_{l}(i, j)\} \notin [0, d_{M}]\) for all the remaining pairs \(i,j\). This is equivalent to prove (5.2).

Let us denote by \(A_{p}(u + 1, 0)\) and \(A_{P}(u, 0)\) the quantities \(\min \{A_{e}(u + 1, 0), A_{l}(u + 1, 0)\}\) and respectively \(\min \{A_{e}(u, 0), A_{l}(u, 0)\}\), where the subscripts \(p,P\) can be equal to ‘e’ or ‘l’, depending on being the minimum of ‘e’ or ‘l’ type (which in turns depend on the value of \(v\) in \([1,5]\)). Let us similarly denote by \(A_{\bar{p}}(u + 1, 0)\) and \(A_{\bar{P}}(u, 0)\) the quantities \(\max \{A_{e}(u + 1, 0), A_{l}(u + 1, 0)\}\) and \(\max \{A_{e}(u, 0), A_{l}(u, 0)\}\). Reminding that \(1 \le i + j \le 5\) and \(0 \le j \le 4\), in what follows we intend that a given property has to be disregarded whenever \(i + j\) or \(j\) is out of its allowed range, as the related structure is missing.

Let us initially assume \(d_{M}\) within the open interval \((A_{p}(u + 1, 0), A_{P}(u, 0))\). So, it is \(d_{M} < A_{P}(u, 0) \le A_{\bar{P}}(u, 0)\), and (5.8) implies \(A_{e}(i, j),\,A_{l}(i, j) > d_{M}\), for \(1 \le i + j \le u\), indicating that the related structures are not admissible. Moreover, (5.7) guarantees \(A_{e}(i, j),\,A_{l}(i, j) > 0\), for \(j = 0, \ldots , u\). Furthermore, being \(d_{M} > A_{p}(u + 1, 0)\), from (5.9) it results \(A_{p}(i, j) < 0\), for \(j = u + 1,\ldots ,4\), and the related structures have to be excluded. The structures \(d_{\bar{p}}(i, j)\), with \(j = u + 1, \ldots , 4\), have to be excluded as well, either directly, because \(A_{\bar{p}}(i, j) < 0\), or because, being \(A_{\bar{p}}(i, j) > 0 > A_{p}(i, j)\), they are not minimal. Exploiting again \(d_{M} > A_{p}(u + 1, 0)\), from (5.5) it certainly is \(A_{p}(i, j) < d_{M}\), for \(u + 1 \le i + j \le 5\), but not necessarily \(A_{\bar{p}}(i, j) < d_{M}\). Nevertheless, it is guaranteed that \(\min \{A_{p}(i, j), A_{\bar{p}}(i, j)\} < d_{M}\), for \(u + 1 \le i + j \le 5\).

Summarizing, it is \(\min \{A_{e}(i, j), A_{l}(i, j)\} \in (0, d_{M})\), for \(j = 0, \ldots , u,\,i = u + 1 - j, \ldots , 5 - j\), whereas \(\min \{A_{e}(i, j), A_{l}(i, j)\} \notin (0, d_{M})\) for all the remaining indexes; then the vectors (5.2) are all (and only) the structured extremals.

To complete the proof we just need to show that the set of structured extremals is given by (5.2), even for \(d_{M} = A_{P}(u, 0)\). It is \(A_{P}(u, 0)\le A_{\bar{P}}(u, 0)\), so that from (5.3), (5.4) we get \(A_{\bar{P}}(i, j) \ge A_{P}(i, j) = d_{M}\) for \(i + j = u\), and \(A_{\bar{P}}(i, j) \ge A_{P}(i, j) = 0\) for \(j = u\), which means that the structures \(d_{P}(i, j)\) with \(i + j = u\), and \(d_{P}(i, j)\) with \(j = u\), are extremals (for the same index pair, \(d_{\bar{P}}(i, j)\) can be extremal only when it coincides with \(d_{P}(i, j)\)). Note that the vectors \(d_{P}(i, j)\), with \(i + j = u\), and the vectors \(d_{P}(i, j)\), with \(j = u\), are all equal to the vector having \(u\) entries equal to \(d_{M}\) and \(5 - u\) zeroes, so that it is unnecessary to explicitly include \(d_{P}(i, j)\), with \(i + j = u\), in the set (5.2). \(\square \)

Remark 1

Linking together all the intervals in (5.1) for \(u=1,\ldots ,4\), we get the total variability range of \(d_{M}\), except the interval \((A_{l}(1, 0), \infty )\). However, it is evident from (5.5) and (5.9) that for \(d_M>A_{l}(1, 0)\) only the structures with \(j = 0\), and precisely \(d_{l}(i, 0),\,i = 1, \ldots , [v]\) and \(d_{e}(i, 0),\,i = \min \{[v] + 1,\, 5\},\ldots , 5\), can be extremals. So, as it could be expected, setting \(d_M>A_{l}(1, 0)\), we find the same structured extremals found in the absence of a dose upper bound (Bertuzzi et al. 2013b).

In order to select the extremals with respect to \(d_M\) and classify them in terms of \(\rho \) and \(v\), we enunciate a theorem, deferring the proof to the paper by Conte and Papa (2013) for the sake of brevity.

Theorem 4

When \(k_{e} - k_{l} > 0,\,\rho _{e} k_{l} - \rho _{l} k_{e} > 0\) and \(v\in [1,5],\) the minimum point solution of the optimization problem

$$\begin{aligned} \min _{d\in \tilde{D}} \{d_5\} \end{aligned}$$
(5.10)

with

$$\begin{aligned} \tilde{D}=\left\{ d\in R^5\, |\, g_{l}(d) = 0,\, g_{e}(d) \le 0, \, d_5\ge d_4\ge d_3 \ge d_2 \ge d_1\ge 0 \right\} , \end{aligned}$$

is the vector \(d_{R}\) having \([v]\) components equal to \(R_{1[v]},\) one component equal to \(S-[v]R_{1[v]}\) \((\)provided that \(v<5),\) and \(5-[v]-1\) components equal to zero \((\)provided that \(v<4),\) where

$$\begin{aligned} R_{1[v]}=\frac{S}{[v]+1}\left( 1+\sqrt{\frac{[v]+1-v}{v[v]}} \right) . \end{aligned}$$

Moreover\(,\) \(R_{1[v]}\) is the minimum value of the cost function \(d_{5},\) and it holds

$$\begin{aligned} R_{1[v]} >S-[v]R_{1[v]}\ge 0. \end{aligned}$$
(5.11)

The optimum vector \(d_{R}\) satisfies both \(g_{e}(d_{R})=0\) and \(g_{l}(d_{R})=0\).

Proof

See Conte and Papa (2013). \(\square \)

The previous result is significant for Problem 1 as it shows that the quantity \(R_{1[v]}\) acts as a threshold: when \(d_M<R_{1[v]}\), there exist no points of \(g_{l}(d) = 0,\,g_{e}(d) \le 0,\,d \ge 0\) satisfying the upper bound, and therefore only the “early” constraint boundary contains extremals.

From Theorem 4, we deduce the position of \(R_{1[v]}\) within the range of \(d_M\). Precisely, for \(v\in [1,5)\), it is

$$\begin{aligned} A_{e}([v]+1, 0) < R_{1[v]} \le A_{l}([v], 0). \end{aligned}$$
(5.12)

Indeed, \(d_{R}\) satisfies the equations

$$\begin{aligned} {[v]}R_{1[v]}^{2}+ [v]\rho _{e} R_{1[v]} + (S-[v]R_{1[v]})^{2} +\rho _{e}(S-[v]R_{1[v]}) - k_{e}=0, \end{aligned}$$
(5.13)

and

$$\begin{aligned} {[v]}R_{1[v]}^{2}+ [v]\rho _{l} R_{1[v]} + (S-[v]R_{1[v]})^{2} +\rho _{l}(S-[v]R_{1[v]}) - k_{l} = 0. \end{aligned}$$
(5.14)

Then, taking into account (5.11), we can easily get (5.12) from the equations obtained subtracting from (5.13), Eq. (4.16) written for \(A_e([v]+1,0)\), and subtracting from (5.14), Eq. (4.18) written for \(A_l([v],0)\). Moreover, for \(v=5\), it is \(R_{1 5} \equiv A_{l}(5, 0) \equiv A_{e}(5, 0)\) (Conte and Papa 2013), but then this value falls below the range of interest for \(d_{M}\).

Table 4 summarizes the extremals of Problem 1 reporting how they change as \(\rho ,v\) and \(d_{M}\) change. Since the intervals of \(d_{M}\) identified so far are delimited by the values \(A_{e}(5, 0),\ldots , A_{e}(\min \{[v] + 1, 5\}, 0), R_{1[v]}, A_{l}([v], 0), \ldots , A_{l}(1, 0)\) which depend on \(v\), a different extremal table in terms of \(\rho \) and \(d_{M}\) might be laid down for \(v\) belonging to subintervals of \([1, 5]\) between two consecutive integers. We propose here a unique table for an arbitrary value of \(v\in [1,5]\). So, the heading row of the table reporting an interval for \(v\), is intended to specify whether the related \(d_M\) interval has to be taken into account or not. The first and fourth table columns (when they exist) can be multiple columns and have to be repeated, according to \(u\), a number of times still dependent on \(v\). The interval in the third column reduces to a single point for \(v\equiv [v]\), since in that case \(R_{1[v]}\equiv A_{l}([v], 0)\) (Conte and Papa 2013). Note that, on the basis of the chosen subdivision in \(d_M\) intervals, when \(d_{M} \ge R_{1 [v]}\) it is not always possible to specify whether an extremal is an “early” vector or a “late” vector, because the minimum between \(A_{e}(i, j),\,A_{l}(i, j)\) can be different within the same \(d_M\) interval. Nevertheless, as we will see in the next section, the chosen subdivision is the minimal one allowing to determine the optimal solution for any \(d_M\).

Table 4 Extremals with respect to the tumour parameter \(\rho \) and the value of \(d_M\) in the absence of a prevalent constraint

As far as the set \(\{\mathring{d}\}\) of non-structured candidates is concerned, it can contain infinite extremals which are all equivalent with respect to the value of \(J\) (Theorem 2). When such points do verify the upper bound (Theorem 4), they are actually extremals, and it is possible to select a single representative vector satisfying all the admissibility constraints. In particular, in Table 4 it has been reported the representative that satisfies the dose upper bound for the largest range of \(d_{M}\) values. Thus, for \(\rho = \rho _{e}\) the chosen representative is \({\mathring{d}}_{e}(5,0)\), because \(d_{M} > \min \{A_{e}(5, 0), A_{l}(5, 0)\} = A_{e}(5, 0)\), whereas for \(\rho _{l} \le \rho < \rho _{e}\) the representative vector is \({\mathring{d}}_{R}\).

For problems in which one constraint prevails over the other (first or third columns of Table 3) all the extremals belong to the same (early or late) boundary, so that the tables of the extremals become remarkably simpler. We omit writing down the tables giving only some hints for their construction. The extremal tables can be simply derived by means of Theorem 3, for prevalent early constraint and \(\rho <\rho _{e}\), or else for prevalent late constraint and \(\rho <\rho _{l}\). Moreover, when the early constraint is prevalent the only extremal is \(d_{e}(5, 0)\) for \(\rho >\rho _{e}\) (see Corollary 5), while the point \(d_{e}(5, 0)\) is chosen as the representative for the set of non-structured extremals for \(\rho = \rho _{e}\). Similarly, in case of prevalent late constraint and for \(\rho \ge \rho _{l}\) the (unique or representative) extremal is \(d_{l}(5, 0)\).

6 Optimal solutions

Still considering the more general setting of normal tissue constraints with intersecting boundaries, and hence starting from Table 4, we can determine the optimal solution for any value of \(\rho ,d_{M}\) and \(v\). This is immediate for \(\rho >\rho _{e}\) because only one extremal exists and it necessarily coincides with the optimal solution, whereas for \(\rho \le \rho _{e}\) a few properties of the cost function \(J\) have to be taken into account. The first property concerns non-structured extremals, which, provided they exist, have minimum \(J\) and are optimal solutions, according to Theorem 2.

When only structured extremals exist, they are “early” vectors only, for \(d_{M} < R_{1 [v]}\) and \(\rho < \rho _{e}\), or mixed “early” and “late” vectors, for \(d_{M} \ge R_{1 [v]}\) and \(\rho <\rho _{l}\). In the first case, we have to select vectors with the smallest sum of the doses in that the cost function (4.10) decreases with \(\sum _{k = 1}^{5} d_{k}\) for \(\rho <\rho _{e}\). For \(i + j\) fixed, \(S_{e}(i, j)\) in Eq. (4.24) increases when \(i\) increases, as shown in the proof of Corollary 4, and, similarly, it can be proved that, for a fixed \(j,\,S_{e}(i, j)\) increases when \(i\) increases. Then, the vector \(d_{e}(1, u)\), having the smallest total dose, is the optimal solution for \(d_{M}\) in \((A_{e}(u+1, 0), A_{e}(u, 0)],\,[v] + 1 \le u \le 4\). Furthermore, \(d_{e}(1, [v])\) is the optimal solution for \(d_{M}\) in \((A_{e}([v]+1, 0), R_{1 [v]})\).

The only part of Table 4 that remains to be analyzed is \(d_{M} \ge R_{1 [v]}\) and \(\rho <\rho _{l}\). For \(\rho < \rho _{l}\) any late structure results in a cost function lower than the cost function of early structures, because \(J\) in (4.12) is lower than in (4.9) and (4.10). Then, it suffices to pick out in each \(d_{M}\) interval, the late vector of the group having the smallest \(S_{l}(i, j)\) verifying that \(A_{l}(i, j) \le A_{e}(i, j)\). Taking into account the behaviour of \(S_{l}(i, j)\), the “late” vectors with the smallest \(S_{l}(i, j)\) are: \(d_{l}(1, 0)\) for \(d_{M} > A_{l}(1, 0)\); \(d_{l}(1, u)\) for \(d_{M}\) in \((A_{l}(u+1, 0), A_{l}(u, 0)]\); \(d_{l}(1, [v])\) for \(d_{M}\) in \([R_{1 [v]}, A_{l}([v], 0)]\). For \(d_{M} > A_{l}([v], 0)\), the mentioned vectors certainly satisfy the early constraint (as reported in Table 4), whereas for \(d_{M}\) in \([R_{1 [v]}, A_{l}([v], 0)]\) we have to verify that \(A_{l}(1, [v]) \le A_{e}(1, [v])\). This inequality can be proved taking into account the definition of \(A_{l}(1, [v])\)

$$\begin{aligned} A_{l}^{2}(1, [v]) + [v] d_{M}^{2} + \rho _{l} A_{l}(1, [v]) + [v] \rho _{l} d_{M} - k_{l} = 0, \end{aligned}$$
(6.1)

and the definition of \(d_{R}\) in (5.14). Subtracting Eq. (6.1) from Eq. (5.14), we obtain

$$\begin{aligned} (S - [v] R_{1 [v]}) - A_{l}(1, [v]) = \frac{\rho _{l} + d_{M} + R_{1 [v]}}{\rho _{l} + A_{l}(1, [v]) + (S - [v] R_{1 [v]})} [v] (d_{M} - R_{1 [v]}).\nonumber \\ \end{aligned}$$
(6.2)

Since \(d_{M} \ge A_{l}(1, [v])\) when it belongs to the interval \([R_{1[v]}, A_{l}([v], 0)]\), and since in addition \(R_{1[v]} > S - [v] R_{1[v]}\) [see (5.11)], Eq. (6.2) implies \(A_{l}(1, [v]) + [v] d_{M} \le S\), which amounts to say \(A_{l}(1, [v]) \le A_{e}(1, [v])\), as seen in Sect. 4.

Taking into account all the properties proved so far, we report the optimal solutions in the absence of a prevalent constraint in Table 5.

The non-structured solutions reported in Table 5 for \(\rho \in [\rho _{l}, \rho _{e}]\) deserve few comments. When \(\rho = \rho _{l}\) or \(\rho = \rho _{e}\) we get limit conditions in which the LQ responses of tumour and normal tissue coincide. So, when for instance it is \(\rho = \rho _{l}\), any admissible point belonging to the “late” boundary yields the same cost function value equal to the minimum value. For \(\rho \in (\rho _{l}, \rho _{e})\) and \(d_{M} \ge R_{1 [v]}\), again we get an infinite set of optimal solutions consisting of admissible points of the intersection between the “early” and “late” boundaries.

Table 5 Optimal solutions with respect to the tumour parameter \(\rho \) and the value of \(d_M\) in the absence of a prevalent constraint

We finally consider the situation of prevalent “late” constraint. Table 6 depicts the optimal solutions in this case and it is simply obtained exploiting the properties of the cost function (4.12) and of the total dose \(S_{l}(i, j)\). Figure 1 illustrates the behaviour of the five optimal doses as a function of \(d_{M}\) for slowly proliferating tumours (\(\rho < \rho _{l}\)), according to (4.19). Table 6 and Fig. 1 are still valid for prevalent “early” constraint, as long as the appropriate “early” quantities and parameters are used.

Fig. 1
figure 1

Patterns of optimal dose fractions as functions of \(d_{M}\) when the late constraint prevails and for tumours with \(\rho <\rho _l\). Upper panel Behaviour of the daily dose fractions with the (arbitrary) choice \(d_1\ge d_2\ge d_3\ge d_4\ge d_5\). Lower panel Optimal weekly schedules corresponding to six significant values of \(d_M\) (indicated by vertical lines)

Table 6 Optimal solutions with respect to the tumour parameter \(\rho \) and the value of \(d_M\) in case of prevalent late constraint

It is worth noting that the optimal solutions of Table 6 for prevalent late constraint coincide with the optimal solutions of Table 5 when \(v=5\). Then the geometry of the admissible domain is such that the only admissible point on the “early” boundary is \(d_e(5,0)\) and it is \(d_e(5,0)\equiv d_l(5,0)\) (see Appendix). Moreover, any other point on the “late” boundary makes the early constraint strictly satisfied, which means that for \(v=5\) the late constraint prevails although not strictly. This is a meaningful situation from the applicative point of view, as we see in the next section.

6.1 Numerical examples of optimal schedules

Tables 5 and 6 depict the optimal schemes of dose fractionation in a single week of treatment for different tumour classes and for different values of \(d_M\). Hence, according to the present formulation, which assumes the total maximal damages to normal tissues equi-distributed over an assigned number of weeks, we obtain overall radiotherapy schedules consisting in the repetition of the same optimal scheme for any treatment week. It is interesting to compare our optimal schedules to the literature, providing examples based on real clinical protocols applied to specific tumours.

All the results in the present section refer to values of the normal tissue parameters commonly found in the literature (Fowler 2010; Yang and Xing 2005). In particular, we assume for the early responding normal tissue \(\rho _e=10\) Gy, \(\alpha _e=0.35\) Gy\(^{-1},\,T_{Ke}=7\) days and \(T_{Pe}=2.5\) days. As for the late normal tissue, the radiosensitivity ratio is set to \(\rho _l=3\) Gy, while the other parameters do not need to be specified, as the compensatory repopulation is negligible (Fowler 2012).

Concerning the maximal admissible damages to normal tissues, it is usual in the literature to express them by means of the quantity called BED introduced by Barendsen (1982) and defined as the total radiation dose proportional to the logarithmic cell kill globally produced by a reference protocol with equal dose fractions \({{\bar{d}}}\). Denoting by BED\(_e\), BED\(_l\) the quantities related to the early and late tissue respectively, the maximal damages are computed as \(C_{e}=\alpha _e \mathrm{BED}_e\) and \(C_{l}=\alpha _l \mathrm{BED}_l\) so that, referring to conventional protocols of the kind one fraction/day, five fractions/week, delivered over \(\nu \) weeks, we can write

$$\begin{aligned} \mathrm{BED}_e&= 5 \nu {{\bar{d}}} \left( 1 + \frac{{\bar{d}}}{\rho _{e}}\right) - \frac{\ln (2)}{\alpha _e T_{Pe}} (T - T_{Ke}) H(T - T_{Ke}),\end{aligned}$$
(6.3)
$$\begin{aligned} \mathrm{BED}_l&= 5 \nu {{\bar{d}}} \left( 1 + \frac{{\bar{d}}}{\rho _{l}}\right) . \end{aligned}$$
(6.4)

In view of (2.4), \(k_{e}\) and \(k_{l}\) are quantified as

$$\begin{aligned} k_e&= 5 \rho _e {\bar{d}} \left( 1 + \frac{{\bar{d}}}{\rho _{e}}\right) ,\end{aligned}$$
(6.5)
$$\begin{aligned} k_l&= 5 \rho _l {\bar{d}} \left( 1 + \frac{{\bar{d}}}{\rho _{l}}\right) , \end{aligned}$$
(6.6)

which imply \(k_{e} > k_{l}\) and \(\rho _e k_{l} > \rho _l k_{e}\), being \(\rho _{e} > \rho _{l}\), and \(v = 5\). Thus, adopting the BED formalism as a measure of the damage to normal tissues, we obtain optimal solutions that fall within Table 6 and are such to produce on the late tissue the maximal damage set as admissible.

The examples are organized as follows. First, we single out a specific tumour class setting the related radiosensitivity and repopulation parameters. It is evident from Table 6 that in order to determine the optimal treatment, we only need to distinguish between \(\rho <\rho _l\) and \(\rho \ge \rho _l\), which is a favourable feature, as in general the tumour parameters are only approximately known because of the high heterogeneity of tumour response to radiation. Next, we select a uniform clinical radiotherapy protocol as a reference protocol and, from (6.4), we determine the tolerable reference value of BED\(_l\) accordingly. Keeping fixed this value of BED\(_l\) and the number of treatment weeks \(\nu \), we let \(d_M\) increase and we recompute the optimal values of the daily fractions in each week enhancing the tumour damaging efficiency. Finally, the fractionation schemes obtained for each \(d_M\) are compared with respect to the induced tumour damage expressed in terms of “log cell kill”:

$$\begin{aligned} \log _{10} \left( \frac{1}{S}\right) = \alpha \, \log _{10} (\mathrm{e}) \left( \,\sum _{k = 1}^{5\nu } d_{k} + \frac{1}{\rho } \sum _{k = 1}^{5\nu } d_{k}^{2} - \frac{\ln (2)}{\alpha T_{P}} (T - T_{K}) H(T - T_{K})\right) .\nonumber \\ \end{aligned}$$
(6.7)

For the numerical application to specific tumours, we focus on the values \(\rho = 10\) Gy (Fowler 2008, 2012) and \(\rho =1.5\) Gy (Brenner and Hall 1999; Yang and Xing 2005) of the radiosensitivity ratio, typically associated to head and neck or lung cancers (fast proliferating) and to prostate cancer (slowly proliferating), respectively.

Let us consider \(\rho =10\) Gy and a reference protocol with dose fractions \({\bar{d}}\). It is evident from Table 6 that the optimal solution for any tumour having \(\rho \ge \rho _{l}\) coincides with the reference protocol and it is independent of \(d_{M}\), provided \({\bar{d}}\le d_M\). For instance, we take as a reference protocol the so-called “strong standard” fractionation schedule, \(35\, \mathrm{F} \times 2\, \mathrm{Gy} = 70\, \mathrm{Gy}/46\) days (\(\nu = 7,\,{\bar{d}} = 2\) Gy) that yields BED\(_{l} = 116.7\) Gy and BED\(_{e} = 53.1\) Gy (Fowler 2008; Yang and Xing 2005). Consequently, as long as \(d_M\ge 2\) Gy, the optimal weekly scheme consists of five fractions equal to 2 Gy. Moreover, assuming \(\alpha =0.35\) Gy\(^{-1},\,T_{K}=21\) days, \(T_{P}=3\) days (Fowler 2012), the optimal protocol results in a tumour log cell kill equal to 10.26.

On the other hand, for slowly proliferating tumours (\(\rho <\rho _{l}\)) the optimal solution tends to be hypofractionated (fewer and higher fractions) if \(d_{M}\) is allowed to increase, resulting in higher log cell kill and lower \(\mathrm{BED}_e\) compared to uniform regimes. For a comparison with the clinical literature, we have chosen two reference protocols used in prostate cancer treatment: the already mentioned “strong standard” (Fowler et al. 2003b) and the (shorter) protocol by Lukka et al. (2005) (NCIC protocol), consisting of \(20\, \mathrm{F} \times 2.625\, \mathrm{Gy} = 52.5\, \mathrm{Gy}/25\) days (\(\nu = 4,\,{\bar{d}} = 2.562\) Gy) and characterized by BED\(_{l} = 98.44\) Gy and BED\(_{e} = 52.02\) Gy. For the tumour parameters, beyond \(\rho = 1.5\) Gy, we assume \(\alpha =0.1\) Gy\(^{-1},\,T_{K}=300\) days, \(T_{P}=40\) days (Yang and Xing 2005).

Table 7 Comparison of optimal schedules iso-effective with respect to the late complications, BED\(_{l} = 116.667\) Gy, and with fixed number of weeks, \(\nu = 7\), for tumours having \(\rho =1.5\) Gy and for different values of \(d_{M}\)
Table 8 Comparison of optimal schedules iso-effective with respect to the late complications, BED\(_{l} = 98.438\) Gy, and with fixed number of weeks, \(\nu = 4\), for tumours having \(\rho =1.5\) Gy and for different values of \(d_{M}\)

Tables 7 and 8 report the optimal schedules for different values of the maximum tolerable fraction size \(d_{M}\). The values of \(d_{M}\) in the first columns have been actually used as daily doses in clinical treatments of prostate cancers (Ritter et al. 2009), though not necessarily in uniform schedules. For each \(d_{M}\), the tables report the optimal week scheme along with the resulting total number of fractions and the total dose over \(\nu \) weeks. Moreover, the tumour log cell kill (LCK) and \(\mathrm{BED}_e\) are computed, along with their relative (per cent) variations with respect to the corresponding reference quantities. In Fig. 2 these latter relative variations are plotted as functions of \(d_{M}\) for both reference protocols. The numerical application of the proposed procedure to slowly proliferating tumours evidences, for \(d_{M}>{\bar{d}}\), the optimality of hypofractionated schedules that result in increased log cell kill and reduced damage to the early responding normal tissue than conventional uniform protocols. As additional advantages we mention the reduction of the total radiation dose and of the number of fractions delivered. As \(d_{M}\) increases, higher dose fractions are permitted, making the advantages of non-uniform fractionation schemes more marked, particularly in Table 7 referring to the “strong standard” protocol. For either reference protocol, we observe that any weekly scheme obtained for a given value of \(d_{M}\) (including the reference) is still an admissible solution for higher values of \(d_{M}\), but clearly it is not the optimum.

Fig. 2
figure 2

Relative variations of tumour log cell kill (squares) and BED\(_e\) (circles) as \(d_{M}\) increases. Left panel “strong standard” protocol (Fowler et al. 2003b); right panel NCIC protocol (Lukka et al. 2005)

In conclusion, the results of Tables 7, 8 are in agreement with clinical studies evidencing the improvement achievable in log cell kill using few dose fractions with high size for slowly proliferating tumours. Indeed, the practical use of non-uniform hypofractionated protocols in the treatment of prostate tumours has been experienced since many years, still remaining a controversial issue because of the possible occurrence of late complications acknowledged by the literature. One of the earliest hypofractionated protocols is reported by Collins et al. (1991) who used 6 fractions of 6 Gy over three weeks in prostate carcinoma treatments between 1964 and 1984. In more recent years, the availability of advanced radiotherapeutic techniques, such as IMRT, has made it possible to selectively irradiate the tumour with very high dose fractions. Examples of once weekly irradiation schemes, very similar to those obtained for \(d_M>A_l(1,0)\) (see last row of Table 8), can be found in the papers by Menkarios et al. (2011) and Tang et al. (2008). Other “extreme” hypofractionated treatments with very high daily doses up to 10.5 Gy have been reported by Ritter et al. (2009).

7 Concluding remarks

We addressed the problem of finding the optimal weekly scheme of radiotherapy dose fractions, that achieves the best trade-off between maximizing the tumour cell kill while sparing the normal tissues. Representing the cell response to radiation by means of the LQ model, we set constraints on the maximal admissible damage to early and late responding normal tissues. Furthermore, an upper limit on the dose fraction size is introduced to strengthen the normal tissue constraints.

In this work, we provide a framework to analytically determine the optimal fractionation of the radiation dose as a function of the tumour type and of the fraction upper bound, as well as of the normal tissue parameters. However, the generality of this approach required some simplifying assumptions: (i) the overall treatment time is assigned, (ii) the cumulative damage to the normal tissues over the whole treatment is assigned so that, in view of (i), the weekly damage is also assigned, even though not necessarily equi-distributed.

Concerning assumption (ii), the maximum tolerable damage to normal tissues is usually expressed in terms of the BED (Barendsen 1982; Yang and Xing 2005; Fowler 2010), so that its value becomes dependent on the treatment protocol and on the model assumed to represent the damage. The analytical study has been developed without this restriction, to give the optimal solution in terms of general model parameters, letting them vary taking positive values. However, we stress that the practical applicability of the obtained results is limited by the difficulty in assessing the model parameter values for highly heterogeneous populations such as the human tumours.

Despite the simplicity of the LQ model and the considered simplifying assumptions, the analytical determination of the optimal solutions proved to be rather complex.

A first remark on the obtained results is that the optimal protocol produce the maximal tolerable damage to at least one normal tissue.

Another remark concerns the influence of the tumour \(\alpha /\beta \) ratio on the fractionation scheme. Indeed, when \(d_{M}\) is sufficiently large we recognize, by means of the mathematical formulation of the problem, that hypofractionation is convenient when \(\alpha /\beta \) is small, whereas the optimal fractionation tends to the uniform scheme for large \(\alpha /\beta \). This result is consistent with the clinical and bio-mathematical literature (Fowler 2010; Yang and Xing 2005; Brenner and Hall 1999; Fowler et al. 2003b) and confirms the results obtained by Bertuzzi et al. (2013b).

An interesting result is how the value of \(d_{M}\) influences the optimal solution for slowly proliferating tumours, such that \(\rho <\rho _{l}\). We notice that when \(d_{M}\) decreases the number of positive doses tends to increase, and in particular the positive doses are all equal to \(d_{M}\) except one, which increases continuously from zero to \(d_{M}\) when \(d_{M}\) decreases. So, as the upper bound decreases getting “stricter”, even for slowly proliferating tumors the optimal protocol tends to become uniform. Figure 1 illustrates the behaviour of the optimal solution just described, reporting different optimal weekly schedules for different value of \(d_M\).

The same solution behaviour with respect to \(d_{M}\) characterizes the optimal solution for tumours having \(\rho \in (\rho _{l}, \rho _{e})\), as long as \(d_{M}\) is strictly less than \(R_{1 [v]}\). On the other hand, when \(d_{M} \ge R_{1 [v]}\), all the points belonging to the intersection between “late” and “early” boundaries that satisfy the upper bound \(d_{M}\) provides optimal fractionation schemes, in that they equivalently produce the maximum damage to the tumour.

Some numerical examples of application of our procedure are proposed to illustrate the behaviour of the optimal solutions with respect to the upper bound \(d_{M}\) and to quantify the tumour damages. In comparison to conventional reference protocols, the optimal fractionation schemes obtained by the present study show an appreciable advantage in terms of tumour cell kill.