Abstract
The current pandemic situation due to COVID-19 demonstrates the need for epidemiologic models to represent infection events as accurately as possible. An important factor is the mobility of the affected individuals which can be investigated with discrete or continuous spatial models. In this contribution, parameter estimation via adjoint functions is presented to fit a reaction-diffusion PDE system with epidemiological SIS model to data sets. For this purpose static and dynamic optimization methods are used to solve an L 2-norm based least squares problem. An artificial data set is generated to test the accuracy of the procedure. Subsequently, the PDE system is adapted to this data set using methods of optimal control theory. Unknown parameters like diffusivity and transmission rate can be determined. The noise in the data set is also taken into account by fitting the initial conditions. The results show that the method is well suited for this purpose and should be further used with real data sets.
Access provided by Autonomous University of Puebla. Download conference paper PDF
Similar content being viewed by others
1 Introduction
The example of the current COVID-19 pandemic clearly shows the significant influence of mobility on the spread of a disease. Mathematical-epidemiological models can address this using various techniques. The movement of people between separate patches such as airports, islands, cities etc. can be represented using Lagrangian movement for short-term stays or Eulerian movement for long-term migrations [5]. The modelling here is done via ordinary differential equations (ODE). However, since this point-by-point distribution of pathogens does not reflect reality on its own, spatial spread need to be taken into account as well. This can be achieved with reaction-diffusion systems which contain partial differential equations (PDE) [2, 10]. Consequently, we consider a system of the form
The goal is to fit this model to data sets. Unfortunately, several parameters are unknown in the epidemiological context, such as the transmission rate or even the parameters describing mobility. In addition, noisy data may be expected, for this the initial value condition shall be adjusted.
In this contribution, a parameter estimation via adjoint functions is tested. This corresponds to techniques from static and dynamic optimization. To investigate the accuracy of the method, we consider an artificially generated data set. Numerical simulations are performed to fit the model to this data set.
2 Model
In the following we consider the set Ω = (0, a) × (0, b) as spatial coordinates and a time axis (0, T) with resulting domain V = Ω × (0, T). To model a spatial spread of an infectious disease, we use an epidemiological SIS model. The resulting reaction-diffusion system reads as
The functions S, I, N ∈ C 2, 1(V ) represent the densities of the compartments of susceptible (S) and infected (I) individuals and the total population density N = S + I in coordinate x at time t.
Here, e.g. \(\partial _t S=\frac {\partial S}{\partial t}\) stands for the time derivative of S and \(\Delta S = \text{div}(\text{grad } S) = \frac {\partial ^2 S}{\partial x_1^2} + \frac {\partial ^2 S}{\partial x_2^2}\) stands for the Laplace operator. For the two compartments initial value conditions are given by S 0, I 0 ∈ C 2( Ω). At the boundary ∂ Ω Neumann boundary conditions are implied, whereby ∂ ν S denotes the derivative of S to the direction of the unit outward normal ν. In context, this means that none of the individuals leaves the region Ω. We also assume that ∫Ω I(x, 0) dx > 0 holds with S 0, I 0 ≥ 0. We define
which stands for the total number of individuals at time t = 0. Due to the Neumann boundary conditions the Gauss’s theorem delivers
Thus, the total population is constant with value \(\overline {N}\).
The parameters β, γ > 0 represent the transmission and recovery rates of the corresponding disease and κ S, κ I > 0 the diffusivity of the corresponding compartments. For simplicity, we assume that κ s = κ I holds and β, γ are constants independent of x. For the derivation of such a model in one dimensional case and the operation of epidemiological models, we refer to [5].
SIS-based reaction-diffusion systems as in (1) have already been studied in [1, 3, 6,7,8,9]. The existence of a global and unique solution is shown, also for cases in which κ S ≠ κ I holds and β, γ are Hölder continuous functions over Ω. In [1] a Basic Reproduction Number is established on Sobolev space H 1( Ω) by
There is shown, that if \(\mathcal {R}_0<1\) holds, the unique disease-free equilibrium \(\text{DFE}=\left ( \frac {\overline {N}}{\vert \Omega \vert },0 \right )\) is globally asymptotically stable and unstable for \(\mathcal {R}_0 > 1\). The expression | Ω| here stands for the corresponding measure. On the other hand, for \(\mathcal {R}_0>1\) the existence of a unique endemic equilibrium EE is shown.
Furthermore, we set κ := κ S = κ I and substitute S = N − I. If we additionally define \(u:=\frac {I}{N}\), we receive a reduced system with f(u) := β(1 − u)u − γu
The simplifying assumptions and the normalization are used to test the presented parameter fitting via adjoint functions. It is clear that in realistic situations much more complex models should be used.
3 Adjoint System
We now want to fit model (3) to data sets using adjoint functions known from optimal control theory. In the epidemiological context, this means parameter estimation of the transmission rate β > 0 and diffusivity κ > 0. The recovery rate γ > 0 can be assumed to be the reciprocal of the average infection duration and thus does not need to be fitted. Furthermore, we assume that the data is noisy and therefore the initial condition u 0 ∈ C 2( Ω) has to be adjusted. In the following, the function u DATA contains the available data points and \(u_0^{\mathit {DATA}}\) the supposedly noisy initial value of the data set at t = 0.
We introduce an objective function \(J : \mathbb {R}^2 \times C^2(\Omega ) \rightarrow \mathbb {R}\)
The function u stands for the solution of the reaction-diffusion PDE system (3). The objective function includes the L 2-norm \(\Vert g \Vert _{L_Y^2}:=\left ( \int _Y g(y)^2 \, d y \right )^{1/2}\) and corresponding normalizing weights \(w_0:=1/ \Vert u^{\mathit {DATA}} \Vert _{L_V^2}^2\) respectively \(w_2:=1/\Vert u_0^{\mathit {DATA}} \Vert _{L_\Omega ^2}^2\). The convex and radially unbounded regularization term w 1(β 2 + κ 2) depends on a very small choosen weight w 1 whose influence is investigated in the subsequent simulations. Assuming one already has initial guess \(\hat {\beta },\hat {\kappa }\) for the parameters, a term of the form \(w_1\left ((\beta -\hat {\beta })^2 + (\kappa -\hat {\kappa })^2 \right )\) can be used alternatively.
This leads to a minimization problem with dynamic constraints
A Lagrange function is introduced containing adjoint functions z ∈ C 2, 1(V )
whereby \(g:=w_0 \left ( u - u^{\mathit {DATA}} \right )^2\) and \(\psi :=w_1 (\beta ^2 + \kappa ^2)+w_2 \int _\Omega \left ( u_0 - u_0^{\mathit {DATA}} \right )^2 \, dx\).
The necessary condition for a minimum \((\beta ^*,\kappa ^*,u_0^*,u^*,z^*)\) is fulfilled, if
holds true. It should be noted that Gâteaux derivatives are needed for the derivatives of L to the directions u 0, u and z. This leads to the following system in \((\beta ^*,\kappa ^*,u_0^*,u^*,z^*)\):
-
(i)
0 = ∂ β ψ +∫V z∂ β f dxdt, (Optimality Condition)
0 = ∂ κ ψ +∫V z Δu dxdt,
-
(ii)
\(u_0=u_0^{\mathit {DATA}}-\frac {z(x,0)}{2w_2}\), (Optimal Initial Condition)
-
(iii)
∂ t z = −∂ u g − z∂ u f − κ Δz, (Adjoint Equation)
z = 0, t = T, (Transversality Condition)
∂ ν z = 0, x ∈ ∂ Ω, (Adjoint Neumann Boundary Condition).
When L is derived in the z direction, the original PDE system (3) is recovered.
4 Numerical Simulations
From the analysis in Sect. 3, the gradient of L with respect to β and κ reads
and we obtain the adjoint equation
The latter must be solved backward in time t due to the transversality condition. This is done using the forward-backward sweep method, see [4]. The performed algorithm can be found in Appendix 1. Solving the PDEs is done using finite differences
and an explicit Euler-scheme
on the domain V = Ω × (0, T) with Ω = (0, a) × (0, b). The Neumann boundary conditions are implemented by \(u_{k+1,j}^n=u_{k,j}^n\) etc., if index (k, j) stands for a point at the rectangular boundary ∂ Ω. In the following simulations we use the setting
-
h := 0.1, τ := 0.001, a := 3, b := 2, T := 1
-
\(x_1^i=ih\): i = 0, …, 30 \(x_2^j=jh\): j = 0, …, 20 t n = nτ: n = 0, …, 1000 .
To test the procedure an artificial data set is generated with initial condition
whereby \(\delta _{(\tilde {x}_1,\tilde {x}_2)}(x_1^i,x_2^j)=1\), if \((x_1^i,x_2^j)=(\tilde {x}_1,\tilde {x}_2)\) and else \(\delta _{(\tilde {x}_1,\tilde {x}_2)} = 0\). Subsequently, the state variable PDE (3) is solved with β := 0.3, κ := 0.2 and γ := 0.1. The received solution is called \(\overline {u}\) in the following. To simulate noisy data, a normally distributed \(q_{i,j}^n \sim \mathcal {N}(0,\sigma ^2)\) is generated, so that the desired data set is calculated by
5 Results and Conclusions
The application of the presented method is tested in three simulations with different initial values β 0, κ 0. The initial value for the initial condition u 0 is taken from the desired data set u DATA. The resulting Table 1 and Fig. 1 in Appendix 2 show adequate parameter estimates. A test run without artificial noise on the data set resulted in the original values β = 0.3 and κ = 0.2. The simulations also show the effect of the weight w 1 of the regularization term on the minimization of the objective function J. Despite this disturbance, better results are obtained than without it. The prerequisite for this is a correspondingly small choice for w 1 which influences the convexity of the objective function in the respective parameters.
The present simulations show that the applied method works very well in this toy problem with self-generated data set. In principle, the procedure is suitable to perform such parameter estimations. In the next step, the method should be tested with real data sets. Depending on the disease, much more sophisticated epidemiological models may also be required. Mobility movements between patches, such as daily commuting or travelling, should also be added to the model. With respect to the PDE solution, other solution methods should also be tested, since the simple Euler method may be numerically unstable. In addition, a simple rectangular area was assumed in our example. In real cases, appropriate adjustments are necessary here.
References
Allen, L.J.S., Bolker, B.M., Lou, Y., Nevai, A.L.: Asymptotic profiles of the steady states for an SIS epidemic reaction–diffusion model. Discrete Contin. Dynam. Syst., 21, 1–20 (2008)
Britton, N.F.: Reaction-Diffusion Equations and Their Applications to Biology. Academic Press, London (1986)
Deng, K., Wu, Yixiang: Dynamics of a susceptible–infected–susceptible epidemic reaction–diffusion model. Proceedings of the Royal Society of Edinburgh: Section A Mathematics, 146(5), 929–946 (2016)
Lenhart, S., Workman, J.T.: Optimal control applied to biological models. CRC Press, 2007.
Martcheva, M.: An Introduction to Mathematical Epidemiology. Springer, New York (2015)
Peng, R.: Asymptotic profiles of the positive steady state for an SIS epidemic reaction–diffusion model. J. Diff. Eqns, 247, 1096–1119 (2009)
Peng, R., Liu, S.: Global stability of the steady states of an SIS epidemic reaction–diffusion model. Nonlin. Analysis, 71, 239–247 (2009)
Peng, R., Yi, F.: Asymptotic profile of the positive steady state for an SIS epidemic reaction–diffusion model: effects of epidemic risk and population movement. Physica D, 259, 8–25 (2013)
Peng, R., Zhao, X.: A reaction–diffusion SIS epidemic model in a time-periodic environment. Nonlinearity, 25, 1451–1471 (2012)
Webb, G.F.: A Reaction-Diffusion Model for a Deterministic Diffusive Epidemic. J. Math. Anal. Appl., 84, 150–161 (1986)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Appendices
Appendix 1
Algorithm 1 Pseudocode for the parameter estimation via adjoint functions
Appendix 2
Rights and permissions
Copyright information
© 2022 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Heidrich, P., Götz, T. (2022). Parameter Estimation via Adjoint Functions in Epidemiological Reaction-Diffusion Models. In: Ehrhardt, M., Günther, M. (eds) Progress in Industrial Mathematics at ECMI 2021. ECMI 2021. Mathematics in Industry(), vol 39. Springer, Cham. https://doi.org/10.1007/978-3-031-11818-0_16
Download citation
DOI: https://doi.org/10.1007/978-3-031-11818-0_16
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-11817-3
Online ISBN: 978-3-031-11818-0
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)