Abstract
The weighted essentially non-oscillatory (WENO) methods are popular and effective spatial discretization methods for nonlinear hyperbolic partial differential equations. Although these methods are formally first-order accurate when a shock is present, they still have uniform high-order accuracy right up to the shock location. In this paper, we propose a novel third-order numerical method for solving optimal control problems subject to scalar nonlinear hyperbolic conservation laws. It is based on the first-disretize-then-optimize approach and combines a discrete adjoint WENO scheme of third order with the classical strong stability preserving three-stage third-order Runge–Kutta method SSPRK3. We analyze its approximation properties and apply it to optimal control problems of tracking-type with non-smooth target states. Comparisons to common first-order methods such as the Lax–Friedrichs and Engquist–Osher method show its great potential to achieve a higher accuracy along with good resolution around discontinuities.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
1 Introduction
We consider the optimal control problem
with the tracking-type functional
where
and \(y=y(T,x;u_0)\) is the scalar entropy solution at the final time \(T>0\) of the nonlinear hyperbolic conservation law (later referred to as state equation)
Here, \(u_0\in {\mathcal {U}}_{ad}\subseteq L^{\infty }({\mathbb {R}})\) is the control and \(y_d \in L^2({\mathbb {R}})\) denotes a given target towards which we strive to optimize. We assume that the flux function satisfies \(f \in C^m({\mathbb {R}})\) with sufficiently large \(m\in {\mathbb {N}}\) and is convex, the admissible set \({\mathcal {U}}_{ad}\) is non-empty, convex and closed, and the region of integration I in (2) is a bounded interval. Weak solutions to (4) are in general not unique, which implies that the physically relevant solution has to be chosen. As a fact we cite the well-known result from [21], which states that for \(u_0 \in L^{\infty }({\mathbb {R}}) \cap BV({\mathbb {R}})\) there exists a unique entropy solution in the sense of Krǔzkov in the class \(C([0,T],L_{loc}^1({\mathbb {R}})) \cap L^{\infty }({\mathbb {R}}\times [0,T])\). Using well-posedness and compactness properties of this solution, the existence of a minimizer \(u_0^{min}\) in (1) can be established under some natural additional assumptions on the class of admissible data \({\mathcal {U}}_{ad}\), see e.g. [5, Theorem 2.1] and [16, Proposition A.1]. In general, uniqueness is not guaranteed due to the occurrence of discontinuous solutions in (4), which can be equal for different initial values. An illustrative example can be found in [5]. These statements generalize to the case where a regularization term \({\mathcal {R}}(u)\) is added to the objective function [28].
In this work, we focus on the numerical treatment of optimal control problems (1) governed by hyperbolic conservation laws, which has been studied amongst others in [1, 2, 5, 6, 10,11,12, 16,17,18, 22,23,24, 29,30,31]. We will follow the first-disretize-then-optimize approach, i.e., Eq. (4) is first discretized in space and time by applying a weighted essentially non-oscillatory (WENO) scheme and a strong stability preserving Runge-Kutta (SSPRK) method. This leads to a finite dimensional optimal control problem, for which the first-order discrete optimality system can be derived and solved by existing optimization solvers such as nonlinear Newton-type algorithms. In spite of the large size of the resulting problems, the flexibility of this approach naturally allows the incorporation of additional constraints and bounds. Further advantages are the direct use of automatic differentiation techniques and the computation of discrete adjoints, which are consistent with the discrete optimal control problem. Symmetric approximations of Hessian matrices can be easily derived and result in a computational speedup.
The application of common methods from nonlinear optimization requires the computation of directional derivatives of the target functional J with respect to the control. An efficient computation of the gradient can be effectuated by using the so-called adjoint approach, in which the derivative is represented via the adjoint state. The crucial issue of hyperbolic conservation laws is the possible formation of shocks even for smooth initial data, for which reason the classical adjoint calculus does not apply. To overcome these difficulties, nonstandard variational concepts have been developed in [4, 29, 31], which incorporate the shock sensitivity in order to derive rigorous optimality conditions. The resulting non-conservative equation has been studied in [3, 7, 29]. Their numerical resolution is intricate, since the interior boundary condition defined on a set of Lebesgue measure zero—existing for the continuous setting—is not present for the discrete counterpart. This inherent problem has been addressed in [1, 10,11,12]. The theory is, however, restricted to differentiable monotone schemes which have sufficiently large numerical diffusion and are of first order only.
To avoid unwanted smearing of the solution by large numerical diffusion and to overcome the lower order restriction of monotone schemes, we propose a novel approach based on WENO schemes introduced in [20, 25, 26]. These schemes have proven to approximate hyperbolic equations comprising both shocks and complex smooth solution structure with higher accuracy and adequate stability along with good resolution around discontinuities. Although these methods are formally first-order accurate when a shock is present, they still have uniform high-order accuracy right up to the shock location. WENO schemes are extensions of the ENO procedure, i.e., they perform essentially non-oscillatory, but overcome shortcomings of the ENO approximation, see [27] for a detailed discussion. By employing a global flux-splitting, the numerical flux function becomes classically differentiable and therefore allows to develop discrete adjoint WENO methods of higher order. Since the third-order WENO method is often applied in applications, we consider this method in the context of optimal control in more detail. We prove that the discrete adjoint WENO3 method is third-order consistent in space for smooth solutions. A fully discrete method is derived by applying a third-order SSPRK method. We present numerical results and study the approximation behaviour of the adjoint WENO3 scheme. Finally, we solve an optimal control problem with discontinuous target and compare the performance of our novel scheme to common first-order schemes such as the modified Lax–Friedrichs and the Engquist–Osher scheme. Further examples can be found in [9].
2 Adjoint equation and reversible solutions
In this section, we briefly recall some theoretical basics in order to set up appropriate adjoint equations for hyperbolic conservation laws. As pointed out in [4, Example 1], the solution operator \(S_t: u_0 \mapsto y(t,\cdot ;u_0)\) is generically not differentiable in \(L_{loc}^1({\mathbb {R}})\), for which reason the classical adjoint calculus does not apply. However, in [29] it has been shown that entropy solutions to hyperbolic conservation laws admit a generalized differentiable structure called shift-differentiability. Under suitable assumptions, a generalized Taylor expansion in \(L_{loc}^1\) of the form
exists for all \(\delta u_0 \in L^{\infty }({\mathbb {R}})\), where \(T_u: \delta u \in L^\infty ({\mathbb {R}}) \mapsto (\delta y^T, \delta x_1,\ldots ,\delta x_N) \in L^r(I)\times {\mathbb {R}}^N\), \(r\in (1,\infty ]\), is a bounded linear operator and \(S_y^{(x_i)}\) is the shift variation defined by
where \(\Omega _i=[\min (x_i,x_i+\delta x_i),\max (x_i,x_i+\delta x_i)]\) and \(x_1,\ldots ,x_N\) denote the locations of the down-jumps of the entropy solution. The important advantage of shift-variations is that this framework allows to develop an adjoint calculus for hyperbolic conservation laws by using an averaged sensitivity equation which avoids the linearization of (4) in the usual way, see [31] for further details. The directional derivative of J in (2) in the direction of \(\delta u_0\) can then be represented by
where p is the solution of the adjoint equation
Here, \(p^{T}(x)\) is given by
where \(X_s\) is the set of locations where \(y(T,\cdot )\) possesses a shock and \([w(x)] :=w(x-)-w(x+)\), which naturally incorporates the shock sensitivity.
Equation (8) is a linear transport equation with, in general, discontinuous coefficients. It admits multiple solutions, which requires the selection of the correct adjoint state. This is achieved by so-called reversible solutions that are defined along generalized characteristics [8]. An illustrative demonstration is given in Example 2.1. Under suitable technical assumptions and for appropriate end data \(p^T\) it can be shown that there exists a unique reversible solution to (8) that is bounded, \(L^{\infty }\)-stable, and TV-stable [29, Theorem 4.2.10 and Corollary 4.2.11]. In what follows, we will work with formulation (8) to derive a discrete adjoint WENO3 method.
Example 2.1
Let \(f(y)= \frac{1}{2} y^2\), \(u_0(x)=-\text {sign}(x)\), \(T=0.5\), and \(y_d(x)=0\) with \(x \in {\mathbb {R}}\). It is well-known that the unique entropy solution is given by \(y(t,x)=-\text {sign}(x)\), \(t \in [0,T]\), and hence we have
The area that is not occupied by the classical characteristics is called shock funnel. It is represented by the grey-coloured triangle in Fig. 1. In this region, p takes the constant value zero. The adjoint remains constant along the classical backwards characteristics outside this region. Hence, the reversible solution p is given by
3 Discrete adjoint WENO3 method
In order to discretize (4) in space, we now consider solutions with compact support [a, b] for the entire time interval [0, T] and take \(y(a)=y(b)=0\) as boundary conditions. Thus, using as many stencil points outside as needed and the compactness of the solution, the implementation of the zero boundary condition does not bear any difficulty. The interval [a, b] is partitioned into subintervals \([x_{j-1/2},x_{j+1/2}]\) of the same size \(\Delta x\) and with midpoints \(x_{j}\) for \(j=1,\ldots ,N\). Setting \({\varvec{u}}_{0}:=(u_0(x_1),\ldots ,u_0(x_N))^{\mathsf{T}}\) and defining spatial approximations \({\varvec{y}}(t):=(y_1(t),\ldots .,y_N(t))^{\mathsf{T}}\) with \(y_j(t) \approx y(t,x_j)\), a spatial semi-discretization of (4) reads
where the nonlinear operator \(F_{\Delta x}: {\mathbb {R}}^N \rightarrow {\mathbb {R}}^N\) represents the discretization of \(\partial _xf(y)\). We choose a conservative finite difference
where \({\hat{f}}_{j+1/2}: {\mathbb {R}}^m \rightarrow {\mathbb {R}}\) denotes the numerical flux at \(x_{j+1/2}\), which is (at least) a Lipschitz continuous function of m neighboring values \(y_i(t)\). In order to avoid the convergence of the scheme towards entropy violating solutions, we apply a global flux splitting
Using the simple Lax–Friedrichs splitting \(f^{\pm }(y)=(f(y) \pm \alpha y)/2\) with \(\alpha :=\max _{u} |f'(u)|\) yields the desired properties \((f^+)'(y)\ge 0\) and \((f^-)'(y)\le 0\). Then, the numerical flux functions of the WENO3 method [26] are defined by
where the weights are
The smoothness indicators are given by
and the linear weights are set to \(\gamma _1^+\!=\!\gamma _2^-\!=\!1/3,\;\gamma _1^-\!=\!\gamma _2^+\!=\!2/3\). Note that \(0<\varepsilon \ll 1\) is chosen in order to avoid the denominator becoming zero. It is set to \(\varepsilon \!=\!10^{-6}\) in our numerical calculations. We would like to emphasize the observation that by construction the numerical fluxes \({\hat{f}}^\pm\) have the same smoothness dependency on its arguments as that of the physical flux function f(y).
Next we will derive the associated adjoint WENO3 scheme. Let \(f\in C^2({\mathbb {R}})\), i.e., there exists the Fréchet derivative of \(F_{\Delta x}\) defined in (13). The continuous optimal control problem is approximated by
where \(U_{ad}=\{{\varvec{u}}\in {\mathbb {R}}_N:\,\text {TV}({\varvec{u}})\le C\}\) is the discrete admissible set. Then, applying the common Lagrangian approach in \({\mathbb {R}}^N\) with multipliers \({\varvec{p}}(t)=(p_1(t),\ldots ,p_N(t))^{\mathsf{T}}\), the adjoint equation to (12) reads
where \(\nabla _{{\varvec{y}}}F_{\Delta x}\) is the Fréchet derivative of \(F_{\Delta x}\) and gradients are treated as row vectors. The initial condition (the adjoint equation works backwards in time) is the discrete counterpart to (9). Observe that the interior boundary condition does not appear here. A short calculation yields the componentwise description
with the coefficients
The indices of the numerical flux functions are directly related to their arguments, e.g. \({\hat{f}}^{+}_{j+3/2}(y_j,y_{j+1},y_{j+2})\) due to (15). For later use, we note that \(\sum _{i=-2,\ldots ,2}L_{i,j}({\varvec{y}})=0\).
We will now study the consistency order of the adjoint WENO3 scheme, i.e., how accurately does the semi-discretization (20) approximate the continuous adjoint equation (8) in the case of smooth solutions. Inserting exact solution values \(p(t,x_j)\) and \(y(t,x_j)\) (still denoted by \(y_j\) to simplify notation) in the semi-discrete scheme (21) gives the residual-type local spatial errors
Taylor expansion around \(x_j\) yields
where we have already used that the sum of the \(L_{i,j}\) disappears. The method is said to have adjoint consistency order q if \(r_j(t)={\mathcal {O}}(\Delta x^q)\). In what follows, we will show that the adjoint WENO3 scheme satisfies all conditions for order \(q=3\).
First, we have to calculate \(\partial _{y_j}L_{i,j}\), i.e., particularly the derivatives of the numerical flux functions defined in (15), (16). Since \(\omega _1^\pm +\omega _2^\pm =1\) for all \({\varvec{y}}(t)\), we deduce \(\partial _{y_k}\omega _1^\pm =-\partial _{y_k}\omega _2^\pm\). Introducing the notation
we find
and
We have the following three lemmata.
Lemma 3.1
Suppose \(f(y),\;y(t,\cdot )\in C^2({\mathbb {R}})\). Then
Proof
Taylor expansion gives \({\bar{f}}^{\pm }_j={\mathcal {O}}(\Delta x^2)\). It remains to show that \(\partial _{y_k} \omega _1^\pm ={\mathcal {O}}(\Delta x)\). Indeed, we have
Since \(\beta _i^\pm ={\mathcal {O}}(\Delta x^2)\) for smooth flux functions f(y), the two quotients are bounded by \(({\tilde{\omega }}_i^\pm )^{-1}={\mathcal {O}}(\varepsilon ^2)\), \(i=2,1\), respectively, for \(\Delta x\rightarrow 0\). Taylor expansions of the derivatives \(\partial _{y_k} {\tilde{\omega }}_i^\pm =-2\gamma _i^\pm (\varepsilon +\beta _i^\pm )^{-3} \partial _{y_k}\beta _i^\pm\), \(i=1,2\), show \({\mathcal {O}}(\Delta x)\) for these terms and therefore also for \(\partial _{y_k} \omega _1^\pm\). \(\square\)
Lemma 3.2
Let \(\{x_{j-1},x_{j},x_{j+1}\}\) and \(\{x_{j},x_{j+1},x_{j+2}\}\) be two neighboring stencils and \(w_{i,j}^\pm\), \(w_{i,j+1}^\pm\), \(i=1,2,\) the corresponding weights. Suppose \(f(y),\;y(t,\cdot )\in C^3({\mathbb {R}})\). Then
Proof
We consider the weights \(w_{1,j}^+\) and \(w_{1,j+1}^+\). Analogous calculations can be done for the other cases. We set \(h(x):=f^+(y(x))\) and define \(h_j:=f^+(y(x_j))\). Then
Due to the strict positivity of the weights, it remains to study the asymptotic behaviour of the numerator. Using the definitions (17) and (18), we have
with the smoothness indicators
Taylor expansion at \(x_j\) yields in (32)
which shows the assertion. \(\square\)
Lemma 3.3
Suppose \(f(y),\;y(t,\cdot )\in C^2({\mathbb {R}})\). Then
Proof
We first consider \(\omega _1^+ - \gamma _1^+\). The difference can be expressed by
The denominator is bounded from below by \((\gamma _1^+ + \gamma _2^+)\varepsilon ^{-2}=\varepsilon ^{-2}>0\). Further, we deduce for the numerator
Let \(h(x):=f^+(y(x))\) and define \(h_j:=f^+(y(x_j))\). Inserting the smoothness indicators \(\beta _1^+=(h_j-h_{j-1})^2\) and \(\beta _2^+=(h_{j+1}-h_{j})^2\), Taylor expansion at \(x_j\) yields
Putting this together with the bound for the denominator stated above gives \(\omega _1^+ - \gamma _1^+ = {\mathcal {O}}(\Delta x^3/\varepsilon )\), from which we can conclude the proof. The same arguments apply to the second difference \(\omega _1^- - \gamma _1^-\). \(\square\)
We are now ready to state the main result of this section.
Theorem 3.1
Let \(f(y),\;y(t,\cdot )\in C^3({\mathbb {R}})\) and \(p(t,\cdot )\in C^4({\mathbb {R}})\). Then the adjoint WENO3 scheme (20) is adjoint consistent of order three, i.e., \(r_j(t)={\mathcal {O}}(\Delta x^3)\) in (24).
Proof
Let us define \(d_k:=\sum _{i=-2,\ldots ,2}i^{k+1}\partial _{y_j}L_{i,j}({\varvec{y}}(t))\), \(k=0,1,2,\) and denote by \(w_{i,m}^\pm\) the weights that correspond to the stencils \(\{x_{m-1},x_{m},x_{m+1}\}\), \(m=j-1,j,j+1\). From (22), we calculate
which gives by using (26), (27) for different stencils and Lemma 3.1 for all terms \(\partial _{y_j} \omega _1^\pm {\bar{f}}^{\pm }_m\) with \(m=j-1,j,j+1\),
Eventually, Lemma 3.2 and the property \(w_{1,j}^\pm +w_{2,j}^\pm =1\) yields
Analogously, we derive
and
Lemma 3.2 directly shows that \(d_1={\mathcal {O}}(\Delta x^3)\). Using \(w_1^\pm +w_2^\pm =1\) and again Lemma 3.2, the linear combinations of the weights in \(d_2\) can be simplified to \(2-3w_{1,j}^-\) and \(3w_{1,j}^+-1\) up to order \({\mathcal {O}}(\Delta x^4)\), respectively. Applying now Lemma 3.3 with \(\gamma _1^+=1/3\) and \(\gamma _1^-=2/3\) to these expressions gives \(d_2={\mathcal {O}}(\Delta x^3)\).
In a last step, we use the asymptotic expressions for \(d_i\), \(i=0,1,2,\) to calculate the residual-type local spatial error
This concludes the proof. \(\square\)
4 Numerical experiments
In this section, we will present some numerical examples for Burgers equation, i.e., we study problems with the nonlinear flux function \(f(y)=\frac{1}{2} y^2\) in (4). The first example with smooth initial data and solution is chosen in order to check to third-order convergence of the discrete adjoint WENO3 method as stated in Theorem 3.1. In the second example, the approximation property of the discrete adjoint in the case of a shock in the initial solution is investigated and compared to approximations computed by means of the first-order modified Lax–Friedrichs (LF) and Engquist–Osher (EO) schemes. These schemes read
with \(y_j^n\approx y(n\Delta t,x_j)\), \(n_T\,\Delta t=T\), and numerical fluxes given by
Applying a standard Lagrangian approach and discrete adjoint calculus, the discrete adjoint schemes can be derived from [16, Prep. 3.1] as
with the coefficients
for the LF scheme and
for the EO scheme. Convergence of these schemes has been intensively studied in [1, 10,11,12, 29]. The choice \(\gamma \!=\!1\) leads to the classical LF method. Stability requirements for the adjoint LF and EO schemes yield the optimal value \(\gamma ^\star \!=\!0.5\) together with the CFL-condition \(\Delta t \le \gamma ^\star \Delta x/\sup |f'(y)|\), see e.g. [16]. Then, both schemes converge for Lipschitz continuous end data \(p^T(x)\) in (8). The stronger condition \(\Delta t \le \gamma ^\star (\Delta x)^{2-q}/\sup |f'(y)|\), \(0<q<1\), ensures the convergence of the modified LF scheme for discontinuous end data, too [11, 12]. Convergence for slightly modified end data and less numerical viscosity has been recently studied in [1].
In order to get a fully discrete scheme for WENO3, the differential equation (12) is numerically solved by the three-stage third-order strong-stability-preserving Runge-Kutta method SSPRK3, which offers good stability properties [13, 14, 16, 19]. In the Shu-Osher representation, it reads
The corresponding adjoint time discretization has the form (see e.g. [16])
We note that the adjoint scheme has only order two, which is the upper barrier for three-stage third-order SSPRK methods [16].
In the final experiment, we solve an optimal control problem with a discontinuous target, proposed in [16]. The discrete adjoint \({\varvec{p}}^0\) provides gradient information, which can be directly used to set up the following algorithm:
-
0.
Given a control \({\varvec{u}}_0:={\varvec{u}}^{(j)}\) at iteration j.
-
1.
Compute the discrete adjoint \({\varvec{p}}^0({\varvec{u}}^{(j)})\) and update \({\varvec{u}}^{(j+1)}={\varvec{u}}^{(j)}-\alpha _j{\varvec{p}}^0({\varvec{u}}^{(j)})\) with \(\alpha _j\) such that Armijo’s condition
$$\begin{aligned} J({\varvec{y}}^{n_T}({\varvec{u}}^{(j+1)}),{\varvec{y}}_d) \le J({\varvec{y}}^{n_T}({\varvec{u}}^{(j)}),{\varvec{y}}_d) - \frac{1}{2}\alpha _j \Vert {\varvec{p}}^0({\varvec{u}}^{(j)}) \Vert ^2_{L^2(I)} \end{aligned}$$is fulfilled. If it is not satisfied, choose \(\alpha _j:=0.95\,\alpha _j\) and check the condition again.
-
2.
Stop if \(|J({\varvec{y}}^{n_T}({\varvec{u}}^{(j+1)}),{\varvec{y}}_d)- J({\varvec{y}}^{n_T}({\varvec{u}}^{(j)}),{\varvec{y}}_d)|\le tol\). Otherwise set \(j:=j+1\) and proceed with step 1.
In general, taking the adjoint as a decent direction may increase the complexity of the optimization process due to the production of additional discontinuities [5, 23, 24]. A careful choice of the initial guess \({\varvec{u}}_0\) can remedy this serious problem. We follow the approach proposed in [17] and first solve the conservation law
where \(y_d\) is the target given in (1). The initial guess is then chosen as \(u_0=z(T,-x)\). Formally, as pointed out in [17], (52) is obtained by reverting t and x in (4) and taking \(y_d\) as initial condition. The advantage of this approach is that it delivers a control whose entropy solution is close to the target and the location of the discontinuities almost coincide. Hence, the production of additional discontinuities within each iteration step is avoided, which improves the performance of the algorithm drastically. We will exemplify the influence of the choice of the initial guess in our optimal control problem.
4.1 Order test for the discrete adjoint for smooth data
This section is devoted to numerically verify the third-order convergence of the adjoint WENO3 scheme. For this purpose, we choose the computational domain \(\Omega _T=(0,0.5]\times [-1.5,1.5]\) and the objective functional
with the smooth initial data
The exact solution y(t, x) can be directly computed from the method of characteristics, i.e., \(y(t,x)=u_0(x_0(x(t),t))\) with \(x_0(x(t),t)\) being the solution of the nonlinear equation \(x(t)=x_0+u_0(x_0)\,t\). A reference solution \(y_T\approx y(0.5,x)\) at the final time is computed by Newton’s method with a high tolerance \(10^{-14}\).
Since shocks are not present, we find \(p^T(x)=y(0.5,x)\) in (8). We also note that the characteristics curves of the adjoint problem coincide with the characteristic curves of the forward problem. Thus, the corresponding reversible solution p(0, x) at time \(t\!=\!0\) is given by \(u_0(x)\), which serves as reference solution for the adjoint.
We use a sequence of spatial meshes with a number of grid points \(N=150\cdot 2^i,\;i=0,\ldots ,6,\) and set \(\Delta t=0.5\,\Delta x\). In order to keep the temporal error below \({\mathcal {O}}((\Delta x)^3)\), we apply the classical fourth-order four-stage explicit Runge-Kutta method (ERK4). Its adjoint time discretization has also order four [15] for smooth solutions and therefore the overall scheme is suitable to check the order three of the adjoint WENO3 method. We also present results for the forward WENO3 method to document the error of the approximated starting value \({\varvec{p}}^{n_T}={\varvec{y}}^{n_T}\). The \(L^\infty\)-errors collected in Table 1 clearly show asymptotic order three of the spatial WENO3 discretization for both forward and adjoint numerical solution.
4.2 Approximation of the discrete adjoint in the case of shocks
We now consider discontinuous solutions with shocks. Our test case is taken from Example 2.1 with computational domain \(\Omega _T=(0,0.5]\times [-1,1]\). The reversible solution p(0, x) at \(t\!=\!0\) is given by (11). We apply the above described forward and adjoint LF, EO, and WENO3 schemes with \(\Delta x=0.01,\;0.002\), and \(\Delta t=0.25\,\Delta x\). The corresponding numerical approximations \({\varvec{p}}^0\) are shown in Fig. 2.
The first-order LF and EO schemes smear out the discontinuities, but deliver \(L^\infty\)-stable approximations and thus respect the analytical property of the adjoint. In the spirit of WENO schemes, the adjoint WENO3 delivers a quite sharp resolution of the shocks at the price of bounded over- and undershoots of around \(5\%\). In Table 2, we plot the \(L^\infty\)-error in the shock funnel for \(x\in [-0.3,0.3]\). All schemes converge quite rapidly. Note that convergence in the shock funnel is not always achieved since the interior boundary condition at shock positions as given in (9) does not appear on the discrete level, see the discussions in [1, 11, 12].
4.3 Optimal control problem with discontinuous target
We consider the optimal control problem (1) with the objective functional [16]
and the discontinuous target \(y_d\) defined by
The optimal control \(u_0^\star\), which serves as a reference solution, is
We will present results for two mesh sizes \(\Delta x=0.005,\;0.002\), and time steps \(\Delta t=0.25\,\Delta x\). The initial guess for the control is computed from (52) with the individual method under consideration. For WENO3 and the coarser mesh size, it is shown in Fig. 3 together with the corresponding state solution.
In Fig. 4, the results of the gradient based optimization procedure described above for tolerances \(tol_1=10^{-5}\), \(tol_2=10^{-7}\), and mesh size \(\Delta x=0.005\) for the adjoint WENO3 method are plotted. The value of the objective functional decreases from \(4.75*10^{-4}\) for \(tol_1\) to \(3.18*10^{-4}\) for \(tol_2\), resulting in a better approximation of the target \(y_d\). We can conclude that the adjoint WENO3 method allows to recover the initial data together with the final state solution adequately. The shock of the target is sharply resolved and the rarefaction of the initial data is also recovered. In order to compare these results with those obtained from the LF and EO schemes, we perform 50 iterations of the optimization algorithm for both mesh sizes. The calculated optimal controls and their corresponding final state solutions are collected in Fig. 5. The adjoint WENO3 method resolves the shock sharply. In contrast, the LF method is too diffusive and only provides an unsatisfactory shock resolution. The numerical artifacts around the shocks are huge. The optimized final state solution obtained by the EO scheme possesses very small numerical artifacts, but the shock is less sharply resolved and the spike of the target is slightly smeared out. In Table 3, we depict the iteration history for all runs of the optimization. In every case, the LF method performs poorer than the others. In terms of a low cost functional, the adjoint WENO3 method performs best. We also see the influence of the initial guess on the performance of the algorithm. This is due to the fact that the use of \({\varvec{u}}_0=0\) as starting control value produces artificial discontinuities within each iteration step.
5 Summary
We have developed a novel adjoint WENO3 scheme to provide approximations of the gradient for optimal control problems governed by hyperbolic conservation laws and proved third-order consistency in space for sufficiently smooth solutions. The adjoint WENO3 method is able to sharply resolve discontinuities of reversible solutions. For an exemplary optimal control problem with discontinuous target, the method works very well and outperforms common first-order methods as the Lax–Friedrichs and Engquist–Osher schemes.
Availability of data and material
The data are available on request.
Code availability
All calculations have been done in Matlab.
References
Aguilar, S.P., Schmitt, J.M., Ulbrich, S., Moos, M.: On the numerical discretization of optimal control problems for conservation laws. Control Cybern. 2/2019 (2019)
Banda, M.K., Herty, M.: Adjoint IMEX-based schemes for control problems governed by hyperbolic conservation laws. Comput. Optim. Appl. 51(2), 909–930 (2012)
Bouchut, F., James, F.: One-dimensional transport equations with discontinuous coefficients. Nonlinear Anal. 32(7), 891–933 (1998)
Bressan, A., Marson, A.: A variational calculus for discontinuous solutions of systems of conservation laws. Commun. Partial Differ. Equ. 20(9–10), 1491–1552 (1995)
Castro, C.M., Palacios, F., Zuazua, E.: An alternating descent method for the optimal control of the inviscid Burgers equation in the presence of shocks. Math. Models Methods Appl. Sci. 18(3), 369–416 (2008)
Chertock, A., Herty, M., Kurganov, A.: An Eulerian–Lagrangian method for optimization problems governed by multidimensional nonlinear hyperbolic PDEs. Comput. Optim. Appl. 59(3), 689–724 (2014)
Conway, E.D.: Generalized solutions of linear differential equations with discontinuous coefficients and the uniqueness question for multidimensional quasilinear conservation laws. J. Math. Anal. Appl. 18(2), 238–251 (1967)
Dafermos, C.M.: Generalized characteristics and the structure of solutions of hyperbolic conservation laws. Indiana Univ. Math. J. 26, 1097–1119 (1977)
Frenzel, D.: Weighted Essentially Non-Oscillatory Schemes in Optimal Control Problems Governed by Nonlinear Hyperbolic Conservation Laws. Ph.D. thesis, Technical University of Darmstadt, Department of Mathematics, published in Reihe Mathematik, Verlag Dr. Hut, ISBN 978-3-8439-4454-0 (2020)
Giles, M.: Discrete adjoint approximations with shocks. In: Hou, T.Y., Tadmor, E. (eds.) Hyperbolic Problems: Theory. Numerics, Applications: Proceedings of the Ninth International Conference on Hyperbolic Problems, pp. 185–194. Springer, Berlin, Heidelberg (2003)
Giles, M., Ulbrich, S.: Convergence of linearized and adjoint approximations for discontinuous solutions of conservation laws. Part 1: linearized approximations and linearized output functionals. SIAM J. Numer. Anal. 48(3), 882–904 (2010)
Giles, M., Ulbrich, S.: Convergence of linearized and adjoint approximations for discontinuous solutions of conservation laws. Part 2: adjoint approximations and extensions. SIAM J. Numer. Anal. 48(3), 905–921 (2010)
Gottlieb, S., Ketcheson, D., Shu, C.-W.: Strong Stability Preserving Runge–Kutta and Multistep Time Discretizations. World Scientific Publishing Company (2011)
Gottlieb, S., Shu, C.-W., Tadmor, E.: Strong stability-preserving high-order time discretization methods. SIAM Rev. 43, 89–112 (2001)
Hager, W.W.: Runge–Kutta methods in optimal control and the transformed adjoint system. Numer. Math. 87, 247–282 (2000)
Hajian, S., Hintermüller, M., Ulbrich, S.: Total variation diminishing schemes in optimal control of scalar conservation laws. IMA J. Numer. Anal. 39(1), 105–140 (2017)
Herty, M., Kurganov, A., Kurochkin, D.: Numerical method for optimal control problems governed by nonlinear hyperbolic systems of PDEs. Commun. Math. Sci. 13(1), 15–48 (2015)
Herty, M., Kurganov, A., Kurochkin, D.: On convergence of numerical methods for optimization problems governed by scalar hyperbolic conservation laws. In: Klingenberg, C., Westdickenberg, M. (eds.) Theory. Numerics and Applications of Hyperbolic Problems I, pp. 691–706. Springer, Cham (2018)
Hintermüller, M., Strogies, N.: On the consistency of Runge Kutta methods up to order three applied to the optimal control of scalar conservation laws. Springer Proc. Math. Stat. 235, 119–154 (2018)
Jiang, G.-S., Shu, C.-W.: Efficient implementation of weighted ENO schemes. J. Comput. Phys. 126, 202–228 (1996)
Kružkov, S.N.: First order quasilinear equations with several independent variables. Math. USSR-Sbornik 10(2), 217–243 (1970)
Kurochkin, D.: Numerical Method For Constrained Optimization Problems Governed By Nonlinear Hyperbolic Systems Of PDEs. PhD thesis, University of New Orleans (2015)
Lecaros, R., Zuazua, E.: Tracking control of 1D scalar conservation laws in the presence of shocks. In: Ancona, V., Strickland, E. (eds.) Trends in Contemporary Mathematics, pp. 195–219. Springer, Cham (2014)
Lecaros, R., Zuazua, E.: Control of 2D scalar conservation laws in the presence of shocks. Math. Comput. 85(299), 1183–1224 (2016)
Liu, X.-D., Osher, S., Chan, T.: Weighted essentially non-oscillatory schemes. J. Comput. Phys. 115, 200–212 (1994)
Shu, C.-W.: Essentially non-oscillatory and weighted essentially non-oscillatory schemes for hyperbolic conservation laws. In: Quarteroni, A. (ed.) Advanced Numerical Approximation of Nonlinear Hyperbolic Equations. Lecture Notes in Mathematics, vol. 1697, pp. 325–432. Springer, Berlin (1998)
Shu, C.-W.: Essentially non-oscillatory and weighted essentially non-oscillatory schemes. In: Acta Numerica, vol. 29 , pp. 701–762. Cambridge University Press (2020)
Ulbrich, S.: Optimal Control of Partial Differential Equations (Chemnitz, 1998), Volume 133 of Internat. Ser. Numer. Math., chapter On the existence and approximation of solutions for the optimal control of nonlinear hyperbolic conservation laws, pp. 287–299. Birkhäuser, Basel (1998)
Ulbrich, S.: Optimal Control of Nonlinear Hyperbolic Conservation Laws with Source terms. Technische Universitát München, Germany, Habilitation, Zentrum Mathematik (2001)
Ulbrich, S.: A sensitivity and adjoint calculus for discontinuous solutions of hyperbolic conservation laws with source terms. SIAM J. Control Optim. 41, 740–797 (2002)
Ulbrich, S.: Adjoint-based derivative computation for the optimal control of discontinuous solutions of hyperbolic conservation laws. Syst. Control Lett. 48(3–4), 313–328 (2003)
Funding
Open Access funding enabled and organized by Projekt DEAL. This work was supported by the Graduate School CE within the Centre for Computational Engineering at Technische Universität Darmstadt and by the Deutsche Forschungsgemeinschaft (DFG, German Research Foundation) within the collaborative research centre TRR154 “Mathematical modeling, simulation and optimisation using the example of gas networks” (Project-ID 239904186, TRR154/2-2018, TP B01).
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Frenzel, D., Lang, J. A third-order weighted essentially non-oscillatory scheme in optimal control problems governed by nonlinear hyperbolic conservation laws. Comput Optim Appl 80, 301–320 (2021). https://doi.org/10.1007/s10589-021-00295-2
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10589-021-00295-2
Keywords
- Nonlinear optimal control
- Discrete adjoints
- Hyperbolic conservation laws
- WENO schemes
- Strong stability preserving Runge–Kutta methods