Abstract
In this paper, we employ the reduced basis method for the efficient and reliable solution of parametrized optimal control problems governed by elliptic partial differential equations. We consider the standard linear-quadratic problem setting with distributed control and two-sided control constraints, which play an important role in many industrial and economical applications. For this problem class, we propose two different reduced basis approximations and associated error estimation procedures. In our first approach, we directly consider the resulting optimality system, introduce suitable reduced basis approximations for the state, adjoint, control, and Lagrange multipliers, and use a projection approach to bound the error in the reduced optimal control. For our second approach, we first reformulate the optimal control problem using two slack variables, we then develop a reduced basis approximation for both slack problems by suitably restricting the solution space, and derive error bounds for the slack based optimal control. We discuss benefits and drawbacks of both approaches and substantiate the comparison by presenting numerical results for a model problem.
Access provided by CONRICYT-eBooks. Download chapter PDF
Similar content being viewed by others
3.1 Introduction
Optimal control problems governed by partial differential equations (PDEs) appear in a wide range of applications in science and engineering, such as heat phenomena, crystal growth, and fluid flow (see, e.g., [4, 8]). Their solution using classical discretization techniques such as finite elements (FE) or finite volumes can be computationally expensive and time-consuming. Often, additional parameters enter the problem, e.g., material or geometry parameters in a design exercise.
Previous work on reduced order methods for optimal control problems considered distributed but unconstrained controls or constrained but scalar controls. Elliptic optimal control problems with distributed control have been considered recently by Negri et al. [10]. The proposed error bound is based on the Banach-Nečas-Babuška (BNB) theory applied to the first order optimality system. The approach thus provides a combined bound for the error in the state, adjoint, and control variable, but it is only applicable to problems without control constraints. Since the bound requires the very costly computation of a lower bound to the inf-sup constant, Negri et al. [11] compute error estimates using a heuristic interpolant surrogate of that constant.
Based on the ideas in Tröltzsch and Volkwein [13], Kärcher and Grepl [6] proposed rigorous and online-efficient control error bounds for reduced basis (RB) approximations of scalar elliptic optimal control problems. These ideas are extended and improved in [7] to distributed control problems.
In a recent paper [1], we employed the RB method as a surrogate model for the solution of distributed and one-sided constrained optimal control problems governed by parametrized elliptic partial differential equations. In this paper we extend this work to two-sided control constraints. After stating the problem in Sect. 3.2 we present the following contributions:
-
In Sect. 3.3 we extend previous work on reduced basis methods for variational inequalities in [1, 3] to the optimal control setting with two-sided control constraints. While we can derive an offline-online decomposable RB optimality system, we are only able to derive a partially offline-online decomposable control error bound that depends on the FE dimension of the control.
-
In Sect. 3.4 we build on the recent work in [1, 14] and propose an RB slack approach for optimal control. We introduce two slack formulations for the optimal control problem, which we obtain by shifting the optimal control by each constraint. We are thus able to derive an offline-online decomposable RB optimality systems and control error bound. The evaluation of this bound is independent of the FE dimension of the problem, but requires the solution of three RB systems.
In Sect. 3.5 we propose a greedy sampling procedure to construct the RB spaces and in Sect. 3.6 we assess the properties of our methods by presenting numerical results for a Graetz flow problem.
3.2 General Problem Statement and Finite Element Discretization
In this section we introduce the parametrized linear-quadratic optimal control problem with elliptic PDE constraint and a constrained distributed control. We introduce a finite element (FE) discretization for the continuous problem and recall the first-order necessary (and in the convex setting sufficient) optimality conditions.
3.2.1 Preliminaries
Let Y e with H 0 1(Ω) ⊂ Y e ⊂ H 1(Ω) be a Hilbert space over the bounded Lipschitz domain \(\varOmega \subset \mathbb{R}^{d},d \in \{ 1,2,3\}\), with boundary Γ.Footnote 1 The inner product and induced norm associated with Y e are given by (⋅ , ⋅ ) Y and \(\Vert \cdot \Vert _{Y } = \sqrt{(\cdot, \cdot )_{Y}}\). We assume that the norm \(\Vert \cdot \Vert _{Y }\) is equivalent to the H 1(Ω)-norm and denote the dual space of Y e by Y e′. We also introduce the control Hilbert space U e = L 2(Ω), together with its inner product (⋅ , ⋅ ) U , induced norm \(\Vert \cdot \Vert _{U} = \sqrt{(\cdot, \cdot )_{U}}\), and associated dual space U e′.Footnote 2 Furthermore, let \(\mathcal{D} \subset \mathbb{R}^{P}\) be a prescribed P-dimensional compact parameter set in which the P-tuple (input) parameter μ = (μ 1, …, μ P ) resides.
We directly consider a FE “truth” approximation for the exact infinite-dimensional optimal control problem. To this end, we define two conforming FE spaces Y ⊂ Y e and U ⊂ U e and denote their dimensions by \(\mathcal{N}_{Y } =\dim (Y )\) and \(\mathcal{N}_{U} =\dim (U)\). We assume that the truth spaces Y and U are sufficiently rich such that the FE solutions guarantee a desired accuracy over \(\mathcal{D}\).
We next introduce the μ-dependent bilinear form \(a(\cdot,\cdot;\mu ): Y \times Y \rightarrow \mathbb{R}\), and shall assume that a(⋅ , ⋅ ; μ) is (1) continuous for all \(\mu \in \mathcal{ D}\) with continuity constant γ a (μ) < ∞ and (2) coercive for all \(\mu \in \mathcal{ D}\) with coercivity constant α a (μ) > 0. Furthermore, we introduce the μ-dependent continuous linear functional \(f(\cdot;\mu ): Y \rightarrow \mathbb{R}\) and the bilinear form \(b(\cdot,\cdot;\mu ): U \times Y \rightarrow \mathbb{R}\) with continuity constant γ b (μ) < ∞.
In anticipation of the optimal control problem, we introduce the parametrized control constraints u a (μ), u b (μ) ∈ U and a desired state y d ∈ D. Here, \(D \subset L^{2}(\varOmega _{\!_{D}}\!)\) is a suitable FE space for the observation subdomain \(\varOmega _{\!_{ D}}\! \subset \varOmega\). Furthermore, we note that the semi-norm \(\vert \cdot \vert _{D}\) for y ∈ L 2(Ω) is defined by \(\vert \cdot \vert _{D} =\Vert \cdot \Vert _{L^{2}(\varOmega _{D})}\).
The involved bilinear and linear forms as well as the control constraint are assumed to depend affinely on the parameter. For example we require for all w, v ∈ Y and all parameters \(\mu \in \mathcal{ D}\) that \(a(w,v;\mu ) =\sum _{ q=1}^{Q_{a}}\varTheta _{a}^{q}(\mu )\,a^{q}(w,v)\) and \(u_{a}(x;\mu ) =\sum _{ q=1}^{Q_{ua}}\varTheta _{ua}^{q}(\mu )\,u_{a}^{q}(x)\) for some (preferably) small integers Q a and Q ua . Here, the coefficient functions \(\varTheta _{\bullet }^{q}(\cdot ):\mathcal{ D} \rightarrow \mathbb{R}\) are continuous and depend on μ, whereas the continuous bilinear and linear forms, e.g., a q(⋅ , ⋅ ) and u a q ∈ U do not depend on μ. Although we choose y d (x) to be parameter-independent, our approach directly extends to an affinely parameter-dependent y d (x; μ) (see Kärcher et al. [7]).
For the development of a posteriori error bounds we also require additional ingredients. We assume that we are given a positive lower bound \(\alpha _{a}^{\mathrm{LB}}(\mu ):\mathcal{ D} \rightarrow \mathbb{R}_{+}\) for the coercivity constant α a (μ) of a(⋅ , ⋅ ; μ) such that \(\alpha _{a}^{\mathrm{LB}}(\mu ) \leq \alpha _{a}(\mu )\ \forall \mu \in \mathcal{ D}\). Furthermore, we assume that we have upper bounds available for the constant \(C_{D}^{\mathrm{UB}} \geq C_{D} =\sup _{w\in Y \setminus \{0\}}\frac{\vert w\vert _{D}} {\Vert w\Vert _{Y }} \geq 0\ \forall \mu \in \mathcal{ D},\) and the continuity constant of the bilinear form b(⋅ , ⋅ ; μ): \(\gamma _{b}^{\mathrm{UB}}(\mu ) \geq \gamma _{b}(\mu )\ \forall \mu \in \mathcal{ D}.\) Here, the constant C D depends on the parameter, since later we use \(\vert \cdot \vert _{D} =\Vert \cdot \Vert _{L^{2}(\varOmega _{D}(\mu ))}\) (see Sect. 3.6). In our setting, it is possible to compute these constants (or their bounds) efficiently using an offline-online procedure (see [7, 12]).
3.2.2 Abstract Formulation of Linear-Quadratic Optimal Control Problems and the First-Order Optimality Conditions
We consider the following FE optimal control problem in weak form with u a (μ) < u b (μ)
where U +: = { ρ ∈ U; ρ ≥ 0 almost everywhere} and we dropped the μ-dependence of the state and control \((\hat{y},\hat{u})\) for the sake of readability. We note that the last line of (P) is equivalent to \(\hat{u}\) being in the convex admissible set
In the following we call problem (P) the “primal” problem, for which the existence and uniqueness of the solution is standard (see, e.g., [4]). The derivation of the necessary and sufficient first-order optimality system is straightforward: Given \(\mu \in \mathcal{ D}\), the optimal solution (y, p, u, σ, σ b ) ∈ Y × Y × U × U × U satisfies
Note that we follow a first-discretize-then-optimize approach here, for a more detailed discussion see [4, Sect. 3.2.4]).
In the following we comment on the FE-setting in this paper. We assume that the state variable is discretized by P 1, i.e., continuous and piecewise linear, and the control variable by P 0, i.e., piecewise constant finite elements. Next, we introduce two bases for the FE spaces Y and U, such that
where \(\phi _{i}^{y} \geq 0,\;i = 1,\ldots,\mathcal{N}_{Y }\), and \(\phi _{i}^{u} \geq 0,\;i = 1,\ldots,\mathcal{N}_{U}\), are the usual hat and bar basis functions. Using these basis functions we can express the functions y, p ∈ Y and u, σ, σ b ∈ U as, e.g., \(y =\sum _{ i=1}^{\mathcal{N}_{Y }}y_{i}\phi _{i}^{y}\). The corresponding FE coefficient vectors are given by, e.g., \(\underline{y} = (y_{1},\ldots,y_{\mathcal{N}_{Y }})^{T} \in \mathbb{R}^{\mathcal{N}_{Y }}\). Note that by definition of U + and since ϕ i u ≥ 0, the condition ρ ∈ U + in (3.2d) and (3.2e) translates into the condition ρ ≥ 0 for the corresponding coefficient vector. Further, we also introduce the control mass matrix M U with entries (M U ) ij = (ϕ i u, ϕ j u) U , which is for a P 0 control discretization a positive diagonal matrix. Hence the point-wise and the ‘weak’ (averaged) constraint formulations are equivalent \(u(x) \geq u_{a}(x;\mu )\, \Leftrightarrow (u,\rho )_{U} \geq (u_{a}(\mu ),\rho )_{U}\ \forall \rho \in U^{+}\). However, this is in general not true for other control discretizations, e.g., P 1.
Based on the truth FE primal problem (P) we derive an RB primal problem (P N ) and a rigorous a posteriori error bound for the error between the truth and RB control approximation in Theorem 1.
3.3 Reduced Basis Method for the Primal Problem
3.3.1 Reduced Basis Approximation
To begin, we define the RB spaces Y N ⊂ Y, U N , Σ N , Σ b, N ⊂ U as well as the convex cones Σ N + ⊂ U +, Σ b, N + ⊂ U + as follows: given N parameter samples μ 1, …, μ N, we set
where we assume that the basis functions, \(\zeta _{1}^{\bullet },\ldots \zeta _{N_{\bullet }}^{\bullet }\), are linearly independent and span+{⋅ } indicates the cone spanned by non-negative combinations of the elements, i.e.
Note that we employ integrated spaces for the state and adjoint as well as for the control (see Remarks 1 and 2). For the spaces Y N and U N we additionally assume that the basis functions are orthogonal, i.e., (ζ i y, ζ j y) Y = δ ij and (ζ i u, ζ j u) U = δ ij , where δ ij is the Kronecker delta. This orthogonality is favorable to keep the condition of the RB algebraic linear systems small [12]. In addition, we do not orthogonalize the basis \(\zeta _{i}^{\sigma },\,\zeta _{i}^{\sigma _{b}}\) of the cones Σ N +, Σ b, N + ⊂ U +, because this non-negativity is used in the definition of the reduced problem (P N ) of (P).Footnote 3 Although the conditions ζ i σ ∈ Σ N + and \(\zeta _{i}^{\sigma _{b}} \in \varSigma _{ b,N}^{+}\) appear to be much more restrictive than \(\zeta _{i}^{\sigma },\,\zeta _{i}^{\sigma _{b}} \in U^{+}\), we observe in numerical tests (not shown) that the RB approximations converge to the FE solutions with a similar rate as the control approximations. In addition, the RB approximations are comparable to the best possible approximations derived by projecting σ to Σ N or Σ N +, analogously for σ b . We describe the greedy sampling approach to construct the RB spaces in Sect. 3.5. Next, given the RB spaces in (3.3) we derive the RB primal problem
The last line of (P N ) defines the admissible set for u N : \(U_{\mathrm{ad},N} =\{\psi \in U_{N};(u_{a}(\mu ),\rho )_{U} \leq (\psi,\rho )_{U}\ \forall \rho \in \varSigma _{N}^{+},\ (u_{b}(\mu ),\rho )_{U} \geq (\psi,\rho )_{U}\ \forall \rho \in \varSigma _{b,N}^{+}\}\), which is in general not a subset of U ad in (3.1). Analogously to the primal problem (P) we obtain the RB optimality system: Given \(\mu \in \mathcal{ D}\), the optimal solution (y N , p N , u N , σ N , σ b, N ) ∈ Y N × Y N × U N ×Σ N ×Σ b, N satisfies
Remark 1 (Existence, Uniqueness, Integrated Space Y N )
Since (P N ) is a linear-quadratic optimal control problem over the closed convex admissible set U ad,N , the existence and uniqueness of the RB optimal control u N follows from standard arguments (see, e.g., [4, Theorem 1.43]). Also note that we use a single “integrated” reduced basis trial and test space Y N for the state and adjoint equations as one ingredient to ensure stability of the system (3.4), see e.g. Kärcher [5].
Remark 2 (Stability, Integrated Space U N )
For the stability of the RB solutions we need to show that the RB inf-sup constants
are bounded away from zero. We guarantee that β N , β b, N ≥ β > 0 by enriching the RB control space with suitable supremizers [9]. Here, these supremizers are just the multiplier snapshots σ(μ n), σ b (μ n), 1 ≤ n ≤ N; we thus have β N = β b, N = β = 1.
3.3.2 Primal Error Bound
We next propose an a posteriori error bound for the optimal control. The bound is based on [1], which uses an (1) RB approach for variational inequalities of the first kind [3], and (2) an RB approach for optimal control problems with a PDE constraint [7]. Before stating the main result, we define the following approximation errors (omitting μ-dependencies) of the RB primal system (3.4)
as well as the residuals in the next definition.
Definition 1 (Residuals)
The residuals of the state equation, the adjoint equation w.r.t. (3.2a)– (3.2c) are defined for all \(\mu \in \mathcal{ D}\) by
Theorem 1 (Primal Error Bound)
Let u and u N be the optimal controls of the FE primal problem (P) and of the RB primal problem ( P N ), respectively. Then the error in the optimal control satisfies for any given parameter \(\mu \in \mathcal{ D}\)
where \(\varDelta _{N}^{\mathrm{pr}}(\mu ):= c_{1}(\mu ) + \sqrt{c_{1 } (\mu )^{2 } + c_{2 } (\mu )}\) with nonnegative coefficients
and \(\delta _{1}\! =\!\Vert [u_{a} -u_{N}]_{+}\Vert _{U}\!,\ \delta _{2}\! =\! ([u_{a} -u_{N}]_{+},\sigma _{N})_{U}\!,\ \delta _{1b}\! =\!\Vert [u_{N} -u_{b}]_{+}\Vert _{U}\!,\ \delta _{2b}\! =\! ([u_{N} -u_{b}]_{+},\sigma _{b,N})_{U}\!\) .
Here [⋅ ]+ = max(⋅ , 0) denotes the positive part (a.e.). Note that we sometimes use r • instead of r •(⋅ ; μ) and omit the μ-dependencies on the r.h.s. of (3.6) for a better readability.
Proof
This proof follows the proof of the primal error bound from [1]. Since the FE optimal solution (y, p, u, σ, σ b ) satisfies the optimality conditions (3.2), we obtain the following error-residual equations:
From (3.7a) with ϕ = e y , (3.7b) with φ = e p , and α a LB(μ) ≤ α a (μ) we infer that
Choosing ϕ = e p , φ = e y , ψ = e u in (3.7), adding (3.7b) and (3.7c), and subtracting (3.7a) results in
Next, we bound (e σ , e u ) U and \(-(e_{\sigma _{b}},e_{u})_{U}\). We first consider (e σ , e u ) U and note that
where, except for the second term, all terms are nonpositive, see (3.2d) and (3.4d). Hence
Analogously, we bound \(-(e_{\sigma _{b}},e_{u})_{U} \leq \Vert e_{\sigma _{b}}\Vert _{U}\;\delta _{1b} +\delta _{2b}\). Most significantly, it remains to bound the terms \(\Vert e_{\sigma }\Vert _{U}\) and \(\Vert e_{\sigma _{b}}\Vert _{U}\), which we achieve in two steps: First, we relate \(\Vert e_{\sigma }\Vert _{U}\) and \(\Vert e_{\sigma _{b}}\Vert _{U}\) with \(\Vert e_{\sigma } - e_{\sigma _{b}}\Vert _{U}\) by
If we employ (σ, σ b ) U = 0, (σ N , σ b ) U ≥ 0, and (σ, σ b, N ) U ≥ 0, we obtain
Second, we focus on the optimality residual (3.7c), use the inf-sup stability of (⋅ , ⋅ ) U and (3.8) to derive \(\Vert e_{\sigma } - e_{\sigma _{b}}\Vert _{U} \leq \Vert r_{u}\Vert _{U'} +\lambda \Vert e_{u}\Vert _{U} + \frac{\gamma _{b}^{\mathrm{UB}}} {\alpha _{a}^{\mathrm{LB}}} \left (\Vert r_{p}\Vert _{Y '} + C_{D}^{\mathrm{UB}}\vert e_{y}\vert _{D}\right ).\) Next, we employ the inequalities (3.8) and (3.11) in (3.9) to obtain
It thus follows from applying Young’s inequality to the \(\vert e_{y}\vert _{D}\)-terms in (3.12) that
where c 1(μ) and c 2(μ) are given in (3.6). Solving the last inequality for the larger root yields \(\Vert e_{u}\Vert _{U} \leq c_{1}(\mu ) + \sqrt{c_{1 } (\mu )^{2 } + c_{2 } (\mu )} =\varDelta _{ N}^{\mathrm{pr}}(\mu )\).
We note that most of the ingredients of the primal error bound Δ N pr(μ) introduced in Theorem 1 are standard, i.e., the dual norms of state, adjoint, and control residuals, as well as coercivity and continuity constants or rather their lower and upper bounds [7, 12]. The only non-standard terms are δ 1, δ 2, δ 1b and δ 2b , which measure the constraint-violation of the RB optimal control u N . As a result, the online computational cost to evaluate δ •—and hence the error bound Δ N pr(μ)—depends on the FE control dimension \(\mathcal{N}_{U}\), requiring \(\mathcal{O}((N_{U} + N_{\sigma } + N_{\sigma _{b}})\mathcal{N}_{U})\) operations.
3.4 Slack Problem and the Primal-Slack Error Bound
In this section we introduce a reformulation of the original primal problem by means of a slack variable. We extend the ideas presented for the one-sided control-constrained problem in [1] to the two-sided control-constrained problem. First, we reformulate the original optimization problem (P) by replacing the control variable with a slack variable that depends on one of the two constraints u a (μ) or u b (μ). Second, we use snapshots of the slack variable to construct an associated convex cone, leading to strictly feasible approximations w.r.t. either the lower or upper constraint. Third, we derive two RB slack problems by restricting the RB-slack coefficients to a convex cone. And finally, we propose an a posteriori \(\mathcal{N}\)-independent error bound for RB slack approximation w.r.t. either the lower or upper constraint in Theorem 2.
3.4.1 FE and RB Slack Problem
We consider the FE optimization problem (P) and introduce the slack variable s ∈ U + given by
together with the corresponding FE coefficient vector s = u − u a (μ), where we state the slack variable w.r.t. u a (μ). Here, we again omit the explicit dependence of u and s on the parameter μ. We note that, by construction, the feasibility of u w.r.t. u a (μ) is equivalent to M U s ≥ 0, which in turn is equivalent to s ≥ 0, if we are using P 0 elements.
If we substitute u by s + u a (μ) in (P), we obtain the “slack” optimization problem
Analogously, we define a slack variable s b = u b (μ) −u w.r.t. u b (μ) and recast (P) w.r.t. s b . We do not state this minimization problem explicitly since it is analogous to (S).
In the following we derive two RB slack problems w.r.t. u a (μ) and u b (μ). We start with the former and reuse the RB space Y N , introduced in Sect. 3.3.1, for the state and adjoint variables. Furthermore, for the RB approximation of the slack variable s we simply introduce an RB slack space S N and a convex cone S N + by shifting the control snapshots of (P) with the control constraint u a (μ)
We assume that the snapshots \(\zeta _{1}^{s},\ldots,\zeta _{N_{s}}^{s}\) are linearly independent and not orthogonalized. Further, we need to consider a Lagrange multiplier σ b s ∈ U + for the constraint u b (μ) by incorporating the RB space Σ b, N ⊂ U, as well as the convex cone Σ b, N + from (3.3d) and (3.3e). Overall, for an RB approximation s N ∈ S N + ⊂ U + of s, we have s N ≥ 0. From the definition of the slack variable s = u − u a (μ), see (3.13), we derive the control approximation u s: = s N + u a (μ) that satisfies u s ≥ u a (μ). However, we can not conclude u s ≤ u b (μ) since the slack approximation s N is constructed—as the slack variable s in (3.13)—using information from u a (μ) but not u b (μ).
Overall, employing the RB spaces in (S) results in the RB slack problem
As in the RB primal problem (P N ), the existence and uniqueness of the RB optimal control follows from the same arguments as in the Remark 1. Next, we derive the optimality conditions for s N ; however, we here follow the ‘first-discretize-then-optimize’ approach that will eventually lead to a feasible—w.r.t. u a (μ)—approximation of the control. We perform two steps.
First, we use the RB-representations of y N s, s N with their RB-coefficient vectors y N s, s N to discretize (s N ). Since the algebraic RB slack problem is simple to derive, we only state the main crucial condition that s N ≥ 0. Next, we derive the first-order optimality conditions. We introduce a discrete Lagrange multiplier \(\hat{\underline{\omega }}_{N} \in \mathbb{R}^{N_{S}},\ \hat{\underline{\omega }}_{N} \geq 0\), ensuring the non-negativeness of \(\hat{\underline{s}}_{N}\) and derive the following necessary (and here sufficient) first-order optimality system: Given \(\mu \in \mathcal{ D}\), the optimal RB slack solution coefficients \((\underline{y}_{N}^{s},\underline{s}_{N},\underline{p}_{N}^{s},\underline{\sigma }_{b,N}^{s},\underline{\omega }_{N}) \in \mathbb{R}^{N_{Y }} \times \mathbb{R}^{N_{Y }} \times \mathbb{R}^{N_{S}} \times \mathbb{R}^{N_{\sigma _{b}}} \times \mathbb{R}^{N_{S}}\) satisfy (omitting all μ-dependencies)
where the reduced basis matrices and vectors are given by
and 1 ≤ i, j ≤ N • [see (3.3) and (3.14)].
Second, by solving (3.15) we have s N ≥ 0 and through the definition of s we obtain a feasible—w.r.t. u a (μ)—approximation for the control by u s = s N + u a (μ). In order to derive an error bound for \(\Vert u - u^{s}\Vert _{U}\) we need, however, to analogously repeat the RB reduction for the second RB slack problem with s b = u b (μ) − u. There we likewise introduce the RB space S b, N , as well as its convex cone S b, N + and follow the previous steps to obtain s b, N ≥ 0. Using this, we obtain a control approximation \(u^{s_{b}} = u_{ b}(\mu ) - s_{b,N}\) that is feasible w.r.t. the constraint u b (μ).
3.4.2 Primal-Slack Error Bound
In the following we will focus on the primal-slack error bound for \(\Vert u - u^{s}\Vert _{U}\) w.r.t. u a (μ). Similarly to the primal error bound in Theorem 1 we use residuals and properties of the bilinear and linear forms to derive a quadratic inequality in \(\Vert u - u^{s}\Vert _{U}\). We consider the following RB primal-slack approximation (y N s, p N s, u s, σ N , σ b, N ) ∈ Y N × Y N × U ad ×Σ N + ×Σ b, N +, which depends on the solutions of the RB primal and slack problem. We define the corresponding errors e y s = y − y N s, e p s = p − p N s, e u s = u − u s. Further, we revisit Definition 1 and insert on the r.h.s. of (3.5) the approximation (y N s, p N s, u s, σ N , σ b, N ) to obtain on the l.h.s. the residuals r y s, r p s, r u s.
We state the main result in the following theorem.
Theorem 2 (Primal-Slack Error Bound)
Let u, s N , and s b, N be the optimal solutions of the FE primal problem (P) and the RB slack problems ( s N ) and its equivalent w.r.t. u b (μ), respectively. Then the error in the optimal control satisfies for all parameters \(\mu \in \mathcal{ D}\)
where \(\varDelta _{N}^{\mathrm{pr-sl}}(\mu ):= c_{1}^{s}(\mu ) + \sqrt{c_{1 }^{s }(\mu )^{2 } + c_{2 }^{s }(\mu )}\) with nonnegative coefficients
Proof
Let the FE primal solution (y, p, u, σ, σ b ) satisfy the optimality conditions (3.2). We follow the proof of Theorem 1, and derive analogously to (3.9) the inequality
We first focus on (e σ , e u s) U and exploit the feasibility of u s w.r.t. u a (μ). Again we have (e σ , e u s) U = −(σ, u a (μ) − u) U − (σ, s N ) U + (σ N , u a (μ) − u) U + (σ N , s N ) U , where the first three terms are non-positive and hence (e σ , e u s) U ≤ (σ N , s N ) U . In order to bound \(-(e_{\sigma _{b}},e_{u}^{s})_{U}\), we need to solve the second RB slack problem for \(u^{s_{b}}\) and derive
We restate that \(\Vert e_{\sigma _{b}}\Vert _{U} \leq \Vert e_{\sigma } - e_{\sigma _{b}}\Vert _{U} + \sqrt{2(\sigma _{N },\sigma _{b,N } )_{U}}\) and \(\Vert e_{\sigma } - e_{\sigma _{b}}\Vert _{U}\) is bounded by \(\Vert e_{\sigma } - e_{\sigma _{b}}\Vert _{U} \leq \Vert r_{u}^{s}\Vert _{U'} +\lambda \Vert e_{u}^{s}\Vert _{U} + \frac{\gamma _{b}^{\mathrm{UB}}} {\alpha _{a}^{\mathrm{LB}}} \left (\Vert r_{p}^{s}\Vert _{Y '} + C_{D}^{\mathrm{UB}}\vert e_{y}^{s}\vert _{D}\right ).\) Using the bounds for \((e_{\sigma },e_{u}^{s})_{U} - (e_{\sigma _{b}},e_{u}^{s})_{U}\) and the inequalities (3.8) in (3.17) we obtain
It thus follows from employing Young’s inequality to the \(\vert e_{y}^{s}\vert _{D}\)-term that \(\Vert e_{u}^{s}\Vert _{U}^{2} - 2c_{1}^{s}(\mu )\Vert e_{u}^{s}\Vert _{U} - c_{2}^{s}(\mu ) \leq 0\), where c 1 s(μ) and c 2 s(μ) are given in (3.16). Solving the last inequality for the larger root yields \(\Vert e_{u}^{s}\Vert _{U} \leq c_{1}^{s}(\mu ) + \sqrt{c_{1 }^{s }(\mu )^{2 } + c_{2 }^{s }(\mu )} =\varDelta _{ N}^{\mathrm{pr-sl}}(\mu )\).
3.5 Greedy Sampling Procedure
The reduced basis spaces for the two-sided control-constrained optimal control problem in Sects. 3.3.1, and 3.4.1 are constructed using the greedy sampling procedure outlined in Algorithm 1. Suppose \(\varXi _{\mathrm{train}} \subset \mathcal{ D}\) is a finite but suitably large parameter train sample, μ 1 ∈ Ξ train is the initial parameter value, N max the maximum number of greedy iterations, ε tol,min > 0 is a prescribed desired error tolerance, and \(\varDelta _{N}^{\bullet }(\mu )/\Vert u_{N}^{\bullet }(\mu )\Vert _{U}\), • ∈ {pr, pr−sl}, is the primal or primal-slack error bound from (3.6) or (3.16) with u N • ∈ {u N , u s}.
Algorithm 1 Greedy sampling procedure
We make two remarks: First, by using the bounds Δ N •(μ), • ∈ {pr, pr−sl} we only refer to the bounds derived for the primal error \(\Vert u - u_{N}\Vert _{U}\) and the slack error \(\Vert u - u^{s}\Vert _{U}\) w.r.t. to u a (μ). Therefore, using the primal-slack bound Δ N pr−sl(μ) in the greedy sampling procedure, we expect not only to construct an accurate RB space S N for s N but also an accurate RB space S b, N for s b, N . Second, we comment on two special cases: (1) if one control constraint is fully active in each greedy step, i.e. we have, e.g., u(μ n) = u a , n = 1, …, N, we set S N = {} and s N = 0 (analogously for u b we set S b, N = {} and s b, N = 0); and (2) if the control constraint is never active, i.e., for all snapshots σ(μ n) = σ b (μ n) = 0, n = 1, …, N, we set Σ N = Σ N + = Σ b, N = Σ b, N + = {} and σ N = σ b, N = 0.
3.6 Numerical Results: Graetz Flow with Parametrized Geometry and Lower and Upper Control Constraints
We consider a Graetz flow problem, which describes a heat convection and conduction in a duct. The main goal of this example is to demonstrate the different properties of the approximations and their error bounds. The problem is parametrized by a varying Péclet number μ 1 ∈ [5, 18] and a geometry parameter μ 2 ∈ [0. 8, 1. 2]. Hence, the parameter domain is \(\mathcal{D} = [5,18] \times [0.8,1.2]\). The parametrized geometry is given by Ω(μ) = [0, 1. 5 + μ 2] × [0, 1] and is subdivided into three subdomains Ω 1(μ) = [0. 2μ 2, 0. 8μ 2] × [0. 3, 0. 7], Ω 2(μ) = [μ 2 + 0. 2, μ 2 + 1. 5] × [0. 3, 0. 7], and Ω 3(μ) = Ω(μ)∖{Ω 1(μ) ∪Ω 2(μ)}. A sketch of the domain is shown in Fig. 3.1. We impose boundary condition of homogeneous Neumann and of non-homogeneous Dirichlet type: y n = 0 on Γ N (μ), and y = 1 on Γ D (μ). Thus the trial space is given by \(Y (\mu ) \subset Y _{\mathrm{e}}(\mu ) =\{ v \in H^{1}(\varOmega (\mu ));v\vert _{\varGamma _{D}(\mu )} = 1\}\). The amount of heat supply in the whole domain Ω(μ) is regulated by the distributed control u ∈ U(μ) ⊂ U e(μ) = L 2(Ω(μ)) and bounded by the lower and upper constraints u a = −0. 5 and u b = 1. 25. The observation domain is \(\varOmega _{\!_{D}}\!(\mu ) =\varOmega _{1}(\mu ) \cup \varOmega _{2}(\mu )\) and the desired state is given by y d = 0. 5 on Ω 1(μ) and y d = 2 on Ω 2(μ).
Overall, the parametrized optimal control problem is given by
for the given parabolic velocity field β(x) = (x 2(1 − x 2), 0)T. The regularization parameter λ is fixed to 0. 01.
After recasting the problem to a reference domain Ω = Ω(μ ref) = [0, 2. 5] × [0, 1] for μ ref = (5, 1), and introducing suitable lifting functions that take into account the non-homogeneous Dirichlet boundary conditions, we can reformulate the problem in terms of the parameter-independent FE space Y ⊂ Y e = H 0 1(Ω) and U ⊂ U e = L 2(Ω) [12]. We then obtain the affine representation of all involved quantities with Q a = Q f = 4, Q b = Q d = Q u = Q yd = 2, and Q ua = 1. The details of these calculations are very similar to the details presented by Rozza et al. [12] and Kärcher [5], and are thus omitted. The inner product for the state space is given by \((w,v)_{Y } = \frac{1} {\mu _{1}^{\mathrm{ref}}} \int _{\varOmega }\nabla w \cdot \nabla v\;\mathrm{d}x + \frac{1} {2}(\int _{\varOmega }\beta (x) \cdot \nabla w\,v\;\mathrm{d}x +\int _{\varOmega }\beta (x) \cdot \nabla v\,w\;\mathrm{d}x)\) and we obtain a lower bound α a LB(μ) for the coercivity constant by the so-called min-theta approach [12]. Note that for the control space we obtain a parameter-dependent inner product (⋅ , ⋅ ) U(μ) from the affine geometry parametrization. Hence the control error is measured in the parameter-dependent energy norm \(\Vert \cdot \Vert _{U(\mu )}\). The derivations of the primal and primal-slack error bounds remain the same in this case and they bound the control error in the energy norm.
Although the introduction of a domain parametrization seems to add an entirely new μ-dependence to the primal and the slack problems (P) and (S), the reductions and the error bound derivations can be analogously derived w.r.t. (⋅ , ⋅ ) U(μ) instead of \((\cdot,\cdot )_{U(\mu ^{\mathrm{ref}})}\). Also the definition of the integrated space U N in (3.3) remains, while in the inf-sup condition of Remark 2 we use (⋅ , ⋅ ) U(μ) instead of (⋅ , ⋅ ) U .
We choose a P 1 discretization for the state and adjoint, and a P 0 discretization for the control to obtain \(\dim (Y ) =\mathcal{ N}_{Y } \approx 11,000\) and \(\dim (U) =\mathcal{ N}_{U} \approx 22,000\). The chosen discretization induces a discretization error of roughly 2%. In Fig. 3.2 we present control snapshots and associated active sets for two different parameters displayed on the reference domain Ω(μ 2 ref = 1). We observe strongly varying control solutions and active sets.
We construct the RB spaces using the greedy procedure described in Algorithm 1 by employing an equidistant train sample \(\varXi _{\mathrm{train}} \subset \mathcal{ D}\) of size 30 ⋅ 30 = 900 (log-scale in μ 1 and lin-scale in μ 2) and stop the greedy enrichment after 30 steps. We also introduce a test sample with 10 ⋅ 5 (log × lin) equidistant parameter points in \([5.2,17.5] \times [0.82,1.17] \subset \mathcal{ D}\).
In Fig. 3.3 we present, as a function of N, the resulting energy norm errors and bounds over Ξ test. Here, the errors and bounds are defined as follows: the primal-slack bound is the maximum of \(\varDelta _{N}^{\mathrm{pr-sl}}(\mu )/\Vert u(\mu )\Vert _{U(\mu )}\) over Ξ test, the primal bound is the maximum of \(\varDelta _{N}^{\mathrm{pr}}(\mu )/\Vert u(\mu )\Vert _{U(\mu )}\) over Ξ test, and the u s and u N errors are the maxima of \(\Vert u(\mu ) - u^{s}(\mu )\Vert _{U(\mu )}/\Vert u(\mu )\Vert _{U(\mu )}\) and \(\Vert u(\mu ) - u_{N}(\mu )\Vert _{U(\mu )}/\Vert u(\mu )\Vert _{U(\mu )}\) over Ξ test, respectively. We observe that both errors and both bounds decay very similarly. Quantitatively, the error bounds are comparable throughout all N, since the dominating primal-slack terms \(\Vert r_{u}^{s}\Vert _{U'(\mu )}\) and \(\lambda \Vert u^{s} - u^{s_{b}}\Vert _{ U(\mu )}\) are comparable to the dominating primal terms \(\Vert r_{u}\Vert _{U'(\mu )}\) and λ(δ 1 + δ 1b ), resulting in Δ N pr(μ) ≈ Δ N pr−sl(μ).
We briefly report the computational timings: the solution of the FE optimization problem takes ≈ 4 s (for a discretization error of 2%). The RB primal problem, for N = 25, is solved in ≈ 0. 066 s and the RB slack problem is solved faster in ≈ 0. 029 s, since dim(S N ) = 25. We turn to the evaluation of error bounds: the primal bound takes 0. 01 s, whereas the primal-slack bound, given σ N , σ b, N , takes 0. 0065 s. From this we can conclude that for N = 25 the overall cost for one primal bound evaluation is roughly 0. 076 s = 0. 066 s + 0. 01 s and for the primal-slack bound evaluation is roughly 0. 13 ≈ 0. 029 + 0. 029 + 0. 066 + 0. 0065 s, since it relies on three RB solutions.
3.7 Conclusions
In this paper we extended the ideas from [1] to propose two certified reduced basis approaches for distributed elliptic optimal control problems with two-sided control constraints: a primal and a primal-slack approach. Albeit the reduction for the primal approach was straightforward, the primal-slack approach needed more consideration. We proposed for each constraint a corresponding RB slack problem with an additional Lagrange multiplier. The primal a posteriori error bound from [1] could be extended for the two-sided case by special properties of the Lagrange multipliers of the two-sided problem. The primal-slack error bound also relies on these properties and in addition uses three RB solutions to derive an \(\mathcal{N}\)-independent error bound. Both the primal and slack RB approximation can be evaluated efficiently using the standard offline-online decomposition. However, on the one hand the primal error bound depends on the FE control dimension and on the other hand the primal-slack error bound relies on three reduced order optimization problems.
Notes
- 1.
The subscript “e” denotes the “exact” infinite-dimensional continuous problem setting.
- 2.
The framework of this work directly extends to Neumann boundary controls U e = L 2(∂Ω) or finite dimensional controls \(U_{\mathrm{e}} = \mathbb{R}^{m}\). Also distributed controls on a subdomain Ω U ⊂ Ω or Neumann boundary controls on a boundary segment Γ U ⊂ ∂Ω are possible.
- 3.
Alternative methods to deal with the non-negativity can be found in [2].
References
Bader, E., Kärcher, M., Grepl, M.A., Veroy, K.: Certified reduced basis methods for parametrized distributed optimal control problems with control constraints. SIAM J. Sci. Comput. 38(6), A3921–A3946 (2016)
Balajewicz, M., David, A., Farhat, C.: Projection-based model reduction for contact problems. Int. J. Numer. Methods Eng. 106(8), 644–663 (2016)
Haasdonk, B., Salomon, J., Wohlmuth, B.: A reduced basis method for parametrized variational inequalities. SIAM J. Numer. Anal. 50(5), 2656–2676 (2012)
Hinze, M., Pinnau, R., Ulbrich, M., Ulbrich, S.: Optimization with PDE Constraints. Mathematical Modelling: Theory and Applications, vol. 23. Springer, Berlin (2009)
Kärcher, M.: Certified reduced basis methods for parametrized PDE-constrained optimization problems. PhD thesis, RWTH Aachen University (2016)
Kärcher, M., Grepl, M.A.: A certified reduced basis method for parametrized elliptic optimal control problems. ESAIM Control Optim. Calculus Var. 20(2), 416–441 (2013)
Kärcher, M., Grepl, M.A., Veroy, K.: Certified reduced basis methods for parameterized distributed optimal control problems. Technical report (2014)
Lions, J.L.: Optimal Control of Systems Governed by Partial Differential Equations. Springer, New York (1971)
Maday, Y., Patera, A.T., Rovas, D.V.: A blackbox reduced-basis output bound method for noncoercive linear problems. In: Cioranescu, D., Lions, J.L. (eds.) Studies in Mathematics and Its Applications, vol. 31, pp. 533–569. Elsevier Science B.V., Amsterdam (2002)
Negri, F., Rozza, G., Manzoni, A., Quarteroni, A.: Reduced basis method for parametrized elliptic optimal control problems. SIAM J. Sci. Comput. 35(5), A2316–A2340 (2013)
Negri, F., Manzoni, A., Rozza, G.: Reduced basis approximation of parametrized optimal flow control problems for the stokes equations. Comput. Math. Appl. 69(4), 319–336 (2015)
Rozza, G., Huynh, D.B.P., Patera, A.T.: Reduced basis approximation and a posteriori error estimation for affinely parametrized elliptic coercive partial differential equations. Arch. Comput. Methods Eng. 15(3), 229–275 (2008)
Tröltzsch, F., Volkwein, S.: POD a-posteriori error estimates for linear-quadratic optimal control problems. Comput. Optim. Appl. 44, 83–115 (2009)
Zhang, Z., Bader, E., Veroy, K.: A slack approach to reduced-basis approximation and error estimation for variational inequalities. C.R. Math. 354(3), 283–289 (2016)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2017 Springer International Publishing AG
About this chapter
Cite this chapter
Bader, E., Grepl, M.A., Veroy, K. (2017). A Certified Reduced Basis Approach for Parametrized Optimal Control Problems with Two-Sided Control Constraints. In: Benner, P., Ohlberger, M., Patera, A., Rozza, G., Urban, K. (eds) Model Reduction of Parametrized Systems. MS&A, vol 17. Springer, Cham. https://doi.org/10.1007/978-3-319-58786-8_3
Download citation
DOI: https://doi.org/10.1007/978-3-319-58786-8_3
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-58785-1
Online ISBN: 978-3-319-58786-8
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)