Optimization Algorithms for Model Predictive Control

Diehl, Moritz

doi:10.1007/978-1-4471-5058-9_9

Moritz Diehl^3,4

293 Accesses
2 Citations

Abstract

This entry reviews optimization algorithms for both linear and nonlinear model predictive control (MPC). Linear MPC typically leads to specially structured convex quadratic programs (QP) that can be solved by structure exploiting active set, interior point, or gradient methods. Nonlinear MPC leads to specially structured nonlinear programs (NLP) that can be solved by sequential quadratic programming (SQP) or nonlinear interior point methods.

Access provided by Autonomous University of Puebla. Download reference work entry PDF

Optimization Algorithms for Model Predictive Control

Robust Model Predictive Control

Keywords

Introduction

Model predictive control (MPC) needs to solve at each sampling instant an optimal control problem with the current system state $\bar{x}_{0}$ as initial value. MPC optimization is almost exclusively based on the so-called direct approach which first discretizes the continuous time system to obtain a discrete time optimal control problem (OCP). This OCP has as optimization variables a state trajectory $X = [x_{0}^{\top },\ldots,x_{N}^{\top }]^{\top }$ with $x_{i} \in \mathbb{R}^{n_{x}}$ for i = 0, …, N and a control trajectory $U = [u_{0}^{\top },\ldots,u_{N-1}^{\top }]^{\top }$ with $u_{i} \in \mathbb{R}^{n_{u}}$ for $i = 0,\ldots,N - 1$. For simplicity of presentation, we restrict ourselves to the time-independent case, and the OCP we treat in this article is stated as follows:

$$\displaystyle\begin{array}{rcl} & \begin{array}{l} \\ \stackrel{\mbox{ minimize}}{X,U} \end{array} \quad \sum _{i=0}^{N-1}L(x_{ i},u_{i})\; +\; E\left (x_{N}\right )&{}\end{array}$$

(1a)

$$\displaystyle\begin{array}{rcl} \mbox{ subject to}\quad x_{0} -\bar{ x}_{0} = 0,& &{}\end{array}$$

(1b)

$$\displaystyle\begin{array}{rcl} x_{i+1} - f(x_{i},u_{i}) = 0,\quad i = 0,\ldots,N - 1,& &{}\end{array}$$

(1c)

$$\displaystyle\begin{array}{rcl} h(x_{i},u_{i}) \leq 0,& \quad i = 0,\ldots,N - 1,&{}\end{array}$$

(1d)

$$\displaystyle\begin{array}{rcl} r\left (x_{N}\right ) \leq 0.& &{}\end{array}$$

(1e)

The MPC objective is stated in Eq. (1a), the system dynamics enter via Eq. (1c), while path and terminal constraints enter via Eqs. (1d) and (1e). All functions are assumed to be differentiable and to have appropriate dimensions ($h(x,u) \in \mathbb{R}^{n_{h}}$ and $r(x) \in \mathbb{R}^{n_{r}}$). Note that $\bar{x}_{0} \in \mathbb{R}^{n_{x}}$ is not an optimization variable, but a parameter upon which the OCP depends via the initial value constraint in Eq. (1b). The optimal solution trajectories depend only on this value and can thus be denoted by $X^{{\ast}}(\bar{x}_{0})$ and $U^{{\ast}}(\bar{x}_{0})$. Obtaining them, in particular the first control value $u_{0}^{{\ast}}(\bar{x}_{0})$, as fast and reliably as possible for each new value of $\bar{x}_{0}$ is the aim of all MPC optimization algorithms. The most important dividing line is between convex and non-convex optimal control problems (OCP). If the OCP is convex, algorithms exist that find a global solution reliably and in computable time. If the OCP is not convex, one usually needs to be satisfied with approximations of locally optimal solutions. The OCP (1) is convex if the objective (1a) and all components of the inequality constraint functions (1d) and (1e) are convex and if the equality constraints (1c) are linear.

We typically speak of linear MPC when the OCP to be solved is convex, and otherwise of nonlinear MPC.

General Algorithmic Features for MPC Optimization

In MPC we would dream to have the solution to a new optimal control problem instantly, which is impossible due to computational delays. Several ideas can help us to deal with this issue.

Off-line Precomputations and Code Generation

As consecutive MPC problems are similar and differ only in the value $\bar{x}_{0}$, many computations can be done once and for all before the MPC controller execution starts. Careful preprocessing and code optimization for the model routines is essential, and many tools automatically generate custom solvers in low-level languages. The generated code has fixed matrix and vector dimensions, has no online memory allocations, and contains a minimal number of if-then-else statements to ensure a smooth computational flow.

Delay Compensation by Prediction

When we know how long our computations for solving an MPC problem will take, it is a good idea not to address a problem starting at the current state but to simulate at which state the system will be when we will have solved the problem. This can be done using the MPC system model and the open-loop control inputs that we will apply in the meantime. This feature is used in many practical MPC schemes with non-negligible computation time.

Division into Preparation and Feedback Phase

A third ingredient of several MPC algorithms is to divide the computations in each sampling time into a preparation phase and a feedback phase. The more CPU intensive preparation phase is performed with a predicted state $\bar{x}_{0}$, before the most current state estimate, say $\bar{x}_{0}'$, is available. Once $\bar{x}_{0}'$ is available, the feedback phase delivers quickly an approximate solution to the optimization problem for $\bar{x}_{0}'$.

Warmstarting and Shift

An obvious way to transfer solution information from one solved MPC problem to the next one uses the existing optimal solution as an initial guess to start the iterative solution procedure of the next problem. We can either directly use the existing solution without modification for warmstarting or we can first shift it in order to account for the advancement of time, which is particularly advantageous for systems with time-varying dynamics or objectives.

Iterating While the Problem Changes

A last important ingredient of some MPC algorithms is the idea to work on the optimization problem while it changes, i.e., to never iterate the optimization procedure to convergence for an MPC problem getting older and older during the iterations but to rather work with the most current information in each new iteration.

Convex Optimization for Linear MPC

Linear MPC is based on a linear system model of the form $x_{i+1} = Ax_{i} + Bu_{i}$ and convex objective and constraint functions in (1a), (1d), and (1e). The most widespread linear MPC setting uses a convex quadratic objective function and affine constraints and solves the following quadratic program (QP):

$$\displaystyle\begin{array}{rcl} & \begin{array}{r} \\ \stackrel{\mbox{ minimize} }{X, U} \end{array} \frac{1} {2}\sum _{i=0}^{N-1}\left [\begin{array}{lll} x_{i} \\ u_{i}\end{array} \right ]^{\top }\left [\begin{array}{@{}c@{\quad }c@{}} Q \quad &S \\ S^{\top }\quad &R \end{array} \right ]& \left [\begin{array}{lll} x_{i} \\ u_{i}\end{array} \right ]\, + \frac{1} {2}x_{N}^{\top }Px_{ N}{}\end{array}$$

(2a)

$$\displaystyle\begin{array}{rcl} \mbox{ subject to}\quad x_{0} -\bar{ x}_{0} = 0,& &{}\end{array}$$

(2b)

$$\displaystyle\begin{array}{rcl} x_{i+1} - Ax_{i} - Bu_{i} = 0,& i = 0,\ldots,N - 1,&{}\end{array}$$

(2c)

$$\displaystyle\begin{array}{rcl} b + Cx_{i} + Du_{i} \leq 0,& i = 0,\ldots,N - 1,&{}\end{array}$$

(2d)

$$\displaystyle\begin{array}{rcl} c + Fx_{N} \leq 0.& &{}\end{array}$$

(2e)

Here, b, c are vectors and Q, S, R, P, C, D, F matrices, and matrices $\left [\begin{array}{@{}c@{\quad }c@{}} Q \quad &S\\ S^{\top }\quad &R \end{array} \right ]$ and P are symmetric and positive semi-definite to ensure the QP is convex.

Sparsity Exploitation

The QP (2) has a specific sparsity structure that can be exploited in different ways. One way is to reduce the variable space by a procedure called condensing and then to solve a smaller-scale QP instead of (2). Another way is to use a banded matrix factorization.

Condensing

The constraints (2b) and (2c) can be used to eliminate the state trajectory X. This yields an equivalent but smaller-scale QP of the following form:

$$\displaystyle\begin{array}{rcl} & \begin{array}{r} \\ \stackrel{\mbox{ minimize}}{U \in \mathbb{R}^{Nn_{u}}} \end{array} \quad \frac{1} {2}\left [\begin{array}{lll} U\\ \bar{x}_{0}\end{array} \right ]^{\top }\left [\begin{array}{@{}c@{\quad }c@{}} H \quad &G \\ G^{\top }\quad &J \end{array} \right ]\left [\begin{array}{lll} U\\ \bar{x}_{ 0}\end{array} \right ]&{}\end{array}$$

(3a)

$$\displaystyle\begin{array}{rcl} \mbox{ subject to}\quad d + K\bar{x}_{0} + MU \leq 0.& &{}\end{array}$$

(3b)

The number of inequality constraints is the same as in the original QP (2) and given by $m = Nn_{h} + n_{r}$. Note that in the simplest case without inequalities (m = 0), the solution $U^{{\ast}}(\bar{x}_{0})$ of the condensed QP can be obtained by setting the gradient of the objective to zero, i.e., by solving $HU^{{\ast}}(\bar{x}_{0}) + G\bar{x}_{0} = 0$. The factorization of a dense matrix H with dimension Nn_u × Nn_u needs $O(N^{3}n_{u}^{3})$ arithmetic operations, i.e., the computational cost of condensing-based algorithms typically grows cubically with the horizon length N.

Banded Matrix Factorization

An alternative way to deal with the sparsity is best sketched at hand of a sparse convex QP (2) without inequality constraints (2d) and (2e). We define the vector of Lagrange multipliers $Y = [y_{0}^{\top },\ldots,y_{N}^{\top }]^{\top }$ and the Lagrangian function by

$$\displaystyle\begin{array}{rcl} & & \mathcal{L}(X,U,Y ) = y_{0}^{\top }(x_{ 0} -\bar{ x}_{0}) + \frac{1} {2}x_{N}^{\top }Px_{ N} \\ & & +\frac{1} {2}\sum _{i=0}^{N-1}\left [\begin{array}{lll} x_{i} \\ u_{i}\end{array} \right ]^{\top }\left [\begin{array}{@{}c@{\quad }c@{}} Q \quad &S \\ S^{\top }\quad &R \end{array} \right ]\left [\begin{array}{lll} x_{i} \\ u_{i}\end{array} \right ] + y_{i+1}^{\top } \\ & & (x_{i+1} - Ax_{i} + Bu_{i}). {}\end{array}$$

(4)

If we reorder all unknowns that enter the Lagrangian and summarize them in the vector

$$\displaystyle{W = [y_{0}^{\top },x_{ 0}^{\top },u_{ 0}^{\top },y_{ 1}^{\top },x_{ 1}^{\top },u_{ 1}^{\top },\ldots,y_{ N}^{\top },x_{ N}^{\top }]^{\top }}$$

the optimal solution $W^{{\ast}}(\bar{x}_{0})$ is uniquely characterized by the first-order optimality condition

$$\displaystyle{\nabla _{W}\mathcal{L}(W^{{\ast}}) = 0.}$$

Due to the linear quadratic dependence of $\mathcal{L}$ on W, this is a block-banded linear equation in the unknown W^∗:

$$\displaystyle{\left [\begin{array}{{@{}c@{\quad }c@{\quad }c@{\quad }c@{\quad }c@{\quad }c@{\quad }c@{\quad }c@{}}} 0\quad & I \quad \\ I\quad & Q \quad & S \quad & - A^{\top }\quad \\ \quad & S^{\top } \quad & R \quad & - B^{\top }\quad \\ \quad &- A\quad & - B\quad & 0 \quad &I \quad \\ \quad & \quad & \quad & I \quad &Q\quad &\cdot \quad &\cdot \quad \\ \quad & \quad & \quad & \quad & \cdot \quad &\cdot \quad &\cdot \quad \\ \quad & \quad & \quad & \quad &\cdot \quad &\cdot \quad &0\quad &I \\ \quad & \quad & \quad & \quad & \quad & \quad &I\quad &P \end{array} \right ]W^{{\ast}}=\left [\begin{array}{lll} \bar{x}_{0} \\ 0\\ 0 \\ 0\\ \cdot \\ \cdot \\ \cdot \\ 0 \end{array} \right ].}$$

Because the above matrix has nonzero elements only on a band of width 2n_x + n_u around the diagonal, it can be factorized with a computational cost of order O(N(2n_x + n_u)³).

Treatment of Inequalities

An important observation is that both the uncondensed QP (2) and the equivalent condensed QP (3) typically fall into the class of strictly convex parametric quadratic programs: the solution $U^{{\ast}}(\bar{x}_{0}),X^{{\ast}}(\bar{x}_{0})$ is unique and depends piecewise affinely and continuously on the parameter $\bar{x}_{0}$. Each affine piece of the solution map corresponds to one active set and is valid on one polyhedral critical region in parameter space. This observation forms the basis of explicit MPC algorithms which precompute the map $u_{0}^{{\ast}}(\bar{x}_{0})$, but it can also be exploited in online algorithms for quadratic programming, which are the focus of this section.

We sketch the different ways of how to treat inequalities only for the condensed QP (3), but they can equally be applied in sparse QP algorithms that directly address (2). The optimal solution $U^{{\ast}}(\bar{x}_{0})$ for a strictly convex QP (3) is – together with the corresponding vector of Lagrange multipliers, or dual variables $\lambda ^{{\ast}}(\bar{x}_{0}) \in \mathbb{R}^{m}$ – uniquely characterized by the so-called Karush-Kuhn-Tucker (KKT) conditions, which we omit for brevity. There are three big families of solution algorithms for inequality-constrained QPs that differ in the way they treat inequalities: active set methods, interior point methods, and gradient projection methods.

Active Set Methods

The optimal solution of the QP is characterized by its active set, i.e., the set of inequality constraints (3b) that are satisfied with equality at this point. If one would know the active set for a given problem instance $\bar{x}_{0}$, it would be easy to find the solution. Active set methods work with guesses of the active set which they iteratively refine. In each iteration, they solve one linear system corresponding to a given guess of the active set. If the KKT conditions are satisfied, the optimal solution is found; if they are not, another division into active and inactive constraints needs to be tried. A crucial observation is that an existing factorization of the linear system can be reused to a large extent when only one constraint is added or removed from the guess of the active set. Many different active set strategies exist, three of which we mention: Primal active set methods first find a feasible point and then add or remove active constraints, always keeping the primal variables U feasible. Due to the difficulty of finding a feasible point first, they are difficult to warmstart in the context of MPC optimization. Dual active set strategies always keep the dual variables λ positive. They can easily be warmstarted in the context of MPC. Parametric or online active set strategies ensure that all iterates stay primal and dual feasible and go on a straight line in parameter space from a solved QP problem to the current one, only updating the active set when crossing the boundary between critical regions, as implemented in the online QP solver qpOASES.

Active set methods are very competitive in practice, but their worst case complexity is not polynomial. They are often used together with the condensed QP formulation, for which each active set change is relatively cheap. This is particularly advantageous if an existing factorization can be kept between subsequent MPC optimization problems.

Interior Point Methods

Another approach is to replace the KKT conditions by a smooth approximation that uses a small positive parameter τ > 0:

$$\displaystyle\begin{array}{rcl} HU^{{\ast}} + G\bar{x}_{ 0} + M^{\top }\lambda ^{{\ast}}& =& 0,{}\end{array}$$

(5a)

$$\displaystyle\begin{array}{rcl} \left (d + K\bar{x}_{0} + MU^{{\ast}}\right )_{ i}\lambda _{i}^{{\ast}}+\tau & =& 0,\;i = 1,\ldots,m.{}\end{array}$$

(5b)

These conditions form a smooth nonlinear system of equations that uniquely determines a primal dual solution $U^{{\ast}}(\bar{x}_{0},\tau )$ and $\lambda ^{{\ast}}(\bar{x}_{0},\tau )$ in the interior of the feasible set. They are not equivalent to the KKT conditions, but for $\tau \rightarrow 0$, their solution tends to the exact QP solution. An interior point algorithm solves the system (5a) and (5b) by Newton’s method. Simultaneously, the path parameter τ, that was initially set to a large value, is iteratively reduced, making the nonlinear set of equations a closer approximation of the original KKT system. In each Newton iteration, a linear system needs to be factored and solved, which constitutes the major computational cost of an interior point algorithm. For the condensed QP (3) with dense matrices H, M, the cost per Newton iteration is of order O(N³). But the interior point algorithm can also be applied to the uncondensed sparse QP (2), in which case each iteration has a runtime of order O(N). In practice, for both cases, 10–30 Newton iterations usually suffice to obtain very accurate solutions. As an interior point method needs always to start with a high value of τ and then reduces it during the iterations, warmstarting is of minor benefit. There exist efficient code generation tools that export convex interior point solvers as plain C-code such as CVXGEN and FORCES.

Gradient Projection Methods

Gradient projection methods do not need to factorize any matrix but only evaluate the gradient of the objective function $HU^{[k]} + G\bar{x}_{0}$ in each iteration. They can only be implemented efficiently if the feasible set is a simple set in the sense that a projection $\mathcal{P}(U)$ on this set is very cheap to compute, as, e.g., for upper and lower bounds on the variables U, and if we know an upper bound L_H > 0 on the eigenvalues of the Hessian H. The simple gradient projection algorithm starts with an initialization U^[0] and proceeds as follows:

$$\displaystyle{U^{[k+1]} = \mathcal{P}\left (U^{[k]} - \frac{1} {L_{H}}(HU^{[k]} + G\bar{x}_{ 0})\right ).}$$

An improved version of the gradient projection algorithm is called the optimal or fast gradient method and has probably the best possible iteration complexity of all gradient type methods. All variants of gradient projection algorithms are easy to warmstart. Though they are not as versatile as active set or interior point methods, they have short code sizes and can offer advantages on embedded computational hardware, such as the code generated by the tool FIORDOS.

Optimization Algorithms for Nonlinear MPC

When the dynamic system $x_{i+1} = f(x_{i},u_{i})$ is not affine, the optimal control problem (1) is non-convex, and we speak of a nonlinear MPC (NMPC) problem. NMPC optimization algorithms only aim at finding a locally optimal solution of this problem, and they usually do it in a Newton-type framework. For ease of notation, we summarize problem (1) in the form of a general nonlinear programming problem (NLP):

$$\displaystyle\begin{array}{rcl} & \begin{array}{r} \\ \stackrel{\mbox{ minimize} }{X, U} \end{array} \qquad \Phi (X,U)\quad \quad \quad &{}\end{array}$$

(6a)

$$\displaystyle\begin{array}{rcl} \mbox{ subject to}\qquad G_{\mathrm{eq}}(X,U,\bar{x}_{0}) = 0,& &{}\end{array}$$

(6b)

$$\displaystyle\begin{array}{rcl} G_{\mathrm{ineq}}(X,U) \leq 0.& &{}\end{array}$$

(6c)

Let us first discuss a fundamental choice that regards the problem formulation and number of optimization variables.

Simultaneous vs. Sequential Formulation

When an optimization algorithm addresses problem (6) iteratively, it works intermediately with nonphysical, infeasible trajectories that violate the system constraints (6b). Only at the optimal solution the constraint residual is brought to zero and a physical simulation is achieved. We speak of a simultaneous approach to optimal control because the algorithm solves the simulation and the optimization problems simultaneously. Variants of this approach are direct discretization or direct multiple shooting.

On the other hand, the equality constraints (6b) could easily be eliminated by a nonlinear forward simulation for given initial value $\bar{x}_{0}$ and control trajectory U, similar to condensing in linear MPC. Such a forward simulation generates a state trajectory $X_{\mathrm{sim}}(\bar{x}_{0},U)$ such that for the given value of $\bar{x}_{0},U$ the equality constraints (6b) are automatically satisfied: $G_{\mathrm{eq}}(X_{\mathrm{sim}}(\bar{x}_{0},U),U,\bar{x}_{0}) = 0$. Inserting this map into the NLP (6) allows us to formulate an equivalent optimization problem with a reduced variable space:

$$\displaystyle\begin{array}{rcl} & \begin{array}{r} \\ \stackrel{\mbox{ minimize} }{U} \end{array} \qquad \Phi (X_{ \mathrm{sim}}(\bar{x}_{0},U),U)\quad \quad &{}\end{array}$$

(7a)

$$\displaystyle\begin{array}{rcl} \mbox{ subject to}\quad G_{\mathrm{ineq}}(X_{\mathrm{sim}}(\bar{x}_{0},U),U) \leq 0.& &{}\end{array}$$

(7b)

When solving this reduced problem with an iterative optimization algorithm, we sequentially simulate and optimize the system, and we speak of the sequential approach to optimal control.

The sequential approach has a lower dimensional variable space and is thus easier to use with a black-box NLP solver. On the other hand, the simultaneous approach leads to a sparse NLP and is better able to deal with unstable nonlinear systems. In the remainder of this section, we thus only discuss the specific structure of the simultaneous approach.

Newton-Type Optimization

In order to simplify notation further, we summarize and reorder all optimization variables U and X in a vector $V = [x_{0}^{\top },u_{0}^{\top },\ldots,x_{N-1}^{\top },u_{N-1}^{\top },x_{N}^{\top }]^{\top }$ and use the same problem function names as in (6) also with the new argument V.

As in the section on linear MPC, we can introduce multipliers Y for the equalities and λ for the inequalities and define the Lagrangian

$$\displaystyle\begin{array}{rcl} & & \mathcal{L}(V,Y,\lambda ) = \Phi (V ) + Y ^{\top }G_{\mathrm{ eq}}(V,\bar{x}_{0}) \\ & & +\lambda ^{\top }G_{\mathrm{ ineq}}(V ). {}\end{array}$$

(8)

All Newton-type optimization methods try to find a point satisfying the KKT conditions by using successive linearizations of the problem functions and Lagrangian. For this aim, starting with an initial guess $(V ^{[0]},Y ^{[0]},\lambda ^{[0]})$, they generate sequences of primal-dual iterates $(V ^{[k]},Y ^{[k]},\lambda ^{[k]})$.

An observation that is crucial for the efficiency of all NMPC algorithms is that the Hessian of the Lagrangian is at a current iterate given by a matrix of the form

$$\displaystyle{\nabla _{V }^{2}\mathcal{L}(\cdot ) = \left [\begin{array}{{@{}c@{\quad }c@{\quad }c@{\quad }c@{\quad }c@{\quad }c@{\quad }c@{}}} Q_{0}^{[k]} \quad &S_{0}^{[k]}\quad \\ S_{0}^{[k],\top }\quad &R_{0}^{[k]}\quad \\ \quad & \quad & Q_{1}^{[k]} \quad &S_{1}^{[k]}\quad \\ \quad & \quad &S_{1}^{[k],\top }\quad &R_{1}^{[k]}\quad \\ \quad & \quad & \quad & \quad &\cdot \quad &\cdot \quad \\ \quad & \quad & \quad & \quad &\cdot \quad &\cdot \quad \\ \quad & \quad & \quad & \quad &\quad &\quad &P^{[k]} \end{array} \right ].}$$

This block sparse matrix structure makes it possible to use in each Newton-type iteration the same sparsity-exploiting linear algebra techniques as outlined in section “Convex Optimization for Linear MPC” for the linear MPC problem.

Major differences exist on how to treat the inequality constraints, and the two big families of Newton-type optimization methods are sequential quadratic programming (SQP) methods and nonlinear interior point (NIP) methods.

Sequential Quadratic Programming (SQP)

A first variant to iteratively solve the KKT system is to linearize all nonlinear functions at the current iterate $(V ^{[k]},Y ^{[k]},\lambda ^{[k]})$ and to find a new solution guess from the solution of a quadratic program (QP):

$$\displaystyle\begin{array}{rcl} & \begin{array}{r} \\ \stackrel{\mbox{ minimize} }{V} \end{array} \quad \Phi _{ \mathrm{quad}}(V ;V ^{[k]},Y ^{[k]},\lambda ^{[k]})&{}\end{array}$$

(9a)

$$\displaystyle\begin{array}{rcl} \mbox{ subject to}\quad G_{\mathrm{eq},\mathrm{lin}}(V,\bar{x}_{0};V ^{[k]}) = 0,& &{}\end{array}$$

(9b)

$$\displaystyle\begin{array}{rcl} G_{\mathrm{ineq},\mathrm{lin}}(V ;V ^{[k]}) \leq 0.& &{}\end{array}$$

(9c)

Here, the subindex “lin” in the constraints $G_{\cdot,\mathrm{lin}}$ expresses that a first-order Taylor expansion at V^[k] is used, while the QP objective is given by $\Phi _{\mathrm{quad}}(V ;V ^{[k]},Y ^{[k]},\lambda ^{[k]}) = \Phi _{\mathrm{lin}}(V ;V ^{[k]}) + \frac{1} {2}(V - V ^{[k]})^{\top }\nabla _{ V }^{2}\mathcal{L}(\cdot )(V - V ^{[k]})$. Note that the QP has the same sparsity structure as the QP (2) resulting from linear MPC, with the only difference that all matrices are now time varying over the MPC horizon. In the case that the Hessian matrix is positive semi-definite, this QP is convex so that global solutions can be found reliably with any of the methods from section “Convex Optimization for Linear MPC.” The solution of the QP along with the corresponding constraint multipliers gives the next SQP iterate $(V ^{[k+1]},Y ^{[k+1]},\lambda ^{[k+1]})$. Apart from the presented “exact Hessian” SQP variant, which has quadratic convergence speed, several other SQP variants exist, which make use of other Hessian approximations. A particularly useful Hessian approximation for NMPC is possible if the original objective function $\Phi (V )$ is convex quadratic, and the resulting SQP variant is called the generalized Gauss-Newton method. In this case, one can just use the original objective as cost function in the QP (9a), resulting in convex QP subproblems and (often fast) linear convergence speed.

Nonlinear Interior Point (NIP) Method

In contrast to SQP methods, an alternative way to address the solution of the KKT system is to replace the last nonsmooth KKT conditions by a smooth nonlinear approximation, with τ > 0:

$$\displaystyle\begin{array}{rcl} \nabla _{V }\mathcal{L}(V ^{{\ast}},Y ^{{\ast}},\lambda ^{{\ast}})& =& 0{}\end{array}$$

(10a)

$$\displaystyle\begin{array}{rcl} G_{\mathrm{eq}}(V ^{{\ast}},\bar{x}_{ 0})& =& 0{}\end{array}$$

(10b)

$$\displaystyle\begin{array}{rcl} G_{\mathrm{ineq},i}(V ^{{\ast}})\;\lambda _{ i}^{{\ast}}+\tau & =& 0,\quad i = 1,\ldots,m.{}\end{array}$$

(10c)

We summarize all variables in a vector $W = [V ^{\top },Y ^{\top },\lambda ^{\top }]^{\top }$ and summarize the above set of equations as

$$\displaystyle{ G_{\mathrm{NIP}}(W,\bar{x}_{0},\tau ) = 0. }$$

(11)

The resulting root finding problem is then solved with Newton’s method, for a descending sequence of path parameters τ^[k]. The NIP method proceeds thus exactly as in an interior point method for convex problems, with the only difference that it has to re-linearize all problem functions in each iteration. An excellent software implementation of the NIP method is given in the form of the code IPOPT.

Continuation Methods and Tangential Predictors

In nonlinear MPC, a sequence of OCPs with different initial values $\bar{x}_{0}^{[0]},\bar{x}_{0}^{[1]},\bar{x}_{0}^{[2]},\ldots$ is solved. For the transition from one problem to the next, it is beneficial to take into account the fact that the optimal solution $W^{{\ast}}(\bar{x}_{0})$ depends almost everywhere differentiably on $\bar{x}_{0}$. The concept of a continuation method is most easily explained in the context of an NIP method with fixed path parameter $\bar{\tau }> 0$. In this case, the solution $W^{{\ast}}(\bar{x}_{0},\bar{\tau })$ of the smooth root finding problem $G_{\mathrm{NIP}}(W^{{\ast}}(\bar{x}_{0},\bar{\tau }),\bar{x}_{0},\bar{\tau }) = 0$ from Eq. (11) is smooth with respect to $\bar{x}_{0}$. This smoothness can be exploited by making use of a tangential predictor in the transition from one value of $\bar{x}_{0}$ to another. Unfortunately, the interior point solution manifold is strongly nonlinear at points where the active set changes, and the tangential predictor is not a good approximation when we linearize at such points.

Generalized Tangential Predictor and Real-Time Iterations

In fact, the true NLP solution is not determined by a smooth root finding problem (10a)–(10c) but by the (nonsmooth) KKT conditions. The solution manifold has smooth parts when the active set does not change, but non-differentiable points occur whenever the active set changes. We can deal with this fact naturally in an SQP framework by solving one QP of form (9) in order to generate a tangential predictor that is also valid in the presence of active set changes. In the extreme case that only one such QP is solved per sampling time, we speak of a real-time iteration (RTI) algorithm. The computations in each iteration can be subdivided into two phases, the preparation phase, in which the derivatives are computed and the QP is condensed, and the feedback phase, which only starts once $\bar{x}_{0}^{[k+1]}$ becomes available and in which only a condensed QP of form (3) is solved, minimizing the feedback delay. This NMPC algorithm can be generated as plain C-code, e.g., by the tool ACADO. Another class of real-time NMPC algorithms based on a continuation method can be generated by the tool AutoGenU.

Cross-References

Bibliography

Betts JT (2010) Practical methods for optimal control and estimation using nonlinear programming, 2nd edn. SIAM, Philadelphia
Google Scholar
Biegler LT (2010) Nonlinear programming. SIAM, Philadelphia
Google Scholar
Binder T, Blank L, Bock HG, Bulirsch R, Dahmen W, Diehl M, Kronseder T, Marquardt W, Schlöder JP, Stryk OV (2001) Introduction to model based optimization of chemical processes on moving horizons. In: Grötschel M, Krumke SO, Rambau J (eds) Online optimization of large scale systems: state of the art. Springer, Berlin, pp 295–340
Google Scholar
Bryson AE, Ho Y-C (1975) Applied optimal control. Wiley, New York
Google Scholar
Diehl M, Ferreau HJ, Haverbeke N (2009) Efficient numerical methods for nonlinear MPC and moving horizon estimation. In: Nonlinear model predictive control. Lecture notes in control and information sciences, vol 384. Springer, Berlin, pp 391–417
Google Scholar
Domahidi A, Zgraggen A, Zeilinger MN, Morari M, Jones CN (2012) Efficient interior point methods for multistage problems arising in receding horizon control. In: IEEE conference on decision and control (CDC), Maui, Dec 2012, pp 668–674
Google Scholar
Ferreau HJ, Bock HG, Diehl M (2008) An online active set strategy to overcome the limitations of explicit MPC. Int J Robust Nonlinear Control 18(8): 816–830
MathSciNet Google Scholar
Fletcher R (1987) Practical methods of optimization, 2nd edn. Wiley, Chichester
Google Scholar
Gill PE, Murray W, Wright MH (1999) Practical optimization. Academic, London
Google Scholar
Houska B, Ferreau HJ, Diehl M (2011) An auto-generated real-time iteration algorithm for nonlinear MPC in the microsecond range. Automatica 47(10): 2279–2285
MathSciNet Google Scholar
Mattingley J, Boyd S (2009) Automatic code generation for real-time convex optimization. In: Convex optimization in signal processing and communications. Cambridge University Press, New York, pp 1–43
Google Scholar
Nesterov Y (2004) Introductory lectures on convex optimization: a basic course. Applied optimization, vol 87. Kluwer, Boston
Google Scholar
Nocedal J, Wright SJ (2006) Numerical optimization. Springer series in operations research and financial engineering, 2nd edn. Springer, New York
Google Scholar
Ohtsuka T, Kodama A (2002) Automatic code generation system for nonlinear receding horizon control. Trans Soc Instrum Control Eng 38(7):617–623
Google Scholar
Richter S, Morari M, Jones CN (2011) Towards computational complexity certification for constrained MPC based on Lagrange relaxation and the fast gradient method. In: 50th IEEE conference on decision and control and European control conference (CDC-ECC), Orlando, Dec 2011, pp 5223–5229
Google Scholar
Wächter A, Biegler LT (2006) On the implementation of a primal-dual interior point filter line search algorithm for large-scale nonlinear programming. Math Program 106(1):25–57
MathSciNet Google Scholar
Wright SJ (1997) Primal-dual interior-point methods. SIAM, Philadelphia
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Microsystems Engineering (IMTEK), University of Freiburg, Freiburg, Germany
Moritz Diehl
ESAT-STADIUS/OPTEC, KU Leuven, Leuven-Heverlee, Belgium
Moritz Diehl

Authors

Moritz Diehl
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Mechanical Engineering, Electrical and Computer Engineering, Boston University, Boston, MA, USA
John Baillieul
Honeywell Automation and Control Solutions, Golden Valley, MN, USA
Tariq Samad

Rights and permissions

Reprints and permissions

Copyright information

About this entry

Cite this entry

Diehl, M. (2015). Optimization Algorithms for Model Predictive Control. In: Baillieul, J., Samad, T. (eds) Encyclopedia of Systems and Control. Springer, London. https://doi.org/10.1007/978-1-4471-5058-9_9

Download citation

DOI: https://doi.org/10.1007/978-1-4471-5058-9_9
Published: 21 July 2015
Publisher Name: Springer, London
Print ISBN: 978-1-4471-5057-2
Online ISBN: 978-1-4471-5058-9
eBook Packages: EngineeringReference Module Computer Science and Engineering

Publish with us

Policies and ethics

Optimization Algorithms for Model Predictive Control

Abstract

Similar content being viewed by others

Optimization Algorithms for Model Predictive Control

Optimization Algorithms for Model Predictive Control

Robust Model Predictive Control

Keywords

Introduction

General Algorithmic Features for MPC Optimization

Off-line Precomputations and Code Generation

Delay Compensation by Prediction

Division into Preparation and Feedback Phase

Warmstarting and Shift

Iterating While the Problem Changes

Convex Optimization for Linear MPC

Sparsity Exploitation

Condensing

Banded Matrix Factorization

Treatment of Inequalities

Active Set Methods

Interior Point Methods

Gradient Projection Methods

Optimization Algorithms for Nonlinear MPC

Simultaneous vs. Sequential Formulation

Newton-Type Optimization

Sequential Quadratic Programming (SQP)

Nonlinear Interior Point (NIP) Method

Continuation Methods and Tangential Predictors

Generalized Tangential Predictor and Real-Time Iterations

Cross-References

Recommended Reading

Bibliography

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this entry

Cite this entry

Download citation

Share this entry

Publish with us

Search

Navigation