Splitting Enables Overcoming the Curse of Dimensionality

Darbon, Jérôme; Osher, Stanley J.

doi:10.1007/978-3-319-41589-5_12

Jérôme Darbon¹⁸ &
Stanley J. Osher¹⁹

Part of the book series: Scientific Computation ((SCIENTCOMP))

4175 Accesses
1 Citations

Abstract

In this chapter we briefly outline a new and remarkably fast algorithm for solving a large class of high dimensional Hamilton-Jacobi (H-J) initial value problems arising in optimal control and elsewhere [1]. This is done without the use of grids or numerical approximations. Moreover, by using the level set method [8] we can rapidly compute projections of a point in $\mathbb{R}^{n}$, n large to a fairly arbitrary compact set [2]. The method seems to generalize widely beyond what will we present here to some nonconvex Hamiltonians, new linear programming algorithms, differential games, and perhaps state dependent Hamiltonians.

Access provided by Autonomous University of Puebla. Download chapter PDF

Algorithms for overcoming the curse of dimensionality for certain Hamilton–Jacobi equations arising in control theory and elsewhere

Article Open access 01 September 2016

Recent Results in the Approximation of Nonlinear Optimal Control Problems

Peaceman–Rachford splitting for a class of nonconvex optimization problems

Article 13 May 2017

Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

1 Introduction

We begin with the Hamilton-Jacobi (HJ) initial value problem

$$\displaystyle{ \left \{\begin{array}{@{}l@{\quad }l@{}} \frac{\partial \varphi } {\partial t}(x,t) + H(\nabla _{x}\varphi (x,t))\; =\; 0\quad &\mbox{ in }\mathbb{R}^{n} \times (0,+\infty ) \\ \varphi (x,0)\; =\; J(x) \quad &\forall x \in \mathbb{R}^{n}.\end{array} \right. }$$

(12.1)

We assume $J: \mathbb{R}^{n} \rightarrow \mathbb{R}$ is convex and one coercive, i.e., $\lim _{\|x\|_{2}\rightarrow +\infty }\frac{J(x)} {\|x\|_{2}} = +\infty$, $H: \mathbb{R}^{n} \rightarrow \mathbb{R}$ is convex and positively one homogeneous (we sometimes relax all these assumptions).

A good example of this is

$$\displaystyle{H(v) =\| v\|_{2}.}$$

Here $\|v\|_{p} = \left (\varSigma _{i=1}^{n}\vert v_{i}\vert ^{p}\right )^{\frac{1} {p} }$ for p ≥ 1 and 〈x, v〉 = Σ _i = 1 ⁿ x _i v _i.

Let us consider a convex Lipschitz function J having the property that, for Ω a convex compact set of $\mathbb{R}^{n}$

$$\displaystyle{\left \{\begin{array}{@{}l@{\quad }l@{}} J(x) <0\quad &\mbox{ for any }x\ \in \ \mbox{ int }\varOmega \\ J(x) = 0\quad &\mbox{ for any }x\ \in \ (\varOmega \setminus \mbox{ int }\varOmega ) \\ J(x)> 0\quad &\mbox{ for any }x\ \in \ (\mathbb{R}^{n}\setminus \varOmega ).\end{array} \right.}$$

We call this level set initial data. Then the set of points for which φ(x, t) = 0, t > 0 are exactly those at a distance t, from the boundary of Ω. In fact given $\bar{x}\ \notin \ \varOmega$, then the closest point x _opt from $\bar{x}$ to (Ω ∖ int Ω) is exactly

$$\displaystyle{ x_{opt}\; =\;\bar{ x} - t \frac{\nabla \varphi (\bar{x},t)} {\|\nabla \varphi (\bar{x},t)\|_{2}}. }$$

(12.2)

To solve (12.1) we use the Hopf formula [5]

$$\displaystyle\begin{array}{rcl} \varphi (x,t) = (J^{{\ast}} + tH)^{{\ast}}(x) = -\min _{ v\in R^{n}}\{J^{{\ast}}(v) + tH(v) -\langle x,v\rangle \},& & {}\\ \end{array}$$

where the Fenchel-Legendre transform $f^{{\ast}}: \mathbb{R}^{n} \rightarrow R \cup (+\infty )$ of the convex function f is defined by

$$\displaystyle{f^{{\ast}}(v) =\sup _{ x\,\in \,R^{n}}\{\langle v,x\rangle - f(x)\}.}$$

Moreover, for free we get that the minimizer satisfies

$$\displaystyle{ \arg \min _{v\in \mathbb{R}^{n}}\{J^{{\ast}}(v) + tH(v) -\langle x,v\rangle \} = \nabla _{ x}\varphi (x,t). }$$

(12.3)

whenever φ(⋅ , t) is differentiable at x. Let us note here that our algorithm computes φ(x, t) but also ∇_x φ(x, t).

Also, we can use the Hopf-Lax formula [5, 6] to solve (12.1).

$$\displaystyle{ \varphi (x,t) =\min _{z\ \in \ \mathbb{R}^{n}}\left \{J(z) + tH^{{\ast}}\left (\frac{x - z} {t} \right )\right \} }$$

(12.4)

for convex H.

From (12.4) it is easy to show that if we have k different initial value problems i = 1, … k

$$\displaystyle{ \left \{\begin{array}{@{}l@{\quad }l@{}} \frac{\partial \varphi _{i}} {\partial t}(x,t) + H(\nabla _{x}\varphi _{i}(x,t))\; =\; 0,\quad &\mbox{ in }\mathbb{R}^{n} \times (0,+\infty ) \\ \varphi _{i}(x,0)\; =\; J_{i}(x) \quad &\forall x \in \mathbb{R}^{n}\end{array} \right. }$$

with the usual hypotheses, then (12.4) implies, for any $x\ \in \ \mathbb{R}^{n},\ t> 0$

$$\displaystyle{\varphi _{i}(x,t) =\min _{z\ \in \ R^{n}}\left \{J_{i}(z) + tH^{{\ast}}\left (\frac{x - z} {t} \right )\right \}.}$$

So

$$\displaystyle{\min _{i=1,k}\varphi _{i}(x,t) =\min _{z\ \in \ R^{n}}\left \{\min _{i=1,\ldots,k}\left \{J_{i}(z) + tH^{{\ast}}\left (\frac{x - z} {t} \right )\right \}\right \}}$$

solves the initial value problem

$$\displaystyle{ \left \{\begin{array}{@{}l@{\quad }l@{}} \frac{\partial \varphi } {\partial t}(x,t) + H(\nabla _{x}\varphi (x,t))\; =\; 0,\quad &\mbox{ in }\mathbb{R}^{n} \times (0,+\infty ) \\ \varphi (x,0)\; =\;\min _{i=1,\ldots,k}J_{i}(x) \quad &\forall x \in \mathbb{R}^{n}. \end{array} \right. }$$

(12.5)

This means that if Ω = ∪_{i = 1, …, k} Ω _i, where each Ω _i is compact and convex and may overlap, then we can easily compute the set of all points at distance t from Ω which is exactly the solution to (12.5) where each J _i is a level set function for Ω _i. Moreover, at every point $\bar{x}$ outside of $\bar{\varOmega }$ for which there is one i such that $\varphi _{i}(\bar{x},t) <\varphi _{i'}(\bar{x},t)$ for any i ≠ i′, then the closest point x _opt to $\bar{x}$ and Ω is again

$$\displaystyle{x_{opt} =\bar{ x} - t \frac{\nabla _{x}\varphi _{i}(\bar{x},t)} {\vert \nabla _{x}\varphi _{i}(\bar{x},t)\vert }.}$$

If there are several i for which $\varphi _{i}(\bar{x},t)$ is the minimum among all k of them, then ∇_x φ will be “multivalued”, i.e., it will have jumps, but any of the x _opt defined above will be a closest point on Ω to $\bar{x}$.

2 Split Bregman

We solve the optimization problem (12.3) by using the split Bregman algorithm [4, 3, 9] as follows

$$\displaystyle\begin{array}{rcl} v^{k+1}& =& \arg \min _{ v\in \mathbb{R}^{n}}\{J^{{\ast}}(v) -\langle x,v\rangle + \frac{\lambda } {2}\|d^{k} - v - b^{k}\|_{ 2}^{2}\},{}\end{array}$$

(12.6)

$$\displaystyle\begin{array}{rcl} d^{k+1}& =& \arg \min _{ d\in \mathbb{R}^{n}}\left \{tH(d) + \frac{\lambda } {2}\|d - v^{k+1} - b^{k}\|_{ 2}^{2}\right \}{}\end{array}$$

(12.7)

$$\displaystyle\begin{array}{rcl} b^{k+1}& =& b^{k} + v^{k+1} - d^{k+1}.{}\end{array}$$

(12.8)

Here the sequences $(v^{k})_{k\in \mathbb{N}},(d^{k})_{k\in \mathbb{N}}$ both converge to ∇_x φ(x, t). Let us emphasize again that our numerical algorithm not only computes the solution φ(x, t) but also computes ∇_x φ(x, t) when φ(⋅ , t) is differentiable.

Both (12.6) and (12.7), up to change of variables, can be reformulated as finding the unique minimizer of

$$\displaystyle{\arg \min _{w}\left \{\alpha f(w) + \frac{1} {2}\|w - z\|_{2}^{2}\right \}}$$

which is the proximal map of f. Equation (12.6) can be solved if either J ^∗ or J have easily computable proximal maps, which often occurs, especially if one of them is smooth.

Equation (12.7) can be easily solved if H(d) = ∥ d ∥ ₂ via the shrink₂ operator defined by

$$\displaystyle{shrink_{2}(z,\alpha ) = \left \{\begin{array}{@{}l@{\quad }l@{}} \frac{z} {\|z\|_{2}} \max (\|z\|_{2}-\alpha,0)\quad & if \ z\neq 0 \\ 0\qquad \qquad \quad & if \ z = 0 \end{array} \right.}$$

and we have

$$\displaystyle{\arg \min _{w}\left \{\alpha \|w\|_{2} + \frac{1} {2}\|w - z\|_{2}^{2}\right \}\; =\; shrink_{ 2}(z,\alpha )}$$

If H(d) = ∥ d ∥ ₁ we use shrink₁ operator defined as follows for any i = 1, …, n

$$\displaystyle{\left (shrink_{1}(z,\alpha )\right )_{i} = \left \{\begin{array}{@{}l@{\quad }l@{}} z_{i}-\alpha \quad &if\ z_{i}>\alpha \\ 0 \quad &if\ \vert z_{i}\vert \leq \alpha \\ z_{i}+\alpha \quad &if\ z_{i} <-\alpha \end{array} \right.}$$

and we have

$$\displaystyle{\arg \min _{w}\left \{\alpha \|w\|_{1} + \frac{1} {2}\|w - z\|_{2}^{2}\right \}\; =\; shrink_{ 1}(z,\alpha ).}$$

To solve (12.7) for more general H(d) convex one homogeneous or to find the proximal map for f of that type we use the fact that H ^∗ is the characteristic function of a closed convex set $C \subset \mathbb{R}^{n}$

$$\displaystyle{H^{{\ast}} = I_{ c}.}$$

By using the Moreau identity [7] we realize that the proximal map of H can be obtained by projecting onto C. To do this projection, we merely solve the eikonal equation with level set initial data for C via split Bregman as above in (12.6), (12.7), (12.8) with H(d) = ∥ d ∥ ₂. This is easy using the shrink₂ operator. We then use (12.2) to obtain the projection and repeat the entire iteration.

3 Numerical Experiments

Numerical experiments on an Intel Laptop Core i5-5300U running at 2.3 GHz are now presented. We consider diagonal matrices D defined by $D_{ii} = 1 + \frac{1+i} {n}$ for i = 1, …, n. We also consider matrices A defined by A _ii = 2 for i = 1, …, n and A _ij = 1 for i, j = 1, …, n. Table 12.1 presents the average time (in seconds) to evaluate the solution over 1,000,000 samples (x, t) uniformly drawn in [−10, 10]ⁿ × [0, 10]. The convergence is remarkably rapid: 10⁻⁶ to 10⁻⁸ seconds on a standard laptop, per function evaluation. Figure 12.1 depicts 2-dimensional slices at different times for the (H-J) equation with a weighted ℓ ₁ Hamiltonian H = ∥ D ⋅ ∥ ₁, initial data $J = \frac{1} {2}\| \cdot \|_{2}^{2}$ and n = 8.

Table 12.1 Time results in seconds for the average time per call for evaluating the solution of the HJ-PDE with the initial data $J = \frac{1} {2}\| \cdot \|_{2}^{2}$, several Hamiltonians and various dimensions n.

Full size table

4 Summary and Future Work

We have derived a very fast and totally parallelizable method to solve a large class of high dimensional, state independent H-J initial value problems. We do this suing the Hopf formula and convex optimization via splitting, which overcomes the “curse of dimensionality”. This is also done without the use of grids or numerical approximations, yielding not only the solution, but also its gradient.

We also, as a step in this procedure, very rapidly compute the projections from a point in $\mathbb{R}^{n}$, n large, to a fairly arbitrary compact set.

In future work, we expect to extend this set of ideas to nonconvex Hamiltonians, including some that arise in differential games, to new linear programming algorithms, to fast methods for redistancing in level set methods and, hopefully, to a wide class of state dependent Hamiltonians.

References

Darbon, J., Osher, S.: Algorithms for overcoming the curse of dimensionality for certain Hamilton-Jacobi equations arising in control theory and elsewhere. Research in the Mathematical Sciences (to appear)
Google Scholar
Darbon, J., Osher, S.: Fast projections onto compact sets in high dimensions using the level set method, Hopf formulas and optimization. (In preparation)
Google Scholar
Glowinski, R., Marroco, A.: Sur l’approximation, par éléments finis d’ordre un, et la résolution, par pénalisation-dualité d’une classe de problèmes de Dirichlet non linéaires. ESAIM: Mathematical Modelling and Numerical Analysis 9 (R2), 41–76 (1975)
MATH Google Scholar
Goldstein, T., Osher, S.: The split Bregman method for L1-regularized problems. SIAM Journal on Imaging Sciences 2 (2), 323–343 (2009)
Article MATH MathSciNet Google Scholar
Hopf, E.: Generalized solutions of non-linear equations of first order (First order nonlinear partial differential equation discussing global locally-Lipschitzian solutions via Jacoby theorem extension). Journal of Mathematics and Mechanics 14, 951–973 (1965)
Google Scholar
Lax, P.D.: Hyperbolic Systems of Conservation Laws and the Mathematical Theory of Shock Waves. SIAM, Philadelphia, PA (1990)
Google Scholar
Moreau, J.J.: Proximité et dualité dans un espace hilbertien. Bulletin de la Société Mathématique de France 93, 273–299 (1965)
MATH MathSciNet Google Scholar
Osher, S., Sethian, J.A.: Fronts propagating with curvature-dependent speed: Algorithms based on Hamilton-Jacobi formulations. Journal of Computational Physics 79 (1), 12–49 (1988)
Article MATH MathSciNet Google Scholar
Yin, W., Osher, S.: Error forgetting of Bregman iteration. Journal of Scientific Computing 54 (2–3), 684–695 (2013)
Article MATH MathSciNet Google Scholar

Download references

Acknowledgements

Research supported by ONR grants N000141410683, N000141210838 and DOE grant DE-SC00183838.

Author information

Authors and Affiliations

CNRS/CMLA-Ecole Normale Supérieure de Cachan, Cachan, France
Jérôme Darbon
Department of Mathematics, UCLA, Los Angeles, CA, 90095-1555, USA
Stanley J. Osher

Authors

Jérôme Darbon
View author publications
You can also search for this author in PubMed Google Scholar
Stanley J. Osher
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Jérôme Darbon .

Editor information

Editors and Affiliations

Department of Mathematics, University of Houston, Houston, Texas, USA
Roland Glowinski
Department of Mathematics, UCLA, Los Angeles, California, USA
Stanley J. Osher
Department of Mathematics, UCLA, Los Angeles, California, USA
Wotao Yin

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Darbon, J., Osher, S.J. (2016). Splitting Enables Overcoming the Curse of Dimensionality. In: Glowinski, R., Osher, S., Yin, W. (eds) Splitting Methods in Communication, Imaging, Science, and Engineering. Scientific Computation. Springer, Cham. https://doi.org/10.1007/978-3-319-41589-5_12

Download citation

DOI: https://doi.org/10.1007/978-3-319-41589-5_12
Published: 06 January 2017
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-41587-1
Online ISBN: 978-3-319-41589-5
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)

Publish with us

Policies and ethics

Splitting Enables Overcoming the Curse of Dimensionality

Abstract

Similar content being viewed by others

Algorithms for overcoming the curse of dimensionality for certain Hamilton–Jacobi equations arising in control theory and elsewhere

Recent Results in the Approximation of Nonlinear Optimal Control Problems

Peaceman–Rachford splitting for a class of nonconvex optimization problems

Keywords

1 Introduction

2 Split Bregman

3 Numerical Experiments

4 Summary and Future Work

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this chapter

Cite this chapter

Download citation

Publish with us

Navigation

Splitting Enables Overcoming the Curse of Dimensionality

Abstract

Similar content being viewed by others

Algorithms for overcoming the curse of dimensionality for certain Hamilton–Jacobi equations arising in control theory and elsewhere

Recent Results in the Approximation of Nonlinear Optimal Control Problems

Peaceman–Rachford splitting for a class of nonconvex optimization problems

Keywords

1 Introduction

2 Split Bregman

3 Numerical Experiments

4 Summary and Future Work

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this chapter

Cite this chapter

Download citation

Share this chapter

Publish with us

Search

Navigation