Abstract
This is a brief discussion of the properties of solutions to several non-linear elliptic equations involving diffusive processes of non-local nature. These equation arise in several contexts: from continuum mechanics and phase transition, from population dynamics, from optimal control and game theory. The equations coming from continuum mechanics exhibit a variational structure and a theory parallel to the De Giorgi–Nash–Moser was necessary to show existence of regular solutions. Population dynamics suggests “porous media like equations” with a non-local pressure, and from optimal control we obtain fully non-linear equations that require methods of the type of the Krylov–Safonov–Evans theory. Finally, we discuss some non-local p and infinite Laplacian models coming from game theory.
Access provided by Autonomous University of Puebla. Download conference paper PDF
Similar content being viewed by others
Keywords
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
1 Introduction
We are interested in integral diffusion equations:
where the operator L takes the form
for some positive kernel (or measure) K(x,y) (or K x (y)).
We call the equation a diffusion equation because solutions try to revert to some sort of “integral average” of u.
Indeed, if u(x 0) is “smaller than” its surrounding values, as weighted by K(x,y), u(x 0,t) will tend to increase, if “bigger”, to decrease (i.e., u t >0 or u t <0).
We may think of the heat equation as an infinitesimal version of this process.
Indeed the Laplacian, Δu, is the limit of
In fact, if
for φ a probability density (a mollifier), the corresponding solutions u ε should converge to a solution u 0(x,t) of
where a ij are the second moments of φ.
These types of equations (and the associated non-linear ones that we will discuss shortly) have roots in different phenomena and, as their second order counterpart, they naturally divide between those with variational structure and those coming from probabilistic considerations.
A familiar example for the first case is prescribing Neumann boundary data (for instance zero). Insulating a wall implies some temperature diffusivity along the surface, expressed by inverting the Dirichlet to Neumann map. A non-local related equation is the quasi-geostrophic equation that describes the evolution of temperature on the ocean surface, due to the (one-side) atmospheric conditions.
On the probabilistic side let us recall the Levy–Khintchine formula. In an informal “black-box” approach suppose we can observe the transition probability of a distribution of particles, for any sequence of times t k , and we realize that the transition from t 1 to t 2 only depends on t 1 and t 2, in fact on t 2−t 1.
Then, for any k, we can write the transition probability from t 1 to t 2 as the composition (convolution) k times of the transition from t=0 to \(t=\frac{1}{k} (t_{2}-t_{1})\). This suggests the possibility, as δt goes to zero, of describing the process through a “heat equation”—as a properly scaled infinitesimal limit of the δt transition.
This is what the Levy–Khintchine formula asserts: That the probability density evolves according to a heat equation
consisting of a continuous part
a symmetric jump process
+⋯ an asymmetric part that we will discuss later
Here, it is required to make sense for a C 2, bounded u, i.e.,
and
Between divergence and non-divergence lie the equations invariant under translations, i.e., where the kernel \(K(x,y) = \widetilde{K}(x-y)\). In this case, the equation can be thought of as having both divergence and non-divergence structure and also, being of convolution type, they enjoy the advantage of allowing for methods of harmonic analysis.
That is the case, for instance, with the family of fractional Laplacians: For 0<α<1
The constant C(α)∼(1−α) to recuperate the standard Δu, as α goes to one.
Notice that the range of α’s is such that it makes these kernels satisfy the Levy–Khintchine condition to be an infinite divisible distribution.
In fact the fractional Laplacians are also called “stable processes”.
On the other hand, the fractional Laplacians is what we obtain as an Euler–Lagrange equation for the energy integral corresponding to the W α,2 (the L 2 norm of the “alpha” fractional derivatives of u):
And finally, convolution with the Δ α kernel corresponds after Fourier transform to the multiplier
In that sense, the fractional Laplacian serves as a basic model for the three classical methods of second order PDE’s.
-
Superposition (potential theory, harmonic analysis)
-
Energy method (calculus of variations, DeGiorgi–Nash–Moser)
-
Probabilistic (optimal control-Krylov–Safonov)
Since we are interested in regularity properties of solutions to such an “elliptic” or “parabolic” equation, the kernel K(x) should be singular at the origin to force u to be somewhat “special” in order to satisfy the equation: To know that after convolution with a smooth function u is smooth does not reflect so much on the regularity of u, at least at first glance.
In that sense, the fractional Laplacians provide a natural comparison scale of “order of differentiation” of the operator to help us develop a general setting.
2 Divergence Structure
Equations with “divergence” structure arise from continuum mechanics and calculus of variations.
A rough characterization would be that the kernel K(x,y) is symmetric.
That makes the equation
the Euler–Lagrange equation of
and thus puts the problem in the framework of weak variational solutions test functions methods, etc.:
For a test function φ(x), the bilinear form
depending on the problem at hand, must be zero, or prescribed or equal to
in the parabolic setting.
The general “non-linear calculus of variations” framework becomes then the study of the minimizers of the form:
with ϕ convex (quadratic for “uniform” fractional ellipticity).
The first, natural problem to study is that of regularity of local minimizers (the equivalent of the DeGiorgi solution of the Hilbert problem and the development of the DeGiorgi–Nash–Moser theory of regularity of solutions). Let us recall that in the second order case, the theory proceeds as follows:
A local minimizer, u, of the functional
satisfies the Euler–Lagrange equation
or, in non-divergence form:
If we would known that ∇u is continuous Shauder estimates would allow us to bootstrap the solution to higher regularity. In turn, first derivatives D e u=w satisfy
But at this point we only know that ∇u is in L 2 and, from the uniform convexity of F, that the matrix F ij (⋅)=A ij (x) is strictly positive:
But then, the celebrated DeGiorgi theorem establishes that solutions of an elliptic equation
with no regularity assumption on a ij are Hölder continuous.
In particular, ∇u is Hölder continuous and higher regularity follows.
In this context, with Chan and Vasseur [9], we develop the DeGiorgi regularity theory for the parabolic case:
Let u(x,t) be the solution of
with “ϕ symmetric and quadratic” (i.e., λ≤ϕ″≤Λ) and
Then u becomes instantaneously smooth.
As in the second order case, the central step is to prove that first derivatives, w=D x u, satisfy a “rough equation” and are Hölder continuous:
(see also related articles by Barlow, Bass, Chen, Kassman, and of Komatsu [1, 3, 14, 15]).
The study of non-local, non-linear equations with “variational structure” has several motivations:
-
What we could call surface diffusion: the quasigeostrophic equation that models ocean atmosphere interaction, the theory of semi-permeable membranes, planar fracture dynamics (see [5, 11]).
-
Problems in statistical mechanics, like phase transition problems with long range interactions (as opposed to neighbor to neighbor). See for instance the work of Giacomin–Lebowitz and of Presutti [12].
-
Material sciences, for instances polymers where many scales interact.
-
Image processing, see for instance the work of Gilboa and Osher [13].
3 Non-divergence Equations
“Non-divergence” equations arise instead from probability (Levy processes), optimal control and game theory.
Suppose for instance that particles generate at some point x 0 of a domain Ω and bounce randomly until they exit Ω.
At that moment they release an amount of energy u(y) depending on the point y where they land.
In principle to find out the expectation for future released energy u(x 0) when starting at x 0, we should just solve Lu=0 in Ω with external data u(y) and the diffusion associated to the process.
In the case of optimal control we are able to “design” the jump process (the media) to maximize the expected value u(x 0).
That is: We have a family of possible diffusion processes given by the kernels
and at each x we want to chose the optimal jump distribution
In order to achieve that we have to find a solution u 0 of the equation
with exterior data u(y).
Indeed, this equation means that “u 0 is a supersolution of all the admissible operators, and at each point is the solution of at least one of the L α .” Therefore on one hand it is better than any choice and at the same time is an admissible distribution.
In the case of second order equations, the central result of the theory is the Evans–Krylov theorem:
In that case, the family of operators are second order
the non-linear equation is
and the Evans–Krylov theorem asserts that solutions to F(D 2 u)=0 are C 2,β and thus classical (i.e., the derivatives involved are continuous).
In collaboration with Silvestre, we reproduce their theory for the corresponding non-local equations [6–8].
If the kernels K α are all comparable to the s-Laplacian:
and they are symmetric in y (no “drift”), then solutions to F(u)=0 are C 2s+β that makes the corresponding integrals convergent and the solutions “classical”.
One of the main features of the work is the proof of a theorem equivalent to the Krylov–Safonov Harnack inequality for “bounded measurable” kernels:
If w is for every x a solution of a different equation
with K x changing discontinuously with x “bounded measurable coefficients”, w is still Hölder continuous.
4 Drifts
What I want to discuss now is the relation, or interaction between diffusion and drift in the optimal control context:
For second order equations, when addressing gradient dependence of an equation, we have two different issues. On one hand semilinear equations, say, for instance
with an associated idea of drift or transport and on the other quasilinear equations:
for instance those coming from the calculus of variations.
Semilinear equations with fractional diffusions arise for instance in the case of the quasigeostrophic equation:
and assuming nice dependence on u, there is here a clear competition between diffusion and transport that becomes critical where s=1/2.
But there is a second, implicit form of drift in the asymmetry of the kernel for a Levy process:
The most general “heat equation” for an infinite divisible distribution, leaving aside the continuous part and the standard drift is
Note that the antisymmetric part has in it an extra cancellation to ensure that the process does not drift to infinity.
For quasilinear equations, one equivalent framework to the second order case is, of course, through the calculus of variations.
For instance, one defines the p- (s-Laplacian), i.e., s-derivatives in L p, as the Euler–Lagrange equation of the L p norm of the s-derivatives of a function
This p-fractional Laplacian is naturally studied through “energy” and “test functions methods” (see [10]). But the p-Laplacian also can also be written in non-divergence form as
where u nn denotes the second derivative in the direction of the gradient of u.
And this has a game-theoretical interpretation (Peres–Sheffield [16]): Let us go back to the example of expected energy release u(x) of the random particle.
Suppose that as before the random process has the (“almost continuous”) diffusion equation (δ t ∼ε 2)
i.e., the particle at position x at time t, jumps, by time t+ε 2, to a position epsilon-away, according to the radially symmetric probability density \(\varphi_{\varepsilon}(y) = \frac{1}{\varepsilon ^{n}} \varphi(y/\varepsilon )\).
Then, as discussed before, when ε goes to zero, we would get the standard “heat” equation.
But, assume now that competing players P 1,P 2 are able to impose on the jump an epsilon-drift in their preferred direction, randomly in time, trying to maximize, respectively minimize, the expected value u.
That is, depending on which player has the input, the particle at x will jump to the position (x+y), with probability density (τ i =τ 1 or τ 2, a unit vector)
As a consequence, the jump probability density φ ε has drifted in the direction τ 1 or τ 2 depending on which player imposed the drift. Here λ is the intensity of the drift and the expected value u will then satisfy the Isaac’s equation
The natural choice for τ 2 is to push the drift in the direction of ∇u, and for τ 1 in that of −∇u. Therefore, if both players use the optimal strategy, the combination of
will shift the mass of φ ε symmetrically in the directions of ±∇u, increasing the second moment in that direction so that the limiting equation, as epsilon goes to zero becomes
i.e., the non-divergence form of the fractional Laplacian.
A similar argument can be made for jump processes:
In work with Bjorland and Figalli, we have studied existence and regularity properties of this “tug of war” game for jump processes. Let me start by pointing out that there are different ways to “influence the drift” that give rise to structurally different mathematical problems. A possible one is for shifted kernels:
That is, for kernels of the form
with K 0(y) a symmetric kernel of the size of a fractional Laplacian, and A(y 1) a smooth odd function, |A(y 1)|≤1−δ.
That is, we look at the Isaac’s equation:
(i.e., each player adds the implicit drift A(y⋅ν) in his optimal direction ν 1 or ν 2).
Another possible way is that the player chooses a direction and it is this direction that suffers a random deviation (an “unsteady hand”). In that case the corresponding basic kernel \(K_{e_{1}}(y)\) should be of the form
where η may vanish outside a neighborhood of e 1.
The final operator is as before, the inf sup over all rotations of \(K_{e_{1}}(y)\).
In both cases, it follows from the non-local Harnack inequality and ABP theorem [6] that solutions are C α for some α.
In fact, let me take this opportunity to discuss informally the non-local ABP theorem, that is central to many of the developments for non-local optimal control.
The local version of the ABP theorem needed for the Harnack inequality (as presented in [4]) is the following:
Theorem 1
u≥0 in B 1, Lu=a ij (x)D ij u≤0, u(0)≤1. Then, ∃ε 0, such that |{u<2}|≥ε 0(λ,Λ)>0.
Proof
We add to u a negative paraboloid in B 1:
and construct its convex envelope in B 1:
We will estimate |{w=Γ(w)}| by below.
Indeed, in this set w is negative and
For this purpose, we use the classical A-B-P argument, i.e., we estimate the volume of the image of the gradient map: ∇Γ:B 1→ℝn. To do that, we lift from minus infinity a plane with generic slope v:
If ℓ(x)=t+〈v,x〉 with |v|≤h/3, for some value t 0, ℓ is a supporting plane of Γ(w −) at some interior point x 0∈{Γ=w}. Thus “the image of {w −=Γ(w)} by the map: x→∇Γ(x) contains the ball of “v’s” of radius \(\frac{h}{3} = \frac{\sup w^{-}}{3} \ge \frac{1}{3}\), i.e.,
We now “change variables”, from v to x
But D∇Γ=D 2 Γ, a non-negative matrix, since Γ is convex, so
with μ the largest eigenvalue of D 2 w (at a contact point w=Γ(w), D 2 w≥D 2 Γ≥0). Since all max eigenvalues of D 2 w≥0,
We then get
This is “almost” the proof of the ABP version we need for the Harnack inequality.
What we are missing is the localization property:
i.e., we need the extra fact that we can get the contact set to be inside of any cube of size 1/4 close to the origin in order to make a C-Z decomposition. For that, all we need is to change h by an h′ with: Lh′≤0 outside Q 1/4 (so that Lw≤0 outside Q 1/4 and contact cannot occur), h′(0)≤−2 so infw≤−1, and Lh′ still bounded above, so Lw is bounded above. □
5 The Corresponding ABP for Integral Diffusions [6]
As before, we assume u≥0 in B 3, Lu≤0, u(0)≤1. Now
For simplicity we will truncate K x :
and restrict ourselves to x∈B 1(0), so L is well defined.
We want to show:
We proceed as before. Consider w=u+a, with a=2(|x|2−1)∧0 and construct the convex envelope Γ(w −) in B 3
As before
The problem is how to relate |{w=Γ(w)}| with its image (no good change of variables formula).
Consider a point x 0 in w=Γ(w). We have the following local geometry
We are going to prove the following family of steps. Consider x 0 as above.
-
(a)
For some small diadic ring \(\mathbb{R}_{k} = B_{2^{-k}}\setminus B_{2^{-(k+1)}}\) \(\tilde{w}\) “grows quadratically on average” in the sense that
$$\int_{{{R}_{k}}}{\tilde{\omega }\le {{C}_{1}}}{{({{r}_{k}})}^{2}}\,\,\,\,\,\,({{r}_{k}}={{2}^{-K}}).$$ -
(b)
Of course, this does not imply that \(\tilde{w} \le c_{1}r_{k}^{2}\), but since \(0\le\tilde{\varGamma}(\tilde{w}) \le\tilde{w}\), and \(\tilde{\varGamma}\) is convex. (a) does imply that
$$\tilde{\varGamma}|_{B_{2^{-(k+1)}}} \le C_2 2^{-2k} \quad\mbox{and}\quad\nabla\tilde{\varGamma}|_{B_{2^{-(k+2)}}} \le C_22^{-k}$$(k=k(x 0) of course).
-
(c)
In particular:
$$\operatorname{Vol}\nabla\varGamma(B\ldots) = \operatorname{Vol}\nabla\tilde{\varGamma}(B_{2^{-(k+2)}}(x_0)) \le C|B_{2^{-(k+2)}} (x_0)| .$$
We now extract a covering of {w=Γ(w)} with the family of these balls B r(x)(x) and we have
But in each \(B_{r_{j}}\), \((\cdot)\widetilde{w}\) differs from Γ by at most (r j )2 in a large portion of \(B_{r(x_{j})}\) since
We divide the integration in diadic rings around x 0
Since \(\tilde{w} (x_{0}) =0\), and \(\tilde{w}\ge0\), the integrand in all of the rings is positive and
-
(a)
We first show that if C 1=MC is a large multiple of C there is at least one ring where
$$\int_{{{\mathbb{R}}_{k}}}{\tilde{\omega }\le {{C}_{1}}r_{k}^{2}.}$$
If not
∼C 1, a contradiction.
In fact, if M is large, we can start the sum from k=k 0 and we get
still a contradiction.
Of course, \(\tilde{w}\) may still be highly oscillatory but
-
(a)
In 99% of the rings, \(\tilde{w}\le100 C_{1} r_{k}^{2}\), that is, in the original configuration w stays close to its convex envelope Γ.
-
(b)
Further, since \(0\le\tilde{\varGamma}\le\tilde{w}\)
$$\int_{{{\mathbb{R}}_{k}}}{\tilde{\Gamma }\le {{C}_{\text{1}}}r_{k}^{2}}\cdot $$But \(\tilde{\varGamma}\) is convex, so this implies a bound \(\tilde{\varGamma}\le C_{2} r_{k}^{2}\) in B k+1 and ∇Γ≤r k in B k+2.
Let \(t_{0} = r_{k}^{2}\)
In turn, this implies that \(\sup_{B_{k+2}} \nabla\tilde{\varGamma}< C\ r_{k}^{2}\).
Finally, it is a general fact that
If Γ is convex in B r ,
A covering lemma completes the proof.
This proof, of course, requires in principle that the kernels be symmetric (some asymmetry is “tolerated” by the fact that the gradient of Γ is bounded, as in the second order case).
But the nature of the “game” symmetrizes the kernel:
From the “inf sup” property, for any x 0, there exists a direction ν + so that
for any μ (in particular −ν +) and vice versa, a ν − so that
for any μ, and this property is all that’s needed.
Going back to the two possible “integral drifts”, in the first case it is also possible to prove that solutions are in fact C 2s+σ, i.e., the integrals converge and the solution is classical (see [7]).
This is because the nature of the drift is such that, as the problem is rescaled the perturbation term A(x 1) drifts to infinity.
6 Non-local Infinite Laplacian
Finally, I would like to discuss briefly the “tug of war” non-local “infinite Laplacian”.
The Infinite Laplacian appears in the case when there is no diffusion left, i.e., when it is just the players taking random turns in choosing the direction of the drift (tug of war).
For the infinitesimal case, when the length of the jump is predetermined you formally get “u nn =0”, n the direction of the gradient (Peres–Schramm–Sheffield–Wilson [17]).
Also, in collaboration with Bjorland and Figalli [2], we consider the case in which the jump of the particle follows the distribution of the s-Laplacian
(That is, each player pulls in the directions ν 1 and ν 2.)
Formally, for s>1/2, the direction of the jump is given by ∇u: Since the integrals diverge, each players is “forced” to take that choice.
We prove existence, uniqueness and (some) regularity, under a monotone geometry, for s>1/2.
We assume that the domain Ω is “strip like”, i.e., between bounded Lipschitz graphs with uniform separation and pay off is respectively 1 and −1. We show that there exists a unique viscosity solution (the least supersolution and larger subsolution coincide), and it is C 2s−1.
(|x|2s−1 are the “cones” for this problem, note that for s=1/2 the theory breaks down.)
We end up with some comments:
-
The case s<1/2 seems very interesting since “∇u” does not fix the direction of the jump any more, and players will choose to jump in “non-opposite directions” most of the time.
-
Instead of prescribing boundary values, it seems more natural to prescribe upper and lower obstacles where it would be optimal for one of the players to stop playing (execute an option).
We can prove in this case similar results as to the boundary value problem discussed before [2].
References
Barlow, M.T., Bass, R.F., Chen, Zh.Q., Kassmann, M.: Non-local Dirichlet forms and symmetric jump processes. Trans. Am. Math. Soc. 361(4), 1963–1999 (2009)
Bjorland, C., Caffarelli, L., Figalli, A.: Non-local tug-of-war and the infinity fractional Laplacian. arXiv:1011.1966
Bass, R.F., Kassmann, M.: Harnack inequalities for non-local operators of variable order. Trans. Am. Math. Soc. 357(2), 837–850 (2005)
Caffarelli, L., Cabre, X.: Fully Nonlinear Elliptic Equations. Am. Math. Soc. Colloq. Publ., vol. 43. Am. Math. Soc., Providence (1995)
Caffarelli, L., Vasseur, A.: Drift diffusion equations with fractional diffusion and the quasi-geostrophic equation. Ann. Math. (2011, in press)
Caffarelli, L.A., Silvestre, L.: Regularity theory for fully nonlinear integro-differential equations. Commun. Pure Appl. Math. 62(5), 597–638 (2009)
Caffarelli, L.A., Silvestre, L.: Regularity results for nonlocal equations by approximation. Arch. Ration. Mech. Anal. (2011, in press). arXiv:0902.4030
Caffarelli, L.A., Silvestre, L.: The Evans-Krylov theorem for non local fully non linear equations. Ann. Math. (2011, in press). arXiv:0905.1339
Caffarelli, L.A., Chan, C.H., Vasseur, A.: Regularity theory for nonlinear integral operators. arXiv:1003.1699
Chambolle, A., Lindgren, E., Monneau, R.: The Hölder infinite Laplacian and Hölder extensions. Preprint (2010)
Duvant, G., Lions, J.-L.: Les inéquations en mécanique et en physique. Travaux et Recherches Mathématiques, vol. 21. Dunod, Paris (1972) (in French)
Giacomin, G., Lebowitz, J.L., Presutti, E.: Deterministic and stochastic hydrodynamic equations arising from simple microscopic model systems. In: Stochastic Partial Differential Equations: Six Perspectives. Math. Surveys Monogr., vol. 64, pp. 107–152. Am. Math. Soc., Providence (1999)
Gilboa, G., Osher, S.: Nonlocal operators with applications to image processing. Multiscale Model. Simul. 7(3), 1005–1028 (2008)
Kassmann, M.: A priori estimates for integro-differential operators with measurable kernels. Calc. Var. Partial Differ. Equ. 34(1), 1–21 (2009)
Komatsu, T.: Uniform estimates for fundamental solutions associated with non-local Dirichlet forms. Osaka J. Math. 32(4), 833–860 (1995)
Peres, Y., Sheffield, S.: Tug-of-war with noise: a game-theoretic view of the p-Laplacian. Duke Math. J. 145(1), 91–120 (2008)
Peres, Y., Schramm, O., Sheffield, S., Wilson, D.B.: Tug-of-war and the infinity Laplacian. J. Am. Math. Soc. 22(1), 167–210 (2009)
Acknowledgements
The author was partially supported by National Science Foundation Grant DMS-0654267.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Caffarelli, L. (2012). Non-local Diffusions, Drifts and Games. In: Holden, H., Karlsen, K. (eds) Nonlinear Partial Differential Equations. Abel Symposia, vol 7. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-25361-4_3
Download citation
DOI: https://doi.org/10.1007/978-3-642-25361-4_3
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-25360-7
Online ISBN: 978-3-642-25361-4
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)