Abstract
In 2010 Menon and Srinivasan published a conjecture for the statistical structure of solutions \(\rho \) to scalar conservation laws with certain Markov initial conditions, proposing a kinetic equation that should suffice to describe \(\rho (x,t)\) as a stochastic process in x with t fixed. In this article we verify an analogue of the conjecture for initial conditions which are bounded, monotone, and piecewise constant. Our argument uses a particle system representation of \(\rho (x,t)\) over \(0 \le x \le L\) for \(L > 0\), with a suitable random boundary condition at \(x = L\).
Similar content being viewed by others
Avoid common mistakes on your manuscript.
1 Introduction
We are interested in the statistical properties of solutions to the Cauchy problem for the scalar conservation law
where \(\xi = \xi (x)\) is a stochastic process. In the special case
(1.1) is the well-known inviscid Burgers’ equation, which has often been considered with random initial data. Burgers himself, in his investigation of turbulence [8], was already moving in this direction, though it would be some time before problems in this area were addressed rigorously.
Several papers, including [4, 5, 9–11, 27], have developed a theory showing a kind of integrability for the evolution of the law of \(\rho (\cdot ,t)\) for certain initial data. This article is concerned with pushing stochastic integrability results beyond the Burgers setting to general scalar conservation laws, an effort begun in earnest by Menon and Srinivasan [22, 25]. Before stating our main result, we first survey the major developments in this subject.
This work is adapted from [19].
1.1 Background
The Burgers case \(H(p) = - p^2/2\) has seen extensive interest. Recall that if
then u(x, t) formally satisfies
and is determined by the Hopf–Lax formula [17]:
The backward Lagrangian y(x, t) is the rightmost minimizer in (1.5), and it is standard [13] that this determines both the viscosity solution u(x, t) to (1.4) and the entropy solution to \(\rho _t + \rho \rho _x = 0\),
Groeneboom, in a paper [15] concerned not with PDE but concave majorantsFootnote 1 and isotonic estimators, determined the statistics of y(x, t) for the case where \(\xi (x)\) is white noise, which we understand to mean that the initial condition for u(x, t) in (1.4) is a standard Brownian motion B(x). Among the results of [15] is a density for
expressed in terms of the Airy function. It has been observed (see e.g. [21]) that the Airy function seems to arise in a surprising number of seemingly unrelated stochastic problems, and it is notable that Burgers’ equation falls in this class.
Several key papers appeared in the 1990s; we rely on the introduction of Chabanol and Duchon’s 2004 paper [11], which recounts some of this history, and discuss those portions most relevant for our present purposes. In 1992, Sinai [27] explicitly connected Burgers’ equation with white noise initial data to convex minorants of Brownian motion. Three years later Avellaneda and Weinan [3] showed for the same initial data that the solution \(\rho (x,t)\) is a Markov process in x for each fixed \(t > 0\).
Carraro and Duchon’s 1994 paper [9] defined a notion of statistical solution to Burgers’ equation. This is a time-indexed family of probability measures \(\nu (t, \cdot )\) on a function space, and the definition of solution is stated in terms of the characteristic functional
for test functions \(\phi \). Namely, \(\hat{\nu }(t,\phi )\) must satisfy for each \(\phi \) a differential equation obtained by formal differentiation under the assumption that \(\rho \) distributed according to \(\nu (t,d\rho )\) solves Burgers’ equation. This statistical solution approach was further developed in 1998 by the same authors [10] and by Chabanol and Duchon [11]. It does have a drawback: given a (random) entropy solution \(\rho (x,t)\) to the inviscid Burgers’ equation, the law of \(\rho (\cdot , t)\) is a statistical solution, but it is not clear that a statistical solution yields a entropy solution, and at least one example is known [4] when these notions differ. Nonetheless, [9, 10] realized that it was natural to consider Lévy process initial data, which set the stage for the next development.
In 1998, Bertoin [4] proved a remarkable closure theorem for Lévy initial data. We quote this, with adjustments to match our notation.
Theorem 1.1
([4, Theorem 2]) Consider Burgers’ equation with initial data \(\xi (x)\) which is a Lévy process without positive jumps for \(x \ge 0\), and \(\xi (x) = 0\) for \(x < 0\). Assume that the expected value of \(\xi (1)\) is positive, \(\mathbb {E}\xi (1) \ge 0\). Then, for each fixed \(t > 0\), the backward Lagrangian y(x, t) has the property that
is independent of y(0, t) and is in the parameter x a subordinator, i.e. a nondecreasing Lévy process. Its distribution is the same as that of the first passage process
Further, denoting by \(\psi (s)\) and \(\varTheta (t,s)\) \((s \ge 0)\) the Laplace exponents of \(\xi (x)\) and \(y(x,t) - y(0,t)\),
we have the functional identity
Remark 1.1
The requirement \(\mathbb {E}\xi (1) \ge 0\) can be relaxed, with minor modifications to the theorem, in light of the following elementary fact. Suppose that \(\xi ^1(x)\) and \(\xi ^2(x)\) are two different initial conditions for Burgers’ equation, which are related by \(\xi ^2(x) = \xi ^1(x) + cx\). It is easily checked using (1.5) that the corresponding solutions \(\rho ^1(x,t)\) and \(\rho ^2(x,t)\) are related for \(t > 0\) by
This observation is found in a paper of Menon and Pego [24], but (as this is elementary) it may have been known previously. Using (1.14) we can adjust a statistical description for a case where \(\mathbb {E}\xi (1) \ge 0\) to cover the case of a Lévy process with general mean drift.
We find Theorem 1.1 remarkable for several reasons. First, in light of (1.6), it follows immediately that the solution \(\rho (x,t) - \rho (0,t)\) is for each fixed t a Lévy process in the parameter x, and we have an example of an infinite-dimensional, nonlinear dynamical system (the PDE, Burgers’ equation) which preserves the independence and homogeneity properties of its random initial configuration. Second, the distributional characterization of y(x, t) is that of a first passage process, where the definition of y(x, t) following (1.5) is that of a last passage process. Third, (1.13) can be used to show [23] that if \(\psi (t,q)\) is the Laplace exponent of \(\rho (x,t) - \rho (0,t)\), then
for \(t > 0\) and \(q \in \mathbb {C}\) with nonnegative real part. This shows for entropy solutions what had previously been observed by Carraro and Duchon for statistical solutions [10], namely that the Laplace exponent (1.12) evolves according to Burgers’ equation!
In 2007 Menon and Pego [24] used the Lévy–Khintchine representation for the Laplace exponent (1.12) and observed that the evolution according to Burgers’ equation in (1.15) corresponds to a Smoluchowski coagulation equation [2, 28], with additive collision kernel, for the jump measure of the Lévy process \(y(\cdot , t)\). The jumps of \(y(\cdot , t)\) correspond to shocks in the solution \(\rho (\cdot , t)\). The relative velocity of successive shocks can be written as a sum of two functions, one depending on the positions of the shocks and the other proportional to the sum of the sizes of the jumps in \(y(\cdot , t)\). Regarding the sizes of the jumps as the usual masses in the Smoluchowski equation, it is plausible that Smoluchowski equation with additive kernel should be relevant, and [24] provides the details that verify this.
It is natural to wonder whether this evolution through Markov processes with simple statistical descriptions is a miracle [23] confined to the Burgers–Lévy case, or an instance of a more general phenomenon. However, extending the results of Bertoin [4] beyond the Burgers case \(H(p) = -\frac{1}{2} p^2\) remains a challenge. A different particular case, corresponding to \(L(q) = (-H)^*(q) = |q|\), is a problem of determining Lipschitz minorants, and has been investigated by Abramson and Evans [1]. From the PDE perspective this is not as natural, since \((-H)^*(q) = |q|\) corresponds to
i.e. \(-H(p)\) takes the value 0 on \([-1,+1]\) and is equal to \(+\infty \) elsewhere. So [1], while very interesting from a stochastic processes perspective, has a specialized structure which is rather different from those cases we will consider.
The biggest step toward understanding the problem for a wide class of H is found in a 2010 paper of Menon and Srinivasan [25]. Here it is shown that when the initial condition \(\xi \) is a spectrally negative Footnote 2 strong Markov process, the backward Lagrangian process \(y(\cdot , t)\) and the solution \(\rho (\cdot , t)\) remain Markov for fixed \(t > 0\), the latter again being spectrally negative. The argument is adapted from that of [4] and both [4, 25] use the notion of splitting times (due to Getoor [14]) to verify the Markov property according to its bare definition. In the Burgers–Lévy case, the independence and homogeneity of the increments can be shown to survive, from which additional regularity is immediate using standard results about Lévy processes [18]. As [25] points out, without these properties it is not clear whether a Feller process initial condition leads to a Feller process in x at later times. Nonetheless, [25] presents a very interesting conjecture for the evolution of the generator of \(\rho (\cdot , t)\), which has a remarkably nice form and follows from multiple (nonrigorous, but persuasive) calculations.
We now give a partial statement of this conjecture. The generator \(\mathscr {A}\) of a stationary, spectrally negative Feller process acts on test functions \(J \in C^\infty _c(\mathbb {R})\) by
where b(y) characterizes the drift and \(f(y, \cdot )\) describes the law of the jumps. If we allow b and f to depend on t, we have a family of generators. The conjecture of [25] is that the evolution of the generator \(\mathscr {A}\) for \(\rho (\cdot , t)\) is given by the Lax equation
for \(\mathscr {B}\) which acts on test functions J by
An equivalent form of the conjecture (1.18) involves a kinetic equation for f. The key result in the present article verifies that this kinetic equation holds in the special case we will consider. Before we state this, let us establish our working notation.
1.2 Notation
Here we collect some of the various notation used later in the article.
Write \(H[p_1,\ldots ,p_n]\) for the nth divided difference of H through \(p_1,\ldots ,p_n\). For each n this function is symmetric in its arguments, and given by the following formulas in the cases \(n = 2\) and \(n = 3\) where \(p_1,p_2,p_3\) are distinct:
The definition for general n and standard properties can be found in several numerical analysis texts, including [16].
We write \(\delta _x\) for the usual point mass assigning unit mass to x and zero elsewhere.
For \(C > 0\) and n a positive integer, write
and \(\overline{\varDelta ^C_n}\) for the closure of this set in \(\mathbb {R}^n\). We write \(\partial \varDelta ^C_n\) for the boundary of the simplex, and \(\partial _i \varDelta ^C_n\) for its various faces:
Write \(\mathscr {M}\) for the set of finite, regular (signed) measures on [0, P], which is a Banach space when equipped with the total variation norm \(\Vert \cdot \Vert _{\textsc {tv}}\). Call its nonnegative subset \(\mathscr {M}_+\).
Let \(\mathscr {K}\) denote the set of bounded signed kernels from [0, P] to [0, P], i.e. the set of \(k : [0,P] \mapsto \mathscr {M}\) which are measurable when [0, P] is endowed with its Borel \(\sigma \)-algebra and \(\mathscr {M}\) is endowed with the \(\sigma \)-algebra generated by evaluation on Borel subsets of [0, P], and which satisfy
Observe that \(\mathscr {K}\) is a Banach space; completeness holds since a Cauchy sequence \(k_n\) has \(k_n(\rho _-,\cdot )\) Cauchy in total variation for each \(\rho _-\), and we obtain a pointwise limit \(k(\rho _-,\cdot )\). Measurability of \(k(\cdot , A)\) for each Borel \(A \subseteq [0,P]\) then holds since this is a real-valued pointwise limit of \(k_n(\cdot , A)\). Write \(\mathscr {K}_+\) for the subset of \(\mathscr {K}\) with range contained in \(\mathscr {M}_+\).
1.3 Main result
In this section we provide a statistical description of solutions to the scalar conservation law when the initial condition is an increasing, pure-jump Markov process given by a rate kernel g. For this we require some assumptions on the rate kernel g and the Hamiltonian H.
Assumption 1.1
The initial condition \(\xi = \xi (x)\) is a pure-jump Markov process starting at \(\xi (0) = 0\) and evolving for \(x > 0\) according to a rate kernel \(g(\rho _-,d\rho _+)\). We assume that for some constant \(P > 0\) the kernel g is supported on
and has total rate which is constant in \(\rho _-\):
for all \(0 \le \rho _- \le P\). In particular, \(g \in \mathscr {K}_+\).
Assumption 1.2
The Hamiltonian function \(H : [0,P] \rightarrow \mathbb {R}\) is smooth, convex, has nonnegative right-derivative at \(p = 0\) and noninfinite left-derivative at \(p = P\).
We provide for \(x \ge 0\) a statistical description consisting of a one-dimensional marginal at \(x = 0\) and a rate kernel generating the rest of the path. The evolution of the rate kernel is given by the following kinetic equation, and the evolution of the marginal will be described in terms of the solution to the kinetic equation.
Definition 1.3
We say that a continuous mapping \(f : [0,\infty ) \rightarrow \mathscr {K}_+\) is a solution of the kinetic equation
where
provided that \(\theta \mapsto \mathscr {L}^{\kappa } f(\theta , \cdot , \cdot ) \in \mathscr {K}\) is Bochner-integrable and
for all \(0 \le s \le t < \infty \).
Definition 1.4
We say that a continuous mapping \(\ell : [0,\infty ) \rightarrow \mathscr {M}_+\) is a solution of the marginal equation
where
provided that \(\theta \mapsto \mathscr {L}^0 \ell (\theta , \cdot ) \in \mathscr {M}\) is Bochner-integrable and
for all \(0 \le s \le t < \infty \).
Theorem 1.2
Suppose Assumptions 1.1 and 1.2 hold. Then the kinetic and marginal equations have unique solutions, in the sense of Definitions 1.3 and 1.4. Furthermore, the total integrals are conserved:
for all \(t \ge 0\) and all \(0 \le \rho _- \le P\). Finally, if
for all \(0 \le \rho _- < P\), then \(f(t,\cdot ,\cdot )\) has the same property for \(t > 0\).
The kernels described by the Theorem 1.2 are precisely what we need to describe the statistics of the solution \(\rho \), which brings us to our main result:
Theorem 1.3
When Assumptions 1.1 and 1.2 hold, the solution \(\rho \) to
for each fixed \(t > 0\) has \(x = 0\) marginal given by \(\ell (t,d\rho _0)\) and for \(0 < x < \infty \) evolves according to rate kernel \(f(t,\rho _-,d\rho _+)\).
Remark 1.2
Theorems 1.2 and 1.3 establish rigorously the conjectured [25] evolution according to the Lax pair (1.18), within the present hypotheses. See [25, Section 2.7] for a calculation showing the equivalence of the kinetic and Lax pair formulations, which simplifies considerably in the present case due to the absence of drift terms. The Lax pair and integrable systems approach (in the case of finitely many states, where the generator is a triangular matrix) have been further explored by Menon [22] and in a forthcoming work by Li [20].
1.4 Organization
The remainder of this article is organized as follows. In Sect. 2 we show that Theorem 1.3 will follow from a similar statistical characterization for the solution to the PDE over \(x \in [0,L]\) with a time-dependent random boundary condition at \(x = L\). The latter we can study using a sticky particle system whose dimension is random and unbounded, but almost surely finite. Elementary arguments are used to check that our candidate for the law matches the evolution of the random initial condition according to the dynamics. Next in Sect. 3 we show existence and uniqueness of the solutions to marginal and kinetic equations of Theorem 1.2, which are needed to construct the candidate law. The concluding Sect. 4 indicates some desired extensions and similar questions for future work. To keep the main development concise, proofs of lemmas have been deferred to Appendix.
2 A random particle system
As functions of x, the solutions \(\rho (x,t)\) we consider all have the form depicted in Fig. 1. From a PDE perspective this situation is standard—a concatenation of Riemann problems for the scalar conservation law—and we can describe the solution completely in terms of a particle system. Each shock consists of some number of particles stuck together, and the particles move at constant velocities according to the Rankine–Hugoniot condition except when they collide. The collisions are totally inelastic.
The utility of this particle description is lessened, however, by the fact that the dynamics are quite infinite dimensional for our pure-jump initial condition \(\xi (x)\), \(x \in [0,+\infty )\). To have a simple description as motion at constant velocities, punctuated by occasional collisions, we might argue that on each fixed bounded interval of space we have finitely many collisions in a bounded interval of time, and piece together whatever statistical descriptions we might obtain for the solution on these various intervals. Inspired by those situations in statistical mechanics where boundary conditions become irrelevant in an infinite-volume limit, we pursue a different approach. We construct solutions to a problem on a bounded space interval [0, L], and choose a random boundary condition at \(x = L\) to obtain an exact match with the kinetic equations. The involved analysis will all pertain to the following result.
Theorem 2.1
Suppose Assumptions 1.1 and 1.2 hold. For any fixed \(L > 0\), consider the scalar conservation law
with initial condition \(\xi \) (restricted to [0, L]), open boundaryFootnote 3 at \(x = 0\), and random boundary \(\zeta \) at \(x = L\). Suppose the process \(\zeta \) has \(\zeta (0) = \xi (L)\) and evolves according to the time-dependent rate kernel \(H[\rho _,\rho _+] f(t,\rho _,d\rho _+)\) independently of \(\xi \) given \(\xi (L)\). Then for all \(t > 0\) the law of \(\rho (\cdot , t)\) is as follows:
-
(i)
the \(x = 0\) marginal is \(\ell (t,d\rho _0)\), and
-
(ii)
the rest of the path is a pure-jump process with rate kernel \(f(t,\rho _-,d\rho _+)\).
To prove our main result we can send \(L \rightarrow \infty \), applying Theorem 2.1 on each [0, L], and use bounded speed of propagation to limit the respective influences of far away particles (unbounded system) or truncation with random boundary (bounded system). The argument is quite short.
Proof
(Theorem 1.3) Fix any \(t > 0\). We will write \(\hat{\rho }(x,t)\) for the solution to (1.35) over the semi-infinite x-interval \([0,+\infty )\) with initial data \(\hat{\xi }(x)\). We take the right-continuous version of the solution. For L to be specified shortly, write \(\rho (x,t)\) for the solution and \(\xi (x)\) for the initial data corresponding to (2.1).
Fix any \(x_1,\ldots ,x_k \in [0,+\infty )\), and let \(X = \max \{x_1,\ldots ,x_k\}\). Choose \(L > X + t H'(P)\). We couple the bounded and unbounded systems by requiring
allowing the random boundary \(\zeta \) to evolve independently of \(\hat{\xi }\) given \(\hat{\xi }(L)\)
Recall [13] that the scalar conservation law has finite speed of propagation. Our solutions are bounded in [0, P], so the speed is bounded by \(H'(P)\). Since \(\hat{\rho }(x,0)\) and \(\rho (x,0)\) are a.s. equal on [0, L], we have also \(\hat{\rho }(x,t) = \rho (x,t)\) a.s. for \(x \in [0,L-tH'(P)] \supset [0,X]\). From this we deduce the distributional equality
By Theorem 2.1, the latter distribution is exactly that of a process started according to \(\ell (t,d\rho _0)\), evolving according to rate kernel \(f(t,\rho _-,d\rho _+)\). This process is terminated deterministically at \(x = L\), which does not alter finite-dimensional distributions prior to \(x = L\). Since \(\rho (x,t)\) is right-continuous and has the correct finite-dimensional distributions, the result follows. \(\square \)
2.1 The dynamics
Our work in the remainder of this section is to prove Theorem 2.1. We begin by describing precisely those particle dynamics which determine the solution to the PDE.
Figure 1 illustrates a parametrization of a nondecreasing pure-jump process on \([0,+\infty )\) with heights \(\rho _0,\rho _1,\rho _2,\ldots \) and jump locations \(x_1,x_2,\ldots \). Our sign restriction on the jumps excludes rarefaction waves, and we have only constant values separated by shocks. Going forward in time, the shocks move according to the Rankine–Hugoniot condition
until they collide. We say that each shock consists initially of one particle moving at the velocity indicated above. Then result of a collision can be characterized in two equivalent ways:
-
(i)
At the first instant when \(x_i = x_{i+1}\), the ith particle is annihilated, and the velocity of the \((i+1)\)th particle changes from \(-H[\rho _i,\rho _{i+1}]\) to
$$\begin{aligned} \dot{x}_{i+1} = - H[\rho _{i-1},\rho _{i+1}]. \end{aligned}$$(2.5)In the caseFootnote 4 where several consecutive particles collide with each other at the same instant, all but the rightmost particle is annihilated. Since we only seek a statistical description for \(x \ge 0\), we annihilate the first particle when \(x_1 = 0\), replace \(\rho _0\) with \(\rho _1\), and relabel the other particles accordingly.
-
(ii)
When \(x_{i-1} < x_i = x_{i+1} = \cdots = x_{j-1} = x_j < x_{j+1}\), the particles i through j all move with common velocity \(-H[\rho _{i-1},\rho _j]\). Following Brenier et al [6, 7] we call these a sticky particle dynamics. We can additionally take the position \(x = 0\) to be absorbing: \(\dot{x_i} = 0\) whenever \(x_i = 0\).
We adopt the first viewpoint for convenience, as this is compatible with [12] (see the text following Definition 2.2 below), but a suitable argument could be given for the second alternative as well.
Definition 2.1
For L as in Theorem 2.1, the configuration space Q for the sticky particle dynamics is
A typical configuration is \(q = (x_1,\ldots ,x_n;\rho _0,\ldots ,\rho _n) \in Q_n\) when \(n>0\), or \(q = (\rho _0) \in Q_0=\{\rho _0: \ 0\le \rho _0\le P\}\) when \(n=0\).
Definition 2.2
Our notation for the particle dynamics is as follows:
-
(i)
For \(0 \le s \le t\) and \(q \in Q\), write
$$\begin{aligned} \phi _s^t q = \phi _0^{t-s} q \end{aligned}$$(2.7)for the deterministic evolution from time s to t of the configuration q according to the annihilating particle dynamics for the PDE, without random entry dynamics at \(x = L\).
-
(ii)
Given a configuration \(q = (x_1,\ldots ,x_n,\rho _0,\ldots ,\rho _n)\) and \(\rho _+ > \rho _n\), write \(\epsilon _{\rho _+} q\) for the configuration \((x_1,\ldots ,x_n,L,\rho _0,\ldots ,\rho _n,\rho _+)\).
-
(iii)
Write \(\varPhi _s^t q\) for the random evolution of the configuration according to deterministic particle dynamics interrupted with random entries at \(x = L\) according to the boundary process \(\zeta \) of (2.1), where the latter has been started at time s with value \(\rho _n\). In particular, if the jumps of \(\zeta \) between times s and t occur at times \(s < \tau _1 < \cdots < \tau _k < t\) with values \(\rho _{n+1} < \cdots < \rho _{n+k}\), then
$$\begin{aligned} \varPhi _s^t q = \phi _{\tau _k}^t \epsilon _{\rho _{n+k}} \phi _{\tau _{k-1}}^{\tau _k} \epsilon _{\rho _{n+k-1}} \cdots \phi _{\tau _1}^{\tau _2} \epsilon _{\rho _{n+1}} \phi _s^{\tau _1} q. \end{aligned}$$(2.8)
Proposition 2.1
For any s, q, the process \(\varPhi _s^t q\) is strong Markov.
This assertion follows after recognizing \(\varPhi _s^t q\) as a piecewise-deterministic Markov process described in some generality by Davis [12]. Namely, we augment the configuration space Q to add the time parameter to each component \(Q_n\), and then have a deterministic flow according to the vector field
With rate \(\int H[\rho _n,\rho _+] f(t,\rho _n,d\rho _+)\) we jump to the indicated point in \(Q_{n+1}\), and upon hitting a boundary we transition to \(Q_k\) for suitable \(k < n\), annihilating particles in the manner described above.
2.2 Checking the candidate measure
Our goal is to take q distributed according to the initial condition and exactly describe the law of \(\varPhi _0^T q\) for each \(T > 0\). Using the kinetic (1.27) and marginal equations (1.30), we construct for each time \(t \ge 0\) a candidate law \(\mu (t,dq)\) on Q as follows. Take N to be Poisson with rate \(\lambda L\), \(x_1,\ldots ,x_N\) uniform on \(\varDelta ^L_N\), and \(\rho _0,\ldots ,\rho _N\) distributed on \(\overline{\varDelta ^P_{N+1}}\) according to the marginal \(\ell \) and transitions f independently of the \(x_i\):
where \(\mu _0(t,dq) = \ell (t,d\rho _0)\) and
for \(n>0\). When we have verified (1.32), it will be immediate that the total mass of this is one.
We decompose the mapping from configurations q to solutions (as functions of x over [0, L]) into the map from q to the measure
and integration over [0, x]. We claim that when q is distributed according to \(\mu (0,dq)\), the law of the measure \(\pi (\varPhi _0^t q, \cdot )\) is identical to that of \(\pi (q', \cdot )\) where \(q'\) is distributed according to \(\mu (t,dq')\).
We now describe the structure of the proof of Theorem 2.1. Fix some time \(T > 0\) and consider \(F(t,q) = \mathbb {E}G(\varPhi _t^T q)\) where G takes the form of a Laplace functional:
for \(J \ge 0\) a continuous function on [0, L]. We aim to show that
for \(0 < t < T\), from which it will follow that
for all G of the form in (2.13). Using the standard fact that Laplace functionals completely determine the law of any random measure [18], this will suffice to show that law of \(\pi (\varPhi _0^T q, dx)\) for q distributed as \(\mu (0, dq)\) is precisely the pushforward through \(\pi \) of \(\mu (T, dq)\), and obtain the result.
We might verify (2.14) by establishing regularity for \(F(t,q) = \mathbb {E}G(\varPhi _t^T q)\) in t and the x-components of q. We speculate that it should be possible to do so if J is smooth with \(J'(0) = 0\) and we content ourselves to divide the configuration space into finitely many regions, each of which corresponds to a definite order of deterministic collisions in \(\phi \). On the other hand, the measures \(\mu (t, dq)\) enjoy considerable regularity (uniformity, in fact) in x, and we pursue instead an argument along these lines. Here continuity of F(t, q) in t will suffice.
Lemma 2.1
Let G take the form of (2.13). Then \(F(t,q) = \mathbb {E}G(\varPhi _t^T q)\) is a bounded function which is uniformly continuous in t uniformly in q:
as \(\delta \rightarrow 0+\).
To differentiate in (2.14) we will need to compare \(\mathbb {E}G(\varPhi _s^T q)\) and \(\mathbb {E}G(\varPhi _t^T q)\) for \(0 < s < t < T\). Our next observation is that when the \(t-s\) is small, we can separate the deterministic and stochastic portions of the dynamics over the time interval [s, t]. The idea is that in an interval of time [s, t], the probability of multiple particle entries is \(O((t-s)^2)\), and a single particle entry has probability \(O(t-s)\) which permits additional o(1) errors.
Lemma 2.2
Let \(0 < s < t \le T\) and \(q \in Q_n\). There exist a random variable \(\tau \in (s,t)\) a.s., with law depending on s, t, and q only through \(\rho _n\), and a constant \(C_{\lambda ,H'(P)}\) independent of q, s, t so that
We proceed to an analysis of the deterministic portion of the flow, \(\phi _s^t q\), where q is distributed according to \(\mu (t,dq)\). For this we introduce some notation. Given \(q \in Q_n\), we write
For each fixed n, we consider several subsets of the set of n-particle configurations \(Q_n\). In particular we need to separate those configurations which experience an exit at \(x = 0\) or a collision in a time interval shorter than \(t-s\). For \(i = 0,\ldots ,n-1\), write:
Figure 2 illustrates these sets in the case \(n = 2\). We observe in particular that \(\phi _s^t U\) and V differ by a \(\mu _n(t,\cdot )\)-null set, and that the terms associated with the boundary faces are related to configurations with one fewer particle.
Lemma 2.3
For each positive integer n and with errors bounded by
we have the following approximations:
for \(i = 1, \ldots , n-1\), and
Remark 2.1
It is essential that the integral over B can be approximated by an integral over \((x_1,\ldots ,x_{n-1},L;\rho _0,\ldots ,\rho _{n-1},\rho _+)\). The measure \(\mathscr {L}^* \mu \) below does not have any singular factors like \(\delta (x_n = L)\). In particular, the result of integrating over B will partially cancel with the term arising from random particle entries.
The final ingredient for the proof of Theorem 2.1 is the time derivative of \(\mu (t,dq)\), which we now record.
Lemma 2.4
For any \(n \ge 0\) and any \(0 \le s < t\) we have
where the norm is total variation and \((\mathscr {L}^* \mu _n)(t,dq)\) is defined to be the signed kernel
The expression for the measure above is to be understood formally; the correct interpretation involves replacement, not division. All of the “divisors” above are present as factors of \(\ell (t, d\rho _0) \prod _{j=1}^n f(t,\rho _{j-1}, d\rho _j)\), and the fractions indicate that the appearance of the denominator in this portion is to be replaced with the indicated numerator.
So that we know what to expect, before proceeding we note that when we sum over i in (2.25), some of the terms arising from \(\mathscr {L}^0\) and \(\mathscr {L}^{\kappa }\) cancel. Namely, the bracketed portion of (2.25) expands as
The gain terms associated with the kinetic equations we leave as they are, but note that the “loss” terms telescope, and the above may be shortened to
We are ready to prove our statistical characterization of the bounded system.
Proof
(Theorem 2.1) For times s and t with \(0 \le s < t \le T\), consider the difference \(\int F(t,q) \, \mu (t,dq) - \int F(s,q) \, \mu (s,dq)\):
Since F(t, q) is uniformly continuous in t uniformly in q by Lemma 2.1 and \(\mu \) is a probability measure, (I) \(\rightarrow 0\) as \(t-s \rightarrow 0\). Using Lemma 2.4 and the fact that \(|F| \le 1\), (II) \(\rightarrow 0\) as \(t-s \rightarrow 0\), and in fact
as \(s \rightarrow t-\) with t fixed. Using both Lemmas 2.1 and 2.4, we see (III) is \(o(t-s)\). Thus \(\int F(t,q) \, \mu (t,dq)\) is continuous in t; we will show additionally that it is differentiable from below in t with one-sided derivative equal to 0 for all \(0 < t < T\). In light of (2.29), our task is to show that \(-\int F(t,q) \, (\mathscr {L}^* \mu )(t,dq)\) approximates (I) up to an \(o(t-s)\) error.
Using Lemma 2.2 we have the following approximation of the portion of (I) involving \(\mu _n\), with error bounded by \(C_{\lambda ,H'(P)}[(t-s)^2 + (t-s) w(t-s)]\):
We have \(\int [F(t,q) \mathbf {1}_V(q) - F(t,\phi _s^t q) \mathbf {1}_U(q)] \, \mu _n(t,dq) = 0\) because the deterministic flow \(\phi _s^t\) maps U to V (modulo lower-dimensional sets) and preserve the Lebesgue measure in spatial coordinates. Making replacements using Lemma 2.3 and reordering the terms, we find with an error bounded by \(C_{L,n,\lambda ,H'(P)}[(t-s) + w(t-s)]\) that
for positive integers n. In the case \(n = 0\), we have \(\phi _s^t q = q\), and the approximation is
We observe that the bracketed portion of (2.31) nearly matches (2.27), except that part involves \(\mu _{n-1}\) and part involves \(\mu _n\). Furthermore, we have
This follows since \(\tau \) is conditionally independent of q given \(\rho _n\), and \(\tau \in (s,t)\) a.s. so that all but \(O(t-s)\) volume of the x-simplex is simply translated by \(\phi _s^\tau \) to a region of identical volume. We make the replacement indicated by (2.33) in (2.31) without changing the form of the error.
For any positive integer N we define
Summing (2.31), (2.32), and our approximation for \(\mu _n(t,\cdot ) - \mu _n(s,\cdot )\) from (2.27) gives
with an \(o(t-s)\) error. Call the right side \(\gamma _N(t)\), so that the left derivative of \(\varGamma _N\) is \(\gamma _N\). Since \(\varGamma _N\) is a Lipschitz function, we deduce
We have \(\gamma _N(t)\) bounded uniformly in t by
and thus \(\gamma _N \rightarrow 0\) uniformly in t. It follows that
as \(N \rightarrow \infty \). Recognizing the limits of \(\varGamma _N(T)\) and \(\varGamma _N(0)\) as
respectively, we have verified (2.14) and completed the proof. \(\square \)
Having shown the candidate measure \(\mu (t,dq)\) constructed using the solutions \(\ell (t,d\rho _0)\) and \(f(t,\rho _-,d\rho _+)\) given by Theorem 1.2, our next task is to verify the latter, which we undertake in the next section.
3 The kinetic and marginal equation
The primary goal of the present section is to prove Theorem 1.2 concerning the kernel \(f(t,\rho _-,d\rho _+)\) and marginal \(\ell (t,d\rho _0)\), so that the candidate measure \(\mu (t,dq)\) in the previous section is well-defined and has the properties required for the argument there.
While we introduce our notation, we also discuss the intuitive meaning of the terms of our kinetic equation, comparing with that of Menon and Srinivasan [25]. Let us write \(\mathscr {L}^{\kappa }\) for the operator on \(\mathscr {K}\) given by the right-hand side of (1.27),
so that the kinetic equation is \(f_t = \mathscr {L}^{\kappa } f\).
The first term, which we call the gain term, corresponds to the production of a shock connecting states \(\rho _-\) and \(\rho _+\) by means of collision of shocks connecting states \(\rho _-,\rho _*\) and \(\rho _*,\rho _+\). Such shocks have relative velocity given by
In the Burgers case, the second divided difference \(H[\cdot ,\cdot ,\cdot ]\) is constant, and the above is proportional to the sum of the increment \(\rho _+ - \rho _-\), analogous to mass in the Smoluchowski equation.
The second line of (3.1) we call the “loss” term, though this need not be of definite sign. To better understand this, note that when we have proven Theorem 1.2, we will know that \(f(t,\rho _-,d\rho _+)\) has total mass which is constant in \((t,\rho _-)\). In this case the loss term may be rewritten equivalently as
which corresponds precisely with the kinetic equation of [25]. The meaning of the first line of (3.3) is clear: we lose a shock connecting states \(\rho _-,\rho _+\) when a shock connecting \(\rho _+,\rho _*\) collides with this, and the relative velocity is precisely \(H[\rho _+,\rho _*] - H[\rho _-,\rho _+]\).
The second line of (3.3) is less easily understood. One would expect to find here a loss related to a shock connecting \(\rho _-,\rho _+\) colliding with \(\rho _*,\rho _-\) for \(\rho _* < \rho _-\), particularly if we were viewing f as a jump density as [25] views its corresponding n(t, y, dz). Viewed as a rate kernel, it is not clear that f should suffer such a loss: in some sense we must condition on being in state \(\rho _-\) for \(f(t,\rho _-,\cdot )\) to be relevant at all. At this point we cannot offer an intuitive kinetic reason for the second line of (3.3), but in light of the rigorous results of this article we can be assured that this is correct for our model.
Remark 3.1
The form (3.3) seems preferable in the more generic setting, since—as pointed out by the referee—the resulting equation \(f_t = \mathscr {L}^\kappa f\) has the property that
is (formally) conserved in time for each \(\rho _-\), without assuming that \(\lambda (0,\rho _-)\) is constant. This observation will be important in attempts to generalize the results of the present paper.
We return to the task at hand, showing existence and uniqueness of \(f(t,\rho _-,d\rho _+)\). The argument proceeds through an approximation scheme for \(\exp (\lambda H'(P) t) f(t,\cdot ,\cdot )\), where we can easily maintain positivity. To avoid endowing \(\mathscr {K}\) with a weak topology, we show directly the approximations are Cauchy, rather than appealing to Arzela–Ascoli.
Proof
(Theorem 1.2, part I) Write \(c = \lambda H'(P)\) and consider for each positive integer n the continuous paths \(h^n : [0,\infty ) \rightarrow \mathscr {K}\) defined by \(h^n(0) = g\) for all n and
We claim the following properties of \(h^n\):
-
(a)
\(h^n(t) \ge 0\) for all \(t \ge 0\),
-
(b)
for each \(t \ge 0\) the total integral \(\int h^n(t,\rho _-,d\rho _+)\) is constant in \(\rho _- \in [0,P)\),
-
(c)
for all \(t \ge 0\)
$$\begin{aligned} \Vert h^n(t)\Vert \le \lambda (1+c/n)^{\lceil nt \rceil } \le \lambda e^{c(t+1)} =: M(t), \end{aligned}$$(3.6)and
-
(d)
\(h^n(t,\rho _-,[0,\rho _-] \cup \{P\}) = 0\) for all t and all \(\rho _- \in [0,P)\).
Since \(h^n\) is piecewise-linear in t and the properties above are preserved by convex combinations, it suffices to verify this at \(t = j/n\) for all integers \(j \ge 0\).
We proceed by induction. Abbreviate \(h^n_j = h^n( j/n)\). The \(j = 0\) case holds by the hypotheses on g. Assume now that the claim holds for j, and consider \(j+1\). We have
In particular, for any \(\rho _+\) we have
using property (c) for case j; thus \(n(h^n_{j+1} - h^n_j) \ge 0\). This and (a) \(h^n_j \ge 0\) give \(h^n_{j+1} \ge 0\), property (a) for case \(j+1\). Furthermore, the total integral of the right-hand side of (3.7) is exactly
since (b) implies the bracketed portion integrates to 0. Also using (b) for case j, the change in the total integral from \(h^n_j\) to \(h^n_{j+1}\) is independent of \(\rho _-\), verifying (b) for case \(j+1\). Using (c) for case j we find
so that (c) holds for case \(j+1\). Finally, we observe that property (d) for \(h^n_j\) implies
and thus (d) holds for case \(j+1\).
We now consider the matter of convergence. Pairing \(\mathscr {L}^{\kappa }\) with any \(k_1,k_2 \in \mathscr {K}\) and measurable \(J(\rho _+)\) with \(|J| \le 1\), we find that
In particular \(\mathscr {L}^\kappa \) is Lipschitz on bounded sets in \(\mathscr {K}\). For brevity write
We compute for any \(m<n\) and \(t \ge 0\)
Observe that
where
We have also
Combining these, there is a constant \(C_1\) so that
for all positive integers n and times \(s \ge 0\).
The remaining term in the integrand of (3.14) is
All together,
and by Gronwall
From this we see that \(h^n\) is Cauchy, hence convergent to a continuous \(h : [0,\infty ) \rightarrow \mathscr {K}\) satisfying
for all \(t \ge 0\), having properties (a,b,d) above and \(\Vert h(t)\Vert \le M(t)\).
We define \(f(t) = e^{-ct} h(t)\), finding that
-
the mapping \(t \mapsto f(t) \in \mathscr {K}_+\) is continuous;
-
the mapping \(t \mapsto \mathscr {L}^\kappa f(t) \in \mathscr {K}\) is continuous (using the local Lipschitz property of \(\mathscr {L}^\kappa \)), and hence Bochner-integrable; and
-
the kernels f solve \(\dot{f} = \mathscr {L}^\kappa f\) with \(f(0) = g\) by Leibniz.
Using the local Lipschitz property of \(\mathscr {L}^\kappa \), the solution is unique. Properties (a,b,d) above hold for f, and (b) in particular gives for each fixed \(\rho _-\)
Thus \(\int f(t,\rho _-,d\rho _+) = \lambda \) for all \(t \ge 0\), \(0 \le \rho _- < P\). \(\square \)
We now turn to \(\ell (t,d\rho _0)\), which is intended to serve as the marginal at \(x = 0\) for our solution \(\rho (x,t)\) for fixed \(t > 0\). We define a time-dependent family of operators \(\mathscr {L}^0\) acting on measures \(\nu (d\rho _0)\) in \(\mathscr {M}\) by
Again the integration is over \(\rho _*\) only, and for each t we have \((\mathscr {L}^0 \nu )(t,\cdot ) \in \mathscr {M}\). Our evolution equation for \(\ell (t,d\rho _0)\) is the linear, time-inhomogeneous \(\ell _t = \mathscr {L}^0 \ell \).
Proof
(Theorem 1.2, part II) Note that (3.24) is exactly the forward equation for a pure-jump Markov process evolving according to a time-varying rate kernel
so we could obtain existence and uniqueness of solutions along these lines. We outline an argument similar to that employed for f, for the sake of completeness.
Define for each n the continuous path \(h^n : [0,\infty ) \rightarrow \mathscr {M}\) with \(h^n(0) = \delta _0\) and
Proceeding as in the proof for f we verify that for each j the \(h^n_j = h^n(j/n)\) are nonnegative and, since \(\mathscr {L}^0 h^n(j/n)\) has total integral zero, \(\Vert h^n_j\Vert \le (1+c/n)^{\lceil n t \rceil }\). Observing that \(\mathscr {L}^0\) is linear and bounded uniformly over t,
we easily show \(h^n\) is Cauchy on bounded time intervals, with limit \(h : [0,\infty ) \rightarrow \mathscr {M}_+\) satisfying
Take \(\ell (t) = e^{-ct} h(t)\) to find that \(\ell (t) \ge 0 \) solves
Uniqueness follows using (3.27). Finally, since \(\mathscr {L}^0 \nu \) as zero total integral for any \(\nu \), we find that \(\int \ell (t,d\rho _0) = 1\) for all t. \(\square \)
We close this section with a remark concerning the kinetic equation \(f_t = \mathscr {L}^\kappa f\) without the assumption that the initial kernel g has bounded support [0, P]. When the initial condition \(\xi (x)\) is unbounded, growing for example linearly in the case of quadratic H, the solution to the scalar conservation law \(\rho _t = H(\rho _x)\) on the semi-infinite domain should blow up in finite time. We likewise expect that the solution f to the kinetic equation will blow up, but control of certain moments prior to this blow up may allow us to run the argument of Theorem 2.1 with only superficial changes. At the moment we lack the sort of estimates one obtains in the Smoluchowski case (e.g. coming from a closed equation for the second moment), but we hope to revisit this in future work.
4 Conclusion
We review what has been accomplished: using an exact propagation of chaos calculation for a bounded system with a suitably selected random boundary condition, we have derived a complete description of the law of the solution \(\rho (x,t)\) to the scalar conservation law \(\rho _t = H(\rho )_x\) for \(x,t > 0\) when \(\rho (x,0) = \xi (x)\) is a bounded, monotone, pure-jump Markov process with constant jump rate. Notably, we have recovered the Markov property of the solution and a statistical description of the shocks simultaneously. This may be regarded as both a strength of our present analysis and a shortcoming of our present understanding (a soft argument for the preservation of the Markov property in the particle system would be illuminating).
We emphasize how our approach using random dynamics on a bounded interval has made things considerably easier: by constructing a bounded system which has exactly the right law, we are relieved of the burden of determining precisely how wrong the law would be with deterministic boundary.
Our result lends additional support to the conjecture of Menon and Srinivasan [25] and we hope that the sticky particle methods described herein will provide another approach, quite different from that of Bertoin [4], which might be adapted to resolve the full conjecture. One of the authors is attempting a similar particle approach in the non-monotone setting, and hopes to report on this in future work.
Notes
Straightforward manipulations of (1.5) relate this to a Legendre transform.
i.e. Without positive jumps.
Using Assumption 1.2 and \(\xi ,\zeta \ge 0\), the shocks and characteristics only flow outward across \(x = 0\). Any boundary condition we would assign, unless it involved negative values, would thus be irrelevant.
Which will almost surely not occur for the randomness we consider.
References
Abramson, J., Evans, S.N.: Lipschitz minorants of brownian motion and lévy processes. Probab. Theory Relat. Fields (2013). doi:10.1007/s00440-013-0497-9
Aldous, D.J.: Deterministic and stochastic models for coalescence (aggregation and coagulation): a review of the mean-field theory for probabilists. Bernoulli 5(1), 3–48 (1999). doi:10.2307/3318611
Avellaneda, M., Weinan, E.: Statistical properties of shocks in burgers turbulence. Commun. Math. Phys. 172(1), 13–38 (1995). doi:10.1007/BF02104509
Bertoin, J.: The inviscid burgers equation with brownian initial velocity. Commun. Math. Phys. 193(2), 397–406 (1998). doi:10.1007/s002200050334
Bertoin, J.: Clustering statistics for sticky particles with brownian initial velocity. J. Math. Pures Appl. 79(2), 173–194 (2000). doi:10.1016/S0021-7824(00)00147-1
Brenier, Y., Gangbo, W., Savaré, G., Westdickenberg, M.: Sticky particle dynamics with interactions. J. Math. Pures Appl. 99(5), 577–617 (2013). doi:10.1016/j.matpur.2012.09.013
Brenier, Y., Grenier, E.: Sticky particles and scalar conservation laws. SIAM J. Numer. Anal. 35(6), 2317–2328 (1998). doi:10.1137/S0036142997317353
Burgers, J.: A mathematics model illustrating the theory of turbulence. In: Von Mises, R., Von Karman, T. (eds.) Advances in Applied Mechanics, vol. 1, pp. 171–199. Elsevier, New York (1948)
Carraro, L., Duchon, J.: Solutions statistiques intrinsèques de l’équation de Burgers et processus de Lévy. Comptes Rendus de l’Académie des Sciences. Série I. Mathématique 319(8), 855–858 (1994)
Carraro, L., Duchon, J.: Équation de burgers avec conditions initiales à accroissements indépendants et homogènes. Annales de l’Institut Henri Poincare (C) Non Linear Analysis 15(4), 431–458 (1998). doi:10.1016/S0294-1449(98)80030-9
Chabanol, M.L., Duchon, J.: Markovian solutions of inviscid burgers equation. J. Stat. Phys. 114(1–2), 525–534 (2004). doi:10.1023/B:JOSS.0000003120.32992.a9
Davis, M.H.A.: Piecewise-deterministic Markov processes: a general class of nondiffusion stochastic models. J. R. Stat. Soc. Series B. Methodol. 46(3), 353–388 (1984)
Evans, L.: Partial Differential Equations. Graduate Studies in Mathematics. American Mathematical Society, Providence (2010)
Getoor, R.: Splitting times and shift functionals. Zeitschrift für Wahrscheinlichkeitstheorie und Verwandte Gebiete 47(1), 69–81 (1979). doi:10.1007/BF00533252
Groeneboom, P.: Brownian motion with a parabolic drift and airy functions. Probab. Theory Relat. Fields 81(1), 79–109 (1989). doi:10.1007/BF00343738
Hildebrand, F.B.: Introduction to Numerical Analysis, 2nd edn. Dover Publications Inc, New York (1987)
Hopf, E.: The partial differential equation \(u_t + u u_x = \mu u_{xx}\). Commun. Pure Appl. Math. 3(3), 201–230 (1950). doi:10.1002/cpa.3160030302
Kallenberg, O.: Foundations of Modern Probability. Probability and its Applications (New York), 2nd edn. Springer, New York (2002). doi:10.1007/978-1-4757-4015-8
Kaspar, D.C.: Exactly solvable stochastic models in elastic structures and scalar conservation laws. Ph.D. thesis, University of California, Berkeley (2014)
Li, L.C.: A finite dimensional integrable system arising in the study of shock clustering (2015). Preprint
Majumdar, S.N., Comtet, A.: Airy distribution function: from the area under a brownian excursion to the maximal height of fluctuating interfaces. J. Stat. Phys. 119(3–4), 777–826 (2005). doi:10.1007/s10955-005-3022-4
Menon, G.: Complete integrability of shock clustering and burgers turbulence. Arch. Ration. Mech. Anal. 203(3), 853–882 (2012). doi:10.1007/s00205-011-0461-8
Menon, G.: Lesser known miracles of burgers equation. Acta Math. Sci. 32(1), 281–294 (2012). doi:10.1016/S0252-9602(12)60017-4
Menon, G., Pego, R.L.: Universality classes in burgers turbulence. Commun. Math. Phys. 273(1), 177–202 (2007). doi:10.1007/s00220-007-0251-1
Menon, G., Srinivasan, R.: Kinetic theory and lax equations for shock clustering and burgers turbulence. J. Stat. Phys. 140(6), 1–29 (2010). doi:10.1007/s10955-010-0028-3
Novikov, D.: Hahn decomposition and Radon-Nikodym theorem with a parameter. ArXiv Mathematics e-prints (2005)
Sinai, Y.: Statistics of shocks in solutions of inviscid Burgers equation. Commun. Math. Phys. 148(3), 601–621 (1992). doi:10.1007/BF02096550
Smoluchowski, M.: Drei Vortrage über Diffusion, Brownsche Bewegung und Koagulation von Kolloidteilchen. Zeitschrift fur Physik 17, 557–585 (1916)
Acknowledgments
The authors thank the anonymous referee for suggestions improving our presentation and for pointing out a recent reference. DK was supported by NSF DMS-1106526 and a Robinson Fellowship as the research reported herein was conducted, and NSF DMS-1148284 as this article was written and revised. FR is supported by NSF Grant DMS-1407723.
Author information
Authors and Affiliations
Corresponding author
Appendix: proofs of lemmas
Appendix: proofs of lemmas
Proof
(Lemma 2.1) Boundedness of G is obvious from its definition as a Laplace functional. For continuity, we begin by considering \(G(\phi _0^t q)\) with q fixed. As t varies, \(\phi _0^t q\) varies continuously, with \(\rho \)-values unchanging and x-values changing with rate bounded by \(H'(P)\), except for finitely many times when collisions occur. Note \(G(\phi _0^t q)\) depends on \(\phi _0^t q\) via \(\pi (\phi _0^t q, \cdot )\), and the former varies continuously across these collision times. More concretely, because \(J \in C([0,L])\), J is uniformly continuous, and so
has the property that \(w_J(\delta ) \rightarrow 0\) as \(\delta \rightarrow 0+\). Using the above,
Since the exponential is uniformly continuous for nonpositive arguments, we find that \(G(\phi _0^t q)\) is continuous at \(t = 0\) uniformly in q. Write
and observe that \(w_G(\delta ) \rightarrow 0\) and \(\delta \rightarrow 0+\).
Suppose now that \(0 \le s < t \le T\); we compare \(\mathbb {E}G(\varPhi _s^T q)\) and \(\mathbb {E}G(\phi _t^T q)\). Write \(\theta = t - s\). We have \(G(\varPhi _s^T q) = G(\varPhi _{T-\theta }^T \varPhi _s^{T-\theta } q)\) a.s., coupling the random entry processes on matching intervals. Then
because the random entries occur at a rate bounded by \(\lambda H'(P)\) and \(|G| \le 1\). Using the time homogeneity and continuity for the deterministic flow,
So, at the cost of an error bounded by \(C_{\lambda ,H'(P)} \theta + w_G(\theta )\), we compare \(\mathbb {E}G(\varPhi _t^T q)\) with \(\mathbb {E}G(\varPhi _s^{T-\theta } q)\) instead. Now the configuration q is flowed on time intervals of equal length, but with random entry rates that have been shifted in time.
The random entry rate is given by \(H[\rho _n,\rho _+] f(r, \rho _n, d\rho _+)\) and f is TV-continuous in r; we have
Let us define three kernels for \(r \in [0,T-t]\) according to
where the minimum (\(\wedge \)) of two measures is defined as usual by choosing a third measure with which the former two are absolutely continuous, and taking the pointwise minimum of their Radon-Nikodym derivatives. That this extends measurably to the parametric case (i.e. involving kernels) is immediate from a parametric version of the Radon–Nikodym theorem [26, Theorem 2.3].
Using the above we construct a coupled random entry process for times \(r \in [0,T-t]\), namely let \((\zeta ^s,\zeta ^t)(r)\) be the pure-jump Markov process started at \((\rho _n,\rho _n)\) from q and evolving according to the generator \(\mathscr {L}^{z}\) acting on bounded measurable functions J(y, z) defined on \([0,P]^2\) according to
Taking J(y, z) which does not depend on z we find that \(\zeta ^s(r)\) for \(r \in [0,T-t]\) has the same law as the random boundary for \(\varPhi _s^{T-\theta } q\), and likewise taking J which does not depend on y we find the \(\zeta ^t(r)\) has the same law as the random boundary for \(\varPhi _t^T q\), verifying the coupling.
On the diagonal \(y = z\), the rate at which \(\mathscr {L}^{z}\) causes jumps that leave the diagonal (the second and third lines of (5.8)) is bounded by
and the probability that such a transition never occurs in a time interval of length \(T - t\) is bounded below by
So, coupling the random entry dynamics, we find that \(\varPhi _s^{T-\theta } q = \varPhi _t^T q\) with probability at least \(\exp [-C_{\lambda ,H'(P)} T \theta ]\). Putting the above pieces together, we find
and the proof is complete. \(\square \)
Proof
(Lemma 2.2) Using the Markov property of the random flow \(\varPhi \), we have a functional identity:
Let q be fixed, and consider the following events, whose union is of full measure for computing the final expectation above:
Observe that on \(\mathscr {E}_0\) we see only the deterministic flow \(\phi \) over the time interval (s, t):
and this occurs with probability
We prefer an expression evaluating f at only a single time, and Taylor expand the exponential around zero.
so that
and by exhaustion
with both errors bounded uniformly over Q. On \(\mathscr {E}_1\) write \(\tau \) for the first time a random entry occurs for \(\varPhi _s^t\), and \(\rho _+\) for the new boundary value, noting that the distribution of \(\tau \) depends only on q through \(\rho _n\) and that the law of \(\rho _+\) is determined by \(f(\tau ,\rho _n, \cdot )\). We have \(\tau \in (s,t)\) so that
Using the strong Markov property for the random boundary at the stopping time \(\tau \),
Since \(\mathbb {P}(\mathscr {E}_1) = O(t-s)\), we can afford to make o(1) modifications to this. Write \(w(\delta )\) for the modulus of continuity of F(t, q) in time, according to Lemma 2.1. Now \(\tau \in (s,t)\) a.s. on \(\mathscr {E}_1\), so
Next we modify the distribution from which \(\rho _+\) is selected; at present, \(\rho _+\) is selected according to the random measure
Since \(\tau \in (s,t)\) a.s. on \(\mathscr {E}_1\), the total variation difference
Let us write \(\hat{\rho }_+\) for an independent random variable distributed as
(note t instead of \(\tau \)), normalized to have unit mass. Then
Note that \(\mathbb {P}(\mathscr {E}_1)\) cancels with the normalization in (4.22), up to an \(O(t-s)\) error, and that \(\hat{\rho }_+\) is independent of \(\mathscr {E}_1\). With error bounded uniformly over \(q \in Q\) by \(C_{\lambda ,H'(P)}[(t-s)^2 + (t-s) w(t-s)]\), we have
where the remaining expectation is now taken only over \(\tau \) conditioned on \(\mathscr {E}_1\), which is therefore distributed over [s, t]. \(\square \)
Proof
(Lemma 2.3) On any of \(A_i\), \(i = 0,\ldots ,n-1\), we have \(\sigma = \sigma (q) \le t\). For such q we have
The first absolute difference is the error induced by conditioning on zero particle entries on the interval \([s,\sigma ]\), which costs less than \(\lambda H'(P) (t-s)\). Lemma 2.1 bounds the second of these by \(w(t-s)\). Using (4.27) we can replace the integrand \(F(t,\phi _s^t q)\) over \(A_i\) with \(F(t,\phi _s^\sigma q)\), i.e. the projection of q onto \((\partial _i \varDelta ^L_n) \times \overline{\varDelta ^P_{n+1}}\) in the direction of the velocity.
Write \(v_i = -H[\rho _{i-1},\rho _i]\) for the velocities. We can integrate over \(A_0\) using as a parametrization for the x-variables
which has absolute Jacobian determinant \(-v_1 = H[\rho _0,\rho _1]\). Namely,
On all but a portion of \(A_0\) with x-volume bounded by \(C_{n,L,H'(P)} (t-s)^2\) we have \(t - s < |(L - x_n)/v_n|\), and we can extend the upper limit of integration to \(\theta = t-s\) with and error bounded by this. Noting that the configurations
are equivalent under our sticky particle dynamics and relabeling, we obtain our approximation for the integral over \(A_0\).
The argument for \(A_i\), \(i = 1,\ldots ,n-1\), is similar. We use the mapping
with absolute Jacobian determinant \(v_i - v_{i+1} = H[\rho _i,\rho _{i+1}] - H[\rho _{i-1},\rho _i]\), and replace the upper limit of integration for \(\theta \) with \(t-s\) at the cost of an \(O((t-s)^2)\) error. Relabeling gives the indicated approximation.
The range of
where \((x_1,\ldots ,x_{n-1}) \in \varDelta ^L_{n-1}\), \((\rho _0,\ldots ,\rho _n) \in \overline{\varDelta ^P_{n+1}}\), and
is B, modulo sets of lower x-dimension. Using Lemma 2.1 we can replace F(t, q) by
with pointwise error over B bounded by \(C_{\lambda ,H'(P)}[w(t-s) + (t-s)]\). For all but an \(O((t-s)^2)\) portion of the x-volume, the value \(t-s\) is the minimum above, and as before we obtain the indicated approximation of the integral over B. \(\square \)
Proof
(Lemma 2.4) Essentially we are verifying the Leibniz rule, but we are unable to find a version of this to cite for kernels. We first obtain quantitative control over our linear approximations of f and \(\ell \). Namely, fix any measurable \(|J| \le 1\). We have
The difference of \(\mathscr {L}^\kappa f\) at different times can be expressed in terms of f and H again,
Noting that \(\Vert \mathscr {L}^\kappa f\Vert \le 3 \lambda ^2 H'(P)\), we integrate over \(\rho _+\), then \(\rho _*\), and find that (4.35) is bounded by
Next, for any \(|J| \le 1\),
and
We have \(\Vert \mathscr {L}^0 \ell \Vert \le 2 \lambda H'(P)\), so by integrating over \(\rho _0\) and then \(\rho _*\) we find (4.38) is bounded by
Returning to the problem of establishing our Leibniz rule, note that \(e^{-\lambda L} \mathbf {1}_{\varDelta ^L_n}(dx)\) factors from both \(\mu _n\) and \(\mathscr {L}^* \mu _n\). It will therefore suffice to obtain a bound on the \((\rho _0,\ldots ,\rho _n)\) portion.
We argue by induction. In the case \(n = 0\), we have only the difference in (4.38), and the result holds. Now suppose that the result holds for case n, and consider \(n+1\). Choose as a test function \(J(\rho _0,\ldots ,\rho _{n+1})\), which is measurable and has \(|J| \le 1\), and integrate it against
For each \(\rho _0,\ldots ,\rho _n\), the function \(J(\rho _0,\ldots ,\rho _{n+1})\) is bounded and measurable in \(\rho _{n+1}\), so \(f(t,\rho _n, d\rho _{n+1}) - f(s,\rho _n,d\rho _{n+1})\) can be replaced with
plus an error no larger than \(9 H'(P)^2 \lambda ^3 (t-s)^2\), which when integrated over the remaining variables grows by a factor \(\lambda ^n\).
In the third line of (4.41), noting \(\int J(\rho _0,\ldots ,\rho _{n+1}) f(s,\rho _n,d\rho _{n+1})\) is bounded and measurable in \((\rho _0,\ldots ,\rho _n)\), we apply the inductive hypothesis, replacing the bracketed difference by
plus an \(o(t-s)\) error. After doing this, again noting \(J(\rho _0,\ldots ,\rho _{n+1})\) is measurable in \(\rho _{n+1}\) for each fixed \(\rho _0,\ldots ,\rho _n\), we replace \(f(s,\rho _n,d\rho _{n+1})\) with \(f(t,\rho _n, d\rho _{n+1})\) at a cost of \(3\lambda ^2 H'(P)(t-s)\), which gets multiplied by the other factor \((t-s)\).
Adding the modified versions of these two lines of (4.41), we find exactly the \(\rho _0,\ldots ,\rho _{n+1}\) portion of \(\mathscr {L}^* \mu _{n+1}\) plus an \(o(t-s)\) error. \(\square \)
Rights and permissions
About this article
Cite this article
Kaspar, D.C., Rezakhanlou, F. Scalar conservation laws with monotone pure-jump Markov initial conditions. Probab. Theory Relat. Fields 165, 867–899 (2016). https://doi.org/10.1007/s00440-015-0648-2
Received:
Revised:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00440-015-0648-2