A Coq Formalization of Lebesgue Integration of Nonnegative Functions

Boldo, Sylvie; Clément, François; Faissole, Florian; Martin, Vincent; Mayero, Micaela

doi:10.1007/s10817-021-09612-0

A Coq Formalization of Lebesgue Integration of Nonnegative Functions

Published: 23 November 2021

Volume 66, pages 175–213, (2022)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Journal of Automated Reasoning Aims and scope Submit manuscript

A Coq Formalization of Lebesgue Integration of Nonnegative Functions

Download PDF

Sylvie Boldo ORCID: orcid.org/0000-0002-1970-3019¹,
François Clément^2,3,
Florian Faissole¹,
Vincent Martin⁴ &
…
Micaela Mayero⁵

356 Accesses
5 Citations
Explore all metrics

Abstract

Integration, just as much as differentiation, is a fundamental calculus tool that is widely used in many scientific domains. Formalizing the mathematical concept of integration and the associated results in a formal proof assistant helps in providing the highest confidence on the correctness of numerical programs involving the use of integration, directly or indirectly. By its capability to extend the (Riemann) integral to a wide class of irregular functions, and to functions defined on more general spaces than the real line, the Lebesgue integral is perfectly suited for use in mathematical fields such as probability theory, numerical mathematics, and real analysis. In this article, we present the Coq formalization of $\sigma $-algebras, measures, simple functions, and integration of nonnegative measurable functions, up to the full formal proofs of the Beppo Levi (monotone convergence) theorem and Fatou’s lemma. More than a plain formalization of the known literature, we present several design choices made to balance the harmony between mathematical readability and usability of Coq theorems. These results are a first milestone toward the formalization of $L^p$ spaces such as Banach spaces.

A Coq Formalization of Lebesgue Induction Principle and Tonelli’s Theorem

Nested Integrals and Rationalizing Transformations

Measure Construction by Extension in Dependent Type Theory with Application to Integration

Article 18 August 2023

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

This paper is dedicated to the Coq [25] formalization of Lebesgue integration theory. Among many applications in mathematics, we focus on the objective of building Sobolev spaces [2] that are used in numerous fields: in functional analysis [18, 62, 69], and in statistical and probabilistic mathematics [6, 29, 35, 38, 67], to name just a few.

Our main application is on the numerical resolution of partial differential equations (PDEs), using the Finite Element Method (FEM). Our final and long-term goal is to formally prove the correctness of the FEM and of parts of a library implementing it. The FEM can be applied to compute numerical approximations to solutions of many problems arising in physics, mechanics, and biology, for just a few examples. The success of the FEM is in large part due to its sound mathematical foundation, see, for instance, [22, 33, 61, 70] among the extensive literature. Prior to this work, we established in [15] a formalization of the proof of the Lax–Milgram theorem, that is a relatively simple way of proving the existence and uniqueness of the PDE solution and their FEM approximations for a wide range of problems. The Lax–Milgram theorem is set on a general Hilbert space (a complete vector space with an inner product). In the context of PDEs, the next stage is then the application of the Lax–Milgram theorem: typically, for the Poisson equation, one takes as Hilbert space a subspace of the $H^1$ Sobolev space, see, for instance, [33, Sect. 3.2]. The $L^p$ Lebesgue space is the space of functions whose absolute value to the power $p\ge 1$ is integrable, and $H^1$ is defined as functions in $L^2$ having a weak derivative also in $L^2$. We recall that $L^p$ is a Banach space (a complete normed vector space), and $L^2$ and $H^1$ are Hilbert spaces. This paper deals with the construction of the Lebesgue integral for nonnegative measurable functions, a first step toward the formalization of $L^p$, $H^1$, and other Sobolev spaces. Future work will include the formal definition and the proof that they are indeed complete normed vector spaces.

As far as the integral is concerned, several options are available, e.g., see [19]. The choice must be driven by the properties required for our future developments. As mentioned before, we are more interested in the completeness of the considered functional spaces (like $L^p$), than in the ability to integrate the most exotic irregular functions. On the one hand, the Riemann integral is thus clearly not satisfactory as it is not compatible with limit: the limit of Riemann-integrable functions is not necessarily a Riemann-integrable function. On the other hand, the gauge (Henstock–Kurzweil) integral [4, 42, 47] has attractive properties, e.g., it is often considered as the easiest powerful integral to teach. Unfortunately, its main drawback is that defining a complete normed vector space of HK-integrable functions is not as obvious as with the Lebesgue $L^p$ spaces [39, 57]. This led us to choose the Lebesgue integral, which has the additional desirable property of being very general: it is neither limited to functions defined on Euclidean spaces, nor to the Lebesgue measure on $\mathbb {R}^n$.

There are also several ways to build the integral of real-valued, or complex-valued, functions for the Lebesgue measure. First, the Daniell approach [26, 36] allows the extension of an elementary integral defined for elementary functions to a larger class of functions by means of continuity and linearity. When applied to the Riemann integral for continuous real-valued functions with compact support, it yields an integral equivalent to the Lebesgue integral for the Lebesgue measure. Second, a not so different alternate path consists in the completion of the normed vector space of continuous functions with compact support, and the extension of the Riemann integral which is uniformly continuous [16, 28]. Third, and the option we chose to follow, is a modern form of the original works of Lebesgue [48]. The Riemann integral is based on subdivisions of the domain of the function to integrate. In contrast, the Lebesgue approach focuses on the codomain. For each preimage, we need to provide its measure, whatever its irregularity.

This article covers the main concepts of measure theory such as the definitions of $\sigma $-algebra, measurability of functions, measure, and simple functions. Then, the integral is built following the Lebesgue scheme: first for nonnegative simple functions, then extended to all nonnegative measurable functions by taking the supremum. The definition of the integral of a function with arbitrary sign can be made by the difference, when possible, of the integrals of the positive and negative parts of the function; this is out of the scope of this paper and will be tackled in future work. The objective of this paper is to formally prove the main results on nonnegative measurable functions: the Beppo Levi (monotone convergence) theorem, and Fatou’s lemma.

From a mathematical point of view, given a measure space defined by a set X, a $\sigma $-algebra $\varSigma $, and a measure $\mu $, the two statements can be expressed in a mathematical setting as follows.

These are the cornerstones of our intended future work, such as the building of the $L^p$ Lebesgue spaces as Banach spaces, the proofs of Lebesgue’s dominated convergence theorem and of the Tonelli–Fubini theorems, and also the construction of the Lebesgue measure (for instance, through Carathéodory’s extension theorem [20, 29]). As a consequence, we do not yet need technical results on subset systems such as the Dynkin $\pi $–$\lambda $ theorem [29], or the monotone class theorem [24], that are popular tools for the extension of some property to the whole $\sigma $-algebra (e.g., the uniqueness of a measure).

Interactive theorem proving is more and more being used and adapted for formalizing real and numerical analysis. Real-life applications, such as hybrid systems or cyber-physical systems, are critical and rely on advanced analysis results. Until now, only the Riemann integral was available in Coq. As useful as the Riemann integral is, the Lebesgue integral is necessary for the numerical analysis we are examining. In addition, even though the Lebesgue integral exists in other theorem provers (see Sect. 10), we have decided to formalize it in Coq. Indeed, it is crucial for our future work to be able to merge results both from numerical analysis and from computer arithmetic (to bound rounding errors for instance). For that, we plan to rely on the Flocq library, which does not have a comparable equivalent in other theorem provers.

We use the Coquelicot library [13], a modernization of the real standard library of Coq, including a formalization of ${\overline{\mathbb {R}}}$, described in more detail in Sect. 2.1. This library provides classical real numbers which correspond to the real analysis we deal with. For this reason, we have also decided, as basic choices of our formalization, to use classical logic and to rely on the following axioms: strong excluded middle and functional extensionality. These choices are described in Sect. 2.2 and discussed in Sect. 9.

The mathematical definitions and proofs were mainly taken from textbooks [37, 40, 51], detailed and compiled in a research report [23] in order to ease the formalization in Coq. The Coq code is available at

http://lipn.univ-paris13.fr/MILC/CoqLIntp/index.php, or in the public repository

https://lipn.univ-paris13.fr/coq-num-analysis/tree/LInt_p.1.0/Lebesgue, where the tag LInt_p.1.0 corresponds to the code of this article.

The paper is organized as follows. Section 2 presents the main basic Coq choices on which our formalization is based. The sequel is our own contribution. Section 3 details auxiliary results on reals. The concept of measurability is discussed in Sect. 4, and that of measure in Sect. 5. Section 6 is devoted to simple functions, and Sect. 7 to integration of nonnegative functions and the main theorems. The case of the Dirac measure is studied in Sect. 8. Concerns about proof engineering are discussed in Sect. 9. Section 10 presents some state of the art of the formalization of the integral. Section 11 concludes and gives some perspectives.

2 Coquelicot Library and Other Basic Coq Choices

We first briefly review the few proof packages used in this work, and some technical and logical choices we made. These are discussed further in Sects. 9.1 and 9.2.

2.1 The Coquelicot Library and ${\overline{\mathbb {R}}}$

The Coquelicot library is a conservative extension of the Coq real standard library (Reals), with total functions for limit, derivative, and Riemann integral [13, 49, 50]. The features used here are the generic topology, the hierarchy of algebraic structures based on canonical structures, and the extended real numbers.

Generic topology. The Coquelicot topology is defined using filters [17, 21]. Intuitively, filters can be seen as sets of neighborhoods. For instance, the filter on type corresponds to the most intuitive neighborhoods of $\infty $.

It is used to define the convergence of sequences.

Algebraic hierarchy. Coquelicot also defines an algebraic hierarchy based on canonical structures. A useful level here is , that formalizes the mathematical concept of uniform space [17, 68]: it is a generalization of metric space with an abstraction of balls. In a uniform space , the property characterizes its open subsets.

Extended real numbers. Coquelicot provides a definition of the extended real numbers ${\overline{\mathbb {R}}}=\mathbb {R}\cup \{-\infty ,\infty \}$. The formal definition contains three constructors: for real numbers, for $\infty $ and for $-\infty $. Conversely, the function returns the real number for finite numbers and 0 for $\pm \infty $.

In addition to this definition, coercions from to and vice versa, an order with and , total operations such as , , , , , and with their properties are provided.

In particular, this means that addition on ${\overline{\mathbb {R}}}$ is a total function [13] that always returns a value. For instance, $\infty +(-\infty )$ (i.e., $\infty -\infty $) is 0, making some statements unintuitive, see also Sect. 4.5. However, the case of multiplication is not an issue as the convention $0\times \pm \infty =\pm \infty \times 0=0$ is widely adopted for measure theory and Lebesgue integration, because it yields more compact statements.

2.2 Axioms

Real analysis, as most mathematics, uses classical logic, and measure theory and Lebesgue integration are no exception. For this reason, we chose to conduct this formalization in a full-flavored classical framework.

We did not add our own axioms. In addition to the axioms defining , we require some classical properties from the standard library, listed here with the theorems we use.

We rely on many times, including for instance the definition of the characteristic function in Sect. 4.1. We have a brief use of dependent types in Sect. 6.2 related to simple functions, and we then rely on and . Last, we rely on at a single point in the proof of Lemma , and this is explained in Sect. 5.3.

3 Auxiliary Results About the Reals

From now on, we present our own contributions. A global dependency graph of our Coq files and results is given in Fig. 1 page 38, with links back to the appropriate sections.

The theorems described in this section are not dedicated to the Lebesgue integral and could be part of a support library. In Sect. 3.1, we show the expression of open subsets of $\mathbb {R}$ and $\mathbb {R}^2$ with a countable topological basis. Section 3.2 deals with sums on ${\overline{\mathbb {R}}}$. Section 3.3 presents some additional results about limits.

3.1 Second-Countability of Real Numbers

In Sect. 4.4, we need to characterize and decompose the open subsets of $\mathbb {R}$. More precisely, we build generators of the $\sigma $-algebras of $\mathbb {R}$ and $\mathbb {R}^2$ that contain the open subsets. Such generators need to be of countable size to comply with the properties of $\sigma $-algebras. Thus, the concepts of topological basis and second-countability appeared necessary.

Recall that a topological basis allows to express any open subset of a topological space as the union of a subfamily of the basis. A topological space is called second-countable when it admits a countable topological basis. Euclidean spaces $\mathbb {R}^n$ are second-countable. Indeed, the open boxes with rational boundaries form such a countable topological basis. In the case of $\mathbb {R}$, this is expressed as follows. For any open subset A of $\mathbb {R}$, there exists a sequence of pairs of rationals $(q_1^n,q_2^n)_{n\in \mathbb {N}}$ such that A is the union of the corresponding open intervals, $A=\bigcup _{n\in \mathbb {N}}(q_1^n,q_2^n)$.

The mathematical proof is well known, but the road to formalization was tedious.

Countability. We define bijections from $\mathbb {N}$ to $\mathbb {N}^2$, $\mathbb {Z}$, $\mathbb {Q}$ and $\mathbb {Q}^2$. It is not enough to prove they have the same size, we need “perfect” bijections, meaning inverse functions from one type to the other, handling correctly special cases such as zero.

Connected components. Given a subset A of $\mathbb {R}$ and a real x, we define the bounds of the largest possible interval included in A and containing x (a.k.a. the connected component of x in A).

The functions and are total functions from the Coquelicot library that compute the greatest lower bound and the least upper bound of a subset of reals.

We prove many properties such as belonging to the closure of its own connected component, , and belonging to the interior of its own connected component for points of open subsets, that is .

Using density of rational numbers. Given an open subset A of $\mathbb {R}$, we prove through density of $\mathbb {Q}$ in $\mathbb {R}$ that A contains at most a countable number of connected components.

Countability appears in this lemma through the rationality of y (otherwise, the theorem would be trivial by using x).

In addition, using again the density of $\mathbb {Q}$ in $\mathbb {R}$, we can take rational bounds for these intervals (by taking countable unions of intervals with rational bounds to recover each initial interval with real bounds). And then, using countability of $\mathbb {Q}^2$, we have a bijection from the integers to the rational bounds of the open intervals, and these serve as topological basis.

Second-countability. Given an open subset A of $\mathbb {R}$, we want to exhibit the $q_i^n$’s such that $A=\bigcup _{n\in \mathbb {N}}(q_1^n,q_2^n)$. This means we need to choose among the possible intervals of the topological basis the useful ones by relying on a property P. Then, A is equivalent to the countable union of the such that holds.

The same property holds for $\mathbb {R}^2$. We can define a topological basis for $\mathbb {R}^2$ (from the tensor product of the topological basis of $\mathbb {R}$) and prove

3.2 About Sums of Extended Real Numbers

Integrals of simple functions are defined in Sect. 6.2 as sums of extended reals. Even if we only sum nonnegative extended reals, we decided to use only as discussed in Sect. 9.1. But as in mathematics, the addition on as defined by Coquelicot is not associative. Indeed, $\infty +(\infty -\infty )=\infty $, while $(\infty +\infty )-\infty =0$. Our design choice therefore implies that big operators [5] cannot be used.

Let us begin with sums of a finite number of values. The definition goes as expected, with an equivalent alternative using for lists instead of functions.

In addition, we found it useful to define an “applied” sum that takes a function f and a list $\ell $ and returns the sum of the images $\varSigma _{i\in \ell }f(i)$.

The curly brackets around mean that this argument is implicit and need not be specified, as Coq can guess it from the type of the list .

This definition allows us to use extensionality either on the list , on the function , or on the application , which turned out to be more practical than what this obvious definition seems. Examples of use are the following lemmas (that do not need nonnegativity). The first one mixes two applications.

The second one focuses on the statement $\varSigma _{i\in \ell _1}f(i)=\varSigma _{i\in \ell _2}f(i)$. Such result is obvious when $\ell _1=\ell _2$, but it is also possible to prove it when the lists are identical except for items i of the lists such that $f(i)=0$, as these do not impact the final sums. Indeed, the sums may be the same even if the two lists are different (and of different lengths for instance). The function is defined later in Sect. 6.1. It has type and selects the elements of a list that have a given property, without changing otherwise the order in the lists.

When values are nonnegative, associativity is back and we have the expected theorems on sums.

For the sake of brevity, we have defined the properties and for nonnegative functions and lists.

The most interesting theorem about sums of lists is the ability to swap the order of a double summation,

$$\begin{aligned} \sum _{i_1 \in \ell _1} \sum _{i_2 \in \ell _2} f (i_1, i_2) = \sum _{i_2 \in \ell _2} \sum _{i_1 \in \ell _1} f (i_1, i_2). \end{aligned}$$

3.3 About Limits

We also need some additional results on limits and suprema.

First of all, the sums defined in Sect. 3.2 have a finite number of terms. But the main theorems to come rely on infinite sums (i.e., series). The most common definition is the limit of the finite partial sums, i.e., in Coquelicot [13]. Nevertheless, by virtue of the least-upper-bound property in $\mathbb {R}$ and ${\overline{\mathbb {R}}}$, when a sequence is increasing (which happens when adding only nonnegative values), the supremum is also the limit, and we may equivalently use instead. This has proved more convenient and more suited to our needs. So theorems of Sect. 7 such as the Beppo Levi theorem rely on .

Next, we are interested in the limit inferior of sequences in ${\overline{\mathbb {R}}}$. But, Coquelicot only provides of type , and nothing for sequences. Therefore, we defined a minor variant of the desired type and proved a few lemmas by directly copying their proofs from those for in Coquelicot.

4 Measurability

We present now the formalization of $\sigma $-algebras, which are defined as an inductive type. They characterize measurable subsets, and particular attention is paid to $\mathbb {R}$, ${\overline{\mathbb {R}}}$ and $\mathbb {R}^2$, where the open subsets generate the Borel measurable subsets.

The issue of subsets is briefly addressed in Sect. 4.1. Section 4.2 is devoted to the measurability of subsets, and Sect. 4.3 to Cartesian products. The Borel subsets of $\mathbb {R}$ and ${\overline{\mathbb {R}}}$ are detailed in Sect. 4.4. And Sect. 4.5 deals with the measurability of functions.

4.1 Subsets and Characteristic Functions

We consider a generic set defined in Coq as . Usually, subsets of are defined in Coq as having type , or . We choose , and this is discussed in Sect. 9.2. Then, the power set of has type .

Given a subset A, we define its characteristic function (or indicator function) ${\mathbb {1}}_{A}$ that maps elements of A to 1, and others to 0.

Indeed, it is very convenient for direct use in arithmetic expressions without exhibiting the membership conditional in a dependent type or an assumption. It is used a lot in the context of simple functions in Sect. 6.

The characteristic function is also convenient to simulate the restriction of a numerical function to a subset, for instance, in Sect. 4.5. More precisely, the mathematical function ${f}_{|_{A}}$ could be formalized either as a record with a dependent type or as a total function. We have explored the first way which became impractical as proofs creep into our statements and prevent some rewritings. The total function is then $f\times {\mathbb {1}}_{A}$, which is the correct value when needed and 0 elsewhere. This is perfectly suited to our context, as integrating zero has no impact.

4.2 Measurability of Subsets

The design choice for the measurability of subsets, i.e., the definition of $\sigma $-algebra, is a central issue for this paper. Even though several equivalent definitions are possible, the use of an inductive type has proved successful, with several proofs done by induction.

Among several possible informal definitions [23, Section 8.6], a $\sigma $-algebra is a subset of the power set that contains the empty set and is closed under complement and countable unions. In fact, a $\sigma $-algebra can be really huge and it is very convenient to represent it with a smaller collection G of so-called generators, and to consider the smallest $\sigma $-algebra containing G. This corresponds to the informal concept of generated $\sigma $-algebra. Indeed, in many situations, it is sufficient to establish a property on G to have it on the whole $\sigma $-algebra generated by G.

While this may suggest the use of a record, we rely on an inductive type. More precisely, we “start” with a collection of generators . Then, a subset is measurable if it is either a generator, empty, the complement of a measurable subset, or the countable union of measurable subsets. This design choice is discussed in Sect. 9.3. Note that the issue of generators is at the center of Sect. 4.3 for Cartesian products and discussed in Sect. 4.4 for the Borel subsets of real numbers.

From this definition, we then prove various lemmas, relying on our classical setting, such as measurability of the full set, and of countable intersections.

A mathematically unexpected, but quite useful theorem is the following.

Constant properties (that do not depend on a variable), be they true or false, are measurable as both (the full set) and (the empty set) are measurable. When decomposing a subset to prove its measurability, this comes in handy.

In many situations, several collections of generators are possible, and switching between them may be convenient for the proof at hand. In fact, if $G_1$ is included in the $\sigma $-algebra generated by $G_2$, and vice versa, then both generated $\sigma $-algebras are the same. This yields the following extensionality result.

We now define what is a $\sigma $-algebra, but this definition is hardly used later on as we rely mostly on the previous inductive. A $\sigma $-algebra is formally defined as a subset of the power set that is equal to the $\sigma $-algebra induced by itself as generator.

We have the equivalence with one of the commonly used mathematical definitions: $\mathcal {S}$ is a $\sigma $-algebra when it contains the empty set and is closed under complement and countable unions.

We can of course prove that the basic $\sigma $-algebras are indeed compliant with our definition, be it the discrete $\sigma $-algebra (the whole power set), or the trivial $\sigma $-algebra (reduced to $\{\varnothing ,E\}$).

An immediate consequence of the extensionality result about generators is the idempotence of $\sigma $-algebra generation. Indeed, the $\sigma $-algebra generated by a given generated $\sigma $-algebra is the very same $\sigma $-algebra. This may be expressed in Coq as , showing that our definition is indeed a $\sigma $-algebra in the mathematical sense. In addition, the definition by induction gives us for free that our definition represents the smallest generated $\sigma $-algebra.

To sum up, in our development, the measurability of subsets of is built by induction from a generator , providing a $\sigma $-algebra.

4.3 Cartesian Product and Measurability

Although we do not deal with the Tonelli–Fubini theorems in this paper, the Cartesian product is used in Sect. 4.5 to establish measurability of the addition and multiplication of two measurable numerical functions.

Given two measurable spaces, i.e., two sets E and F and their associated generators $G_E$ and $G_F$, it is natural to ask the question of measurability on the Cartesian product $E\times F$; but with which $\sigma $-algebra? Among other possibilities, the tensor product of the two $\sigma $-algebras is of paramount interest, since it makes both canonical projections (the maps $((x_E,x_F)\mapsto x_E)$ and $((x_E,x_F)\mapsto x_F)$) measurable. It is the $\sigma $-algebra generated by the Cartesian products of measurable subsets of E and F.

Unfortunately, on the matter of generator, simply taking the Cartesian products of elements of $G_E$ and $G_F$ is not correct: in this case, for instance, one cannot prove the measurability of $A_E \times F$, for $A_E\in G_E$. We need to add the full sets to the initial generator, using the following definition.

And we prove this satisfies the desired property.

4.4 Borel Subsets of Real Numbers

We specify now an important class of $\sigma $-algebras. When the measurable space has also a topological space structure (e.g., in Coquelicot, see Sect. 2.1), one usually selects the Borel $\sigma $-algebra. It is generated by all the open subsets, or equivalently by all the closed subsets, and has the nice property of providing measurability for continuous functions (see Sect. 4.5).

Lebesgue integration theory is essentially meant for real-valued functions (or with codomain ${\overline{\mathbb {R}}}$, $\mathbb {R}^n$, or $\mathbb {C}^n$). Thus, we need to equip $\mathbb {R}$, and ${\overline{\mathbb {R}}}$, with their Borel $\sigma $-algebras, and we have some leeway in choosing the generators, instead of all open subsets. Now, we present our choice and also prove that other possibilities define the same $\sigma $-algebras.

Borel subsets on $\mathbb {R}$. Among many possibilities, we pick the closed intervals (of the form [a, b], with $a\leqslant b$ reals) for $\mathbb {R}$.

This choice of is somewhat arbitrary and could be changed. Thus, we introduce an anonymous that will be used in the sequel of the paper for the definition of measurable subsets of $\mathbb {R}$.

Other choices for $\mathbb {R}$ include the open intervals, or of the form [a, b), or the open left-rays of the form $(-\infty ,b)$. We proved, for instance, that measurability on $\mathbb {R}$ generated by closed intervals (our definition) is the same as measurability generated by open intervals.

The proof is a call to the generator extensionality lemma and then relies on basic properties of measurability (closedness under complement and countable union), and on the definition of a nested sequence of closed intervals (from ) whose union is an open interval (from ), thanks to the Archimedean property of $\mathbb {R}$. Moreover, from the density of $\mathbb {Q}$ in $\mathbb {R}$, we may only consider open intervals with rational endpoints.

And finally, more interestingly from a mathematical viewpoint, we prove that our measurable subsets on $\mathbb {R}$ (based on closed intervals) are indeed the Borel subsets generated by from .

The proof is simply an application of lemma from Sect. 3.1 stating that any open subset is the countable union of open intervals with rational endpoints. This is needed in Sect. 4.5 where the measurability of the addition of two measurable real-valued functions relies on the continuity of the addition in $\mathbb {R}$.

Borel subsets on $\mathbb {R}^2$. Combining the generator for a Cartesian product of Sect. 4.3 and the second-countability of $\mathbb {R}^2$ of Sect. 3.1, we have an equivalence result for the Borel subsets of $\mathbb {R}^2$.

Here, stands for the open subsets of $\mathbb {R}^2$. The canonical structures of Coquelicot deduce that $\mathbb {R}^2$, as product of two s, is a .

Borel subsets on ${\overline{\mathbb {R}}}$. For ${\overline{\mathbb {R}}}$, the generators we choose are the closed right-rays (of the form $[a,\infty ]$, $a\in {\overline{\mathbb {R}}}$), but we also define an anonymous .

We proved the equivalence with the measurability defined by closed left-rays (of the form $[-\infty ,a]$). Unlike $\mathbb {R}$, the measurability of the addition of two measurable ${\overline{\mathbb {R}}}$-valued functions does not rely on continuity anymore (see Sect. 4.5), and we did not prove that our measurable subsets on ${\overline{\mathbb {R}}}$ (based on closed rays) are indeed the Borel subsets generated by the open subsets of ${\overline{\mathbb {R}}}$, as we do not need it for now.

Next, we proved that measurability is compatible with scaling.

Note that $\ell $ may be any extended real, even 0 or $\pm \infty $. So one may imagine the numerous subcases to ensure this lemma.

4.5 Measurability of Functions

From the measurability of subsets defined above, we can now define the measurability of a function.

General case. Given two sets E and F and associated generators $G_E$ and $G_F$, a function $f:E\rightarrow F$ is measurable when for every measurable subset A, the subset $f^{-1}(A)$ is measurable, i.e., $\{x\,|\,f(x)\in A\}$ is measurable. Note that $f^{-1}$ is obviously understood as a function from the power set of F to the one of E. The measurability is then defined in Coq as follows.

We then prove some basic properties. For instance, it is enough to consider the generators to ensure the measurability of a function.

When and are also (from Coquelicot, see Sect. 2.1), the use of Borel $\sigma $-algebras (generated by the open subsets) ensures that continuous functions are measurable. As explained in Sect. 2.1, the continuity definition is based on filters.

This is simply due to the fact that the inverse image of an open subset by a continuous function is an open subset.

Case of numerical functions. Now let us consider the case of numerical functions, with codomain $\mathbb {R}$, or ${\overline{\mathbb {R}}}$. The definition relies on the generators and defined above.

Later on, we have to deal with piecewise-defined functions, and in such a situation, it is interesting to treat each piece separately, and to use the restriction defined in Sect. 4.1 as the multiplication by the characteristic function. The following result, simple but useful, states that given a measurable subset A and a measurable function g, given a function f equal to g on A, then $f\times {\mathbb {1}}_{A}$ is measurable. Its proof is rather easy given the proved properties of the measurability of subsets.

The main mathematical result of the rest of this section is the compatibility of measurability of functions with algebraic operations (addition, scalar multiplication and multiplication); the most complex one being the addition. From the mathematical standpoint, when extended real values are involved, it is assumed that these operations are well defined. In Coq, when using operations on from Coquelicot that are total functions, the situation is different, and somewhat more complex as explained below.

Functions to $\mathbb {R}$. Let us prove first the measurability of the sum of two measurable real-valued functions.

The proof uses the compatibility of measurability with the composition of functions: if both f and g are measurable, then so is $f\circ g$. This is applied to $f{\mathop {=}\limits ^{\mathrm {def.}}}((x,y)\mapsto x+y)$ of type $\mathbb {R}^2\rightarrow \mathbb {R}$ and $g{\mathop {=}\limits ^{\mathrm {def.}}}(x\mapsto (f_1(x),f_2(x)))$ of type $\mathbb {R}\rightarrow \mathbb {R}^2$.

Measurability on $\mathbb {R}^2$ relies on , the generator of the Borel subsets of $\mathbb {R}^2$ defined in Sect. 4.4. The proof is based on the generator equivalence between and , and on the continuity of addition. This proof was not difficult, but happened to be much higher-level than expected. The multiplication of real-valued functions is treated exactly in the same way.

Scalar multiplication for measurable functions is deduced from a similar theorem about scalar multiplication for measurable subsets. In the end, the measurable real-valued functions form an algebra (over the field $\mathbb {R}$); however, we have not stated it (with canonical structures for instance) as we have no use for it, but all the needed lemmas are proved.

Functions to ${\overline{\mathbb {R}}}$. Let us consider now the addition of measurable extended real-valued functions. The semantics of $+_{{\overline{\mathbb {R}}}}$ is more complex, as it raises the question of what is $\infty -\infty $. We rely on the Coquelicot definition of . As a total function, it returns 0 in this special case, see Sect. 2.1. The proof for $\mathbb {R}$ was based on the continuity of $+$; but that cannot be used here, as $+_{{\overline{\mathbb {R}}}}$ is not continuous on the whole set ${\overline{\mathbb {R}}}^2$ (there are problems at infinity, even for the total function).

In order to stick closely to the mathematics, we rely on a way to express the legality of addition: the property that basically prevents adding $\infty $ and $-\infty $. Thus, we prove the following theorem.

The proof is a little tedious as it splits E into all the possible cases using : when both $f_1(x)$ and $f_2(x)$ are finite, the previous theorem on $\mathbb {R}$ is used. Otherwise, the preimages of $\pm \infty $ are measurable since singletons are (as closed subsets). Thus, we are able to finish all the cases.

Among the peculiarities of Coq compared to mathematics, note that a simpler theorem can be devised.

It states the same conclusion, but without assuming the legality of addition. Indeed, the total function $(x\mapsto f_1(x)+_{{\overline{\mathbb {R}}}}f_2(x))$, with value 0 when both operands are infinite opposites, is actually measurable. This subtlety when considering $\infty -\infty $ is related to total functions, a design choice that prevents dependent types but may give strange results when out of the domain of the function. This strangeness also exists in the Coq standard library of reals [54] when considering the division as a total function, making a valid real. This hard question would be solved more naturally in other provers, for instance, in PVS relying on TCCs (Type-Correctness Conditions) [60]. To conclude, the main problem with this theorem is that it does not state what the mathematicians read in it, so we have decided not to use it.

The multiplication of two functions taking their values in ${\overline{\mathbb {R}}}$ is treated similarly. However, it does not raise the same issues as addition, because Coquelicot and mathematics for measure theory use the same convention $\pm \infty \times 0=0$, see Sect. 2.1. Multiplication by a scalar is deduced from a similar theorem on measurable subsets. Note that in contrast to the case of $\mathbb {R}$, measurable functions with values in do not form an algebra, as is not associative, see Sect. 3.2.

5 Measure

A measurable space with a $\sigma $-algebra can be equipped with a measure. A measure is a mapping from measurable subsets to nonnegative extended real values that satisfies additivity properties. Some well-known measures are the Lebesgue measure, the counting measure, the Dirac measure (see Sect. 8), and numerous probability measures (that take values in the interval [0, 1]).

Measure theory is a general abstract setting that applies to any measure, and the axiomatization of their fundamental properties is formalized here with an instantiation in Sect. 8.

5.1 Specification and Basic Properties

Given a measurable space defined by a set and a generator of type (see Sect. 4.2), our design choice is to specify measures as a type containing a map together with the fundamental properties making this map a measure.

The measure is defined as a record. For the sake of brevity, we want to use it directly as a function, so we have a coercion (hence the symbol ) between the type and .

The first two properties and are self-explanatory. Using standard mathematical notations ($\uplus $ denotes the disjoint union), the $\sigma $-additivity of a map $\mu $ means that for any sequence $(A_n)_{n\in \mathbb {N}}$ of pairwise disjoint measurable subsets of E, we have $\mu \left( \biguplus _{n\in \mathbb {N}}A_n\right) =\sum _{n\in \mathbb {N}}\mu (A_n)$. Note that infinite summations in ${\overline{\mathbb {R}}}_+$ are formalized as the supremum of partial sums (see Sect. 3.3).

From these fundamental axioms, we prove several other properties of measures among which monotony (i.e., $A\subseteq B\Rightarrow \mu (A)\leqslant \mu (B)$, for measurable subsets A and B), and the weakening of $\sigma $-additivity into (finite) additivity, for finite unions of pairwise disjoint subsets. For instance, the special case of the union of two disjoint subsets simplifies into

Another interesting result is the following decomposition of the measure of a measurable subset using a countable partition of the set .

The proof derives directly from $\sigma $-additivity. A weakened version for finite partitions is useful to establish additivity of the integral of nonnegative simple functions in Sect. 6.3.

5.2 Boole’s Inequality and Continuity from Below

The $\sigma $-additivity and additivity properties described in Sect. 5.1 deal with the union of pairwise disjoint measurable subsets. When the union is not disjoint, the equality becomes an inequality, and the resulting subadditivity property is called Boole’s inequality. The proof path we have followed first addresses the finite case, then establishes an important intermediate result known as continuity from below, and finally deals with the infinite case of $\sigma $-subadditivity.

Let us first consider finite subadditivity. It states that for any finite sequence $(A_n)_{n\in [0..N]}$ of measurable subsets of E, we have

$$\begin{aligned} \mu \left( \bigcup _{n \in [0..N]} A_n \right) \leqslant \sum _{n \in [0..N]} \mu (A_n). \end{aligned}$$

The proof is performed by induction on the parameter N and uses several previously proved results, such as additivity and monotony of measures, and compatibility of measurability with finite union and intersection. A specialization for the case $N=2$, called , will be handy in the sequel.

The next step is technical, it allows to transform any countable union of subsets into a pairwise disjoint union, while keeping equal the partial unions. When the input sequence $(A_n)_{n\in \mathbb {N}}$ is nondecreasing, the new sequence of pairwise disjoint subsets somehow corresponds to “nested onion peels”: $B_0{\mathop {=}\limits ^{\mathrm {def.}}}A_0$, and for all $n\in \mathbb {N}$, $B_{n+1}{\mathop {=}\limits ^{\mathrm {def.}}}A_{n+1}\setminus A_n$. The Coq formalization is quasiliteral.

We prove several properties of layers, such as compatibility with partial and countable union (i.e., $\biguplus _{n\in I}B_n=\bigcup _{n\in I}A_n$ with , for $I=[0..N]$ and $I=\mathbb {N}$), and compatibility with measurability (i.e., the layers of a sequence of measurable subsets are measurable).

Our main application of layers and their properties is the continuity from below of measures. This results states that for any nondecreasing sequence $(A_n)_{n\in \mathbb {N}}$ of measurable subsets of E (i.e., $A_n\subseteq A_{n+1}$), we have

$$\begin{aligned} \mu \left( \bigcup _{n \in \mathbb {N}} A_n \right) \leqslant \lim _{n \rightarrow \infty } \mu (A_n). \end{aligned}$$

Note that monotony of measures allows to replace the limit by a supremum (see Sect. 3.3). Again, the Coq formalization is straightforward.

The proof simply stems from finite additivity and $\sigma $-additivity of measures and from careful use of the properties of layers.

Finally, let us consider $\sigma $-subadditivity, i.e., Boole’s inequality. It states that for any sequence $(A_n)_{n\in \mathbb {N}}$ of measurable subsets of E, we have

$$\begin{aligned} \mu \left( \bigcup _{n \in \mathbb {N}} A_n \right) \leqslant \sum _{n \in \mathbb {N}} \mu (A_n). \end{aligned}$$

It is formalized using .

The proof is an application of continuity from below to the sequence of partial unions ($B_N{\mathop {=}\limits ^{\mathrm {def.}}}\bigcup _{n\in [0..N]}A_n)$. In Coq, partial unions are defined using existential quantification that makes the proof process convenient and fluid.

Then, the proof resumes by applying finite subadditivity to the nondecreasing sequence , and using properties of the supremum.

5.3 Negligible Subsets

The concepts of negligible subset and property satisfied almost everywhere are of major importance in Lebesgue integration theory. They are the key ingredients to obtain the positive definiteness property (i.e., $\Vert u\Vert =0\Rightarrow u=0$) of the norm in $L^p$ Lebesgue spaces, which will be the subject of future developments.

A subset A of E is said to be negligible for the measure $\mu $, or simply $\mu $-negligible, when it is included in a measurable subset B of measure 0.

We prove several simple results about negligible subsets. For instance, measurable subsets of measure 0 are negligible, and subsets of negligible subsets are negligible. The negligibility of the countable union of negligible subsets is a bit more challenging; it is a consequence of Boole’s inequality.

This lemma is the only one where we rely on the choice property, see Sect. 2.2. The reason is as follows. Given a natural number n, as we have , we deduce the existence of a B containing $A_n$ that is both measurable and of measure 0. But the use of Boole’s inequality and of the measurability of a countable union of sets require a sequence of these B. So we rely on to go from “for each n, we have a B” to a sequence of type with the expected properties.

A property is said to hold $\mu $-almost everywhere () when its complement is $\mu $-negligible.

We prove some simple results about $\mu $-almost everywhere properties. For instance, the countable intersection of properties holding $\mu $-almost everywhere holds $\mu $-almost everywhere. This derives from . An important instantiation of is the equality $\mu $-almost everywhere, used in Sect. 7.4.

6 Simple Functions

Simple functions are real-valued functions that attain only a finite number of values. But, unlike step functions used for Riemann sums, each value may be taken here on a nonconnected subset.

This is a very simple mathematical definition, but it will require some proof engineering to have a usable formal definition. Another mathematical equivalent definition is that a simple function is a finite linear combination of indicator functions, and can be expressed as

$$\begin{aligned} f = \sum _{y \in f (E)} y \times {\mathbb {1}}_{f^{-1}(\{y\})}, \end{aligned}$$

(3)

where ${\mathbb {1}}_{}$ is the characteristic function (see Sect. 4.1). This definition is impractical in Coq as it sums over f(E) that may be infinite in general. Only the property of f makes this subset finite. We choose to have a data structure that allows us to access the possible values, in order to be able to compute the integral of simple functions, and we choose to have them as a list. Indeed, the Coq List library is rather comprehensive, even if not perfectly suited for our use. We also finally choose to have simple functions of type and not ; this is discussed in Sect. 9.4.

We consider an ambient set now required to be inhabited. The empty case is not of interest here, and it would mean empty lists that make the following functions fail. Instead of having additional hypotheses on the lists, it was easier not to consider empty types. Given a function and a list, the property states that the values taken by the function belong to the list.

Note that this list is far from unique: the elements may be in any order, can be taken several times, and useless values may be in the list. Hence, the need for a canonical list that is computed in Sect. 6.1, in order to integrate nonnegative simple functions, as described in Sect. 6.2. The positive linearity of the integral is shown in Sect. 6.3.

6.1 Canonical Representation

As explained above, the property does not specify a unique list. To enforce uniqueness, we need a strictly sorted list, with only the useful values.

where (from the Coq standard library) is the inductive definition of a sorted list using the relation. Here, we require the strict order to prevent duplicates.

The related proofs are then threefold. First, we need to prove that only one list fits the requirement.

The proof is not difficult, but slightly tedious. An intermediate lemma states that if two lists have the same elements (using ) and are both with , then they are equal.

Second, to recover the fact that our simple functions are indeed a finite linear combination of indicator functions, we also prove that

implies the same equality as (3), but for y in the list: $f=\sum _{y\in \ell }y\times {\mathbb {1}}_{f^{-1}(\{y\})}$.

Last but not least, we need to be able to build this canonical list using several intermediate steps.

Let us explain these functions. The function takes advantage of the axiom to select the values of a list having a given property. The function then allows us to select only the useful values of the list (such that there exists a preimage to it). The function from the Coq standard library removes duplicates (the decidability of equality on real numbers is given). We redefined the sort function and call it with the nonstrict order .

The canonizer function is then a successive call to , and . Note that these operations are actually commuting, thus any ordering would have been correct. We choose the one that eases the proofs. In particular, is the last called function as it will imply an easy proof that the final list is sorted. The function is called first as few lemmas are available on it.

The correctness of this canonizer is then proved.

For instance, consider the case of the characteristic function of some proper subset A (distinct from the empty set and from the full set, thus assuming that type is neither empty, nor representing a singleton). This function actually takes the values 0 and 1 (see Sect. 4.1): it is a simple function. Starting with any list of real numbers containing 0 and 1, e.g., , we may show the property . And thus, from the previous lemma, we have , where the canonized list is simply .

6.2 Integration of Nonnegative Simple Functions

Following the definition of simple functions, we retain those for which preimages of singletons are measurable and thus admit a measure, possibly infinite. Those measurable simple functions are collected into the set $\mathcal {SF}$, and the subset of nonnegative ones is denoted $\mathcal {SF}_+$. The needed tools for integrating in $\mathcal {SF}_+$ are sums on ${\overline{\mathbb {R}}}$ as defined in Sect. 3.2, and a measure $\mu $ as defined in Sect. 5. The integral of $f\in \mathcal {SF}_+$ is defined by

$$\begin{aligned} \int _{\mathcal {SF}_+}f \, \mathrm{d}\mu {\mathop {=}\limits ^{\mathrm {def.}}}\sum _{y \in f (E)} y \times \mu \left( f^{-1}(\{y\}) \right) . \end{aligned}$$

(4)

We first need to specify simple functions of $\mathcal {SF}$ that have measurable preimages.

Note that the list is in as we need to get a hand on it to compute the integral. A weak existential is not strong enough for our purpose. In Coq, the notation means there exists such that , but the type is . This means it is a strong existential, i.e., it is possible to pick up the witness .

Note also that since singletons are Borel subsets of $\mathbb {R}$ (as closed subsets), we are able to prove measurability of functions in $\mathcal {SF}$.

Then, the definition of the integral in $\mathcal {SF}_+$ is straightforward from a proof of type .

Note the required hypothesis that encompasses both the proof that f is a valid simple function, and the list witness $\ell $ on which the definition depends. Then, returns the first part of this proof, that is the list , in order to sum on it. This dependent type is only inside the library and is not to be used outside: final users will make use only of a total function for the Lebesgue integral. This limited use of dependent types has not proved inconvenient.

We first prove that the value of the integral does not depend on the chosen list/proof ( and in the next lemma).

A first easy example of integration is the relationship between this integral and the characteristic function. The characteristic function has two possible values (0 and 1), so it is a simple function. When the subset A is measurable, ${\mathbb {1}}_{A}$ belongs to $\mathcal {SF}_+$, and its integral is, as expected, the measure of A.

6.3 Linearity of the Integral of Simple Functions

Then comes a proof that is unexpectedly complex, the additivity of the integral in $\mathcal {SF}_+$ : $\int _{\mathcal {SF}_+}(f+g)\,\mathrm{d}\mu =\int _{\mathcal {SF}_+}f\,\mathrm{d}\mu +\int _{\mathcal {SF}_+}g\,\mathrm{d}\mu $.

Alternate proofs are available, e.g., see [37, 62], but were not considered in this work. We choose a proof using a change of variable, from the sum of values taken by each function f and g to the values taken by their sum $f+g$. The main difficulty is that the canonical list $\ell _{f+g}$ associated with $f+g$ has nothing to do with any kind of “addition” of the lists $\ell _f$ and $\ell _g$ associated with f and g.

The first stage is a lemma stating that $\mathcal {SF}$ is closed under addition.

For that, we rely on

that gathers all possible sums from two lists. When applied to $\ell _f$ and $\ell _g$, the result may be too large a list, but no useful value is missing. Thus, we may strip unwanted values by applying the previously defined canonizer(see Sect. 6.1).

The second stage is a couple of lemmas coming from the fact that the subsets $f^{-1}(\{y\})$ for $y\in f(E)$ constitute a partition of the domain E of the function f. First, a specialization of the finite version of the lemma (see Sect. 5.1) for preimages by functions f and g provides

Note that this result is first proved with the assumption that y is actually a value taken by f. But this premise can be dropped as for all other values of y, the equality to show simplifies into the trivial $0=0$.

Then, the result is lifted to the integral level,

$$\begin{aligned} \sum _{y \in f (E)} y \times \mu \left( f^{-1}(\{y\}) \right) = \sum _{y \in f (E)} y \left( \sum _{z \in g (E)} \mu \left( f^{-1}(\{y\}) \cap g^{-1}(\{z\}) \right) \right) , \end{aligned}$$

which is formalized as

The third stage consists in applying the latter lemma to justify the rewritings in the equations below. First, for both integrals in the sum.

$$\begin{aligned} \int _{\mathcal {SF}_+}f \, \mathrm{d}\mu + \int _{\mathcal {SF}_+}g \, \mathrm{d}\mu= & {} \sum _{y \in f (E)} y \, \mu \left( f^{-1}(\{y\}) \right) + \sum _{z \in g (E)} z \, \mu \left( g^{-1}(\{z\}) \right) \\= & {} \sum _{y \in f (E)} \sum _{z \in g (E)} y \, \mu \left( f^{-1}(\{y\}) \cap g^{-1}(\{z\}) \right) \\&+ \sum _{z \in g (E)} \sum _{y \in f (E)} z \, \mu \left( g^{-1}(\{z\}) \cap f^{-1}(\{y\}) \right) \\= & {} \sum _{y \in f (E)} \sum _{z \in g (E)} (y + z) \, \mu \left( f^{-1}(\{y\}) \cap g^{-1}(\{z\}) \right) . \end{aligned}$$

And then, for the integral of the sum, where the lemma is applied with $=f+g$ and $=f$.

$$\begin{aligned} \int _{\mathcal {SF}_+}(f + g) \, \mathrm{d}\mu= & {} \sum _{t \in (f + g) (E)} t \, \mu \left( (f + g)^{-1}(\{t\}) \right) \\= & {} \sum _{t \in (f + g) (E)} t \, \sum _{y \in f (E)} \mu \left( (f + g)^{-1}(\{t\}) \cap f^{-1}(\{y\}) \right) \\= & {} \sum _{y \in f (E)} \sum _{t \in (f + g) (E)} t \, \mu \left( f^{-1}(\{y\}) \cap (f + g)^{-1}(\{t\}) \right) . \end{aligned}$$

Finally, the last step of the additivity proof is to connect both sets of equalities by establishing the following “horrible” change of variable formula

$$\begin{aligned}&\sum _{z \in g (E)} (y + z) \, \mu \left( f^{-1}(\{y\}) \cap g^{-1}(\{z\}) \right) \\&\quad = \sum _{t \in (f + g) (E)} t \, \mu \left( f^{-1}(\{y\}) \cap (f + g)^{-1}(\{t\}) \right) , \end{aligned}$$

that is formalized as

The key ingredient here is that sums may be restricted to only their nonzero terms, which makes the change of variable $z\mapsto t=y+z$ (for fixed y) a bijection.

An interesting point is that this lemma is hardly explicit in mathematical textbooks and we had to puzzle it out to fulfill the proof. We had to write it explicitly as it was a key point in our design choice for simple functions, see Sect. 9.4.

Ultimately, we end up with similar double summations, and we are able to prove the additivity of the integral in $\mathcal {SF}_+$.

As a break, we establish the compatibility of the integral in $\mathcal {SF}_+$ with nonnegative scaling.

This calls for a proof that a simple function multiplied by a scalar is indeed a simple function. Then, we only need to require that both the scalar and the function are nonnegative to ensure that $\int _{\mathcal {SF}_+}a\times f\,\mathrm{d}\mu =a\times \int _{\mathcal {SF}_+}f\,\mathrm{d}\mu $. Then, monotony of the integral in $\mathcal {SF}_+$ is a direct consequence of additivity, since the relation $g=f+(g-f)$ holds in $\mathcal {SF}_+$ when $f\leqslant g$.

7 Integration of Nonnegative Functions

Let us now consider functions of type that may take an infinite number of (possibly infinite) values. The set of measurable functions to ${\overline{\mathbb {R}}}$ is denoted by $\mathcal {M}$, and its subset of nonnegative functions by $\mathcal {M}_+$. The key idea for the definition of the integral in $\mathcal {M}_+$ is to use approximations from below by simple functions in $\mathcal {SF}_+$, and surprisingly, we benefit from the use of computer arithmetic.

The integral is presented in Sect. 7.1 together with some preliminary properties. Then, Sect. 7.2 is devoted to the crucial Beppo Levi (monotone convergence) theorem. Adapted sequences are defined in Sect. 7.3. Linearity and other properties of the integral are displayed in Sect. 7.4. Finally, Sect. 7.5 is devoted to Fatou’s lemma, the other major result on the integral for nonnegative functions.

7.1 Definition and First Properties

We now want to define the Lebesgue integral for nonnegative integrable functions. The mathematical definition is

$$\begin{aligned} \forall f \in \mathcal {M}_+,\quad \int _{\mathcal {M}_+}f \, \mathrm{d}\mu \,{\mathop {=}\limits ^{\mathrm {def.}}}\,\sup _{\begin{array}{c} \psi \in \mathcal {SF}_+\\ \psi \leqslant f \end{array}} \int _{\mathcal {SF}_+}\psi \, \mathrm{d}\mu \end{aligned}$$

where the supremum is taken for nonnegative measurable simple functions $\psi $ less than or equal to f pointwise, and where the integral in $\mathcal {SF}_+$ is defined in Sect. 6.2.

Keeping on with total functions, we prescribe a value whatever the input function f, with a Coq definition quite similar to the mathematical one.

The supremum is taken on a subset of extended reals z such that there exists a simple function

less than or equal to f having z for integral.

The first thing to prove is that this newly defined integral is the same as the already-defined integral when the function is a simple function, i.e., $\int _{\mathcal {M}_+}f\,\mathrm{d}\mu $ is equal to $\int _{\mathcal {SF}_+}f\,\mathrm{d}\mu $ for all f in $\mathcal {SF}_+$, and in Coq

Then comes the monotony of the integral.

$$\begin{aligned} \forall f, g \in \mathcal {M}_+,\quad f \leqslant g \quad \Longrightarrow \quad \int _{\mathcal {M}_+}f \, \mathrm{d}\mu \,\leqslant \,\int _{\mathcal {M}_+}g \, \mathrm{d}\mu . \end{aligned}$$

The Coq translation becomes

Indeed, the least upper bound (LUB) in the definition of the total function is enough to ensure monotony for any functions f and g, not only for the nonnegative and measurable ones as in the mathematical statement.

Another easy result is about the multiplication by a nonnegative scalar value.

As before, there is no assumption on the fact that f is nonnegative.

The following extensionality result instantiated for restricted functions has proved useful. It states that when functions are equal on a measurable subset, then the integral of their restriction to that subset are equal. This is hardly mentioned in mathematics. As before, there is no requirement on the properties of A, f and g. The total function gives something that is the same in both cases, even for nonmeasurable functions.

7.2 The Beppo Levi Theorem

The Beppo Levi theorem (see Textbook Theorem 1, page 3), also known as the monotone convergence theorem, is one of the most fundamental results in measure and integration theories. It states that for any sequence $(f_{n})_{n \in \mathbb {N}}$ of pointwise nondecreasing and nonnegative measurable functions (i.e., in $\mathcal {M}_+$), the pointwise limit $\lim _{n \rightarrow \infty } f_n$ (which actually equals $\sup _{n \in \mathbb {N}} f_n$, see Sect. 3.3) is also in $\mathcal {M}_+$. This property is proved using standard properties of measurable functions such as monotony and is not particularly challenging. The Beppo Levi theorem also states the following integral-limit exchange formula

$$\begin{aligned} \int _{\mathcal {M}_+}\sup _{n \in \mathbb {N}} f_n \, \mathrm{d}\mu = \sup _{n \in \mathbb {N}} \int _{\mathcal {M}_+}f_n \, \mathrm{d}\mu \end{aligned}$$

which is stated in Coq as

The proof of this equality is technical and relies on a wide variety of previously proved results. It can be divided into two inequalities. The easy one, $\sup _{n\in \mathbb {N}}\int _{\mathcal {M}_+}f_n\,\mathrm{d}\mu \leqslant \int _{\mathcal {M}_+}\sup _{n\in \mathbb {N}}f_n\,\mathrm{d}\mu $, is proved using monotony of the integral, and fundamental properties of the supremum. The other one,

$$\begin{aligned} \int _{\mathcal {M}_+}\sup _{n \in \mathbb {N}} f_n \, \mathrm{d}\mu \leqslant \sup _{n \in \mathbb {N}} \int _{\mathcal {M}_+}f_n \, \mathrm{d}\mu , \end{aligned}$$

(5)

is more intricate. The first step of the proof is to show that for any $\psi \in \mathcal {SF}_+$ less than or equal to $\sup _{n\in \mathbb {N}}f_n$, and any number $0<a<1$, we have the bound

$$\begin{aligned} \int _{\mathcal {M}_+}\psi \, \mathrm{d}\mu \leqslant \frac{1}{a} \sup _{n \in \mathbb {N}} \int _{\mathcal {M}_+}f_n \, \mathrm{d}\mu . \end{aligned}$$

(6)

For that purpose, the subsets $A_n=\{x\in E\,|\,a\psi \leqslant f_n\}$ are first shown to be nondecreasing and measurable, the latter coming from the measurability of functions $a\psi -f_n$. Then, they are shown to cover the full set E, which is stated in Coq as

, and the existential is exhibited as a rank N above which we have $a\psi (x)\leqslant f_n(x)$. Then, the proof of Eq. (6) relies on continuity from below of the measure (see Sect. 5.2), measurability of simple functions (see Sect. 6.2), linearity properties of the integral in $\mathcal {SF}_+$ (see Sect. 6.3), and monotony and compatibility with characteristic functions for the integral in $\mathcal {M}_+$ (see Sect. 7.1).

Finally, the inequality (5) is obtained by taking in (6) the limit as a goes up to 1, and from the definition of the integral in $\mathcal {M}_+$ (see Sect. 7.1) and properties of the supremum.

This concludes the proof of the Beppo Levi theorem.

7.3 Adapted Sequences

As for simple functions, a real difficulty is the additivity property of the integral in $\mathcal {M}_+$ : $\int _{\mathcal {M}_+}(f+g)\,\mathrm{d}\mu =\int _{\mathcal {M}_+}f\,\mathrm{d}\mu +\int _{\mathcal {M}_+}g\,\mathrm{d}\mu $. Given the definition of the Lebesgue integral (see Sect. 7.1), the common proof relies on adapted sequences in $\mathcal {SF}_+$.

An adapted sequence for a function f is a pointwise nondecreasing sequence of nonnegative functions that is pointwise converging from below toward f.

In our case, the adapted sequences of interest are the measurable simple functions of $\mathcal {SF}$. We then deduce that the sequence of integrals of such a sequence converges toward the integral of f from below.

Having this definition and its link to the integral is far from enough. We need to have an adapted sequence corresponding to any function in $\mathcal {M}_+$. This building is quite easy in mathematics: for f nonnegative, the chosen adapted sequence is

$$\begin{aligned} \varphi _n (x) {\mathop {=}\limits ^{\mathrm {def.}}}\left\{ \begin{array}{ll} \displaystyle \frac{\left\lfloor 2^n f (x) \right\rfloor }{2^n} &{}\quad \text{ when } f (x) < n,\\ n &{}\quad \text{ otherwise }. \end{array} \right. \end{aligned}$$

(7)

We began by translating literally this definition. Then, we tried to prove that the sequence is nondecreasing, and so on. One of the authors then noticed that such proofs were already done and available in the library Flocq [10] dedicated to computer arithmetic. Flocq is a formalization of floating-point arithmetic that provides a comprehensive library of theorems on a multi-radix multi-precision arithmetic. It also supports efficient numerical computations inside Coq. As it aims at genericity, even computer arithmetic formats are abstract to encompass floating-point and fixed-point arithmetics and many proved results also hold in fixed-point arithmetic. Seen from a computer science point of view, the definition of $\varphi _n$ in Eq. (7) indeed relies on a fixed-point rounding downward with a least significant bit (lsb) of $-n$. It is formalized in Coq as

Many proofs related to inequalities (such as $(\varphi _n)_{n\in \mathbb {N}}$ is bounded and nondecreasing) are really smooth, relying on the support library of Flocq. For instance, the theorem $\varphi _n(x)\le f(x)$ is a split whether $f(x)<n$, followed by a call to a property of the rounding downward.

A more involved example lies in the proof of $\varphi _n(x)\le \varphi _{n+1}(x)$. We first split depending whether $f(x)<n$ or $n+1$, and handle three simple cases. The most difficult case should be to prove that

$$\begin{aligned} \frac{\left\lfloor 2^n f (x) \right\rfloor }{2^n} \le \frac{\left\lfloor 2^{n+1} f (x) \right\rfloor }{2^{n+1}} \end{aligned}$$

when $f(x)<n$. Indeed, a direct proof does not seem so straightforward. Taking the computer arithmetic point of view, it becomes $\bigtriangledown _n(x)\le \bigtriangledown _{n+1}(x)$, with $\bigtriangledown _n$ the rounding down in fixed-point arithmetic with least significant bit (lsb) n. Then, we rely on the following floating-point theorem: if $u\le v$ and $u\in \mathbb {F}$, then $u\le \circ (v)$ for any reasonable format $\mathbb {F}$ and rounding $\circ $. Applying it, there is left to prove both that $\bigtriangledown _n(x)\le x$ (trivial by the properties of the rounding down) and that $\bigtriangledown _n(x)$ fits in the fixed-point format with least significant bit $-n-1$ (easy as it has a lsb of $-n$). The proof of this subcase has been done in 10 lines of standard Coq.

Even the convergence was eased by existing Flocq error lemmas. The main result states that the $\varphi _n$ are indeed an adapted sequence.

The part left to prove is that the $\varphi _n$ are in $\mathcal {SF}$. The first thing to prove is that the preimages of $\varphi _n$ are measurable subsets.

There are several proof paths. The chosen one is to prove that the subset is either empty (so measurable), or $y\le n$. We also prove that y is a fixed-point number with lsb n. Then, we have

$$\begin{aligned} \varphi _n (x) = y \quad \Longleftrightarrow \quad y \le f (x) < \mathrm {succ}(y) \end{aligned}$$

where $\mathrm {succ}$ is the successor in fixed-point arithmetic, meaning the next number in the format with lsb n. This subset is measurable as f is measurable, and from the properties of Borel subsets of Sect. 4.4. There are some special cases related to $\infty $ and to $y=n$ that are tedious but easy.

Last is to prove that the $\varphi _n$ only take a finite number of values, making them valid simple functions.

The mathematical definition is clear, but this is the first proof of this kind (the only previously proved simple function is the characteristic function). So we have to build by hand the list of all possible values of $\varphi _n$: first the list of integers i with $0\le i\le n$, and then the list of all the $i/2^n$ for $0\le i\le n2^n$. This kind of proof could clearly be automated, or simplified by dedicated lemmas if need be. From this very generic list, we only have to apply the canonizer defined in Sect. 6.1 to f. Then, we end up the proof, relying on the various properties of the canonizer, the fixed-point rounding, and the measurability above.

To conclude, we have defined explicitly an adapted sequence that we may give to the theorem , thus providing an explicit formula for the integral in $\mathcal {M}_+$ : $\int _{\mathcal {M}_+}f\,\mathrm{d}\mu =\sup _{n\in \mathbb {N}}\int _{\mathcal {M}_+}\varphi _n\,\mathrm{d}\mu $.

7.4 Linearity and Other Properties of the Integral

We present now some theorems about the integration of nonnegative measurable functions that we consider essential for a library user. They are all consequences of the Beppo Levi (monotone convergence) theorem (see Sect. 7.2). They are gathered in Table 1.

The first entry () is for additivity of the integral in $\mathcal {M}_+$. The proof is rather smooth, now that additivity of the integral in $\mathcal {SF}_+$ (see Sect. 6.3), the monotone convergence (see Sect. 7.2), and existence of an adapted sequence (see Sect. 7.3) are established. The second entry () generalizes the nonnegative scaling property (see Sect. 7.1) to the infinite case too. Both will be significant ingredients of the linearity of the Lebesgue integral for arbitrary sign functions, to build the structure of vector space of the integrable functions (out of the scope of this paper).

The remaining entries actually follow from the first two linearity results. The characterization of zero-integral functions () relies on the scaling property and the compatibility result with characteristic functions (see Sect. 7.1), while the decomposition on a partition () relies on the additivity property. Finally, the compatibility result with almost equality () relies on the last two. Note that this latter possesses a companion lemma about inequalities, and both share the same proof through the abstraction of their binary relation.

7.5 Fatou’s Lemma

Fatou’s lemma (see Textbook Theorem 2) is the other fundamental result in Lebesgue integration theory for nonnegative functions. It specifies how the situation deteriorates when the sequence in the Beppo Levi theorem is no longer monotone: the equality becomes an inequality and limits (i.e., suprema) become limits inferior. It states that for any sequence $(f_n)_{n\in \mathbb {N}}$ of nonnegative measurable functions (i.e., in $\mathcal {M}_+$), the pointwise limit $\liminf _{n\rightarrow \infty }f_n$ is also in $\mathcal {M}_+$, and the integral of the limit inferior is less than or equal to the limit inferior of the integrals,

$$\begin{aligned} \int _{\mathcal {M}_+}\liminf _{n \rightarrow \infty } f_n \, \mathrm{d}\mu \leqslant \liminf _{n \rightarrow \infty } \int _{\mathcal {M}_+}f_n \, \mathrm{d}\mu , \end{aligned}$$

which is stated in Coq as

The proof is rather short. The principle is to apply the Beppo Levi theorem (see Sect. 7.2) to the sequence $(\inf _{m\in \mathbb {N}}f_{n+m})_{n\in \mathbb {N}}$ which is shown nondecreasing, nonnegative, and measurable. We get

$$\begin{aligned} \int _{\mathcal {M}_+}\liminf _{n \rightarrow \infty } f_n \, \mathrm{d}\mu = \sup _{n \in \mathbb {N}} \int _{\mathcal {M}_+}\inf _{m \in \mathbb {N}} f_{n + m} \mathrm{d} \mu . \end{aligned}$$

Then, by the monotony result of Sect. 7.1 and the definitions of infimum and limit inferior, we have

$$\begin{aligned} \sup _{n \in \mathbb {N}} \int _{\mathcal {M}_+}\inf _{m \in \mathbb {N}} f_{n + m} \mathrm{d} \mu \leqslant \liminf _{n \rightarrow \infty } \int _{\mathcal {M}_+}f_{n} \, \mathrm{d}\mu , \end{aligned}$$

proving the result.

Note that a less common proof path is also possible: establish first Fatou’s lemma, and then the Beppo Levi theorem.

Fatou’s lemma is essential to establish other fundamental results such as the Fatou–Lebesgue theorem that collects a chain of inequalities involving both inferior and superior limits, and above all, Lebesgue’s dominated convergence theorem whose result is similar to that of the Beppo Levi theorem, but only through the dominance of the sequence (both left as future work).

8 A Simple Case Study: The Dirac Measure

It makes sense to exhibit an example of measure to test the specifications described in the previous sections and especially the axiomatic definition of Sect. 5.

For instance, the Lebesgue measure, which extends the notion of length of intervals in $\mathbb {R}$, is ubiquitous on Euclidean spaces $\mathbb {R}^n$. And the counting measure, which returns the cardinal, is pertinent on countable sets. But both present formalization issues, and their study is left for future works.

We present the construction and usage of a very simple measure, the Dirac measure. It is used, for instance, in physics to model a point mass.

The Dirac map associated with an element $a\in E$ is the function $\delta _a$ mapping any subset $A\subseteq E$ to ${\mathbb {1}}_{A}(a)$ (see Sect. 4.1).

As a measure, the total function defined above only makes sense for measurable subsets. But the Dirac measure has the salient property to be a measure for any $\sigma $-algebra on , even for the discrete $\sigma $-algebra (see Sect. 4.2).

To instantiate the Dirac map $\delta _a$ as a measure, we prove that it meets the specification of measures of Sect. 5.1. The first two properties, homogeneity () and nonnegativeness (), are obvious. The proof of $\sigma $-additivity is based on the following argument, that is valid for any pairwise disjoint sequence of subsets $(A_n)_{n\in \mathbb {N}}$ (there is at most one nonzero term in the sum).

$$\begin{aligned} \delta _a \left( \biguplus _{n \in \mathbb {N}} A_n \right) = {\mathbb {1}}_{\biguplus _{n \in \mathbb {N}} A_n} (a) = \sum _{n \in \mathbb {N}} {\mathbb {1}}_{A_n} (a) = \sum _{n \in \mathbb {N}} \delta _a (A_n). \end{aligned}$$

Then, the Dirac measure can be built for any generator .

The integral of any function with the Dirac measure is

$$\begin{aligned} \int f \, \mathrm{d}\delta _a = f (a). \end{aligned}$$

The formalization in $\mathcal {SF}_+$ for nonnegative measurable simple functions, and any generator , is (see Sect. 6.2)

The proof is a direct application of lemma of Sect. 6.1. The version in $\mathcal {M}_+$ for nonnegative measurable functions, and any generator , is (see Sect. 7.1)

Its proof is an application of lemma of Sect. 7.3.

9 Discussion on Proof-Engineering Concerns

During our development, we had to make several choices regarding logic and the formalization of mathematics.

9.1 Extended Real Numbers

Measure theory and integration of nonnegative functions that are investigated here only manipulate values in ${\overline{\mathbb {R}}}_+$. Thus arises the question of the most practical Coq formalization for nonnegative extended real numbers among the following three possible choices: either $[0,\infty ]$, $\mathbb {R}\cup \{\infty \}$, or $\mathbb {R}\cup \{-\infty ,\infty \}$.

From a mathematical point of view, all solutions are acceptable because, in addition, we either prove or require that values are nonnegative. But we also need to keep in mind that eventually we will have to deal with arbitrary sign functions. And despite the absence of algebraic structure, extended real numbers with both infinities are the usual framework often used by mathematicians to allow for simplified expressions of many statements. We have chosen to follow this practice and to use from Coquelicot (see Sect. 2.1). But let us review the other possibilities we considered.

First solution: $[0,\infty ]$. For example with a specific type . This would be very difficult to use in Coq because it would not be directly related to the type , so we would need a coercion or some subtyping to use this type in formulas with reals. Moreover, it would make $\infty $ appear explicitly a lot. This would lead to very verbose statements with few automation possibilities.

Second solution: $\mathbb {R}\cup \{\infty \}$. For example with a type with two constructors and . This would keep validity of usual algebraic properties such as associativity and distributivity without the need for additional hypotheses, and would favor a low number of cases in proofs. But it would still be a new type that would lead to Coq coercions. Moreover, when $-\infty $ will enter the picture for functions with arbitrary sign, it would make necessary coercions from/to the three types, which would be as difficult to handle as coercions from/to $\mathbb {N}$, $\mathbb {Z}$, and $\mathbb {R}$.

Chosen (third) solution: $\mathbb {R}\cup \{-\infty ,\infty \}$. Which is the type . This has the advantage of being already defined in Coquelicot with several lemmas proved, and is related to the type . Of course, for the present developments on nonnegative functions, we have to deal with meaningless negative cases and additional hypotheses. However, this drawback is balanced by the fact that we are ready to treat arbitrary sign functions.

9.2 Classical Logic Aspects

In our previous work, we tried to minimize the use of classical aspects. For example, in the formal proof of the Lax–Milgram theorem [15], we had a few decidability hypotheses and some statements relied on double negations to avoid using a stronger classical property. We consider here that it is no longer worth the effort compared to the theoretical gain. For this reason, we have decided to use the classical theorems listed in Sect. 2.2.

Moreover, as explained in Sect. 4.1, we choose that subsets have the type . We could have chosen and it would have simplified some proofs as we rely a lot on the excluded middle axiom for deciding whether a point is in a subset or not. Nevertheless, this use of would not have fully removed the excluded middle axiom. Indeed, when selecting in a list the elements that have a property (function of Sect. 6.1), we need to decide inside this function whether a property holds or not. And this is then applied to the following property:

We need, for each $x\in E$ to decide whether it belongs to the image of f and that requires the informative excluded middle axiom. So, as it has no logical impact, we have chosen for convenience as it fits better in the libraries we rely on.

9.3 Measurability of Subsets

Implementing the concept of (generated) $\sigma $-algebra, i.e., the measurability of subsets, as an inductive type allows to conduct proofs by induction. This is proposed in Isabelle/HOL^{Footnote 1} and Lean,^{Footnote 2} and it is useful in Coq too (see Sect. 4.2). But this design choice can also have an impact on how mathematics results are stated. Of course, there is a constructor for each basic property of $\sigma $-algebras. But it is also necessary to add a constructor, , that embodies the belonging to the collection of generators. Indeed, this is required to initiate the constructive process of specifying a measurable subset. In other words, our Coq definition also encompasses the mathematical concept of generated $\sigma $-algebra.

As a consequence, we cannot instantiate a $\sigma $-algebra without exhibiting a generator. But fortunately, nothing prevents from setting the whole $\sigma $-algebra as the generator, and specify, or prove, that it satisfies predicate. A notable effect is that the mathematical result stating that any generated $\sigma $-algebra is the smallest $\sigma $-algebra containing its generator is already structurally granted by the inductive type . Our definition of generated $\sigma $-algebra then precedes the definition of $\sigma $-algebra, which is not the common order in mathematics. It is, however, common to have several possible orders or equivalent definitions in mathematics. We found that this formalization of generated $\sigma $-algebra was easy to use. For example, the Coq proof of lemma (see Sect. 4.5) is done very simply by induction on the measurable subset defined inductively and applying directly the three properties , and .

Whatever the definitions (or the order of them), the most difficult point was related to the equivalence of generators. More precisely, the equivalence between the $\sigma $-algebra generated by compact intervals, , and the Borel $\sigma $-algebra generated by open subsets, , of Sect. 4.4 has proved tedious, long, and with harder ingredients than expected: many bijections, connected components, density of $\mathbb {Q}$, second-countability.

9.4 Simple Functions

About simple functions, we had difficulties designing them and we tried many Coq definitions for the same mathematical object before deciding for the one described in Sect. 6.

Note, for instance, that we have chosen the total function approach as much as possible in our development to ensure the simplicity in writing formulas. But valid simple functions come as a dependent type with the function, the list of values and the corresponding proof. This was needed as the list is required to compute the integral. To give the value of this integral, we need to sum over a finite list and therefore we need this list to be given. A solution would be to have an easy mechanism to sum over arbitrary sets (possibly infinite and possibly bigger than $\mathbb {N}$) like done in Lean.^{Footnote 3} This extension of total function would make a practical addition to Coq and may simplify some of our statements but is out of the scope of this paper. Note also that this dependent type is not supposed to appear to a library user, contrary to measurable functions and subsets, and to the integral.

Another design choice is about the type of simple functions: either , or . Mathematicians usually consider $\mathbb {R}$ (in fact $\mathbb {R}_+$) as codomain for $\mathcal {SF}_+$, and reserve ${\overline{\mathbb {R}}}$ (in fact ${\overline{\mathbb {R}}}_+$) for $\mathcal {M}_+$ (as limits of functions in $\mathcal {SF}_+$). We first tried to have simple functions of type for coherence and simplicity, but it failed in the proof of the difficult of Sect. 6.3. We could have kept the same type with an assumption that all values taken are finite, but we found it less convenient than using types for this requirement. As suggested in mathematics, we ended up by having simple functions of type .

A surprising successful choice is related to a particular type of simple functions: adapted sequences. Even if the mathematical definitions and proofs are given, we chose to take a computer scientist (or even a computer arithmetic) point of view. The use of Flocq has made trivial many proofs.

10 State of the Art

Interactive theorem provers may be classified into several distinct families with respect to their inherent logic. It may rely on set theory or type theory, on classical logic (e.g., with a built-in axiom of choice) or intuitionistic logic, and so on. Similarly to programming languages, the choice of a proof assistant is driven by the kind of formal proofs to be developed, the support libraries, the ease of the developer, and so on. It is currently either impractical or impossible to automatically “translate” developments done within some proof assistant into another, especially when they do not share the same inherent logic or theory.

This is the main reason for this work: providing to Coq users a practical formalization of Lebesgue integration. As explained below, this already exists in several other provers. We have taken inspiration (within the limits set by the various logics), we have made various design choices and proved the common lemmas and theorems. Our goal is to offer to Coq users (so that they may rely on it for their own Coq development) this library. This also allows this library to be eventually mixed it with Flocq for proving floating-point programs.

Lebesgue measure and integration is known to be an important chapter in mathematics. For instance, it belongs to a “top 100” of mathematical theorems established at the turn of the millennium for which F. Wiedijk is keeping track of the formalization within the main proof assistants.^{Footnote 4} This state of the art focuses on the formalization of the Lebesgue measure and integration; for a larger view about real analysis (in ACL2, Coq, HOL, HOL Light, Mizar, and PVS), we refer the reader to this survey [14].

Regarding intrinsically classical proof assistants, we may cite Mizar [53], Isabelle/HOL [58], and PVS [59].

Mizar libraries have been very advanced since the 1990s on the formalization of mathematics in general. The Lebesgue integral has been addressed continuously during the last two decades, for instance, the integral of simple functions [31], Fatou’s lemma [32], and Fubini’s theorem [30].

More recently, a large library of results, including many results about measure theory [43] and nonnegative Lebesgue integration,^{Footnote 5} Ordinary differential equations (ODEs) [44,45,46], and a formalization of Green’s theorem [1] have been developed in Isabelle/HOL, and Fourier transform [41] in HOL4. In HOL4 [55] and in PVS,^{Footnote 6} Lebesgue integration theory has also been developed in 2010.

In these provers, the definition of the Lebesgue integral is quite similar to ours. The main difference lies in the definition of simple functions. Leaving measurability aside, they consider the image set and the function is simple when this set is finite. For instance, in PVS, is_finite(image(f,fullset[T])).

Regarding intrinsically intuitionistic proof assistants, we may cite Lean [27], and Coq [25].

About Lean, we only found out in [66] that the Bochner integral is available. Note that the Bochner integral extends the Lebesgue integral to functions taking their values in a Banach space. There is no description on how all this is formalized and the available theorems. We investigated the source, and found the Beppo Levi (monotone convergence) theorem and Fatou’s lemma, with some formalization key points. First, Borel spaces^{Footnote 7} are generated from the open sets (only) with a kind of inductive definition, so this ends up being similar to us, even if our genericity in terms of generators has proved useful. Second, as in other provers, simple function definitions^{Footnote 8} rely on a predicate that says whether a set is finite or not (used on the image of the function), which relies on the logical underlying framework that is quite different from the one of Coq.

As mentioned in Introduction, our work relies on the Coq proof assistant. As a previous work about analysis in Coq we can cite formalization of Picard’s operator for ODEs [52], which uses the constructive CoRN and the Math Classes libraries. Regarding analysis with classical reals, our previous works in Coq include the full formalization of the discretization of the wave equation [11, 12], and the formalization of Lax–Milgram theorem [15]. For both of these works, we paid particular attention to the statements and their proof to avoid the use of classical axioms (we have now chosen the classical side, see Sect. 9.2). Another recent development is the formal proof of the Lax equivalence theorem for finite difference schemes [65], it is based on the classical standard real numbers, Coquelicot libraries, and our formalization of the Lax–Milgram theorem.

On the other hand, the math-comp /analysis library is currently being developed [3]. It aims at providing numerical analysis results in classical logic, building upon math-comp. This library has been developed in parallel to ours and is still in development, with few documentation and many branches, so it is hard to trace proved theorems.^{Footnote 9} We found out similar definitions for $\sigma $-algebra (by induction), connected components and the Lebesgue integral. In one branch, we found out a very experimental generic integral but in another one dedicated to the Lebesgue integral, we found some of the lemmas described in this article. As for the differences, they define simple functions by a dependent type including a unique list. Our adapted sequences are more smooth as based on Flocq, while they handle dyadic intervals by hand. They have a full definition of the Lebesgue measure using Carathéodory’s extension theorem.

Last, following the rebuilding of the standard real library [64], in which a constructive and classical basis was built up, constructive analysis lemmas were also introduced by Vincent Semeria, based on the constructive analysis of Bishop [7].

11 Conclusion and Perspectives

This work is a second stone that paves the way toward the formal correctness of the Finite Element Method, the first one being the formal proof of the Lax–Milgram theorem [15]. The contributions are the Coq formalizations and proofs of $\sigma $-algebras, measures, simple functions, and Lebesgue integration of nonnegative measurable functions, and the formal proofs of the Beppo Levi (monotone convergence) theorem and Fatou’s lemma.

The subset addressed in the present paper is more than 50-page long (6000 lines of code (LOC) of LaTeX, and weighs 220 kB) in its mathematics counterpart [23]. These Coq source files add up to about 11 kLOC, and weigh 370 kB. In total, the cumulative development including the Lax–Milgram theorem [15], finite-dimensional vector spaces [34], and this formalization is about 21 kLOC/650 kB.

As in [12], we observe here again that in-depth pen-and-paper proofs can be an order of magnitude longer than usual proofs from textbooks, and the lengths of formal and detailed pen-and-paper proofs are similar. In the present case, we can notice that the extra effort deployed on (in and , roughly 3 kLOC/90 kB) explains part of the gap between the respective sizes of the Coq formalization and the pen-and-paper proof, where ${\overline{\mathbb {R}}}$ received far less detail.

The hyperlinked dependency graph is detailed in Fig. 1 where our target development, the Lebesgue integral (built on simple functions), is represented in . For this purpose, we had to formalize $\sigma $-algebras, measurable functions, and measures (represented in ), as well as some preliminary developments on countability, topological bases in $\mathbb {R}$, and the handling of sums in ${\overline{\mathbb {R}}}$ (represented in ). We also had to develop results that were missing both in the standard libraries (in the subdirectories Logic, Lists, Sorting, and Reals) and Coquelicot (represented in ), including some tactics for . As usual, we can note that a large number of prerequisites are necessary to reach the desired formalization. In our case, this distributes roughly into one third for the (either in kLOC or in kB), one seventh for the , one third for , and one fifth for the target .

As usual, formalization is not just straightforward translation of mathematical texts and formulas. Some design choices have to be made and proof paths may differ, mainly to favor usability of Coq theorems and ease formal developments.

After both the Lax–Milgram theorem [15] and this work, the road is still long to be able to tackle the formal proof of scientific computation programs using the Finite Element Method (FEM).

The next step is to formalize the construction of the Lebesgue measure (for instance, using Carathéodory’s extension theorem [20, 29]), and the Lebesgue integral for measurable functions with arbitrary sign, with the proofs of Lebesgue’s dominated convergence theorem and of the Tonelli–Fubini theorems as the next milestones. Then comes the formalization of the $L^p$ Lebesgue spaces as complete normed vector spaces (a.k.a. Banach spaces), and in particular of $L^2$ as a complete inner product space (a.k.a. a Hilbert space). We expect the completeness to be the most challenging part of the proof. And finally, the formalization of simple Sobolev spaces $H^1$ and $H^1_0$ will need part of the distribution theory [63]. Further steps will include the formalization of parts of interpolation and approximation theories to end up with the FEM.

In parallel, we plan to merge with recent works on constructive reals [64] now included in the Coq standard library, and in particular with the constructive measure theory [8] based on the Daniell integral [26]. We also plan to formalize the Bochner integral [9, 56] that generalizes the Lebesgue integral to the case of functions taking their values in a Banach space, for instance, such as the Euclidean spaces $\mathbb {R}^n$ and the Hermitian spaces $\mathbb {C}^n$.

Notes

References

Abdulaziz, M., Paulson, L.C.: An Isabelle/HOL formalisation of Green’s theorem. In: Blanchette, J.C., Merz, S. (eds.) Proceedings of the 7th International Conference on Interactive Theorem Proving (ITP 2016), Volume 9807 of Lecture Notes in Computer Science, pp. 3–19. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-43144-4_1
Adams, R.A.: Sobolev Spaces. Pure and Applied Mathematics, vol. 65. Academic Press, New York–San Francisco–London (1975)
Google Scholar
Affeldt, R., Cohen, C., Mahboubi, A., Rouhling, D., Strub, P.-Y.: Classical analysis with Coq. In: The 9th Coq Workshop (2018). https://easychair.org/smart-slide/slide/n3pK
Bartle, R.G.: A Modern Theory of Integration, Volume 32 of Graduate Studies in Mathematics. American Mathematical Society, Providence (2001). https://doi.org/10.1090/gsm/032
Book Google Scholar
Bertot, Y., Gonthier, G., Biha, S.O., Pasca, I.: Canonical big operators. In: Mohamed, O.A., Muñoz, C., Tahar, S. (eds.) Proceedings of the 21st International Conference on Theorem Proving in Higher Order Logics (TPHOL 2008), Volume 5170 of Lecture Notes in Computer Science, pp. 86–101. Springer, Berlin (2008). https://doi.org/10.1007/978-3-540-71067-7_11
Billingsley, P.: Probability and Measure. Wiley Series in Probability and Mathematical Statistics, 3rd edn. Wiley, New York (1995)
MATH Google Scholar
Bishop, E.: Foundations of Constructive Analysis. McGraw-Hill Book Co., New York–Toronto–London (1967)
MATH Google Scholar
Bishop, E., Cheng, H.: Constructive Measure Theory. Number 116 in Memoirs of the American Mathematical Society. American Mathematical Society, Providence (1972)
Google Scholar
Bochner, S.: Integration von funktionen, deren werte die elemente eines vektorraumes sind. Fundam. Math. 20, 262–276 (1933). (In German)
Article Google Scholar
Boldo, S., Melquiond, G.: Flocq: a unified library for proving floating-point algorithms in Coq. In: Proceedings of the IEEE 20th Symposium on Computer Arithmetic (ARITH-20), pp. 243–252. IEEE (2011). https://doi.org/10.1109/ARITH17396.2011
Boldo, S., Clément, F., Filliâtre, J.-C., Mayero, M., Melquiond, G., Weis, P.: Wave equation numerical resolution: a comprehensive mechanized proof of a C program. J. Autom. Reason. 50(4), 423–456 (2013). https://hal.inria.fr/hal-00649240/
Boldo, S., Clément, F., Filliâtre, J.-C., Mayero, M., Melquiond, G., Weis, P.: Trusting computations: a mechanized proof from partial differential equations to actual program. Comput. Math. Appl. 68(3), 325–352 (2014). https://hal.inria.fr/hal-00769201/
Boldo, S., Lelay, C., Melquiond, G.: Coquelicot: a user-friendly library of real analysis for Coq. Math. Comput. Sci. 9(1), 41–62 (2015). https://hal.inria.fr/hal-00860648/
Boldo, S., Lelay, C., Melquiond, G.: Formalization of real analysis: a survey of proof assistants and libraries. Math. Struct. Comput. Sci. 26(7), 1196–1233 (2016). https://hal.inria.fr/hal-00806920/
Boldo, S., Clément, F., Faissole, F., Martin, V., Mayero, M.: A Coq formal proof of the Lax–Milgram theorem. In: Proceedings of the 6th ACM SIGPLAN International Conference on Certified Programs and Proofs (CPP 2017), pp. 79–89. Association for Computing Machinery, New York (2017). https://hal.inria.fr/hal-01391578/
Bourbaki, N.: Éléments de mathématiques. Livre VI : Intégration. Chapitres 1à 4, 2nd edn. Hermann, Paris (1965). (In French)
Bourbaki, N.: Éléments de mathématiques. Livre III : Topologie générale. Chapitres 1 à 4, 2nd edn. Hermann, Paris (1971). (In French)
Brezis, H.: Analyse fonctionnelle—Théorie et applications. Collection Mathématiques Appliquées pour la Maîtrise. Masson, Paris (1983). (In French)
Burk, F.E.: A Garden of Integrals, Volume 31 of The Dolciani Mathematical Expositions. Mathematical Association of America, Washington (2007)
Book Google Scholar
Carathéodory, C.: Algebraic Theory of Measure and Integration. Chelsea Publishing Co., New York (1963)
MATH Google Scholar
Cartan, H.: Théorie des filtres. C. R. Acad. Sci. 205, 595–598 (1937). (In French)
MATH Google Scholar
Ciarlet, P.G.: The Finite Element Method for Elliptic Problems, Volume 40 of Classics in Applied Mathematics. Society for Industrial and Applied Mathematics (SIAM), Philadelphia (2002). https://doi.org/10.1137/1.9780898719208. Reprint of the 1978 original (North-Holland, Amsterdam)
Clément, F., Martin, V.: Lebesgue integration. Detailed proofs to be formalized in Coq. Research Report RR-9386, Inria, Paris, Jan 2021. https://hal.inria.fr/hal-03105815v2. Version 2
Cohn, D.L.: Measure Theory. Birkhäuser Advanced Texts: Basler Lehrbücher, 2nd edn. Birkhäuser/Springer, New York (2013). https://doi.org/10.1007/978-1-4614-6956-8
Book Google Scholar
Coq-ref. The Coq reference manual. https://coq.inria.fr/refman/
Daniell, P.J.: A general form of integral. Ann. Math. (2) 19(4), 279–294 (1918). https://doi.org/10.2307/1967495
Article MathSciNet MATH Google Scholar
de Moura, L., Kong, S., Avigad, J., van Doorn, F., von Raumer, J.: The Lean theorem prover (system description). In: Felty, A.P., Middeldorp, A. (eds.) Proceedings of the 25th International Conference on Automated Deduction (CADE 2015), Volume 9195 of Lecture Notes in Computer Science, pp. 378–388. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-21401-6_26
Dieudonné, J.: Éléments d’analyse. Chapitres XII à XV. Cahiers Scientifiques, Fasc. XXXI. Gauthier-Villars, Paris, Tome II (1968). (In French)
Durrett, R.: Probability—Theory and Examples, Volume 49 of Cambridge Series in Statistical and Probabilistic Mathematics, 5th edn. Cambridge University Press, Cambridge (2019). https://doi.org/10.1017/9781108591034
Book Google Scholar
Endou, N.: Fubini’s theorem. Formaliz. Math. 27(1), 67–74 (2019). https://doi.org/10.2478/forma-2019-0007
Endou, N., Narita, K., Shidama, Y.: Lebesgue integral of simple valued function in Mizar. Formaliz. Math. 13(1), 67–71 (2005). https://fm.mizar.org/2005-13/pdf13-1/mesfunc3.pdf
Endou, N., Narita, K., Shidama, Y.: Fatou’s lemma and the Lebesgue’s convergence theorem. Formaliz. Math. 16(4), 305–309 (2008). https://doi.org/10.2478/v10037-008-0037-8
Article Google Scholar
Ern, A., Guermond, J.-L.: Theory and Practice of Finite Elements, Volume 159 of Applied Mathematical Sciences. Springer, New York (2004). https://doi.org/10.1007/978-1-4757-4355-5
Book MATH Google Scholar
Faissole, F.: Formalization and closedness of finite dimensional subspaces. In: Proceedings of the 19th International Symposium on Symbolic and Numeric Algorithms for Scientific Computing (SYNASC 2017), pp. 121–128. IEEE (2017)
Feller, W.: An Introduction to Probability Theory and Its Applications, vol. I, 3rd edn. Wiley, New York–London–Sydney (1968)
MATH Google Scholar
Folland, G.B.: Real Analysis—Modern Techniques and Their Applications. Pure and Applied Mathematics (New York), 2nd edn. Wiley, New York (1999)
MATH Google Scholar
Gallouët, T., Herbin, R.: Mesure, intégration, probabilités. Ellipses Edition Marketing (2013). https://hal.archives-ouvertes.fr/hal-01283567/. (In French)
Ghosal, S., van der Vaart, A.: Fundamentals of Nonparametric Bayesian Inference, Volume 44 of Cambridge Series in Statistical and Probabilistic Mathematics. Cambridge University Press, Cambridge (2017). https://doi.org/10.1017/9781139029834
Book Google Scholar
Gill, T.L., Zachary, W.W.: Functional Analysis and the Feynman Operator Calculus. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-27595-6
Book MATH Google Scholar
Gostiaux, B.: Cours de mathématiques spéciales—2. Topologie, analyse réelle. Mathématiques. Presses Universitaires de France, Paris (1993). (In French)
Guan, Y., Zhang, J., Shi, Z., Wang, Y., Li, Y.: Formalization of continuous Fourier transform in verifying applications for dependable cyber-physical systems. J. Syst. Archit. 106, 101707 (2020). https://doi.org/10.1016/j.sysarc.2020.101707
Article Google Scholar
Henstock, R.: Theory of Integration. Buttherworths, London (1963)
MATH Google Scholar
Hölzl, J., Heller, A.: Three chapters of measure theory in Isabelle/HOL. In: van Eekelen, M., Geuvers, H., Schmaltz, J., Wiedijk, F. (eds.) Proceedings of the 2nd International Conference on Interactive Theorem Proving (ITP 2011), Volume 6898 of Lecture Notes in Computer Science, pp. 135–151. Springer, Berlin (2011). https://doi.org/10.1007/978-3-642-22863-6_12
Immler, F.: Formally verified computation of enclosures of solutions of ordinary differential equations. In: Badger, J.M., Rozier, K.Y. (eds.) Proceedings of the 6th International Symposium NASA Formal Methods (NFM 2014), Volume 8430 of Lecture Notes in Computer Science, pp. 113–127. Springer, Cham (2014). https://doi.org/10.1007/978-3-319-06200-6_9
Immler, F., Hölzl, J.: Numerical analysis of ordinary differential equations in Isabelle/HOL. In: Beringer, L., Felty, A.P. (eds.) Proceedings of the 3rd International Conference on Interactive Theorem Proving (ITP 2012), Volume 7406 of Lecture Notes in Computer Science, pp. 377–392. Springer, Berlin (2012). https://doi.org/10.1007/978-3-642-32347-8_26
Immler, F., Traut, C.: The flow of ODEs. In: Blanchette, C.J., Merz, S. (eds.) Proceedings of the 7th International Conference on Interactive Theorem Proving (ITP 2016), Volume 9807 of Lecture Notes in Computer Science, pp. 184–199. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-43144-4_12
Kurzweil, J.: Generalized ordinary differential equations and continuous dependence on a parameter. Czechoslov. Math. J. 7(3), 418–449 (1957). https://doi.org/10.21136/CMJ.1957.100258
Article MathSciNet MATH Google Scholar
Lebesgue, H.L.: Leçons sur l’intégration et la recherche des fonctions primitives professées au Collège de France. Cambridge Library Collection. Cambridge University Press, Cambridge (2009). https://doi.org/10.1017/CBO9780511701825. Reprint of the 1904 original [Gauthier-Villars, Paris]. (In French)
Lelay, C.: Repenser la bibliothèque réelle de Coq : vers une formalisation de l’analyse classique mieux adaptée. Thèse de doctorat, Université Paris-Sud, June (2015). https://tel.archives-ouvertes.fr/tel-01228517/. (In French)
Lelay, C.: How to express convergence for analysis in Coq. In: The 7th Coq Workshop, June 2015. https://hal.archives-ouvertes.fr/hal-01169321/
Maisonneuve, F.: Mathématiques 2: Intégration, transformations, int’egrales et applications—Cours et exercices. Presses de l’École des Mines (2014). (In French)
Makarov, E., Spitters, B.: The Picard algorithm for ordinary differential equations in Coq. In: Blazy, S., Paulin-Mohring, C., Pichardie, D. (eds.) Proceedings of the 4th International Conference on Interactive Theorem Proving (ITP 2013), Volume 7998 of Lecture Notes in Computer Science, pp. 463–468. Springer, Berlin (2013). https://doi.org/10.1007/978-3-642-39634-2_34
Matuszewski, R. (ed.): Formalized Mathematics. Sciendo, Poland. https://fm.mizar.org/
Mayero, M.: Formalisation et automatisation de preuves en analyses réelle et numérique. Thèse de doctorat, Université Paris VI, December 2001. http://www-lipn.univ-paris13.fr/~mayero/publis/these-mayero.ps.gz. (In French)
Mhamdi, T., Hasan, O., Tahar, S.: On the formalization of the lebesgue integration theory in HOL. In: Kaufmann, M., Paulson, L.C. (eds.) Proceedings of the 1st International Conference on Interactive Theorem Proving (ITP 2010), Volume 6172 of Lecture Notes in Computer Science, pp. 387–402. Springer, Berlin (2010). https://doi.org/10.1007/978-3-642-14052-5_27
Mikusiński, J.: The Bochner Integral. Academic Press, New York–San Francisco (1978)
Book Google Scholar
Musial, P., Tulone, F.: Dual of the class of HK$_r$ integrable functions. Minimax Theory Appl. 4(1), 151–160 (2019)
MathSciNet MATH Google Scholar
Nipkow, T., Paulson, L.C., Wenzel, M.: Isabelle/HOL—A Proof Assistant for Higher-Order Logic, Volume 2283 of Lecture Notes in Computer Science. Springer, Berlin (2002). https://doi.org/10.1007/3-540-45949-9
Book MATH Google Scholar
Owre, S., Rushby, J.M., Shankar, N.: PVS: a prototype verification system. In: Kapur, D. (ed.) Proceedings of the 11th International Conference on Automated Deduction (CADE 1992), Volume 607 of Lecture Notes in Computer Science, pp. 748–752. Springer, Berlin (1992). https://doi.org/10.1007/3-540-55602-8_217
Owre, S., Shankar, N., Rushby, J.M., Stringer-Calvert, D.W.J.: PVS system guide. In: SRI International, Computer Science Laboratory, Menlo Park, CA, August 2020. http://pvs.csl.sri.com/doc/pvs-system-guide.pdf. Version 7.1 [1st version in 1999]
Quarteroni, A., Valli, A.: Numerical Approximation of Partial Differential Equations. Springer Series in Computational Mathematics, vol. 23. Springer, Berlin (1994)
Book Google Scholar
Rudin, W.: Real and Complex Analysis, 3rd edn. McGraw-Hill Book Co., New York (1987)
MATH Google Scholar
Schwartz, L.: Théorie des distributions, 2nd edn. Hermann, Paris (1966). 1st edition in 1950–1951. (In French)
Semeria, V.: Nombres réels dans Coq. In: Dargaye, Z., Regis-Gianas, Y. (eds.) Actes des 31es Journées Francophones des Langages Applicatifs (JFLA), pp. 104–111. IRIF (2020). https://hal.inria.fr/hal-02427360/. (In French)
Tekriwal, M., Duraisamy, K., Jeannin, J.-B.: A formal proof of the Lax equivalence theorem for finite difference schemes. In: 13th International Symposium on NASA Formal Methods (NFM 2021) (2021). (To appear)
The mathlib Community. The Lean mathematical library. In: Blanchette, J., Hritcu, C. (eds.) Proceedings of the 9th ACM SIGPLAN International Conference on Certified Programs and Proofs (CPP 2020), pp. 367–381. ACM (2020). https://doi.org/10.1145/3372885.3373824
Tsybakov, A.B.: Introduction to Nonparametric Estimation. Springer Series in Statistics. Springer, New York (2009). https://doi.org/10.1007/b13794. Revised and extended from the 2004 French original [Springer, Berlin], Zaiats, V. (Trans.)
Weil, A.: Sur les espaces à structure uniforme et sur la topologie générale. Hermann, Paris (1937). (In French)
Yosida, K.: Functional Analysis. Classics in Mathematics. Springer, Berlin (1995). https://doi.org/10.1007/978-3-642-61859-8. Reprint of the 6th (1980) edition (Springer, Berlin)
Zienkiewicz, O.C., Taylor, R.L., Zhu, J.Z.: The Finite Element Method: Its Basis and Fundamentals, 7th edn. Elsevier/Butterworth Heinemann, Amsterdam (2013). https://doi.org/10.1016/B978-1-85617-633-0.00001-0
Book MATH Google Scholar

Download references

Acknowledgements

We are deeply grateful to Stéphane Aubry for some proofs and tactics on properties, and to Assia Mahboubi, Cyril Cohen, Guillaume Melquiond, and Vincent Semeria for fruitful discussions about and in . This work was partly supported by the Paris Île-de-France Region (DIM RFSI MILC). This work was partly supported by Labex DigiCosme (project ANR-11-LABEX-0045-DIGICOSME) operated by ANR as part of the program “Investissement d’Avenir” Idex Paris-Saclay (ANR-11-IDEX-0003-02).

This work was partly supported by the European Research Council (ERC) under the European Union’s Horizon 2020 Research and Innovation Programme—Grant Agreement n$^\circ $810367.

Author information

Authors and Affiliations

Université Paris-Saclay, CNRS, ENS Paris-Saclay, Inria, Laboratoire Méthodes Formelles, 91190, Gif-sur-Yvette, France
Sylvie Boldo & Florian Faissole
Inria, 2 rue Simone Iff, 75589, Paris, France
François Clément
CERMICS, École des Ponts, 77455, Marne-la-Vallée, France
François Clément
LMAC (Laboratory of Applied Mathematics of Compiègne), CS 60319, Université de technologie de Compiègne, 60203, Compiègne Cedex, France
Vincent Martin
LIPN, CNRS UMR 7030, Université Paris 13, 93430, Villetaneuse, France
Micaela Mayero

Authors

Sylvie Boldo
View author publications
You can also search for this author in PubMed Google Scholar
François Clément
View author publications
You can also search for this author in PubMed Google Scholar
Florian Faissole
View author publications
You can also search for this author in PubMed Google Scholar
Vincent Martin
View author publications
You can also search for this author in PubMed Google Scholar
Micaela Mayero
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Sylvie Boldo.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Boldo, S., Clément, F., Faissole, F. et al. A Coq Formalization of Lebesgue Integration of Nonnegative Functions. J Autom Reasoning 66, 175–213 (2022). https://doi.org/10.1007/s10817-021-09612-0

Download citation

Received: 15 April 2021
Accepted: 09 October 2021
Published: 23 November 2021
Issue Date: May 2022
DOI: https://doi.org/10.1007/s10817-021-09612-0

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

A Coq Formalization of Lebesgue Integration of Nonnegative Functions

Abstract

Similar content being viewed by others

A Coq Formalization of Lebesgue Induction Principle and Tonelli’s Theorem

Nested Integrals and Rationalizing Transformations

Measure Construction by Extension in Dependent Type Theory with Application to Integration

Explore related subjects

1 Introduction

2 Coquelicot Library and Other Basic Coq Choices

2.1 The Coquelicot Library and \({\overline{\mathbb {R}}}\)

2.2 Axioms

3 Auxiliary Results About the Reals

3.1 Second-Countability of Real Numbers

3.2 About Sums of Extended Real Numbers

3.3 About Limits

4 Measurability

4.1 Subsets and Characteristic Functions

4.2 Measurability of Subsets

4.3 Cartesian Product and Measurability

4.4 Borel Subsets of Real Numbers

4.5 Measurability of Functions

5 Measure

5.1 Specification and Basic Properties

5.2 Boole’s Inequality and Continuity from Below

5.3 Negligible Subsets

6 Simple Functions

6.1 Canonical Representation

6.2 Integration of Nonnegative Simple Functions

6.3 Linearity of the Integral of Simple Functions

7 Integration of Nonnegative Functions

7.1 Definition and First Properties

7.2 The Beppo Levi Theorem

7.3 Adapted Sequences

7.4 Linearity and Other Properties of the Integral

7.5 Fatou’s Lemma

8 A Simple Case Study: The Dirac Measure

9 Discussion on Proof-Engineering Concerns

9.1 Extended Real Numbers

9.2 Classical Logic Aspects

9.3 Measurability of Subsets

9.4 Simple Functions

10 State of the Art

11 Conclusion and Perspectives

Notes

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation