A Heuristic Prover for Elementary Analysis in Theorema

Jebelean, Tudor

doi:10.1007/978-3-030-81097-9_10

Tudor Jebelean¹⁰

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 12833))

Included in the following conference series:

International Conference on Intelligent Computer Mathematics

651 Accesses
1 Citations

Abstract

We present a plug-in to the Theorema system, which generates proofs similar to those produced by humans for theorems in elementary analysis and is based on heuristic techniques combining methods from automated reasoning and computer algebra. The prover is able to construct automatically natural-style proofs for various examples related to convergence of sequences as well as to limits, continuity, and uniform continuity of functions. Additionally to general inference rules for predicate logic, the techniques used are: the S-decomposition method for formulae with alternating quantifiers, use of Quantifier Elimination by Cylindrical Algebraic Decomposition, analysis of terms behavior in zero, bounding the $\epsilon $-bounds, semantic simplification of expressions involving absolute value, polynomial arithmetic, usage of equal arguments to arbitrary functions, and automatic reordering of proof steps in order to check the admissibility of solutions to the metavariables. The problem of proving such theorems directly without using refutation and clausification is logically equivalent to the problem of satisfiability modulo the theory of real numbers, thus these techniques are relevant for SMT solving also.

Access provided by Autonomous University of Puebla. Download conference paper PDF

Scalable Fine-Grained Proofs for Formula Processing

Article 04 January 2019

Theorem Proving as Constraint Solving with Coherent Logic

Article 20 May 2022

Keywords

1 Introduction

In this paper we present our results on a class of proof problems which arise in elementary analysis, namely those involving formulae with alternating quantifiers. We implement the following heuristic techniques, which extend our previous work [5, 6, 9]: the S-decomposition method for formulae with alternating quantifiers [8], use of Quantifier Elimination by Cylindrical Algebraic Decomposition [4], analysis of terms behavior in zero, bounding the $\epsilon $-bounds, semantic simplification of expressions involving absolute value, polynomial arithmetic, usage of equal arguments under unknown functions, and automatic reordering of proof steps in order to check the admissibility of solutions to the metavariables.

Our prover, implemented in the frame of the Theorema system [3], aims at producing natural-style proofs for simple theorems involving convergence of sequences and of functions, continuity, uniform continuity, etc. The prover does not need to access a large collection of formulae (expressing the properties of the domains involved). Rather, the prover uses techniques from computer algebra in order to discover relevant terms and to check necessary conditions, and only needs as starting knowledge the definitions of the main notions involved. The size of this short paper does not allow an overview of the relevant literature, so we only mention [2], which in contrast to our prover is based mostly on rewriting of logical terms and does not handle alternating quantifiers.

2 Application of Special Techniques

Example: Product of Convergent Sequences. We illustrate the heuristics by the proof of the theorem “The product of two convergent sequences is convergent”, which is presented in detail together with other examples and explanations of the techniques in [7]. The proof starts from the definitions of product of two functions and of convergence of a function ${f:\mathbb {N}\longrightarrow \mathbb {R}}$:

$${\underset{a\in \mathbb {R}}{\exists }\ \underset{e>0}{\underset{e\in \mathbb {R}}{\forall }}\ \underset{M\in \mathbb {N}}{\exists }\underset{n\ge M}{\underset{n\in \mathbb {N}}{\forall }}|f[n]-a|<e}$$

After introducing Skolem constants $f_1, f_2$ for the arbitrary convergent sequences and expansion of the goal and of the assumptions by the definitions of convergence and of product of functions, the prover is left with two main assumptions and one goal (instances of the formula above), which have parallel alternating quantifiers.

The S-decomposition Method. The main structure of the proof (see [8]) is as follows: the quantifiers are removed from the 3 statements in parallel, using a combination of inference steps which decompose the proof into several branches. When the 3 formulae are existential, first introduce the Skolem constants for the assumptions, and then introduce a witness for the goal. The proof branches into: a main branch with the new goal, and secondary branches for proving the subgoals stating that the type and the condition of the existential variable hold for the witness. When the 3 formulae are universal, first introduce Skolem constants for the goal, and then introduce the instantiation terms for the assumptions. Similarly to above, separate secondary branches are created for the type and condition checking of the instantiation terms.

Thus in this proof the prover produces, in order: Skolem constants $a_1, a_2$, witness $a_1 + a_2$; Skolem constant $e_0$, instantiation term $\text {Min}\left[ 1,\frac{e_0}{|a_2|+|a_1|+1}\right] $; Skolem constants $M_1, M_2$, witness Max $[M_1, M_2]$; Skolem constant $n_0$, instantiation term $n_0$. (The names are similar to the corresponding variables in the definition.)

At every iteration of the proof cycle one needs a witness for the existential goal and an one or more instantiation terms for the universal assumptions: these are the difficult steps in the proof, for which we use special proof techniques based on computer algebra.

Reasoning About Terms Behavior in Zero: by polynomial arithmetic the prover infers the value of the witness for a by equating all the expressions under the absolute value to zero.

Use of Metavariables: the existential variable in the goal (or the universal variable in an assumption) is replaced by a new symbol (metavariable), which is a name for the term (solution of the metavariable) which we need to find. This term is determined later in the proof, and the subgoals stating the type and the condition are checked on the secondary branches. Also, one must ensure that the solution to the metavariable does not contain Skolem constants which are introduced later in the proof. If this condition is not fulfilled, the prover tries to reorder the steps of the proof.

Quantifier Elimination is used in order to find the solution of the metavariable in relatively simple situations – as for instance in this proof for $Max[M_1, M_2]$, as described in [1].

Identification of Equal Terms Under Unknown Functions. This is used for finding the instantiation term $n_0$.

Since $f_1$ and $f_2$ are arbitrary, we do not know anything about their behaviour. In the goal $f_1$ and $f_2$ have argument $n_0$, therefore the prover uses the same argument in the assumptions, otherwise it would be impossible us the assumptions in the proof of the goal.

Algebraic Manipulations. The most challenging part in this proof is the automatic generation of the instantiation term $\text {Min}\left[ 1,\frac{e_0}{|a_2|+|a_1|+1}\right] $, which is performed by a heuristic combination of solving, substitution, and simplifying, as well as rewriting of expressions under the absolute value function, and it is realized at the end of the proof. The goal in this moment is:

$$\begin{aligned} |\left( f_1\left[ n_0\right] *f_2\left[ n_0\right] \right) -\left( a_1*a_2\right) |<e_0 \end{aligned}$$

(1)

and the main assumptions are:

$$\begin{aligned} |f_1\left[ n_0\right] -a_1|<e \end{aligned}$$

(2)

$$\begin{aligned} |f_2\left[ n_0\right] -a_2|<e \end{aligned}$$

(3)

Internally the prover replaces $f_1[n_0]$ and $f_2[n_0]$ by $x_1$ and $x_2$, respectively, both in the goal and in the assumptions. The argument of the absolute value in the transformed goal is $E_0 = x_1*x_2 - a_1*a_2$ and in the transformed assumptions $E_1 = x_1 - a_1$ and $E_2 = x_2 - a_2$.

First we use the following heuristic principle: transform the goal expression $E_0$ such that it uses as much as possible $E_1$ and $E_2$, because about those we know that they are small. In order to do this we take new variables $y_1, y_2$, we solve the equations $y_1 = E_1$ and $y_2 = E_2$ for $x_1, x_2$, we substitute the solutions in $E_0$ and the result simplifies to: $a_1*y_2 + a_2*y_1 + y_1*y_2$. This is the internal representation of the absolute value argument in the goal (4).

$$\begin{aligned} |a_1*(f_2[n_0]-a_2)+a_2*(f_1[n_0]-a_1) +\ (f_1[n_0]-a_1)*(f_2[n_0]-a_2)|<e_0 \end{aligned}$$

(4)

Note that the transformation from (1) to (4) is relatively challenging even for a human prover.

$$\begin{aligned}&|a_1*(f_2[n_0]-a_2)+a_2*(f_1[n_0]-a_1) +\ (f_1[n_0]-a_1)*(f_2[n_0]-a_2)|\\&\le |a_1*(f_2[n_0]-a_2)|+|a_2*(f_1[n_0]-a_1)| +\ |(f_1[n_0]-a_1)*(f_2[n_0]-a_2)|\nonumber \\&= |a_1|*|f_2[n_0]-a_2|+|a_2|*|f_1[n_0]-a_1| +\ |f_1[n_0]-a_1|*|f_2[n_0]-a_2|\nonumber \\&<|a_1|*e+|a_2|*e+e*e\le |a_1|*e+|a_2|*e+e=e*(|a_1|+|a_2|+1)\nonumber \\&=\frac{e_0}{|a_2|+|a_1|+1}*(|a_1|+|a_2|+1)=e_0\nonumber \end{aligned}$$

(5)

The formula (5) is realized by rewriting of the absolute value expressions. Namely, we apply certain rewrite rules to expressions of the form |E| and their combination, as well as to the metavariable e. Every rewrite rule transforms a (sub)term into one which is not smaller, so we are sure to obtain a greater or equal term. The final purpose of these transformations is to obtain a strictly positive ground term t multiplied by the target metavariable (here e). Since we need a value for e which fullfils $t * e \le e_0$, we can set e to $e_0/t$. The rewrite rules come from the elementary properties of the absolute value function: (e.g. $|u + v| \le |u| + |v|)$) and from the principle of bounding the $\epsilon $-bounds: Since we are interested in the behaviour of the expressions in the immediate vicinity of zero, the bound e can be bound from above by any positive value. In the case of product (presented here), we also use the rule: $e * e \le e$, that is we bound e to 1. This is why the final expression of e is the minimum between 1 and the term t found as above.

Proving Simple Conditions. At certain places in the proof, the conditions upon certain quantified variables have to be proven. The prover does not display a proof of these simple statements, but just declares them to be consequences of “elementary properties of $\mathbb {R}$”. (Such elementary properties are also invoked when developing formulae (4) and (5)). In the background, however, the prover uses Mathematica functions in order to check that these statements are correct. This happens for instance for the subgoal $\underset{e_0}{\forall }(e_0>0 \Rightarrow e>0)$ and will be treated after the instatiation term for e is found, by using QE on this formula (where e has the found value $\text {Min}[\ldots ]$), which returns True in Mathematica.

3 Conclusion and Further Work

When applied to problems over reals, Satisfiability Modulo Theories (SMT) solving combines techniques from automated reasoning and from computer algebra. From the point of view of automated reasoning, proving unsatisfiability of a set of clauses appears to be quite different from producing natural-style proofs. Indeed the proof systems are different (resolution on clauses vs. some version of sequent calculus), but they are essentially equivalent, relaying on equivalent transformations of formulae. Moreover, the most important steps in first order proving, namely the instantiations of universally quantified formulae (which in natural-style proofs is also present as the equivalent operation of finding witnesses for existentially quantified goals), are actually the same or very similar.

The full automation of proofs in elementary analysis constitutes a very interesting application for the combination of logic and algebraic techniques, which is essentially equivalent to SMT solving (combining satisfiability checking and symbolic computation). Our experiments show that complete and efficient automation is possible by using certain heuristics in combination with complex algebraic algorithms.

Further work includes a systematic treatment of various formulae which appear in textbooks, and extension of the heuristics to more general types of formulae. In this way we hope to address the class of problems which are usually subject to SMT solving.

References

Abraham, E., Jebelean, T.: Adapting cylindrical algebraic decomposition for proof specific tasks. In: Kusper, G. (ed.) ICAI 2017: 10th International Conference on Applied Informatics (2017) (in print)
Google Scholar
Bauer, A., Clarke, E., Zhao, X.: Analytica - an experiment in combining theorem proving and symbolic computation. J. Autom. Reasoning 21(3), 295–325 (1998). https://doi.org/10.1023/A:1006079212546
Buchberger, B., Jebelean, T., Kutsia, T., Maletzky, A., Windsteiger, W.: Theorema 2.0: computer-assisted natural-style mathematics. JFR 9(1), 149–185 (2016)
Google Scholar
Collins, G.E.: Quantier elimination for real closed fields by cylindrical algebraic decomposition. In: Automata Theory and Formal Languages. LNCS, vol. 33, pp. 134–183. Springer (1975)
Google Scholar
Jebelean, T.: Techniques for natural-style proofs in elementary analysis. ACM Commun. Comput. Algebra 52(3), 92–95 (2019)
Google Scholar
Jebelean, T.: Techniques for natural-style proofs in elementary analysis (extended abstract). In: Bigatti, A.M., Brain, M. (eds.) Third International Workshop on Satisfiability Checking and Symbolic Computation (2018)
Google Scholar
Jebelean, T.: A heuristic prover for elementary analysis in Theorema. Tech. Rep. 21–07, Research Institute for Symbolic Computation (RISC), Johannes Kepler University Linz (2021)
Google Scholar
Jebelean, T., Buchberger, B., Kutsia, T., Popov, N., Schreiner, W., Windsteiger, W.: Automated reasoning. In: Buchberger, B., et al. (eds.) Hagenberg Research, pp. 63–101. Springer (2009). https://doi.org/10.1007/978-3-642-02127-5_2
Vajda, R., Jebelean, T., Buchberger, B.: Combining logical and algebraic techniques for natural style proving in elementary analysis. Math. Comput. Simul. 79(8), 2310–2316 (2009)
Article MathSciNet Google Scholar

Download references

Author information

Authors and Affiliations

RISC–Linz, JKU, Linz, Austria
Tudor Jebelean

Authors

Tudor Jebelean
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Tudor Jebelean .

Editor information

Editors and Affiliations

Heriot-Watt University, Edinburgh, UK
Fairouz Kamareddine
University of Bologna, Bologna, Italy
Claudio Sacerdoti Coen

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Jebelean, T. (2021). A Heuristic Prover for Elementary Analysis in Theorema. In: Kamareddine, F., Sacerdoti Coen, C. (eds) Intelligent Computer Mathematics. CICM 2021. Lecture Notes in Computer Science(), vol 12833. Springer, Cham. https://doi.org/10.1007/978-3-030-81097-9_10

Download citation

DOI: https://doi.org/10.1007/978-3-030-81097-9_10
Published: 20 July 2021
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-81096-2
Online ISBN: 978-3-030-81097-9
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

A Heuristic Prover for Elementary Analysis in Theorema

Abstract

Similar content being viewed by others

Scalable Fine-Grained Proofs for Formula Processing

Scalable Fine-Grained Proofs for Formula Processing

Theorem Proving as Constraint Solving with Coherent Logic

Keywords

1 Introduction

2 Application of Special Techniques

3 Conclusion and Further Work

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

A Heuristic Prover for Elementary Analysis in Theorema

Abstract

Similar content being viewed by others

Scalable Fine-Grained Proofs for Formula Processing

Scalable Fine-Grained Proofs for Formula Processing

Theorem Proving as Constraint Solving with Coherent Logic

Keywords

1 Introduction

2 Application of Special Techniques

3 Conclusion and Further Work

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation