HiFrog: SMT-based Function Summarization for Software Verification

Alt, Leonardo; Asadi, Sepideh; Chockler, Hana; Even Mendoza, Karine; Fedyukovich, Grigory; Hyvärinen, Antti E. J.; Sharygina, Natasha

doi:10.1007/978-3-662-54580-5_12

Leonardo Alt¹⁵,
Sepideh Asadi¹⁵,
Hana Chockler¹⁶,
Karine Even Mendoza¹⁶,
Grigory Fedyukovich¹⁷,
Antti E. J. Hyvärinen¹⁵ &
…
Natasha Sharygina¹⁵

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 10206))

Included in the following conference series:

International Conference on Tools and Algorithms for the Construction and Analysis of Systems

2301 Accesses
15 Citations

Abstract

Function summarization can be used as a means of incremental verification based on the structure of the program. HiFrog is a fully featured function-summarization-based model checker that uses SMT as the modeling and summarization language. The tool supports three encoding precisions through SMT: uninterpreted functions, linear real arithmetics, and propositional logic. In addition the tool allows optimized traversal of reachability properties, counter-example-guided summary refinement, summary compression, and user-provided summaries. We describe the use of the tool through the description of its architecture and a rich set of features. The description is complemented by an experimental evaluation on the practical impact the different SMT precisions have on model-checking.

This work was supported by the SNF projects 153402 and 163001.

You have full access to this open access chapter, Download conference paper PDF

SMT-based verification of program changes through summary repair

Article Open access 01 June 2022

IC-Cut: A Compositional Search Strategy for Dynamic Test Generation

Random model sampling: Making craig interpolation work when it should not

Article 01 December 2015

1 Introduction

Incremental verification addresses the unique opportunities and challenges that arise when a verification task can be performed in an incremental way, as a sequence of smaller closely related tasks. We present an implementation of the incremental verification of software with assertions that uses the insights obtained from a successful verification of earlier assertions. As a fundamental building block in storing the insights we use function summaries known to provide speed-up through localizing and modularizing verification [12, 13].

In this paper we describe the HiFrog verification tool that uses Craig interpolation [6] in the context of Bounded Model Checking (BMC) [4] for constructing function summaries. The novelty of the tool is in the unique way it combines function summaries with the expressiveness of satisfiability modulo theories (SMT). The system currently supports verification based on the quantifier-free theories of linear real arithmetics (\(\mathrm {QF\_LRA}\)) and uninterpreted functions (\(\mathrm {QF\_UF}\)), in addition to propositional logic (\(\mathrm {QF\_BOOL}\)). Compared to our earlier propositional tool FunFrog [13], the SMT summaries are smaller and more efficient in verification. They are also often significantly more human-readable, enabling their easier reuse, as well as injection of summaries provided directly by the user. The difference is due to the propositional summaries being based on correctness proofs over circuit-level representation of arithmetic operations. Theory encoding uses instead directly arithmetic symbols in the summaries. In addition, the tool offers a rich set of features such as verification of recursive programs, different ways of optimizing the summaries with respect to both size and strength, efficient heuristics for removing redundant safety properties, and easy-to-understand witnesses of property violations that can be directly mapped to bugs in the source code.

The paper provides an architectural description of the tool, an introduction to its use, and experimental evidence of its performance. The tool together with a comprehensive demo is available at http://verify.inf.usi.ch/hifrog.

Related Work. Incremental verification is extensively researched in domains such as hardware verification, deductive verification, and model checking. Due to space constraints we provide only a brief review of recent related work. The CPAchecker tool is able to migrate predicates across program versions [3]. Deductive verification tools such as Viper and Dafny offer modular verification [11] and caching the intermediate verification results [9] respectively. CBMC is a symbolic bounded model-checker for C that to a limited extent exploits incremental capabilities of a SAT solver^{Footnote 1}, but does not use or output any reusable information like function summaries. Similar to HiFrog, ESBMC also shares the CProver infrastructure and is based on an SMT solver. To the best of our knowledge, it does not support incremental verification [5].

2 Tool Overview

HiFrog consists of two main components SMT encoder and interpolating SMT solver; and the function summaries (see Fig. 1). The components are initially configured with the theory and the interpolation algorithms. The tool then processes assertions sequentially using function summaries when possible. The results of a successful assertion verification are stored as interpolated function summaries, and failed verifications trigger a refinement phase or the printing of an error trace. This section details the tool features.

Preprocessing. The source code is parsed and transformed into an intermediate goto-program using the goto-cc symbolic compiler. The loops are unwound to the pre-determined number of iterations. HiFrog identifies the set of assertions from the source code, reads the user-defined function summaries (if any) in the smtlib2-format, and makes them available for the subsequent analysis.

SMT Encoding and Function Summarization. For a given assertion, the goto-program is symbolically executed function-per-function resulting in the “modular” Static Single Assignment (SSA) form of the unwound program, i.e., a form where each function has its own isolated SSA-representation. To reduce the size of the SSA form, HiFrog performs slicing that keeps only the variables in the SSA form that are syntactically dependent on the variables in the assertion.

When the SSA form is pruned, HiFrog creates the SMT formula in the pre-determined logic (\(\mathrm {QF\_BOOL}\), \(\mathrm {QF\_UF}\) or \(\mathrm {QF\_LRA}\)). The modularity of the SSA form comes in handy when the function summaries of the chosen logic (either user-defined, interpolation-based, or treated nondeterministically) are available. If this is the case, the call to a function with the available summary is replaced by the summary. The final SMT formula is pushed to an SMT solver to decide its satisfiability.

Due to over-approximating nature of function summaries, the program encoded with the summaries may contain spurious errors. The summary refiner identifies and marks summaries directly involved in the detected error, and HiFrog returns to the encoding stage to replace the marked summaries by the precise (up to the pre-determined logic) function representations. Note that due to refinement, HiFrog reveals nested function calls (including recursive ones) which are again replaced by available summaries. For an unsatisfiable SMT formula, HiFrog extracts function summaries using interpolation. The extracted summaries are serialized in a persistent storage so that they are available for other HiFrog runs. For a more detailed description we refer to [13].

Theories. HiFrog supports three different quantifier-free theories in which the program can be modelled: bit-precise \(\mathrm {QF\_BOOL}\), \(\mathrm {QF\_UF}\) and \(\mathrm {QF\_LRA}\). The use of theories beyond \(\mathrm {QF\_BOOL}\) allows the system to scale to larger problems since encoding in particular the arithmetic operations using bit-precision can be very expensive. As the precise arithmetics often do not play a role in the correctness of the program, substituting them with linear arithmetics, uninterpreted functions, or even nondeterministic behavior might result in a significant reduction in model-checking time (see Sect. 3). If a property is proved using one of the light-weight theories \(\mathrm {QF\_UF}\) and \(\mathrm {QF\_LRA}\), the proof holds also for the exact BMC encoding of the program. However, the loss of precision can sometimes produce spurious counterexamples due to the over-approximating encoding. The light-weight theories therefore need to be refined (i.e., using theory refiner) to \(\mathrm {QF\_BOOL}\) if the provided counter-example does not correspond to a concrete counterexample.

Obtaining Summaries by Interpolation. HiFrog relies on different interpolation frameworks for the different theories it supports. As a result the generation of propositional, \(\mathrm {QF\_UF}\) and \(\mathrm {QF\_LRA}\) interpolants can be controlled with respect to strength and size by specifying an interpolation algorithm for a theory. For propositional logic we provide the Labeled Interpolation Systems [7] including the Proof-Sensitive interpolation algorithms [1]. Interpolation for \(\mathrm {QF\_UF}\) is implemented with duality-based interpolation [2], and a similar extension is applied to the interpolation algorithm for \(\mathrm {QF\_LRA}\) based on [10]. HiFrog also provides a range of techniques to reduce the size of the generated interpolants through removing redundancies in propositional proofs [12]: the algorithms RecyclePivotsWithIntersection and LowerUnits, structural hashing, and a set of local rewriting rules.

Assertion Optimizer. In addition to incremental verification of a set of assertions, HiFrog supports the basic functionality of classical model checkers to verify all assertions at once. For the cases when the set of assertions is too large, it can be optimized by constructing an assertion implication relation and exploiting it to remove redundant assertions [8]. In a nutshell, the assertion optimizer considers pairs of spatially close assertions \(a_i\) and \(a_j\) and uses the SMT solver to check if \(a_i\) conjoined with the code between \(a_i\) and \(a_j\) implies \(a_j\) (if there is any other assertion between \(a_i\) and \(a_j\) then it is treated as assumption). If the check succeeds then \(a_j\) is proven redundant and its verification can be safely skipped.

3 HiFrog Usage

We provide a Linux binary of HiFrog reading as input a C-program, assertions to be verified, a set of parameters and the interpolated or user-defined function summaries in the SMT-LIB2 format. HiFrog exploits the CProver framework and inherits some of its options (e.g., --unwind for the loop unrolling, --show-claims and --claim for managing the assertions checks); the ability for the user to declare and to use a nondet_TYPE() function of a specific numerical type (e.g., int, long, double, unsigned, in \(\mathrm {QF\_LRA}\) only) or add a __CPROVER_assume() statement to limit the domain to a specific range of values.

HiFrog uses \(\mathrm {QF\_LRA}\) by default but can be switched to \(\mathrm {QF\_UF}\) via the --logic option.^{Footnote 2} HiFrog uses a variety of interpolation and proof compression algorithms to control the the precision (with --itp-uf-algorithm option for \(\mathrm {QF\_UF}\), --itp-lra-algorithm option for \(\mathrm {QF\_LRA}\), and --itp-algorithm option for propositional interpolation) and the size (with --reduce-proof) of summaries. The summary storage is controlled using the --save-summaries and --load-summaries options. In between verification runs, the summaries contained in the corresponding files for \(\mathrm {QF\_UF}\) and \(\mathrm {QF\_LRA}\) might be edited manually. Note that due to the SMT encoding constraints HiFrog does not allow interchanging summaries between the theories. Finally, HiFrog supports the identification and reporting of redundant assertions with --claims-opt, a useful feature for some automatically generated assertions [8].

In the end of each verification run, HiFrog either reports VERIFICATION SUCCESSFUL or VERIFICATION FAILED accompanied by an error trace. An error trace presents a sequence of steps with a direct reference to the code and the values of variables in these steps. In most cases when \(\mathrm {QF\_UF}\) and \(\mathrm {QF\_LRA}\) introduce a spurious error, HiFrog outputs a warning, and thus the user is advised to use HiFrog with a more precise theory. HiFrog also reports the statistics on the running time and the number of the summary-refinements performed.

Experimental Results. We evaluated HiFrog on a large set of C programs coming from both academic and industrial sources such as SV-COMP. All benchmarks contained multiple assertions to be checked. To demonstrate the advantages of the SMT-based summarization, here we provide data for analysis of benchmarks containing 1086 assertions from which 474 were proven to hold using \(\mathrm {QF\_BOOL}\) (meaning that those properties satisfy the system specifications). Even despite the over-approximating nature of \(\mathrm {QF\_UF}\) and \(\mathrm {QF\_LRA}\), our experiments witnessed a large amount of properties which were also proven to be correct by employing the light-weight theories of HiFrog (namely, 50.65% and 69.2% of validated properties out of 474 for \(\mathrm {QF\_UF}\) and \(\mathrm {QF\_LRA}\) respectively).

Furthermore, those experiments revealed that model checking using the \(\mathrm {QF\_UF}\) and \(\mathrm {QF\_LRA}\)-based summarization was extremely efficient. Figure 2 presents two logarithmic plots for comparison of running times^{Footnote 3} of HiFrog with \(\mathrm {QF\_BOOL}\) to respectively \(\mathrm {QF\_UF}\) and \(\mathrm {QF\_LRA}\). Each point represents a pair of verification runs of a holding assertion with the two corresponding theories using the interpolation-based summaries. Note that for most of the assertions, the verification with \(\mathrm {QF\_UF}\) and \(\mathrm {QF\_LRA}\) is an order of magnitude faster than the verification with \(\mathrm {QF\_BOOL}\).

Notes

1.
http://www.cprover.org.
2.
Currently the support for \(\mathrm {QF\_BOOL}\) needs to be specified at compile time.
3.
The timing results were obtained on an Ubuntu 14.04.1 LTS server running two Intel(R) Xeon(R) E5620 CPUs @ 2.40 GHz and 16 GB RAM. We prepared a pre-compiled Linux-binary available at the Virtual Machine at http://verify.inf.usi.ch/hifrog/binary; our benchmarks set is available at http://verify.inf.usi.ch/hifrog/bench and can facilitate the property verification for other researchers.

References

Alt, L., Fedyukovich, G., Hyvärinen, A.E.J., Sharygina, N.: A proof-sensitive approach for small propositional interpolants. In: Gurfinkel, A., Seshia, S.A. (eds.) VSTTE 2015. LNCS, vol. 9593, pp. 1–18. Springer, Heidelberg (2016). doi:10.1007/978-3-319-29613-5_1
Chapter Google Scholar
Alt, L., Hyvärinen, A.E.J., Sharygina, N.: Duality-based interpolation for quantifier-free equalities and uninterpreted functions (2016). http://www.inf.usi.ch/postdoc/hyvarinen/euf-interpolation.pdf
Beyer, D., Löwe, S., Novikov, E., Stahlbauer, A., Wendler, P.: Precision reuse for efficient regression verification. In: ESEC/FSE, pp. 389–399. ACM (2013)
Google Scholar
Biere, A., Cimatti, A., Clarke, E., Zhu, Y.: Symbolic model checking without BDDs. In: Cleaveland, W.R. (ed.) TACAS 1999. LNCS, vol. 1579, pp. 193–207. Springer, Heidelberg (1999). doi:10.1007/3-540-49059-0_14
Chapter Google Scholar
Cordeiro, L.C., de Lima Filho, E.B.: SMT-based context-bounded model checking for embedded systems: challenges and future trends. ACM SIGSOFT Softw. Eng. Notes 41(3), 1–6 (2016)
Article Google Scholar
Craig, W.: Three uses of the Herbrand-Gentzen theorem in relating model theory and proof theory. J. Symb. Log. 22(3), 269–285 (1957)
Article MathSciNet MATH Google Scholar
D’Silva, V., Kroening, D., Purandare, M., Weissenbacher, G.: Interpolant strength. In: Barthe, G., Hermenegildo, M. (eds.) VMCAI 2010. LNCS, vol. 5944, pp. 129–145. Springer, Heidelberg (2010). doi:10.1007/978-3-642-11319-2_12
Chapter Google Scholar
Fedyukovich, G., D‘Iddio, A.C., Hyvärinen, A.E.J., Sharygina, N.: Symbolic detection of assertion dependencies for bounded model checking. In: Egyed, A., Schaefer, I. (eds.) FASE 2015. LNCS, vol. 9033, pp. 186–201. Springer, Heidelberg (2015). doi:10.1007/978-3-662-46675-9_13
Google Scholar
Leino, K.R.M., Wüstholz, V.: Fine-grained caching of verification results. In: Kroening, D., Păsăreanu, C.S. (eds.) CAV 2015. LNCS, vol. 9206, pp. 380–397. Springer, Heidelberg (2015). doi:10.1007/978-3-319-21690-4_22
Chapter Google Scholar
McMillan, K.L.: An interpolating theorem prover. Theor. Comput. Sci. 345(1), 101–121 (2005)
Article MathSciNet MATH Google Scholar
Müller, P., Schwerhoff, M., Summers, A.J.: Viper: a verification infrastructure for permission-based reasoning. In: Jobstmann, B., Leino, K.R.M. (eds.) VMCAI 2016. LNCS, vol. 9583, pp. 41–62. Springer, Heidelberg (2016). doi:10.1007/978-3-662-49122-5_2
Chapter Google Scholar
Rollini, S.F., Alt, L., Fedyukovich, G., Hyvärinen, A.E.J., Sharygina, N.: PeRIPLO: a framework for producing effective interpolants in SAT-based software verification. In: McMillan, K., Middeldorp, A., Voronkov, A. (eds.) LPAR 2013. LNCS, vol. 8312, pp. 683–693. Springer, Heidelberg (2013). doi:10.1007/978-3-642-45221-5_45
Chapter Google Scholar
Sery, O., Fedyukovich, G., Sharygina, N.: FunFrog: bounded model checking with interpolation-based function summarization. In: Chakraborty, S., Mukund, M. (eds.) ATVA 2012. LNCS, vol. 7561, pp. 203–207. Springer, Heidelberg (2012). doi:10.1007/978-3-642-33386-6_17
Chapter Google Scholar

Download references

Author information

Authors and Affiliations

Università della Svizzera italiana, Lugano, Switzerland
Leonardo Alt, Sepideh Asadi, Antti E. J. Hyvärinen & Natasha Sharygina
King’s College London, London, UK
Hana Chockler & Karine Even Mendoza
University of Washington, Seattle, USA
Grigory Fedyukovich

Authors

Leonardo Alt
View author publications
You can also search for this author in PubMed Google Scholar
Sepideh Asadi
View author publications
You can also search for this author in PubMed Google Scholar
Hana Chockler
View author publications
You can also search for this author in PubMed Google Scholar
Karine Even Mendoza
View author publications
You can also search for this author in PubMed Google Scholar
Grigory Fedyukovich
View author publications
You can also search for this author in PubMed Google Scholar
Antti E. J. Hyvärinen
View author publications
You can also search for this author in PubMed Google Scholar
Natasha Sharygina
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding authors

Correspondence to Leonardo Alt , Sepideh Asadi , Hana Chockler , Karine Even Mendoza , Grigory Fedyukovich , Antti E. J. Hyvärinen or Natasha Sharygina .

Editor information

Editors and Affiliations

Inria, Rennes Cedex, France
Axel Legay
University of Limerick and Lero - The Irish Software Research Center, Limerick, Ireland
Tiziana Margaria

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Alt, L. et al. (2017). HiFrog: SMT-based Function Summarization for Software Verification. In: Legay, A., Margaria, T. (eds) Tools and Algorithms for the Construction and Analysis of Systems. TACAS 2017. Lecture Notes in Computer Science(), vol 10206. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-662-54580-5_12

Download citation

DOI: https://doi.org/10.1007/978-3-662-54580-5_12
Published: 31 March 2017
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-662-54579-9
Online ISBN: 978-3-662-54580-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Societies and partnerships

The European Joint Conferences on Theory and Practice of Software. (opens in a new tab)

HiFrog: SMT-based Function Summarization for Software Verification

Abstract

Similar content being viewed by others

SMT-based verification of program changes through summary repair

IC-Cut: A Compositional Search Strategy for Dynamic Test Generation

Random model sampling: Making craig interpolation work when it should not

1 Introduction

2 Tool Overview

3 HiFrog Usage

Notes

References

Author information

Authors and Affiliations

Corresponding authors

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Societies and partnerships

Navigation

HiFrog: SMT-based Function Summarization for Software Verification

Abstract

Similar content being viewed by others

SMT-based verification of program changes through summary repair

IC-Cut: A Compositional Search Strategy for Dynamic Test Generation

Random model sampling: Making craig interpolation work when it should not

1 Introduction

2 Tool Overview

3 HiFrog Usage

Notes

References

Author information

Authors and Affiliations

Corresponding authors

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Societies and partnerships

Search

Navigation