Rahft: A Tool for Verifying Horn Clauses Using Abstract Interpretation and Finite Tree Automata

Kafle, Bishoksan; Gallagher, John P.; Morales, José F.

doi:10.1007/978-3-319-41528-4_14

Rahft: A Tool for Verifying Horn Clauses Using Abstract Interpretation and Finite Tree Automata

Bishoksan Kafle¹⁵,
John P. Gallagher^15,16 &
José F. Morales¹⁶

Conference paper
First Online: 13 July 2016

2107 Accesses
20 Citations

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 9779))

Abstract

We present Rahft (Refinement of Abstraction in Horn clauses using Finite Tree automata), an abstraction refinement tool for verifying safety properties of programs expressed as Horn clauses. The paper describes the architecture, strength and weakness, implementation and usage aspects of the tool. Rahft loosely combines three powerful techniques for program verification: (i) program specialisation, (ii) abstract interpretation, and (iii) trace abstraction refinement in a non-trivial way, with the aim of exploiting their strengths and mitigating their weaknesses through the complementary techniques. It is interfaced with an abstract domain, a tool for manipulating finite tree automata and various solvers for reasoning about constraints. Its modular design and customizable components allows for experimenting with new verification techniques and tools developed for Horn clauses.

B. Kafle—Funded by the EU FP7 project 318337, ENTRA.

J.P. Gallagher—Funded by the EU FP7 project 611004, coordination and support action ICT-Energy.

J.F. Morales—Work partially funded by Comunidad de Madrid project S2013/ICE-2731 N-Greens Software, MINECO Projects TIN2012-39391-C04-03 (StrongSoft) and TIN2015-67522-C3-1-R (TRACES), and EU FP7-ICT-2013.3.4 project 610686 POLCA.

You have full access to this open access chapter, Download conference paper PDF

1 Constrained Horn Clause Verification and Our Approach

A constrained Horn clause (CHC) is a first order predicate logic formula usually written in the form \(p(X) \leftarrow \phi , p_1(X_1), \dots , p_k(X_k)\) (\(k \ge 0\)) using Constraint Logic Programming (CLP) syntax, where \(\phi \) is a first order logic formula (constraint) with respect to some background theory, \(X_i, X\) are (possibly empty) tuples of distinct variables, and \(p_1,\ldots ,p_k, p\) are predicate symbols. There is a distinguished predicate symbol \( false \) which is interpreted as \(\textsc {False}\). Clauses with \( false \) head are called integrity constraints. A set of CHCs is called a (CLP) program.

An interpretation of a set of CHCs P is a set of constrained facts of the form \(A \leftarrow \phi \) where A is an atom and \( \phi \) is a formula with respect to some background theory. An interpretation that satisfies each clause is called a model (a solution in some works [6, 34]). In Horn clause verification, integrity constraints represent the safety properties to be verified; other clauses represent the program’s behaviours. The CHC verification problem is to check whether there exists a model of P.

Several verification tools have been developed for CHCs, including SeaHorn [24], QARMC [21], VeriMap [16], Convex polyhedral analyser [31], TRACER [29], ELDARICA [27], and Trace abstraction refinement tool [37]. They exploit either Formulation I or Formulation II for Horn clause verification.

Formulation I (deductive): P has a model if and only if (false is not derivable from P). In CLP terminology, \(P \vdash A\) if and only if the query \(\leftarrow A\) succeeds in P. In this formulation it is sufficient to show that the query \(\leftarrow false \) fails finitely or infinitely. Formulation I forms the basis of the tools described in [25, 37]. As the minimal model of P contains exactly the set of atoms that succeed [28], we have another formulation of the CHC verification problem [20].

Formulation II (model-based): P has a model if and only if \( false \not \in M[\![P ]\!]\), where \(M[\![P ]\!]\) is the minimal model of P. In Formulation II it is sufficient to find a model \(M' \supseteq M[\![P ]\!]\), where \( false \not \in M'\). It forms the basis of tools based on abstract interpretation, interpolation or predicate abstraction [21, 24, 31].

The program in Fig. 1(a) is a simple but challenging problem for many verification tools. \(\mathtt {l(X,Y) \equiv X \ge Y \wedge Y \ge 0}\) is a model of the program, whose solution requires the discovery of the invariants \(\mathtt {X \ge Y}\) and \(\mathtt {Y \ge 0}\). For example neither QARMC [21] nor SeaHorn [24] (using only the PDR engine [7]) terminates on this program. However, SeaHorn (with PDR and the abstract interpreter IKOS [8]) solves it. Rahft solves it with the pre-processing step alone.

Rahft exploits both of the above formulations using techniques based on abstract interpretation over the domain of convex polyhedra, trace abstraction-refinement using finite tree automata (FTAs) and program specialisation using constraint specialisation [30]. The motivations behind this combination are: (i) to benefit from a powerful and scalable technique such as abstract interpretation [13] for verifying properties of programs, (ii) to refine abstract interpretation through automata theoretic operations which offers the advantages of simplicity and generality [31] and (iii) to construct highly parametric and configurable verification tools through program transformation [16].

2 Rahft Architecture and Interface

Figure 1(b) gives an overview of Rahft. It compiles to a standalone command line utility that accepts a set of CHCs as input. It consists of two modules namely, Abstraction (green box) and Refinement (red box). Rahft takes a file containing a set of CHCs P as input and returns safe or unsafe respectively if P has or does not have a model.

2.1 Abstraction

The Abstraction module takes a set of CHCs P as input and returns safe, unsafe or a trace representing the abstract derivation of \( false \) together with the set of all derivations (traces) (both represented as FTAs) used while applying abstraction interpretation to P. It consists of the following components:

Pre-processor (PP): Pre-processing is a model-preserving source-to-source program transformation of Horn clauses. In principle, any such transformation can be used as a pre-processor, but we use constraint specialisation [30]. The specialisation consists of strengthening the constraints in the clauses using abstract interpretation [13] and query-answer transformation [3, 17] of the original program. The specialisation is independent of the abstract domain and the background theory underlying the clauses and does not unfold the clauses at all. This has been proven to be an effective transformation [30] for verifying Horn clauses [15] and as a pre-processor to other Horn clause verification tools such as [21].

Abstract Interpreter (AI): The AI implements a fixed point algorithm over the domain of convex polyhedra [12] based on abstract interpretation [13]. It constructs an over-approximation M of the minimal model of a program P, where M contains at most one constrained fact \(p(X) \leftarrow \phi \) for each predicate p. The constraint \(\phi \) is a conjunction of linear inequalities, representing a convex polyhedron. The set of traces used during abstract interpretation of P can be captured by an FTA, say \(\mathcal {A}_P\), using M as shown in [32]. An FTA is a mathematical model capable of capturing tree structured computations (Horn clauses derivations) (see [31] for the correspondence between a program and an FTA).

The approximation M and the pre-processed clauses can be used by other Horn clause tools, for example [21]. These tools can strengthen M (which may contain some useful invariants) incrementally to construct a model of P rather than starting from a coarse abstraction (\(p(X) \leftarrow true \) for each predicate p of P).

Verifier: The verifier receives M and \(\mathcal {A}_P\) and checks the safety of the clauses based on some simple condition. The clauses are safe if there is no constrained fact for \(\textit{false}\) in M (M is called safe inductive invariant or a model of P) or there are no error traces rooted at \( false \). Otherwise we do not know whether the clauses are unsafe or whether the approximation was too imprecise. In this case, the verifier picks a trace, say \(t \in \mathcal {A}_P\), representing the abstract derivation of \( false \) (if any) from the set of traces. If t is feasible (while simulating in P), then P is unsafe and t is a counterexample, otherwise we refine P.

2.2 Refinement

The Refinement module takes as input a program P and two FTAs (i) recognising the set of all possible traces of P; and (ii) recognising a set of infeasible traces. A difference automaton is computed from these automata which recognises all traces except the infeasible ones. A refined program is obtained as output using the difference automaton and P. Rather than eliminating a single infeasible trace in each refinement iteration, we generalise it using an interpolant automaton [25, 32, 37] thereby eliminating a possibly infinite number of infeasible traces. The refinement offers the advantages of simplicity and generality which is independent of the abstract domain and background theory underlying the clauses. The Refinement module consists of following components:

Finite tree automata manipulator (FTAM): FTAM takes as input two FTAs and outputs their difference automaton. The FTA difference construction needs determinisation; we built upon an optimised determinisation algorithm by Gallagher, Ajspur and Kafle [19] which scales well in practice, generating transitions of the determinised automaton in a very compact form called product form.

Clause generator (CG): Given a set of clauses P, and an automaton recognising an over-approximation of all feasible traces of P, CG produces a set of clauses which is equisatisfiable to P. For this purpose, we exploit a correspondence between the traces using the clauses and the language of FTAs to generate a new set of clauses.

The refinement offers two advantages: (i) the refinement is manifested in the clauses generated – we do not need to keep track of the previous refinements; and (ii) the original predicates get split in refined clauses which help improve the precision of analysis [20].

2.3 Implementation

Rahft is implemented in Ciao [26] and is available from https://github.com/bishoksan/RAHFT. It consists of a collection of reusable Prolog modules which rely on state-of-the-art specialised external libraries written in C and C++ for handling constraints. We use the Yices SMT solver [18] and the Parma Polyhedra Library [2] for handling the constraints and the FTA library [19] for manipulating FTAs. The construction of an interpolant tree automaton uses an algorithm presented in [36] for computing an interpolant of two formulas. The code consists of over 7,000 lines of Ciao Prolog code split over 42 modules, interfaced to the above-mentioned external libraries. The implementation of iterative fixpoint algorithms is inspired by the approach to the abstract interpretation of logic programs described by Codish and Søndergaard [10]. Data structures for manipulating Horn clauses are based on terms and the internal Prolog database, reusing the optimizations of the underlying machine (e.g., clause indexing) rather than reimplementing them in our tool. The glue code that ties together the general purpose Prolog engine and the specialised solvers written in C and C++ is generated via the Ciao foreign interface [26].

2.4 Strength and Weakness

Rahft is a verification tool for safety properties of programs expressed as Horn clauses; it can be used as a back end solver by different front end tools outputting in CLP form. It handles clauses whose underlying theory is linear arithmetic; other theories are not supported currently. It accepts input in CLP form.

Since different components of Rahft are loosely coupled, the tool can be reconfigured (with a very little effort) to produce verification tools solely based on (i) program transformation as in iterated specialisation approach [15] by iterating the pre-processing component, (ii) abstract interpretation, only with the AI component, (iii) trace abstraction refinement [25, 37] by iterating the FTAM component, and (iv) a sensible combination thereof – all followed by a lightweight verifier which checks the safety of the clauses based on some condition. Since our tool uses both state abstraction and trace abstraction, it allows application of a wide range of tools and techniques.

We have evaluated Rahft on software verification benchmarks from a variety of sources [4, 5, 22, 23, 27, 29] and the results show that it compares favourably (in time and the number of instances solved) with the other state-of-the-art Horn clause verification tools (see [30–32] for the details).

Convex polyhedra is an expensive abstract domain and is a potential bottleneck for verification of large code bases. Instead, we can use cheaper domains supported by the Parma Polyhedra Library such as octagons or intervals at the cost of precision. Rahft is also limited by the hard-coded limits of the libraries and the Prolog implementation used (e.g. arity limit of the predicates), which may be too restrictive for some verification problems and we intend to improve this by some suitable data representation. We are aware of some examples from SV-COMP if not many which cross this limit.

We can leverage state-of-the-art interpolating SMT solvers [9, 33] for the tree interpolant generation which can be used for constructing an interpolant tree automaton; our current implementation does not scale well. Furthermore we aim to handle more advanced data structures such as arrays, maps and sets, requiring more expressive theories than linear arithmetic. One way to achieve this is by composing abstract domains as described in [11, 14]; we are also aware of the support for the reduced product of domains in the PPL library.

Rahft is able to generate a model (a counterexample) if it proves the safety (unsafety) a program. We need bookkeeping to generate these witnesses with respect to the original program; and sometimes it becomes rather challenging because of the use of external libraries, tools or the transformations applied.

3 Future Work

Future work will involve making Rahft a more flexible tool so that the user can configure other parameters such as abstract domains and pre-processors. We are also planning for a detailed performance measurement of the tool to detect bottlenecks; and work on language-based optimisations to minimize them. Generation of a model or a counterexample with respect to the original program, handling clauses with richer background theories (arrays, uninterpreted functions) is on our to-do list. In addition, we are extending Rahft to consider Horn clauses in SMT-LIB format [1], though several Horn clause verification tools use standard CLP notation [16, 21, 31].

References

SMT-LIB format. http://smtlib.cs.uiowa.edu. Accessed 5 May 2016
Bagnara, R., Hill, P.M., Zaffanella, E.: The Parma Polyhedra Library: toward a complete set of numerical abstractions for the analysis and verification of hardware and software systems. SCP 72(1–2), 3–21 (2008)
MathSciNet Google Scholar
Bancilhon, F., Maier, D., Sagiv, Y., Ullman, J.: Magic sets and other strange ways to implement logic programs. In: Proceedings of the 5th ACM SIGMOD-SIGACT Symposium on Principles of Database Systems (1986)
Google Scholar
Beyer, D.: Second competition on software verification - (summary of SV-COMP 2013). In: Piterman and Smolka [35], pp. 594–609
Google Scholar
Beyer, D.: Software verification and verifiable witnesses. In: Baier, C., Tinelli, C. (eds.) TACAS 2015. LNCS, vol. 9035, pp. 401–416. Springer, Heidelberg (2015)
Google Scholar
Bjørner, N., McMillan, K., Rybalchenko, A.: On solving universally quantified Horn clauses. In: Logozzo, F., Fähndrich, M. (eds.) Static Analysis. LNCS, vol. 7935, pp. 105–125. Springer, Heidelberg (2013)
Chapter Google Scholar
Bradley, A.R., Manna, Z.: Property-directed incremental invariant generation. Formal Asp. Comput. 20(4–5), 379–405 (2008)
Article MATH Google Scholar
Brat, G., Navas, J.A., Shi, N., Venet, A.: IKOS: a framework for static analysis based on abstract interpretation. In: Giannakopoulou, D., Salaün, G. (eds.) SEFM 2014. LNCS, vol. 8702, pp. 271–277. Springer, Heidelberg (2014)
Google Scholar
Cimatti, A., Griggio, A., Schaafsma, B.J., Sebastiani, R.: The MathSAT5 SMT solver. In: Piterman and Smolka [35], pp. 93–107
Google Scholar
Codish, M., Søndergaard, H.: Meta-circular abstract interpretation in prolog. In: Mogensen, T.Æ., Schmidt, D.A., Sudborough, I.H. (eds.) The Essence of Computation. LNCS, vol. 2566, pp. 109–134. Springer, Heidelberg (2002)
Chapter Google Scholar
Cortesi, A., Costantini, G., Ferrara, P.: A survey on product operators in abstract interpretation. In: Banerjee, A., Danvy, O., Doh, K., Hatcliff, J. (eds.) Semantics, Abstract Interpretation, and Reasoning About Programs. EPTCS, vol. 129, pp. 325–336 (2013)
Google Scholar
Cousot, P., Halbwachs, N.: Automatic discovery of linear restraints among variables of a program. In: Proceedings of the 5th Annual ACM Symposium on Principles of Programming Languages, pp. 84–96 (1978)
Google Scholar
Cousot, P., Cousot, R.: Abstract interpretation: a unified lattice model for static analysis of programs by construction or approximation of fixpoints. In: Graham, R.M., Harrison, M.A., Sethi, R. (eds.) POPL, pp. 238–252. ACM (1977)
Google Scholar
Cousot, P., Cousot, R., Mauborgne, L.: The reduced product of abstract domains and the combination of decision procedures. In: Hofmann, M. (ed.) FOSSACS 2011. LNCS, vol. 6604, pp. 456–472. Springer, Heidelberg (2011)
Chapter Google Scholar
De Angelis, E., Fioravanti, F., Pettorossi, A., Proietti, M.: Program verification via iterated specialization. SCP 95, 149–175 (2014)
Google Scholar
De Angelis, E., Fioravanti, F., Pettorossi, A., Proietti, M.: VeriMAP: a tool for verifying programs through transformations. In: Ábrahám, E., Havelund, K. (eds.) TACAS 2014 (ETAPS). LNCS, vol. 8413, pp. 568–574. Springer, Heidelberg (2014)
Chapter Google Scholar
Debray, S., Ramakrishnan, R.: Abstract interpretation of logic programs using magic transformations. J. Logic Program. 18, 149–176 (1994)
Article MathSciNet MATH Google Scholar
Dutertre, B.: Yices 2.2. In: Biere, A., Bloem, R. (eds.) CAV 2014. LNCS, vol. 8559, pp. 737–744. Springer, Heidelberg (2014)
Google Scholar
Gallagher, J.P., Ajspur, M., Kafle, B.: An optimised algorithm for determinisation and completion of finite tree automata. CoRR, abs/1511.03595 (2015)
Google Scholar
Gallagher, J.P., Kafle, B.: Analysis and transformation tools for constrained Horn clause verification. TPLP 14(4–5 (additional materials in online edition)), 90–101 (2014)
Google Scholar
Grebenshchikov, S., Lopes, N.P., Popeea, C., Rybalchenko, A.: Synthesizing software verifiers from proof rules. In: Vitek, J., Lin, H., Tip, F. (eds.) PLDI, pp. 405–416. ACM (2012)
Google Scholar
Gulavani, B.S., Chakraborty, S., Nori, A.V., Rajamani, S.K.: Automatically refining abstract interpretations. In: Ramakrishnan, C.R., Rehof, J. (eds.) TACAS 2008. LNCS, vol. 4963, pp. 443–458. Springer, Heidelberg (2008)
Chapter Google Scholar
Gupta, A., Rybalchenko, A.: InvGen: an efficient invariant generator. In: Bouajjani, A., Maler, O. (eds.) CAV 2009. LNCS, vol. 5643, pp. 634–640. Springer, Heidelberg (2009)
Chapter Google Scholar
Gurfinkel, A., Kahsai, T., Komuravelli, A., Navas, J.A.: The SeaHorn verification framework. In: Kroening, D., Păsăreanu, C.S. (eds.) CAV 2015. LNCS, vol. 9206, pp. 343–361. Springer, Heidelberg (2015)
Chapter Google Scholar
Heizmann, M., Hoenicke, J., Podelski, A.: Refinement of trace abstraction. In: Palsberg, J., Su, Z. (eds.) SAS 2009. LNCS, vol. 5673, pp. 69–85. Springer, Heidelberg (2009)
Chapter Google Scholar
Hermenegildo, M.V., Bueno, F., Carro, M., López-García, P., Mera, E., Morales, J.F., Puebla, G.: An overview of Ciao and its design philosophy. TPLP 12(1–2), 219–252 (2012)
MathSciNet MATH Google Scholar
Hojjat, H., Konečný, F., Garnier, F., Iosif, R., Kuncak, V., Rümmer, P.: A verification toolkit for numerical transition systems - tool paper. In: Giannakopoulou, D., Méry, D. (eds.) FM 2012. LNCS, vol. 7436, pp. 247–251. Springer, Heidelberg (2012)
Chapter Google Scholar
Jaffar, J., Maher, M.: Constraint logic programming: a survey. J. Logic Program. 1920, 503–581 (1994)
Article MathSciNet MATH Google Scholar
Jaffar, J., Murali, V., Navas, J.A., Santosa, A.E.: TRACER: a symbolic execution tool for verification. In: Madhusudan, P., Seshia, S.A. (eds.) CAV 2012. LNCS, vol. 7358, pp. 758–766. Springer, Heidelberg (2012)
Chapter Google Scholar
Kafle, B., Gallagher, J.P.: Constraint specialisation in Horn clause verification. In: Asai, K., Sagonas, K. (eds.) Proceedings Workshop on PEPM, PEPM, Mumbai, India, 15–17 January 2015, pp. 85–90. ACM (2015)
Google Scholar
Kafle, B., Gallagher, J.P.: Horn clause verification with convex polyhedral abstraction and tree automata-based refinement. Comput. Lang. Syst. Struct. (2015, In press). http://www.sciencedirect.com/science/article/pii/S1477842415000822, doi:10.1016/j.cl.2015.11.001
Google Scholar
Kafle, B., Gallagher, J.P.: Interpolant tree automata and their application in Horn clause verification. CoRR, abs/1601.06521 (2016)
Google Scholar
McMillan, K.L.: Interpolants from Z3 proofs. In: Bjesse, P., Slobodová, A. (eds.) FMCAD 2011, Austin, TX, USA, 30 October–02 November 2011, pp. 19–27. FMCAD Inc. (2011)
Google Scholar
McMillan, K.L., Rybalchenko, A.: Solving constrained Horn clauses using interpolation. Technical report, Microsoft Research (2013)
Google Scholar
Piterman, N., Smolka, S.A. (eds.): TACAS 2013 (ETAPS 2013). LNCS, vol. 7795. Springer, Heidelberg (2013)
MATH Google Scholar
Rybalchenko, A., Sofronie-Stokkermans, V.: Constraint solving for interpolation. J. Symb. Comput. 45(11), 1212–1233 (2010)
Article MathSciNet MATH Google Scholar
Wang, W., Jiao, L.: Trace abstraction refinement for solving Horn clauses. Technical report ISCAS-SKLCS-15-19, SCAS-SKLCS, December 2015. http://lcs.ios.ac.cn/wangwf/TechReportISCAS-SKLCS-15-19.pdf

Download references

Author information

Authors and Affiliations

Roskilde University, Roskilde, Denmark
Bishoksan Kafle & John P. Gallagher
IMDEA Software Institute, Madrid, Spain
John P. Gallagher & José F. Morales

Authors

Bishoksan Kafle
View author publications
You can also search for this author in PubMed Google Scholar
John P. Gallagher
View author publications
You can also search for this author in PubMed Google Scholar
José F. Morales
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Bishoksan Kafle .

Editor information

Editors and Affiliations

Rice University , Houston, Texas, USA
Swarat Chaudhuri
University of Toronto , Toronto, Ontario, Canada
Azadeh Farzan

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Kafle, B., Gallagher, J.P., Morales, J.F. (2016). Rahft: A Tool for Verifying Horn Clauses Using Abstract Interpretation and Finite Tree Automata. In: Chaudhuri, S., Farzan, A. (eds) Computer Aided Verification. CAV 2016. Lecture Notes in Computer Science(), vol 9779. Springer, Cham. https://doi.org/10.1007/978-3-319-41528-4_14

Download citation

DOI: https://doi.org/10.1007/978-3-319-41528-4_14
Published: 13 July 2016
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-41527-7
Online ISBN: 978-3-319-41528-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics