Entrants' System Descriptions

CVC3 2.4
E(P/LTB) 1.4pre
E-Darwin 1.4
E-KRHyper 1.2
E-MaLeS 1.0
FIMO 0.2
H2WO4 11.07
iProver 0.8 (CASC-J5 EPR division winner)
iProver(-SInE) 0.9
iProver-Eq(-SInE) 0.7
Isabelle/HOL 2011
leanCoP 2.2
LEO-II 1.2 (CASC-J5 THF division winner)
LEO-II 1.2.8
MELIA 0.1.2
Metis 2.3
MetiTarski 1.8
Muscadet 4.1
Nitpick 2011
Nitrox 0.2
Otter 3.3
Paradox 3.0 (CASC-J5 FNT division winner)
Refute 2011
Satallax 2.1
SPASS+T 2.2.14
SPASS-XDB 0.8
TPS 3.110228S1a
Vampire 1.8
Waldmeister 710 (CASC-J5 UEQ division winner)
Z3 2.20

CVC3 2.4

Clark Barrett¹, Cesare Tinelli²
¹New York University, ²University of Iowa

Architecture

CVC3 [BT07] is a DPLL-based theorem prover for Satisfiability Modulo Theories (SMT) problems. It can be used to prove the validity (or, dually, the satisfiability) of first-order formulas in a large number of built-in logical theories and their combination. CVC3 is the last offspring of a series of popular SMT provers, which originated at Stanford University with the SVC system. In particular, it builds on the code base of CVC Lite, its most recent predecessor. Its high level design follows that of the Sammy prover.

CVC3 works with a version of first-order logic with polymorphic types and has a wide variety of features including:

several built-in base theories: rational and integer linear arithmetic, arrays, tuples, records, inductive data types, bit vectors, and equality over uninterpreted function symbols;
support for quantifiers;
an interactive text-based interface;
a rich C and C++ API for embedding in other systems;
proof and model generation abilities;
predicate subtyping;
essentially no limit on its use for research or commercial purposes.

Strategies

CVC3 uses congruence closure for equality and uninterpreted functions, and Fourier-Motzkin for arithmetic. Perhaps most relevant to CASC are the strategies for quantifiers. CVC3 uses E-matching and instantiation heuristics to search for quantifier instantiations that can close search branches. In addition, some heuristics for complete instantiation are available.

Implementation

CVC3 is implemented in C++. For details of the implementation, downloads, additional publications, and a user's guide, please refer to the CVC3 web site

    http://www.cs.nyu.edu/acsys/cvc3

Expected Competition Performance

CVC3 is being entered as an experimental joint venture of the SMT and ATP community. We expect its performance to be somewhere in the middle of the pack as we have made no specific effort to tune it for CASC. We hope that its performance will shed light on areas where SMT solvers are strong as opposed to ATP systems.

E(P/LTB) 1.4pre

Stephan Schulz
Technische Universität München, Germany

Architecture

E 1.4pre [Sch02,Sch04] is described in this section. E is a purely equational theorem prover for full first-order logic with equality. It consists of an (optional) clausifier for pre-processing full first-order formulae into clausal from, and a saturation algorithm implementing an instance of the superposition calculus with negative literal selection and a number of redundancy elimination techniques. E is based on the DISCOUNT-loop variant of the given-clause algorithm, i.e., a strict separation of active and passive facts. No special rules for non-equational literals have been implemented. Resolution is effectively simulated by paramodulation and equality resolution.

EP 1.4pre is just a combination of E 1.4pre in verbose mode and a proof analysis tool extracting the used inference steps. For the LTB division, a control program uses a SInE-like analysis to extract reduced axiomatizations that are handed to several instances of E.

Strategies

Proof search in E is primarily controlled by a literal selection strategy, a clause evaluation heuristic, and a simplification ordering. The prover supports a large number of pre-programmed literal selection strategies. Clause evaluation heuristics can be constructed on the fly by combining various parametrized primitive evaluation functions, or can be selected from a set of predefined heuristics. Clause evaluation heuristics are based on symbol-counting, but also take other clause properties into account. In particular, the search can prefer clauses from the set of support, or containing many symbols also present in the goal. Supported term orderings are several parametrized instances of Knuth-Bendix-Ordering (KBO) and Lexicographic Path Ordering (LPO).

The automatic mode is based on a static partition of the set of all clausal problems based on a number of simple features (number of clauses, maximal symbol arity, presence of equality, presence of non-unit and non-Horn clauses,...). Each class of clauses is automatically assigned a heuristic that performs well on problems from this class in test runs. About 100 different strategies have been evaluated on all untyped first-order problems from TPTP 4.1.0.

Implementation

E is build around perfectly shared terms, i.e. each distinct term is only represented once in a term bank. The whole set of terms thus consists of a number of interconnected directed acyclic graphs. Term memory is managed by a simple mark-and-sweep garbage collector. Unconditional (forward) rewriting using unit clauses is implemented using perfect discrimination trees with size and age constraints. Whenever a possible simplification is detected, it is added as a rewrite link in the term bank. As a result, not only terms, but also rewrite steps are shared. Subsumption and contextual literal cutting (also known as subsumption resolution) is supported using feature vector indexing [Sch04]. Superposition and backward rewriting use fingerprint indexing, a new technique combining ideas from feature vector indexing and path indexing. Finally, LPO and KBO are implemented using the elegant and efficient algorithms developed by Bernd Löchner in [Loe06,Loe06]. The prover and additional information are available at

    http://www.eprover.org

Expected Competition Performance

E 1.4pre is relatively little changed from last years entry. The system is expected to perform well in most proof classes, but will at best complement top systems in the disproof classes.

E-Darwin 1.4

Björn Pelzer
University Koblenz-Landau, Germany

    http://www.uni-koblenz.de/~bpelzer/edarwin

Expected Competition Performance

There have been some calculus changes since last year, but the performance should remain similar. While the original Darwin performs strongly in EPR, E-Darwin is more of a generalist, less effective in EPR, yet stronger in the other divisions.

E-KRHyper 1.2

Björn Pelzer
University Koblenz-Landau, Germany

Architecture

E-KRHyper [PW07] is a theorem proving and model generation system for first-order logic with equality. It is an implementation of the E-hyper tableau calculus [BFP07], which integrates a superposition-based handling of equality [BG98] into the hyper tableau calculus [BFN96]. The system is an extension of the KRHyper theorem prover [Wer03], which implements the original hyper tableau calculus.

An E-hyper tableau is a tree whose nodes are labeled with clauses and which is built up by the application of the inference rules of the E-hyper tableau calculus. The calculus rules are designed such that most of the reasoning is performed using positive unit clauses. Splitting is done without rigid variables. Instead, variables which would be shared between branches are prevented by ground substitutions, which are guessed from the Herbrand universe and constrained by rewrite rules. Redundancy rules allow the detection and removal of clauses that are redundant with respect to a branch. The hyper extension inference from the original hyper tableau calculus is equivalent to a series of E-hyper tableau calculus inference applications. Therefore the implementation of the hyper extension in KRHyper by a variant of semi-naive evaluation [Ull89] is retained in E-KRHyper, where it serves as a shortcut inference for the resolution of non-equational literals.

Strategies

E-KRHyper uses a uniform search strategy for all problems. The E-hyper tableau is generated depth-first, with E-KRHyper always working on a single branch. Refutational completeness and a fair search control are ensured by an iterative deepening strategy with a limit on the maximum term weight of generated clauses.

In the LTB division E-KRHyper sequentially tries three axiom selection strategies: an implementation of Krystof Hoder's SInE algorithm, another incomplete selection based on the CNF representations of the axioms, and finally the complete axiom set.

Implementation

E-KRHyper is implemented in the functional/imperative language OCaml. The system accepts input in the TPTP-format and in the TPTP-supported Protein-format. The calculus implemented by E-KRHyper works on clauses, so first order formula input is converted into CNF by an algorithm similar to the one used by Otter [McC03], with some additional connector literals to prevent explosive clause growth when dealing with DNF-like structures. E-KRHyper operates on an E-hyper tableau which is represented by linked node records. Several layered discrimination-tree based indexes (both perfect and non-perfect) provide access to the clauses in the tableau and support backtracking. The system runs on Unix and MS-Windows platforms, and is available under the GNU Public License from the E-KRHyper website at

    http://www.uni-koblenz.de/~bpelzer/ekrhyper

Expected Competition Performance

There have been minor improvements since the last version, but overall E-KRHyper will remain in the middle ground.

E-MaLeS 1.0

Daniel Kuehlwein¹, Josef Urban¹, Stephan Schulz²
¹Radboud Universiteit Nijmegen, The Netherlands, ²Technische Universität München, Germany

Architecture

E-MaLeS is a meta system for E 1.3. It uses kernel methods to learn which of E's strategies are most likely to solve a problem. Furthermore E-MaLeS runs several strategies for a shorter time instead of one strategy for the whole time.

Strategies

Since E-MaLeS is based on E, please refer to E's description for its internal procedures. The performance of E's strategies was evaluated over the TPTP problems. Each problem was characterised by E's problem features. Then kernel methods were used to learn which strategy is most likely to solve a problem given its features.

Implementation

E-MaLeS is implemented in Python using the Numpy/Scipy library.

Expected Competition Performance

Since E-MaLeS is based on E we expect it to perform at least as good as E.

FIMO 0.2

Orkunt Sabuncu
University of Potsdam, Germany

Architecture

FIMO is a system for computing finite models of first-order formulas by incremental Answer Set Programming (iASP). The input theory is transformed to an incremental logic program. If any, answer sets of this program represent finite models of the input theory. iClingo is used for computing answer sets of iASP programs.

Strategies

FIMO is the successor of the system fmc2iasp [GSS10]. Unlike fmc2iasp, FIMO does not rely on flattening for translating the input theory to a satisfiablity problem. FIMO features symmetry breaking and incremental answer set solving provided by the underlying iASP system iClingo.

Implementation

FIMO is developed in Python. Being the successor of fmc2iasp, it will be available from

    http://potassco.sourceforge.net

Expected Competition Performance

fmc2iasp performed well against Paradox in FNT division of 2009 (not in competition but published in [GSS10]). However, FIMO is based on a different strategy.

H2WO4 11.07

David Stanovsky
Charles University in Prague, Czech Republic

Architecture

Tungstic acid is a substance reacting with problems in the first order logic wit h special arithmetical functions (as defined in the TPTP library), producing a code in Wolfram's Mathematica that attempts to find a solution. The first version, H₂WO₄ 11.07, is a simple script transla ting between the two languages and calling built-in functions of Mathematica to solve the problem.

Strategies

The current version is using various combinations of three functions:

the Reduce and FullSimplify functions, for simplifying expressions - a problem is solved if simplified to True or False
the FindInstance function, for solving systems of equations over specified domains - an existential problem is solved if a solution is found or if no solutions exists

Implementation

This is a pair of Perl scripts: one for parsing TPTP problems into Mathematica, the other for processing Mathematica's output. The scripts are available at

    http://www.karlin.mff.cuni.cz/~stanovsk/h2wo4/

They were tested with the text-based interface of Mathematica 7.0.

Expected Competition Performance

My motivation is, to compare the other entrants with a commercial, state-of-the- art computer algebra system (Mathematica, in my case). I wish it did not do well, compared to specially developed systems :-) Mathematica is strong in pure arithmetic and weak in logical reasoning. It does well on the last year set of problems in TPTP.

iProver 0.8

Konstantin Korovin
University of Manchester, United Kingdom

Architecture

iProver is an automated theorem prover based on an instantiation calculus Inst-Gen [GK03,Kor09] which is complete for first-order logic. One of the distinctive features of iProver is a modular combination of first-order reasoning with ground reasoning. In particular, iProver currently integrates MiniSat [ES04] for reasoning with ground abstractions of first-order clauses. In addition to instantiation, iProver implements ordered resolution calculus and a combination of instantiation and ordered resolution; see [Kor08] for the implementation details. The saturation process is implemented as a modification of a given clause algorithm. We use non-perfect discrimination trees for the unification indexes, priority queues for passive clauses, and a compressed vector index for subsumption and subsumption resolution (both forward and backward). The following redundancy eliminations are implemented: blocking non-proper instantiations; dismatching constraints [GK04,Kor08]; global subsumption [Kor08]; resolution-based simplifications and propositional-based simplifications. We implemented a compressed feature vector index for efficient forward/backward subsumption and subsumption resolution. Equality is dealt with (internally) by adding the necessary axioms of equality. Major additions in the current version are:

Model output using first-order definitions in term algebra.
Incrementality wrt. model changes in the SAT solving part.
New index for dismatching constraints.

Strategies

iProver has around 40 options to control the proof search including options for literal selection, passive clause selection, frequency of calling the SAT solver, simplifications and options for combination of instantiation with resolution. At CASC iProver will execute a small number of fixed schedules of selected options depending on general syntactic properties such as Horn/non-Horn, equational/non-equational, and maximal term depth.

Implementation

iProver is implemented in OCaml and for the ground reasoning uses MiniSat. iProver accepts FOF and CNF formats, where either Vampire [RV02] or E prover [Sch04] is used for clausification of FOF problems. iProver is available from:

    http://www.cs.man.ac.uk/~korovink/iprover/

Expected Competition Performance

iProver 0.8 is the CASC-J5 EPR division winner.

iProver(-SInE) 0.9

Konstantin Korovin
The University of Manchester, United Kingdom

Architecture

iProver is an automated theorem prover based on an instantiation calculus Inst-Gen [GK03,Kor09] which is complete for first-order logic. One of the distinctive features of iProver is a modular combination of first-order reasoning with ground reasoning. In particular, iProver currently integrates MiniSat [ES04] for reasoning with ground abstractions of first-order clauses. In addition to instantiation, iProver implements ordered resolution calculus and a combination of instantiation and ordered resolution; see [Kor08] for the implementation details. The saturation process is implemented as a modification of a given clause algorithm. iProver uses non-perfect discrimination trees for the unification indexes, priority queues for passive clauses, and a compressed vector index for subsumption and subsumption resolution (both forward and backward). The following redundancy eliminations are implemented: blocking non-proper instantiations; dismatching constraints [GK04,Kor08]; global subsumption [Kor08]; resolution-based simplifications and propositional-based simplifications. A compressed feature vector index is used for efficient forward/backward subsumption and subsumption resolution. Equality is dealt with (internally) by adding the necessary axioms of equality with an option of using Brand's transformation. In the LTB division, iProver-SInE uses axiom selection based on the SInE algorithm [HV11] as implemented in Vampire [HKV10], i.e., axiom selection is done by Vampire and proof attempts are done by iProver.

Major additions in the current version are:

answer computation,
several modes for model output using first-order definitions in term algebra,
Brand's transformation.

Strategies

Implementation

iProver is implemented in OCaml and for the ground reasoning uses MiniSat. iProver accepts FOF and CNF formats, where Vampire [HKV10] is used for clausification of FOF problems.

iProver is available from:

    http://www.cs.man.ac.uk/~korovink/iprover/

Expected Competition Performance

iProver 0.9 is expected to perform slightly better than the previous version.

iProver-Eq(-SInE) 0.7

Christoph Sticksel, Konstantin Korovin
The University of Manchester, United Kingdom

Architecture

iProver-Eq [KS10] extends the iProver system [Kor08] with built-in equational reasoning, along the lines of [GK04]. As in the iProver system, first-order reasoning is combined with ground satisfiability checking where the latter is delegated to an off-the-shelf ground solver.

iProver-Eq consists of three core components: i) ground reasoning by an SMT solver, ii) first-order equational reasoning on literals in a candidate model by a labelled unit superposition calculus [KS10,KS10] and iii) instantiation of clauses with substitutions obtained by ii). Given a set of first-order clauses, iProver-Eq first abstracts it to a set of ground clauses which are then passed to the ground solver. If the ground abstraction is unsatisfiable, then the set of first-order clauses is also unsatisfiable. Otherwise, literals are selected from the first-order clauses based on the model of the ground solver. The labelled unit superposition calculus checks whether selected literals are conflicting. If they are conflicting, then clauses are instantiated such that the ground solver has to refine its model in order to resolve the conflict. Otherwise, satisfiability of the initial first-order clause set is shown.

Clause selection and literal selection in the unit superposition calculus are implemented in separate given clause algorithms. Relevant substitutions are accumulated in labels during unit superposition inferences and then used to instantiate clauses. For redundancy elimination iProver-Eq uses demodulation, dismatching constraints and global subsumption. In order to efficiently propagate redundancy elimination from instantiation into unit superposition, we implemented different representations of labels based on sets, AND/OR-trees and OBDDs. Non-equational resolution and equational superposition inferences provide further simplifications.

For the LTB division, iProver-Eq-SInE uses axiom selection based on the SInE algorithm [HV11] as implemented in Vampire [HKV10], i.e., axiom selection is done by Vampire and proof attempts are done by iProver-Eq.

Strategies

Proof search options in iProver-Eq control clause and literal selection in the respective given clause algorithms. Equally important is the global distribution of time between the inference engines and the ground solver. At CASC, iProver-Eq will execute a fixed schedule of selected options.

If no equational literals occur in the input, iProver-Eq falls back to the inference rules of iProver, otherwise the latter are disabled and only unit superposition is used. If all clauses are unit equations, no instantiations need to be generated and the calculus is run without the otherwise necessary bookkeeping.

Implementation

iProver-Eq is implemented in OCaml and uses CVC3 [BT07] for the ground reasoning in the equational case and MiniSat [ES04] in the non-equational case. iProver-Eq accepts FOF and CNF formats, where Vampire [HKV10] is used for clausification of FOF problems. iProver-Eq is available at

    http://www.cs.man.ac.uk/~sticksec/iprover-eq

Expected Competition Performance

iProver-Eq has seen many optimisations from the version in the previous CASCs. We expect reasonably good performance in all divisions, including the EPR divisions where instantiation-based methods are particularly strong.

Isabelle/HOL (a.k.a. IsabelleP) 2011

Jasmin C. Blanchette¹, Lawrence C. Paulson², Tobias Nipkow¹, Makarius Wenzel¹, Stefan Berghofer¹
¹Technische Universität München, Germany
²University of Cambridge, United Kingdom

Architecture

Isabelle/HOL 2011 [NPW09] is the higher-order logic incarnation of the generic proof assistant Isabelle2011. Isabelle/HOL provides several automatic proof tactics, notably an equational reasoner [Nip89], a classical reasoner [PN94], a tableau prover [Pau99], and a first-order resolution-based prover [Hur03].

Although Isabelle is designed for interactive proof development, it is a little known fact that it is possible to run Isabelle from the command line, passing in a theory file with a formula to solve. Isabelle theory files can include Standard ML code to be executed when the file is processed. The TPTP2X Isabelle format module outputs a THF problem in Isabelle/HOL syntax, augmented with ML code that (1) runs the ten tactics in sequence, each with a CPU time limit, until one succeeds or all fail, and (2) reports the result and proof (if found) using the SZS standards. A Perl script is used to insert the CPU time limit (equally divided over the ten tactics) into TPTP2X's Isabelle format output, and then run the command line isabelle-process on the resulting theory file.

Strategies

The IsabelleP tactic submitted to the competition simply tries the following tactics sequentially:

simp: Performs equational reasoning using rewrite rules.
blast: Searches for a proof using a fast untyped tableau prover and then attempts to reconstruct the proof using Isabelle tactics.
auto: Combines simplification and classical reasoning under one roof.
metis: Combines ordered resolution and ordered paramodulation. The proof is then reconstructed using Isabelle tactics.
fast: Searches for a proof using sequent-style reasoning, performing a depth-first search. Unlike blast and metis, they construct proofs directly in Isabelle. That makes them slower but enables them to work in the presence of the more unusual features of HOL, such as type classes and function unknowns.
fastsimp: Combines fast and simp.
best: Similar to fast, except that it performs a best-first search.
force: Similar to auto, but more exhaustive.
meson: Implements Loveland's MESON procedure [Lov78]. Constructs proofs directly in Isabelle.
smt: Invokes the Z3 SMT solver [dMB08] developed at Microsoft Research and optionally reconstructs the proofs in Isabelle [Boh09].
sledgehammer: Invokes Sledgehammer as an oracle with the sound fully typed translation [PB10].

Implementation

Isabelle is a generic theorem prover written in Standard ML. Its meta-logic, Isabelle/Pure, provides an intuitionistic fragment of higher-order logic. The HOL object logic extends pure with a more elaborate version of higher-order logic, complete with the familiar connectives and quantifiers. Other object logics are available, notably FOL (first-order logic) and ZF (Zermelo–Fraenkel set theory).

The implementation of Isabelle relies on a small LCF-style kernel, meaning that inferences are implemented as operations on an abstract theorem datatype. Assuming the kernel is correct, all values of type theorem are correct by construction.

Most of the code for Isabelle was written by the Isabelle teams at the University of Cambridge and the Technische Universität München. A notable exception is the metis proof method, which was taken from the HOL4 theorem prover (also implemented in ML).

Isabelle/HOL is available for all major platforms under a BSD-style license from

    http://www.cl.cam.ac.uk/research/hvg/Isabelle

Expected Competition Performance

Results from last year would suggest that Isabelle will finish third in the THF category, after Satallax and LEO-II. However, since last year, we have added the sledgehammer proof methods, which we expect will improve our chances.

leanCoP 2.2

Jens Otten
University of Potsdam, Germany

Architecture

leanCoP [OB03,Ott08] is an automated theorem prover for classical first-order logic with equality. It is a very compact implementation of the connection (tableau) calculus [Bib87,LS01].

Strategies

The reduction rule of the connection calculus is applied before the extension rule. Open branches are selected in a depth-first way. Iterative deepening on the proof depth is used to achieve completeness. Additional inference rules and strategies include regularity, lemmata, and restricted backtracking [Ott10]. leanCoP uses an optimized structure-preserving transformation into clausal form [Ott10] and a fixed strategy scheduling.

Implementation

leanCoP is implemented in Prolog (ECLiPSe, SICStus and SWI Prolog are currently supported). The source code of the core prover is only a few lines long and fits on half a page. Prolog's built-in indexing mechanism is used to quickly find connections.

leanCoP can read formulae using the leanCoP syntax as well as the (raw) TPTP syntax format. Equality axioms are automatically added if required. The core leanCoP prover returns a very compact connection proof, which is translated into a readable proof. Several output formats are available.

As the main enhancement leanCoP 2.2 now supports the output of proofs in an unofficial TPTP syntax format for representing derivations in connection (tableau) calculi [OS10]. Furthermore, besides the Linux/Unix and MacOS platforms, most Windows platforms are now supported as well.

The source code of leanCoP 2.2 is available under the GNU general public license. Together with more information it can be found on the leanCoP website at

    http://www.leancop.de

Expected Competition Performance

As the core prover has not changed, we expect the performance of leanCoP 2.2 to be similar to the performance of leanCoP 2.1.

LEO-II 1.2

Christoph Benzmüller¹, Frank Theiss²
¹Articulate Software, USA, ²Saarland University, Germany

Architecture

LEO-II [BP+08], the successor of LEO [BK98], is a higher-order ATP system based on extensional higher-order resolution. More precisely, LEO-II employs a refinement of extensional higher-order RUE resolution [Ben99]. LEO-II is designed to cooperate with specialist systems for fragments of higher-order logic. By default, LEO-II cooperates with the first-order ATP systems E [Sch02]. LEO-II is often too weak to find a refutation amongst the steadily growing set of clauses on its own. However, some of the clauses in LEO-II's search space attain a special status: they are first-order clauses modulo the application of an appropriate transformation function. The default transformation is Hurd's fully typed translation [Hur03]. Therefore, LEO-II launches a cooperating first-order ATP system every n iterations of its (standard) resolution proof search loop (e.g., n = 10). If the first-order ATP system finds a refutation, it communicates its success to LEO-II in the standard SZS format. Communication between LEO-II and the cooperating first-order ATP system uses the TPTP language and standards.

Strategies

LEO-II employs an adapted "Otter loop". In contrast to its competitor systems (such as Satallax, TPS, and IsabelleP) LEO-II so far only employs a monolithic search strategy, that is, it does not use strategy scheduling to try different search strategies or flag settings. However, LEO-II version 1.2 for the first time includes some very naive relevance filtering and selectively applies some simple scheduling for different relevance filters.

Implementation

LEO-II is implemented in Objective Caml version 3.10, and its problem representation language is the new TPTP THF language [BRS08]. In fact, the development of LEO-II has largely paralleled the development of the TPTP THF language and related infrastructure [SB10].

The improved performance of LEO-II in comparison to its predecessor LEO (implemented in LISP) is due to several novel features including the exploitation of term sharing and term indexing techniques [TB06], support for primitive equality reasoning (extensional higher-order RUE resolution), and improved heuristics at the calculus level. One recent development is LEO-II's new parser: in addition to the TPTP THF language, this parser now also supports the TPTP FOF and CNF languages. Hence, LEO-II can now also be used for FOF and CNF problems. Unfortunately the LEO-II system still uses only a very simple sequential collaboration model with first-order ATPs instead of using the more advanced, concurrent and resource-adaptive OANTS architecture [BS+08] as exploited by its predecessor LEO.

The LEO-II system is distributed under a BSD style license, and it is available from:

    http://leoprover.org

Expected Competition Performance

LEO-II 1.2 is the CASC-J5 THF division winner.

LEO-II 1.2.8

Christoph Benzmüller¹, Frank Theiss²
¹Freie Universität Berlin, Germany, ²Saarland University, Germany

Architecture

LEO-II [BP+08], the successor of LEO [BK98], is a higher-order ATP system based on extensional higher-order resolution. More precisely, LEO-II employs a refinement of extensional higher-order RUE resolution [Ben99]. LEO-II is designed to cooperate with specialist systems for fragments of higher-order logic. By default, LEO-II cooperates with the first-order ATP systems E [Sch02]. LEO-II is often too weak to find a refutation amongst the steadily growing set of clauses on its own. However, some of the clauses in LEO-II's search space attain a special status: they are first-order clauses modulo the application of an appropriate transformation function. The default transformation is Hurd's fully typed translation [Hur03]. Therefore, LEO-II launches a cooperating first-order ATP system every n iterations of its (standard) resolution proof search loop (e.g., 10). If the first-order ATP system finds a refutation, it communicates its success to LEO-II in the standard SZS format. Communication between LEO-II and the cooperating first-order ATP system uses the TPTP language and standards.

Strategies

LEO-II employs an adapted "Otter loop". Moreover, LEO-II now also uses some very basic strategy scheduling to try different search strategies or flag settings. These search strategies also include some different relevance filters.

Implementation

LEO-II is implemented in Objective Caml version 3.12, and its problem representation language is the TPTP THF language [BRS08]. In fact, the development of LEO-II has largely paralleled the development of the TPTP THF language and related infrastructure [SB10].

The improved performance of LEO-II in comparison to its predecessor LEO (implemented in LISP) is due to several novel features including the exploitation of term sharing and term indexing techniques [TB06], support for primitive equality reasoning (extensional higher-order RUE resolution), and improved heuristics at the calculus level. LEO-II's parser supports the TPTP THF0 language and also the TPTP languages FOF and CNF. Unfortunately the LEO-II system still uses only a very simple sequential collaboration model with first-order ATPs instead of using the more advanced, concurrent and resource-adaptive OANTS architecture [BS+08] as exploited by its predecessor LEO.

The LEO-II system is distributed under a BSD style license, and it is available from:

    http://leoprover.org

Expected Competition Performance

LEO-II has not improved much over the last year. The main modifications concern proof output in order to enable proof reconstruction/verification of LEO-II proofs in the Isabelle system. I doubt that LEO-II will be able to defend its championship at this years CASC since some of its competitor systems, such as Satallax, have significantly changed over the last year. However, it is great to have such a strong dynamics in the THF category.

LEO-II will again participate in the FOF and CNF categories in order to evaluate its performance for these fragments. For this, note that LEO-II still employs its own input processing and normalization techniques, and that calls to prover E are applied only modulo Hurd's fully typed translation.

MELIA 0.1.2

Peter Baumgartner
NICTA and ANU, Australia

Architecture

MELIA is a theorem prover for the Model Evolution Calculus with Equality and Linear Integer Arithmetic [BT11,BPT11]. It also integrates most of the theoretical developments of the Model Evolution calculus, in particular superposition-like inference rules for equality handling and built-in inference rules for linear integer arithmetic.

MELIA accepts formulas in the TFF format (typed TPTP formulas, see the TPTP technical report). It includes a pre-processor for transforming such formulas into (sorted) clausal logic over foreground sorts specified in the input files and built-in linear integer arithmetic.

Strategies

MELIA features a variety of flag settings to control its search, e.g., for selecting literals to focus inferences on. In the competition, MELIA uses the same search strategy for all problems. It includes a heuristics to select the next literal to split on and to select the next superposition inference.

Implementation

MELIA has been written (from scratch) in Scala, and runs on the Java virtual machine.

Expected Competition Performance

MELIA's participates in the TFA division only. It is in a very early stage and has not been tuned for performance (or for the competition). It lacks term indexing techniques to make it more efficient. When integer arithmetic is involved, MELIA is incomplete even if (theoretically) unneccessary, and it is likely to miss proving some theorems because of that.

Expectations are not high.

Metis 2.3

Joe Hurd
Galois Inc., USA

Architecture

Metis 2.3 [Hur03] is a proof tactic used in the HOL4 interactive theorem prover. It works by converting a higher order logic goal to a set of clauses in first order logic, with the property that a refutation of the clause set can be translated to a higher order logic proof of the original goal.

Experiments with various first order calculi [Hur03] have shown a given clause algorithm and ordered resolution to best suit this application, and that is what Metis 2.3 implements. Since equality often appears in interactive theorem prover goals, Metis 2.3 also implements the ordered paramodulation calculus.

Strategies

Metis 2.3 uses a fixed strategy for every input problem. Negative literals are always chosen over positive literals, and terms are ordered using the Knuth-Bendix ordering with uniform symbol weight and precedence favouring reduced arity.

Implementation

Metis 2.3 is written in Standard ML, for ease of integration with HOL4. It uses indexes for resolution, paramodulation, (forward) subsumption and demodulation. It keeps the Active clause set reduced with respect to all the unit equalities so far derived.

In addition to standard age and size measures, Metis 2.3 uses finite models to weight clauses in the Passive set. When integrated with higher order logic, an interpretation of known functions and relations is manually constructed to make many of their standard properties valid in the finite model. For example, the domain of the model is the set {0,...,7}, and the higher order logic arithmetic functions are interpreted in the model modulo 8. Unknown functions and relations are interpreted randomly, but with a bias towards making supporting theorems valid in the model. The finite model strategy carries over to TPTP problems, by manually interpreting a collection of functions and relations that appear in TPTP axiom files in such a way as to make the axioms valid in the model.

Metis 2.3 reads problems in TPTP format and outputs detailed proofs in TSTP format, where each refutation step is one of 6 simple inference rules. Metis 2.3 implements a complete calculus, so when the set of clauses is saturated it can soundly declare the input problem to be unprovable (and outputs the saturation set).

Metis 2.3 is free software, released under the MIT license. It can be downloaded from

    http://www.gilith.com/software/metis

Expected Competition Performance

There have been only minor changes to Metis 2.3 since CASC J5, so it is expected to perform at approximately the same level in CASC 23 and end up in the lower third of the table.

MetiTarski 1.8

Lawrence C. Paulson
University of Cambridge, United Kingdom

Architecture

MetiTarski [AP10,DA+09] is an automatic theorem prover based on a combination of resolution and QEPCAD-B [Bro03], a decision procedure for the theory of real closed fields. It is designed to prove theorems involving real-valued special functions such as log, exp, sin, cos, atan and sqrt. In particular, it is designed to prove universally quantified inequalities involving such functions. Support for existentially quantified inequalities is very limited. MetiTarski is a modified version of Joe Hurd's theorem prover, Metis [Hur03].

Strategies

MetiTarski employs resolution, augmented with axiom files that specify upper and lower bounds of the special functions mentioned in the problem. MetiTarski also has code to simplify polynomials and put them into canonical form. The resolution calculus is extended with a literal deletion rule: if the decision procedure finds a literal to be inconsistent with its context (which consists of known facts and the negation of the other literals in the clause), then it is deleted. From 2011, MetiTarski also implements case-splitting with backtracking. MetiTarski is incomplete, and nothing can be inferred if it fails to prove a conjecture.

Implementation

MetiTarski, like Metis, is implemented in Standard ML. QEPCAD is implemented in C and C++. The latest version of MetiTarski can be downloaded from

    http://www.cl.cam.ac.uk/~lp15/papers/Arith/

Expected Competition Performance

No expectation provided.

Muscadet 4.1

Dominique Pastre
University Paris Descartes, France

Architecture

The Muscadet theorem prover is a knowledge-based system. It is based on Natural Deduction, following the terminology of [Ble71] and [Pas78], and uses methods which resembles those used by humans. It is composed of an inference engine, which interprets and executes rules, and of one or several bases of facts, which are the internal representation of "theorems to be proved". Rules are either universal and put into the system, or built by the system itself by metarules from data (definitions and lemmas). Rules may add new hypotheses, modify the conclusion, create objects, split theorems into two or more subtheorems or build new rules which are local for a (sub-)theorem.

Strategies

There are specific strategies for existential, universal, conjonctive or disjunctive hypotheses and conclusions, and equalities. Functional symbols may be used, but an automatic creation of intermediate objects allows deep subformulae to be flattened and treated as if the concepts were defined by predicate symbols. The successive steps of a proof may be forward deduction (deduce new hypotheses from old ones), backward deduction (replace the conclusion by a new one), refutation (only if the conclusion is a negation), search for objects satisfying the conclusion or dynamic building of new rules.

The system is also able to work with second order statements. It may also receive knowledge and know-how for a specific domain from a human user; see [Pas89] and [Pas93]. These two possibilities are not used while working with the TPTP Library.

Implementation

Muscadet [Pas01] is implemented in SWI-Prolog. Rules are written as more or less declarative Prolog clauses. Metarules are written as sets of Prolog clauses. The inference engine includes the Prolog interpreter and some procedural Prolog clauses. A theorem may be split into several subtheorems, structured as a tree with "and" and "or" nodes. All the proof search steps are memorized as facts including all the elements which will be necessary to extract later the useful steps (the name of the executed action or applied rule, the new facts added or rule dynamically built, the antecedents and a brief explanation).

Muscadet is available from:

    http://www.math-info.univ-paris5.fr/~pastre/muscadet/muscadet.html

Expected Competition Performance

The best performances of Muscadet will be for problems manipulating many concepts in which all statements (conjectures, definitions, axioms) are expressed in a manner similar to the practice of humans, especially of mathematicians [Pas02,Pas07]. It will have poor performances for problems using few concepts but large and deep formulas leading to many splittings. Its best results will be in set theory, especially for functions and relations. It's originality is that proofs are given in natural style.

Nitpick (a.k.a. IsabelleN) 2011

Jasmin C. Blanchette
Technische Universität München, Germany

Architecture

Nitpick [BN10] is an open source counterexample generator for Isabelle/HOL [NPW09]. It builds on Kodkod [TJ07], a highly optimized first-order relational model finder based on SAT. The name Nitpick is appropriated from a now retired Alloy precursor.

Strategies

Nitpick employs Kodkod to find a finite model of the negated conjecture. The translation from HOL to Kodkod's first-order relational logic (FORL) is parameterized by the cardinalities of the atomic types occurring in it. Nitpick enumerates the possible cardinalities for each atomic type, exploiting monotonicity to prune the search space [BK11]. If a formula has a finite counterexample, the tool eventually finds it, unless it runs out of resources.

SAT solvers are particularly sensitive to the encoding of problems, so special care is needed when translating HOL formulas. As a rule, HOL scalars are mapped to FORL singletons and functions are mapped to FORL relations accompanied by a constraint.

An n-ary first-order function (curried or not) can be coded as an (n + 1)-ary relation accompanied by a constraint. However, if the return type is the type of Booleans, the function is more efficiently coded as an unconstrained n-ary relation.

Higher-order quantification and functions bring complications of their own. A function from σ to τ cannot be directly passed as an argument in FORL; Nitpick's workaround is to pass |σ| arguments of type τ that encode a function table.

Implementation

Nitpick, like most of Isabelle/HOL, is written in Standard ML. Unlike Isabelle itself, which adheres to the LCF small-kernel discipline, Nitpick does not certify its results and must be trusted.

Nitpick is available as part of Isabelle/HOL for all major platforms under a BSD-style license from

    http://www.cl.cam.ac.uk/research/hvg/Isabelle

Expected Competition Performance

Thanks to Kodkod's amazing power, we expect that Nitpick will beat both Satallax and Refute with its hands tied behind its back in the TNT category.

Nitrox 0.2

Jasmin C. Blanchette¹, Emina Torlak²
¹Technische Universität München, Germany
²IBM Research, USA

Architecture

Nitrox is the first-order version of Nitpick [BN10], an an open source counterexample generator for Isabelle/HOL [NPW09]. It builds on Kodkod [TJ07], a highly optimized first-order relational model finder based on SAT. The name Nitrox is a portmanteau of Nitpick and Paradox (clever, eh?).

Strategies

Nitrox employs Kodkod to find a finite model of the negated conjecture. It performs a few transformations on the input, such as pushing quantifiers inside, but 99% of the solving logic is in Kodkod and the underlying SAT solver.

The translation from HOL to Kodkod's first-order relational logic (FORL) is parameterized by the cardinalities of the atomic types occurring in it. Nitrox enumerates the possible cardinalities for the universe. If a formula has a finite counterexample, the tool eventually finds it, unless it runs out of resources.

Nitpick is optimized to work with higher-order logic (HOL) and its definitional principles (e.g., (co)inductive predicates, (co)inductive datatypes, (co)recursive functions). When invoked on untyped first-order problem, few of its optimizations come into play, and the problem handed to Kodkod is essentially a first-order relational logic (FORL) rendering of the TPTP FOF problem. One exception is nested quantifiers, which Nitpick optimizes before Kodkod gets a chance to look at them [BN10].

Implementation

Nitrox, like most of Isabelle/HOL, is written in Standard ML. Unlike Isabelle itself, which adheres to the LCF small-kernel discipline, Nitrox does not certify its results and must be trusted. Kodkod is written in Java. MiniSat 1.14 is used as the SAT solver.

Expected Competition Performance

Since Nitpick was designed for HOL, it doesn't have any type inference àla Paradox. It also doesn't use the SAT solver incrementally, which penalizes it a bit (but not as much as the missing type inference). Kodkod itself is known to perform less well on FOF than Paradox, because it is designed and optimized for a somewhat different logic, FORL. On the other hand, Kodkod's symmetry breaking seems better calibrated than Paradox's. Hence, we expect Nitrox to end up in second place at best in the TNF category.

Otter 3.3

William McCune
Argonne National Laboratory, USA

Architecture

Otter 3.3 [McC03] is an ATP system for statements in first-order (unsorted) logic with equality. Otter is based on resolution and paramodulation applied to clauses. An Otter search uses the "given clause algorithm", and typically involves a large database of clauses; subsumption and demodulation play an important role.

Strategies

Otter's original automatic mode, which reflects no tuning to the TPTP problems, will be used.

Implementation

Otter is written in C. Otter uses shared data structures for clauses and terms, and it uses indexing for resolution, paramodulation, forward and backward subsumption, forward and backward demodulation, and unit conflict. Otter is available from:

    http://www.cs.unm.edu/~mccune/otter/

Expected Competition Performance

Otter has been entered into CASC as a stable benchmark against which progress can be judged (there have been only minor changes to Otter since 1996 [MW97], nothing that really affects its performance in CASC). This is not an ordinary entry, and we do not hope for Otter to do well in the competition.

Acknowledgments: Ross Overbeek, Larry Wos, Bob Veroff, and Rusty Lusk contributed to the development of Otter.

Paradox 3.0

Koen Claessen, Niklas Sörensson
Chalmers University of Technology, Sweden

Architecture

Paradox [CS03] is a finite-domain model generator. It is based on a MACE-style [McC03] flattening and instantiating of the first-order clauses into propositional clauses, and then the use of a SAT solver to solve the resulting problem.

Paradox incorporates the following features: Polynomial-time clause splitting heuristics, the use of incremental SAT, static symmetry reduction techniques, and the use of sort inference.

Strategies

There is only one strategy in Paradox:

Analyze the problem, finding an upper bound N on the domain size of models, where N is possibly infinite. A finite such upper bound can be found, for example, for EPR problems.
Flatten the problem, and split clauses and simplify as much as possible.
Instantiate the problem for domain sizes 1 up to N, applying the SAT solver incrementally for each size. Report "SATISFIABLE" when a model is found.
When no model of sizes smaller or equal to N is found, report "CONTRADICTION".

In this way, Paradox can be used both as a model finder and as an EPR solver.

Implementation

The main part of Paradox is implemented in Haskell using the GHC compiler. Paradox also has a built-in incremental SAT solver which is written in C++. The two parts are linked together on the object level using Haskell's Foreign Function Interface.

Expected Competition Performance

Paradox 3.0 is the CASC-J5 FNT division winner.

Refute (a.k.a. IsabelleM) 2011

Jasmin C. Blanchette¹, Tjark Weber²
¹Technische Universität München, Germany
²University of Cambridge, United Kingdom

Architecture

Refute [Web08] is an open source counterexample generator for Isabelle/HOL [NPW09] based on a SAT solver, and Nitpick's [BN10] precursor.

Strategies

Refute employs a SAT solver to find a finite model of the negated conjecture. The translation from HOL to propositional logic is parameterized by the cardinalities of the atomic types occurring in the conjecture. Refute enumerates the possible cardinalities for each atomic type. If a formula has a finite counterexample, the tool eventually finds it, unless it runs out of resources.

Implementation

Refute, like most of Isabelle/HOL, is written in Standard ML. Unlike Isabelle itself, which adheres to the LCF small-kernel discipline, Refute does not certify its results and must be trusted.

Refute is available as part of Isabelle/HOL for all major platforms under a BSD-style license from

    http://www.cl.cam.ac.uk/research/hvg/Isabelle

Expected Competition Performance

We expect that Refute will solve about 75% of the problems solved by Nitpick in the TNT category, and perhaps a few problems that Nitpick cannot solve.

Satallax 2.1

Chad E. Brown
Saarland University, Germany

Architecture

Satallax [Bro11] is an automated theorem prover for higher-order logic. The particular form of higher-order logic supported by Satallax is Church's simple type theory with extensionality and choice operators. The SAT solver MiniSat [ES04] is responsible for much of the search for a proof.

The theoretical basis of search is a complete ground tableau calculus for higher-order logic [BS10] with a choice operator [BB11]. A problem is given in the THF format. A branch is formed from the axioms of the problem and the negation of the conjecture (if any is given). From this point on, Satallax tries to determine unsatisfiability or satisfiability of this branch.

Satallax progressively generates higher-order formulae and corresponding propositional clauses [Bro11]. These formulae and propositional clauses correspond to instances of the tableau rules. Satallax uses the SAT solver MiniSat as an engine to test the current set of propositional clauses for unsatisfiability. If the clauses are unsatisfiable, then the original branch is unsatisfiable. If there are no quantifiers at function types, the generation of higher-order formulae and corresponding clauses may terminate [BS09,BS09]. In such a case, if MiniSat reports the final set of clauses as satisfiable, then the original set of higher-order formulae is satisfiable (by a standard model in which all types are interpreted as finite sets).

Strategies

There are a number of flags that control the order in which formulas and instantiation terms are considered and propositional clauses are generated. Other flags activate some optional extensions to the basic proof procedure. A collection of flag settings is called a mode. Approximately 250 modes have been tried so far. Regardless of the mode, the search procedure is sound and complete for higher-order logic with choice. This implies that if search terminates with a particular mode, then we can conclude that the original set of formulae is unsatisfiable or satisfiable.

A strategy schedule is an ordered collection of modes with information about how much time the mode should be allotted. Satallax tries each of the modes for a certain amount of time sequentially. Satallax 2.1 has eight strategy schedules which were determined through experimentation using the THF problems in version 5.1.0 of the TPTP library. One of these eight strategy schedules is chosen based on the amount of time Satallax is given to solve the problem. For example, if Satallax is given 180 seconds to solve the problem, then a schedule with 38 modes is chosen.

Implementation

Satallax 2.1 is implemented in OCaml. A foreign function interface is used to interact with MiniSat. Satallax is available from

    http://satallax.com

Expected Competition Performance

Satallax 1.4 proved to be competitive in the THF division of CASC last year, coming in second out of four systems. Since last year, Satallax has been reimplemented in OCaml (instead of Lisp) and the integration with MiniSat has been improved. In addition, a number of new heuristics and flags to control these heuristics have been added. Based on these improvements, Satallax is expected to perform well in the THF division of CASC this year. On the other hand, Satallax is typically weak on problems requiring equality reasoning or nontrivial higher-order instantiations. Since Satallax can sometimes be used to determine satisfiability of a set of formulas, it will also compete in the TNT demonstration division.

SPASS+T---2.2.14

Uwe Waldmann¹, Stephan Zimmer²
¹Max-Planck-Institut für Informatik, Germany, ²AbsInt GmbH, Germany

Architecture

SPASS+T is an extension of the superposition-based theorem prover SPASS that integrates algebraic knowledge into SPASS in three complementary ways: by passing derived formulas to an external SMT procedure (currently Yices or CVC3), by adding standard axioms, and by built-in arithmetic simplification and inference rules. A first version of the system has been described in [PW06]. In the current version, a much more sophisticated coupling of the SMT procedure has been added [Zim07].

Strategies

Standard axioms and built-in arithmetic simplification and inference rules are integrated into the standard main loop of SPASS. Inferences between standard axioms are excluded, so the user-supplied formulas are taken as set of support. The external SMT procedure runs in parallel in a separate process, leading occasionally to non-deterministic behaviour.

Implementation

SPASS+T is implemented in C. The system is available from

    http://www.mpi-inf.mpg.de/~uwe/software/#TSPASS

Expected Competition Performance

SPASS+T came a close second in the TFA division of last CASC, and is has improved noticeably since then.

SPASS-XDB 3.01X0.6

Geoff Sutcliffe¹, Martin Suda^2,3
¹University of Miami, USA, ²Max-Planck-Institut für Informatik and Saarland University, Germany, ³Charles University in Prague, Czech Republic

Architecture

SPASS-XDB [SS+09,SS+10] is an extended version of the well-known, state-of-the-art, SPASS automated theorem proving system [WF+09]. The original SPASS reads a problem, consisting of axioms and a conjecture, in TPTP format from a file, and searches for a proof by refutation of the negated conjecture. SPASS-XDB adds the capability of retrieving extra positive unit axioms (facts) from external sources during the proof search (hence the "XDB", standing for eXternal DataBases). The axioms are retrieved asynchronously, on-demand, based on an expectation that they will contribute to completing the proof. The axioms are retrieved from a range of external sources, including SQL databases, SPARQL endpoints, WWW services, computation sources (e.g., computer algebra systems), etc., using a TPTP standard protocol.

For the TFA division, the TFF formulae are converted to FOF using the standard approach [Wal83], with type predicates to check the types of numeric variables. Numbers are represented internally as special constant symbols that carry both the type and the value. The basic mathematical functionality is provided by a new inference rule called ground arithmetic rewriting. Given a clause as a premise, it traverses the term structure of all its literals in a bottom up fashion, and performs the symbolically represented arithmetic operations whenever all arguments are numbers (thus even terms below uninterpreted symbols may get simplified). Arithmetic predicates (including equality) may get evaluated, which simplifies the clause, or might show its redundancy. The GMP arithmetic library is used for arbitrary precision computation.

To extend the mathematical capabilities beyond ground arithmetic rewriting, Mathematica is used as an external source of axioms. SPASS-XDB generates requests from negative arithmetic literals, taking advantage of the fact that Mathematica understands all arithmetic and logical symbols. When SPASS-XDB selects a negative literal that contains arithmetic symbols, all negative literals in the clause are scanned to check whether they can be conjoined with the selected literal into a single request. The S2M2S mediator translates the request into the Mathematica language, and calls the FindInstance function of Mathematica. FindInstance inputs an expression and a set of variables that occur in the expression, and finds instances of the variables that make the expression true. Axioms that are instances of the first literal in the request are returned. The mediator reads the answer, and creates new axioms that are passed back to SPASS-XDB.

Strategies

Generally, SPASS-XDB follows SPASS' strategies. However, SPASS, like most (all?) ATP systems, was designed under the assumption that all formulae are in the problem file, i.e., it is ignorant that external axioms might be delivered. To regain completeness, constraints on SPASSâ€™ search are relaxed in SPASS-XDB. This increases the search space, so the constraints are relaxed in a controlled, incremental fashion [SS+10]. The search space is also affected by the number of external axioms that can be delivered, and mechanisms to control the delivery and focus the consequent search are used [SS+10].

Implementation

SPASS-XDB, as an extension of SPASS, is written in C. The internal arithmetic is done using the GMP multiple precision arithmetic library. Reals are converted to rationals for computation, but results are presented in real format. SPASS-XDB is available for use online in the SystemOnTPTP interface:

    http://www.tptp.org/cgi-bin/SystemOnTPTP

Expected Competition Performance

SPASS-XDB should do well, particularly on problems that use uncommon parts of the TPTP TFA syntax, e.g., the evaleq predicate.

TPS 3.110228S1a

Chad E. Brown¹, Peter B. Andrews²
¹Saarland University, Germany, ²Carnegie Mellon University, USA

Architecture

TPS is a higher-order theorem proving system that has been developed over several decades under the supervision of Peter B. Andrews with substantial work by Eve Longini Cohen, Dale A. Miller, Frank Pfenning, Sunil Issar, Carl Klapper, Dan Nesmith, Hongwei Xi, Matthew Bishop, Chad E. Brown, Mark Kaminski, Rémy ChrÃ©tien and Cris Perdue. TPS can be used to prove theorems of Church's type theory automatically, interactively, or semi-automatically [AB+96,AB06].

When searching for a proof, TPS first searches for an expansion proof [Mil87] or an extensional expansion proof [Bro07] of the theorem. Part of this process involves searching for acceptable matings [And81]. Using higher-order unification, a pair of occurrences of subformulae (which are usually literals) is mated appropriately on each vertical path through an expanded form of the theorem to be proved. The expansion proof thus obtained is then translated [Pfe87] without further search into a proof of the theorem in natural deduction style.

Strategies

Strategies used by TPS in the search process include:

Re-ordering conjunctions and disjunctions to alter the way paths through the formula are enumerated.
The use of primitive substitutions and gensubs [And89].
Path-focused duplication [Iss90].
Dual instantiation of definitions, and generating substitutions for higher-order variables which contain abbreviations already present in the theorem to be proved [BA98].
Component search [Bis99].
Generating and solving set constraints [Bro02].
Generating connections using extensional and equational reasoning [Bro07].

Implementation

TPS has been developed as a research tool for developing, investigating, and refining a variety of methods of searching for expansion proofs, and variations of these methods. Its behavior is controlled by hundreds of flags. A set of flags, with values for them, is called a mode. When searching for a proof during the competition, TPS tries each of 80 selected modes in turn for a specified amount of time. If TPS succeeds in finding an expansion proof, it translates the expansion proof to a natural deduction proof. This final step ensures that TPS will not incorrectly report that a formula has been proven.

TPS is implemented in Common Lisp, and is available from

    http://gtps.math.cmu.edu/tps.html

Expected Competition Performance

TPS 3.080227G1d was the CASC-22 THF division winner. The main difference between the older version and the newer version is that more modes are included during the search and the final translation to natural deduction has been enabled.

Vampire 1.8

Krystof Hoder, Andrei Voronkov
The University of Manchester, United Kingdom

Architecture

Vampire 1.8, is an automatic theorem prover for first-order classical logic. It consists of a shell and a kernel. The kernel implements the calculi of ordered binary resolution and superposition for handling equality. The splitting rule in kernel adds propositional parts to clauses, which are manipulated using binary decision diagrams and a SAT solver. A number of standard redundancy criteria and simplification techniques are used for pruning the search space: subsumption, tautology deletion, subsumption resolution and rewriting by ordered unit equalities. The reduction ordering is the Knuth-Bendix Ordering.

Substitution tree and code tree indexes are used to implement all major operations on sets of terms, literals and clauses. Although the kernel of the system works only with clausal normal form, the shell accepts a problem in the full first-order logic syntax, clausifies it and performs a number of useful transformations before passing the result to the kernel. Also the axiom selection algorithm Sine [HV11] can be enabled as part of the preprocessing.

When a theorem is proved, the system produces a verifiable proof, which validates both the clausification phase and the refutation of the CNF.

Implementation

Vampire 1.8 is implemented in C++.

Strategies

The Vampire 1.8 kernel provides a fairly large number of options for strategy selection. The most important ones are:

Choice of the main procedure:
- Limited Resource Strategy
- DISCOUNT loop
- Otter loop
- Goal oriented mode based on tabulation
A variety of optional simplifications.
Parameterized reduction orderings.
A number of built-in literal selection functions and different modes of comparing literals.
Age-weight ratio that specifies how strongly lighter clauses are preferred for inference selection.
Set-of-support strategy.

Expected Competition Performance

We expect Vampire 1.8 to slightly outperform the last year's Vampire 0.6.

Waldmeister 710

Thomas Hillenbrand
Max-Planck-Institut für Informatik, Germany

Architecture

Waldmeister 710 [Hil03] is a system for unit equational deduction. Its theoretical basis is unfailing completion in the sense of [BDP89] with refinements towards ordered completion (cf. [AHL03]). The system saturates the input axiomatization, distinguishing active facts, which induce a rewrite relation, and passive facts, which are the one-step conclusions of the active ones up to redundancy. The saturation process is parameterized by a reduction ordering and a heuristic assessment of passive facts [HJL99]. This year's version is the result of polishing and fixing a few things in last year's.

Implementation

The approach taken to control the proof search is to choose the search parameters – reduction ordering and heuristic assessment – according to the algebraic structure given in the problem specification [HJL99]. This is based on the observation that proof tasks sharing major parts of their axiomatization often behave similarly.

Strategies

The prover is coded in ANSI-C. It runs on Solaris, Linux, MacOS X, and Windows/Cygwin. The central data strucures are: perfect discrimination trees for the active facts; group-wise compressions for the passive ones; and sets of rewrite successors for the conjectures. Visit the Waldmeister web pages at:

    http://www.waldmeister.org

Expected Competition Performance

Waldmeister 710 is the CASC-J5 UEQ division winner.

Z3 2./20

Nikolaj Bjorner, Leonardo de Moura
Microsoft Research, USA System description not supplied.

References

AP10: Akbarpour B., Paulson L. (2010), MetiTarski: An Automatic Theorem Prover for Real-Valued Special Functions, Journal of Automated Reasoning 44(3), 175-205.
AB+96: Andrews P. B., Bishop M., Issar S., Nesmith. D., Pfenning F., Xi H. (1996), TPS: A Theorem-Proving System for Classical Type Theory, Journal of Automated Reasoning 16(3), 321-353.
AB06: Andrews P. B., Brown C.E. (2006), TPS: A Hybrid Automatic-Interactive System for Developing Proofs, Journal of Applied Logic 4(4), 367-395.
And81: Andrews P. B. (1981), Theorem Proving via General Matings, Journal of the ACM 28(2), 193-214.
And89: Andrews P. B. (1989), On Connections and Higher-Order Logic, Journal of Automated Reasoning 5(3), 257-291.
AHL03: Avenhaus J., Hillenbrand T., Löchner B. (2003), On Using Ground Joinable Equations in Equational Theorem Proving, Journal of Symbolic Computation 36(1-2), 217-233, Elsevier Science.
BDP89: Bachmair L., Dershowitz N., Plaisted D.A. (1989), Completion Without Failure, Ait-Kaci H., Nivat M., Resolution of Equations in Algebraic Structures, 1-30, Academic Press.
BG98: Bachmair L., Ganzinger H. (1998), Equational Reasoning in Saturation-Based Theorem Proving, Bibel W., Schmitt P.H., Automated Deduction, A Basis for Applications, 352-397, Applied Logic Series I Foundations - Calculi and Methods(10), Kluwer Academic Publishers.
BB11: Backes J., Brown C.E. (2011), Analytic Tableaux for Higher-Order Logic with Choice, Journal of Automated Reasoning, To appear.
BT07: Barrett C., Tinelli C. (2007), CVC3, Damm W., Hermanns H., Proceedings of the 19th International Conference on Computer Aided Verification (Berlin, Germany), 298-302, Lecture Notes in Computer Science 4590, Springer-Verlag.
BFT04: Baumgartner P., Fuchs A., Tinelli C. (2004), Darwin - A Theorem Prover for the Model Evolution Calculus, Sutcliffe G., Schulz S., Tammet T., Proceedings of the Workshop on Empirically Successful First Order Reasoning, 2nd International Joint Conference on Automated Reasoning (Cork, Ireland).
BFT08: Baumgartner P., Fuchs A., Tinelli C. (2008), ME(LIA) - Model Evolution with Linear Integer Arithmetic Constraints, Cervesato I., Veith H., Voronkov A., Proceedings of the 15th International Conference on Logic for Programming Artificial Intelligence and Reasoning, 258-273, Lecture Notes in Artificial Intelligence 5330, Springer-Verlag.
BFN96: Baumgartner P., Furbach U., Niemelä I. (1996), Hyper Tableaux, Alferes J., Pereira L., Orlowska E., Proceedings of JELIA'96: European Workshop on Logic in Artificial Intelligence (Evora, Portugal), 1-17, Lecture Notes in Artificial Intelligence 1126, Springer-Verlag.
BFP07: Baumgartner P., Furbach U., Pelzer B. (2007), Hyper Tableaux with Equality, Pfenning F., Proceedings of the 21st International Conference on Automated Deduction (Bremen, Germany), 492-507, Lecture Notes in Artificial Intelligence 4603, Springer-Verlag.
BPT11: Baumgartner P., Pelzer B., Tinelli C. (2011), Model Evolution with Equality - Revised and Implemented, Journal of Symbolic Computation, To appear.
BT03: Baumgartner P., Tinelli C. (2003), The Model Evolution Calculus, Baader F., Proceedings of the 19th International Conference on Automated Deduction (Miami, USA), 350-364, Lecture Notes in Artificial Intelligence 2741, Springer-Verlag.
BT05: Baumgartner P., Tinelli C. (2005), The Model Evolution Calculus with Equality, Nieuwenhuis R., Proceedings of the 20th International Conference on Automated Deduction (Tallinn, Estonia), 392-408, Lecture Notes in Artificial Intelligence 3632, Springer-Verlag.
BT11: Baumgartner P., Tinelli C. (2011), Model Evolution with Equality Modulo Built-in Theories, Bjorner N., Sofronie-Stokkermans V., Proceedings of the 23rd International Conference on Automated Deduction (Wroclaw, Poland), 85-100, Lecture Notes in Artificial Intelligence 6803, Springer-Verlag.
BK98: Benzmüller C., Kohlhase M. (1998), LEO - A Higher-Order Theorem Prover, Kirchner C., Kirchner H., Proceedings of the 15th International Conference on Automated Deduction (Lindau, Germany), 139-143, Lecture Notes in Artificial Intelligence 1421, Springer-Verlag.
BP+08: Benzmüller C., Paulson L., Theiss F., Fietzke A. (2008), LEO-II - A Cooperative Automatic Theorem Prover for Higher-Order Logic, Baumgartner P., Armando A., Gilles D., Proceedings of the 4th International Joint Conference on Automated Reasoning (Sydney, Australia), 162-170, Lecture Notes in Artificial Intelligence 5195, Springer-Verlag.
BRS08: Benzmüller C., Rabe F., Sutcliffe G. (2008), THF0 - The Core TPTP Language for Classical Higher-Order Logic, Baumgartner P., Armando A., Gilles D., Proceedings of the 4th International Joint Conference on Automated Reasoning (Sydney, Australia), 491-506, Lecture Notes in Artificial Intelligence 5195, Springer-Verlag.
BS+08: Benzmüller C., Sorge V., Jamnik M., Kerber M. (2008), Combined Reasoning by Automated Cooperation, Journal of Applied Logic 6(3), 318-342.
Ben99: Benzmüller C. (1999), Extensional Higher-order Paramodulation and RUE-Resolution, Ganzinger H., Proceedings of the 16th International Conference on Automated Deduction (Trento, Italy), 399-413, Lecture Notes in Artificial Intelligence 1632, Springer-Verlag.
Bib87: Bibel W. (1987), Automated Theorem Proving, Vieweg and Sohn.
BA98: Bishop M., Andrews P.B. (1998), Selectively Instantiating Definitions, Kirchner C., Kirchner H., Proceedings of the 15th International Conference on Automated Deduction (Lindau, Germany), 365-380, Lecture Notes in Artificial Intelligence 1421, Springer-Verlag.
BK11: Blanchette J., Kraus A. (2011), Monotonicity Inference for Higher-Order Formulas, Journal of Automated Reasoning, To appear.
BN10: Blanchette J., Nipkow T. (2010), Nitpick: A Counterexample Generator for Higher-Order Logic Based on a Relational Model Finder, Kaufmann M. Paulson L., Proceedings of the 1st International Conference on Interactive Theorem Proving (Edinburgh, United Kingdom), 131-146, Lecture Notes in Computer Science 6172, Springer-Verlag.
Ble71: Bledsoe W.W. (1971), Splitting and Reduction Heuristics in Automatic Theorem Proving, Artificial Intelligence 2, 55-77.
BS09: Brown C.E., Smolka G. (2009), Terminating Tableaux for the Basic Fragment of Simple Type Theory, Giese M., Waaler A., Proceedings of the 18th International Conference on Automated Reasoning with Analytic Tableaux and Related Methods (Oslo, Norway), 138-151, Lecture Notes in Artificial Intelligence 5697, Springer-Verlag.
BS09: Brown C.E., Smolka G. (2009), Extended First-Order Logic, Nipkow T., Urban C., Proceedings of the 22nd International Conference on Theorem Proving in Higher Order Logics (Munich, Germany), 164-179, Lecture Notes in Computer Science 5674, Springer-Verlag.
BS10: Brown C.E., Smolka G. (2010), Analytic Tableaux for Simple Type Theory and its First-Order Fragment, Logical Methods in Computer Science 6(2).
Bro02: Brown C.E. (2002), Solving for Set Variables in Higher-Order Theorem Proving, Voronkov A., Proceedings of the 18th International Conference on Automated Deduction (Copenhagen, Denmark), 408-422, Lecture Notes in Artificial Intelligence 2392, Springer-Verlag.
Bro03: Brown C.E. (2003), QEPCAD B - A Program for Computing with Semi-algebraic sets using CADs, ACM SIGSAM Bulletin 37(4), 97-108.
Bro07: Brown C.E. (2007), Automated Reasoning in Higher-Order Logic: Set Comprehension and Extensionality in Church's Type Theory, Studies in Logic: Logic and Cognitive Systems 10, College Publications.
Bro11: Brown C.E. (2011), Reducing Higher-Order Theorem Proving to a Sequence of SAT Problems, Bjorner N., Sofronie-Stokkermans V., Proceedings of the 23rd International Conference on Automated Deduction (Wroclaw, Poland), To appear, Lecture Notes in Artificial Intelligence 6803, Springer-Verlag.
Boh09: Böhme S. (2009), Proof Reconstruction for Z3 in Isabelle/HOL, Duterte B., Strichman O., Proceedings of the 7th International Workshop on Satisfiability Modulo Theories (Montreal, Canada).
CS03: Claessen K., Sörensson N. (2003), New Techniques that Improve MACE-style Finite Model Finding, Baumgartner P., Fermueller C., Proceedings of the CADE-19 Workshop: Model Computation - Principles, Algorithms, Applications (Miami, USA).
DA+09: Denman W., Akbarpour B., Tahar S., Zaki M., Paulson L. (2009), Formal Verification of Analog Designs using MetiTarski, Biere A., Pixley C., Proceedings of Formal Methods in Computer Aided Design 2009 (Austin, USA), 93-100, IEEE.
ES04: E{\'e}n N., Sörensson N. (2004), An Extensible SAT-solver, Giunchiglia E., Tacchella A., Proceedings of the 6th International Conference on Theory and Applications of Satisfiability Testing (Santa Margherita Ligure, Italy), 502-518, Lecture Notes in Computer Science 2919, Springer-Verlag.
GK03: Ganzinger H., Korovin K. (2003), New Directions in Instantiation-Based Theorem Proving, Kolaitis P., Proceedings of the 18th IEEE Symposium on Logic in Computer Science (Ottawa, Canada), 55-64, IEEE Press.
GK04: Ganzinger H., Korovin K. (2004), Integrating Equational Reasoning into Instantiation-Based Theorem Proving, Marcinkowski J., Tarlecki A., Proceedings of the 18th International Workshop on Computer Science Logic, 13th Annual Conference of the EACSL (Karpacz, Poland), 71-84, Lecture Notes in Computer Science 3210, Springer-Verlag.
GSS10: Gebser M., Sabuncu O., Schaub T. (2010), An Incremental Answer Set Programming Based System for Finite Model Computation, Janhunen T., Niemelä I., Proceedings of the 12th European Conference on Logics in Artificial Intelligence (Helsinki, Finland), 169-181, Lecture Notes in Artificial Intelligence 6341, Springer-Verlag.
HJL99: Hillenbrand T., Jaeger A., Löchner B. (1999), Waldmeister - Improvements in Performance and Ease of Use, Ganzinger H., Proceedings of the 16th International Conference on Automated Deduction (Trento, Italy), 232-236, Lecture Notes in Artificial Intelligence 1632, Springer-Verlag.
Hil03: Hillenbrand T. (2003), Citius altius fortius: Lessons Learned from the Theorem Prover Waldmeister, Dahn I., Vigneron L., Proceedings of the 4th International Workshop on First-Order Theorem Proving (Valencia, Spain), 1-13, Electronic Notes in Theoretical Computer Science 86.1.
HV11: Hoder K., Voronkov A. (2011), Sine Qua Non for Large Theory Reasoning, Sofronie-Stokkermans V., Bj{\oe}rner N., Proceedings of the 23rd International Conference on Automated Deduction (Wroclaw, Poland), To appear, Lecture Notes in Artificial Intelligence 6803, Springer-Verlag.
Hur03: Hurd J. (2003), First-Order Proof Tactics in Higher-Order Logic Theorem Provers, Archer M., Di Vito B., Munoz C., Proceedings of the 1st International Workshop on Design and Application of Strategies/Tactics in Higher Order Logics (Rome, Italy), 56-68, NASA Technical Reports NASA/CP-2003-212448.
Iss90: Issar S. (1990), Path-Focused Duplication: A Search Procedure for General Matings, Dietterich T. Swartout W., Proceedings of the 8th National Conference on Artificial Intelligence (Boston, USA), 221-226, American Association for Artificial Intelligence / MIT Press.
KS10: Korovin K., Sticksel C. (2010), iProver-Eq - An Instantiation-Based Theorem Prover with Equality, Giesl J., Haehnle R., Proceedings of the 5th International Joint Conference on Automated Reasoning (Edinburgh, United Kingdom), 196-202, Lecture Notes in Artificial Intelligence 6173.
Kor08: Korovin K. (2008), iProver - An Instantiation-Based Theorem Prover for First-order Logic (System Description), Baumgartner P., Armando A., Gilles D., Proceedings of the 4th International Joint Conference on Automated Reasoning (Sydney, Australia), 292-298, Lecture Notes in Artificial Intelligence 5195.
Kor09: Korovin K. (2009), An Invitation to Instantiation-Based Reasoning: From Theory to Practice, Podelski A., Voronkov A., Wilhelm R., Volume in Memoriam of Harald Ganzinger, 163-166, Lecture Notes in Computer Science 5663, Springer-Verlag.
LS01: Letz R., Stenz G. (2001), Model Elimination and Connection Tableau Procedures, Robinson A., Voronkov A., Handbook of Automated Reasoning, 2015-2114, Elsevier Science.
Loe06: Loechner B. (2006), Things to Know When Implementing KBO, Journal of Automated Reasoning 36(4), 289-310.
Loe06: Loechner B. (2006), Things to Know When Implementing LBO, Journal of Artificial Intelligence Tools 15(1), 53-80.
Lov78: Loveland D.W. (1978), Automated Theorem Proving : A Logical Basis, Elsevier Science.
MW97: McCune W.W., Wos L. (1997), Otter: The CADE-13 Competition Incarnations, Journal of Automated Reasoning 18(2), 211-220.
McC03: McCune W.W. (2003), Mace4 Reference Manual and Guide, ANL/MCS-TM-264, Argonne National Laboratory (Argonne, USA).
McC03: McCune W.W. (2003), Otter 3.3 Reference Manual, ANL/MSC-TM-263, Argonne National Laboratory (Argonne, USA).
Mil87: Miller D. (1987), A Compact Representation of Proofs, Studia Logica 46(4), 347-370.
NPW09: Nipkow T., Paulson L., Wenzel M., Isabelle/HOL: A Proof Assistant for Higher-Order Logic, http://www.cl.cam.ac.uk/research/hvg/Isabelle/dist/Isabelle/doc/tutorial.pdf.
Nip89: Nipkow T. (1989), Equational Reasoning in Isabelle, Science of Computer Programming 12(2), 123-149.
OB03: Otten J., Bibel W. (2003), leanCoP: Lean Connection-Based Theorem Proving, Journal of Symbolic Computation 36(1-2), 139-161.
OS10: Otten J., Sutcliffe G. (2010), Using the TPTP Language for Representing Derivations in Tableau and Connection Calculi, Konev B., Schmidt R., Schulz S., Proceedings of the Workshop on Practical Aspects of Automated Reasoning, 5th International Joint Conference on Automated Reasoning (Edinburgh, United Kingdom), 90-100.
Ott08: Otten J. (2008), {\sf leanCoP 2.0} and {\sf ileancop 1.2}: High Performance Lean Theorem Proving in Classical and Intuitionistic Logic, Baumgartner P., Armando A., Gilles D., Proceedings of the 4th International Joint Conference on Automated Reasoning (Sydney, Australia), 283-291, Lecture Notes in Artificial Intelligence 5195.
Ott10: Otten J. (2010), Restricting Backtracking in Connection Calculi, AI Communications 23(2-3), 159-182.
Pas01: Pastre D. (2001), Muscadet 2.3 : A Knowledge-based Theorem Prover based on Natural Deduction, Gore R., Leitsch A., Nipkow T., Proceedings of the International Joint Conference on Automated Reasoning (Siena, Italy), 685-689, Lecture Notes in Artificial Intelligence 2083, Springer-Verlag.
Pas01: Pastre D. (2001), Implementation of Knowledge Bases for Natural Deduction, Nieuwenhuis R., Voronkov A., Proceedings of the 8th International Conference on Logic for Programming, Artificial Intelligence and Reasoning (Havana, Cuba), 29-68, Lecture Notes in Artificial Intelligence 2250, Springer-Verlag.
Pas02: Pastre D. (2002), Strong and Weak Points of the Muscadet Theorem Prover, AI Communications 15(2-3), 147-160.
Pas07: Pastre D. (2007), Complementarity of a Natural Deduction Knowledge-based Prover and Resolution-based Provers in Automated Theorem Proving, http://www.math-info.univ-paris5.fr/~pastre/compl-NDKB-RB.pdf.
Pas78: Pastre D. (1978), Automatic Theorem Proving in Set Theory, Artificial Intelligence 10, 1-27.
Pas89: Pastre D. (1989), Muscadet : An Automatic Theorem Proving System using Knowledge and Metaknowledge in Mathematics, Artificial Intelligence 38, 257-318.
Pas93: Pastre D. (1993), Automated Theorem Proving in Mathematics, Annals of Mathematics and Artificial Intelligence 8, 425-447.
PB10: Paulson L., Blanchette J. (2010), Three Years of Experience with Sledgehammer, a Practical Link between Automatic and Interactive Theorem Provers, Sutcliffe G., Ternovska E., Schulz S., Proceedings of the 8th International Workshop on the Implementation of Logics (Yogyakarta, Indonesia), To appear.
PN94: Paulson L.C., Nipkow T. (1994), Isabelle: A Generic Theorem Prover, Lecture Notes in Computer Science 828, Springer-Verlag.
Pau99: Paulson L. (1999), A Generic Tableau Prover and its Integration with Isabelle, Artificial Intelligence 5(3), 73-87.
PW07: Pelzer B., Wernhard C. (2007), System Description: E-KRHyper, Pfenning F., Proceedings of the 21st International Conference on Automated Deduction (Bremen, Germany), 508-513, Lecture Notes in Artificial Intelligence 4603, Springer-Verlag.
Pfe87: Pfenning F. (1987), Proof Transformations in Higher-Order Logic, PhD thesis, Carnegie-Mellon University (Pittsburg, USA).
PW06: Prevosto V., Waldmann U. (2006), SPASS+T, Sutcliffe G., Schmidt R., Schulz S., Proceedings of the FLoC'06 Workshop on Empirically Successful Computerized Reasoning, 3rd International Joint Conference on Automated Reasoning (Seattle, USA), 19-33, CEUR Workshop Proceedings 192.
RV01: Riazanov A., Voronkov A. (2001), Vampire 1.1 (System Description), Gore R., Leitsch A., Nipkow T., Proceedings of the International Joint Conference on Automated Reasoning (Siena, Italy), 376-380, Lecture Notes in Artificial Intelligence 2083, Springer-Verlag.
RV02: Riazanov A., Voronkov A. (2002), The Design and Implementation of Vampire, AI Communications 15(2-3), 91-110.
Sch02: Schulz S. (2002), E: A Brainiac Theorem Prover, AI Communications 15(2-3), 111-126.
Sch04: Schulz S. (2004), System Abstract: E 0.81, Rusinowitch M., Basin D., Proceedings of the 2nd International Joint Conference on Automated Reasoning (Cork, Ireland), 223-228, Lecture Notes in Artificial Intelligence 3097.
Sch04: Schulz S. (2004), Simple and Efficient Clause Subsumption with Feature Vector Indexing, Sutcliffe G., Schulz S., Tammet T., Proceedings of the Workshop on Empirically Successful First Order Reasoning, 2nd International Joint Conference on Automated Reasoning (Cork, Ireland).
SS+09: Suda M., Sutcliffe G., Wischnewski P., Lamotte-Schubert M., de Melo G. (2009), External Sources of Axioms in Automated Theorem Proving, Mertsching B., Proceedings of the 32nd Annual Conference on Artificial Intelligence (Paderborn, Germany), 281-288, Lecture Notes in Artificial Intelligence 5803.
SB10: Sutcliffe G., Benzmüller C. (2010), Automated Reasoning in Higher-Order Logic using the TPTP THF Infrastructure, Journal of Formalized Reasoning 3(1), 1-27.
SS+10: Sutcliffe G., Suda M., Teyssandier A., Dellis N., de Melo G. (2010), Progress Towards Effective Automated Reasoning with World Knowledge, Murray C., Guesgen H., Proceedings of the 23rd International FLAIRS Conference (Daytona Beach, USA), 110-115, AAAI Press.
TB06: Theiss F., Benzmüller C. (2006), Term Indexing for the LEO-II Prover, Benzmüller C., Fischer B., Sutcliffe G., Proceedings of the 6th International Workshop on the Implementation of Logics (Phnom Penh, Cambodia), 7-23, CEUR Workshop Proceedings 212.
TJ07: Torlak E., Jackson D. (2007), Kodkod: A Relational Model Finder, Grumberg O., Huth M., Proceedings of the 13th International Conference on Tools and Algorithms for the Construction and Analysis of Systems (Braga, Portugal), 632-647, Lecture Notes in Computer Science 4424, Springer-Verlag.
Ull89: Ullman J. (1989), Principles of Database and Knowledge-Base Bystems, Computer Science Press, Inc..
Wal83: Walther C. (1983), A Many-Sorted Calculus Based on Resolution and Paramodulation, Bundy A., Proceedings of the 8th International Joint Conference on Artificial Intelligence (Karlsruhe, Germany), 882-891.
Web08: Weber T. (2008), SAT-based Finite Model Generation for Higher-Order Logic, PhD thesis, Technische Universität München (Munich, Germany).
Wer03: Wernhard C. (2003), System Description: KRHyper, Fachberichte Informatik 14--2003, Universität Koblenz-Landau (Koblenz, Germany).
Zim07: Zimmer S. (2007), Intelligent Combination of a First Order Theorem Prover and SMT Procedures, MSc thesis, Saarland University (Saarbruecken, Germany).