200 6. Implementation
of answers (Def. 3.2.5), regularity (Def. 3.3.2), factorization (Section 3.3.3),
ground reduction steps (Section 3.3.4) and the combined connection calcu-
lus - model elimination calculus (Note 4.3.2) are straightforward, and hence
omitted.
Finally, I want to refer to the work of
[
Neugebauer and Petermann, 1995
]
where a language is proposed to specify inference rules (e.g. extension, reduc-
tion, factorization, equality handling etc.) for model elimination-based theo-
rem proving. By this, the translation process just explained can be described
in a more declarative way, which facilitates the construction of respective
provers.
6.2 Practical Experiments
Running practical experiments and comparing runtime results is a widely
used technique to evaluate the power of theorem proving systems. Our in-
vestigations are biased towards assessing the potential of theory reasoning
according to the linearizing completion approach. I will first describe the
general setup taken for all experiments, and then comment on the results.
As the theory reasoning prover we used PROTEIN as described above. In
order to see the relative advantages of theory reasoning vs. non-theory rea-
soning PROTEIN was run in the following calculi settings: model elimination
(ME, Def. 3.2.3), restart model elimination (RME, Def. 4.6.3 and Note 4.6.4),
partial theory model elimination (PTME-I, Def. 4.5.4) and partial restart the-
ory model elimination (PRTME-I, Def. 4.6.3). Further, to get an impression
of the difficulty of the investigated problems we also run SETHEO (version
3.2.5) and OTTER (version 3.0.4). All provers were run in the default mode.
PROTEIN in its default mode employs regularity (cf. Def. 3.3.2 for
PTME-I and Section 4.6.3 PRTME-I) and the ground version of factorization
(Section 3.3.3).
SETHEO is a highly developed model elimination prover, featuring in
its default mode subgoal reordering, purity deletion, anti-lemmas, folding-
up, regularity, tautology and subsumption constraints (in
[
Letz et al., 1994;
Christoph Goller and Schumann, 1994
]
some of these are described).
OTTER is a state-of-the-art resolution prover. Its numerous flags are set
automatically in the “autonomous mode” according to built-in heuristics. In
the experiments below OTTER chooses hyper resolution as the primary infer-
ence rule, possibly augmented by special inference rules for equality handling
such as paramodulation and demodulation.
The columns in Tables 6.2, 6.2 and 6.3 below are labeled with these provers
respectively. The entries are to be read as follows: the timing results are given
in seconds and are obtained on a SUN SPARCstation 20/712 (2 SuperSparc
processores, 70 Mhz, with 1 MB cache, Solaris 2.5.).
The entries “#Inf .” give the total number of inferences carried out in the
proof search of the model elimination based provers. Since both PROTEIN