TSTP Solution File: LCL506+1 by Princess---230619
View Problem
- Process Solution
%------------------------------------------------------------------------------
% File : Princess---230619
% Problem : LCL506+1 : TPTP v8.1.2. Released v3.3.0.
% Transfm : none
% Format : tptp
% Command : princess -inputFormat=tptp +threads -portfolio=casc +printProof -timeoutSec=%d %s
% Computer : n026.cluster.edu
% Model : x86_64 x86_64
% CPU : Intel(R) Xeon(R) CPU E5-2620 v4 2.10GHz
% Memory : 8042.1875MB
% OS : Linux 3.10.0-693.el7.x86_64
% CPULimit : 300s
% WCLimit : 300s
% DateTime : Thu Aug 31 08:11:29 EDT 2023
% Result : Theorem 8.71s 1.97s
% Output : Proof 10.66s
% Verified :
% SZS Type : -
% Comments :
%------------------------------------------------------------------------------
%----WARNING: Could not form TPTP format derivation
%------------------------------------------------------------------------------
%----ORIGINAL SYSTEM OUTPUT
% 0.00/0.12 % Problem : LCL506+1 : TPTP v8.1.2. Released v3.3.0.
% 0.00/0.12 % Command : princess -inputFormat=tptp +threads -portfolio=casc +printProof -timeoutSec=%d %s
% 0.12/0.34 % Computer : n026.cluster.edu
% 0.12/0.34 % Model : x86_64 x86_64
% 0.12/0.34 % CPU : Intel(R) Xeon(R) CPU E5-2620 v4 @ 2.10GHz
% 0.12/0.34 % Memory : 8042.1875MB
% 0.12/0.34 % OS : Linux 3.10.0-693.el7.x86_64
% 0.12/0.34 % CPULimit : 300
% 0.12/0.34 % WCLimit : 300
% 0.12/0.34 % DateTime : Fri Aug 25 07:17:32 EDT 2023
% 0.12/0.34 % CPUTime :
% 0.19/0.62 ________ _____
% 0.19/0.62 ___ __ \_________(_)________________________________
% 0.19/0.62 __ /_/ /_ ___/_ /__ __ \ ___/ _ \_ ___/_ ___/
% 0.19/0.62 _ ____/_ / _ / _ / / / /__ / __/(__ )_(__ )
% 0.19/0.62 /_/ /_/ /_/ /_/ /_/\___/ \___//____/ /____/
% 0.19/0.62
% 0.19/0.62 A Theorem Prover for First-Order Logic modulo Linear Integer Arithmetic
% 0.19/0.62 (2023-06-19)
% 0.19/0.62
% 0.19/0.62 (c) Philipp Rümmer, 2009-2023
% 0.19/0.62 Contributors: Peter Backeman, Peter Baumgartner, Angelo Brillout, Zafer Esen,
% 0.19/0.62 Amanda Stjerna.
% 0.19/0.62 Free software under BSD-3-Clause.
% 0.19/0.62
% 0.19/0.62 For more information, visit http://www.philipp.ruemmer.org/princess.shtml
% 0.19/0.62
% 0.19/0.62 Loading /export/starexec/sandbox/benchmark/theBenchmark.p ...
% 0.19/0.63 Running up to 7 provers in parallel.
% 0.19/0.64 Prover 1: Options: +triggersInConjecture -genTotalityAxioms -tightFunctionScopes -clausifier=none -reverseFunctionalityPropagation -boolFunsAsPreds -triggerStrategy=maximal -realRatSaturationRounds=0 +ignoreQuantifiers -constructProofs=always -generateTriggers=all -randomSeed=-1571432423
% 0.19/0.64 Prover 0: Options: +triggersInConjecture +genTotalityAxioms +tightFunctionScopes -clausifier=simple -reverseFunctionalityPropagation -boolFunsAsPreds -triggerStrategy=allUni -realRatSaturationRounds=0 -ignoreQuantifiers -constructProofs=never -generateTriggers=all -randomSeed=1042961893
% 0.19/0.64 Prover 3: Options: +triggersInConjecture -genTotalityAxioms -tightFunctionScopes -clausifier=none -reverseFunctionalityPropagation -boolFunsAsPreds -triggerStrategy=maximal -realRatSaturationRounds=1 +ignoreQuantifiers -constructProofs=never -generateTriggers=all -randomSeed=1922548996
% 0.19/0.64 Prover 2: Options: +triggersInConjecture +genTotalityAxioms -tightFunctionScopes -clausifier=simple +reverseFunctionalityPropagation +boolFunsAsPreds -triggerStrategy=allMinimalAndEmpty -realRatSaturationRounds=1 -ignoreQuantifiers -constructProofs=never -generateTriggers=all -randomSeed=-1065072994
% 0.19/0.64 Prover 4: Options: +triggersInConjecture -genTotalityAxioms -tightFunctionScopes -clausifier=simple -reverseFunctionalityPropagation -boolFunsAsPreds -triggerStrategy=allUni -realRatSaturationRounds=0 +ignoreQuantifiers -constructProofs=always -generateTriggers=all -randomSeed=1868514696
% 0.19/0.64 Prover 6: Options: -triggersInConjecture -genTotalityAxioms +tightFunctionScopes -clausifier=none +reverseFunctionalityPropagation -boolFunsAsPreds -triggerStrategy=maximalOutermost -realRatSaturationRounds=0 -ignoreQuantifiers -constructProofs=never -generateTriggers=all -randomSeed=-1399714365
% 0.19/0.65 Prover 5: Options: +triggersInConjecture -genTotalityAxioms +tightFunctionScopes -clausifier=none +reverseFunctionalityPropagation +boolFunsAsPreds -triggerStrategy=allMaximal -realRatSaturationRounds=1 -ignoreQuantifiers -constructProofs=never -generateTriggers=complete -randomSeed=1259561288
% 3.28/1.19 Prover 4: Preprocessing ...
% 3.28/1.19 Prover 1: Preprocessing ...
% 3.28/1.22 Prover 0: Preprocessing ...
% 3.28/1.22 Prover 5: Preprocessing ...
% 3.28/1.23 Prover 2: Preprocessing ...
% 3.28/1.23 Prover 6: Preprocessing ...
% 3.28/1.23 Prover 3: Preprocessing ...
% 7.62/1.81 Prover 4: Constructing countermodel ...
% 7.62/1.81 Prover 5: Proving ...
% 7.62/1.82 Prover 6: Constructing countermodel ...
% 7.62/1.85 Prover 3: Constructing countermodel ...
% 8.03/1.85 Prover 1: Constructing countermodel ...
% 8.38/1.92 Prover 0: Proving ...
% 8.71/1.97 Prover 6: proved (1322ms)
% 8.71/1.97
% 8.71/1.97 % SZS status Theorem for /export/starexec/sandbox/benchmark/theBenchmark.p
% 8.71/1.97
% 8.71/1.97 Prover 3: proved (1328ms)
% 8.71/1.97
% 8.71/1.97 % SZS status Theorem for /export/starexec/sandbox/benchmark/theBenchmark.p
% 8.71/1.97
% 8.71/1.97 Prover 7: Options: +triggersInConjecture -genTotalityAxioms +tightFunctionScopes -clausifier=simple +reverseFunctionalityPropagation +boolFunsAsPreds -triggerStrategy=allUni -realRatSaturationRounds=1 +ignoreQuantifiers -constructProofs=always -generateTriggers=all -randomSeed=-236303470
% 8.71/1.97 Prover 5: proved (1327ms)
% 8.71/1.97
% 8.71/1.97 % SZS status Theorem for /export/starexec/sandbox/benchmark/theBenchmark.p
% 8.71/1.97
% 8.71/1.98 Prover 8: Options: +triggersInConjecture +genTotalityAxioms -tightFunctionScopes -clausifier=none -reverseFunctionalityPropagation -boolFunsAsPreds -triggerStrategy=maximal -realRatSaturationRounds=0 +ignoreQuantifiers -constructProofs=always -generateTriggers=all -randomSeed=-200781089
% 8.71/1.98 Prover 10: Options: +triggersInConjecture -genTotalityAxioms +tightFunctionScopes -clausifier=simple -reverseFunctionalityPropagation +boolFunsAsPreds -triggerStrategy=maximal -realRatSaturationRounds=1 +ignoreQuantifiers -constructProofs=always -generateTriggers=all -randomSeed=919308125
% 8.71/2.02 Prover 0: stopped
% 8.71/2.03 Prover 2: Proving ...
% 8.71/2.03 Prover 2: stopped
% 8.71/2.03 Prover 11: Options: +triggersInConjecture -genTotalityAxioms +tightFunctionScopes -clausifier=simple -reverseFunctionalityPropagation -boolFunsAsPreds -triggerStrategy=allUni -realRatSaturationRounds=1 +ignoreQuantifiers -constructProofs=always -generateTriggers=all -randomSeed=-1509710984
% 8.71/2.03 Prover 13: Options: +triggersInConjecture -genTotalityAxioms -tightFunctionScopes -clausifier=simple -reverseFunctionalityPropagation +boolFunsAsPreds -triggerStrategy=maximal -realRatSaturationRounds=0 +ignoreQuantifiers -constructProofs=always -generateTriggers=complete -randomSeed=1138197443
% 9.34/2.04 Prover 10: Preprocessing ...
% 9.34/2.04 Prover 1: Found proof (size 19)
% 9.34/2.04 Prover 1: proved (1407ms)
% 9.34/2.07 Prover 7: Preprocessing ...
% 9.34/2.07 Prover 4: stopped
% 9.34/2.08 Prover 8: Preprocessing ...
% 9.34/2.08 Prover 11: Preprocessing ...
% 9.34/2.09 Prover 10: stopped
% 9.34/2.09 Prover 13: Preprocessing ...
% 9.77/2.10 Prover 7: stopped
% 9.77/2.12 Prover 11: stopped
% 9.77/2.13 Prover 13: stopped
% 9.77/2.20 Prover 8: Warning: ignoring some quantifiers
% 9.77/2.21 Prover 8: Constructing countermodel ...
% 9.77/2.22 Prover 8: stopped
% 9.77/2.22
% 9.77/2.22 % SZS status Theorem for /export/starexec/sandbox/benchmark/theBenchmark.p
% 9.77/2.22
% 9.77/2.22 % SZS output start Proof for theBenchmark
% 9.77/2.23 Assumptions after simplification:
% 9.77/2.23 ---------------------------------
% 9.77/2.23
% 9.77/2.23 (and_1)
% 9.77/2.25 (and_1 & ! [v0: $i] : ! [v1: $i] : ! [v2: $i] : ! [v3: $i] : ( ~ (and(v0,
% 9.77/2.25 v1) = v2) | ~ (implies(v2, v0) = v3) | ~ $i(v1) | ~ $i(v0) |
% 9.77/2.25 is_a_theorem(v3) = 0)) | ( ~ and_1 & ? [v0: $i] : ? [v1: $i] : ? [v2:
% 9.77/2.25 $i] : ? [v3: $i] : ? [v4: int] : ( ~ (v4 = 0) & and(v0, v1) = v2 &
% 9.77/2.25 implies(v2, v0) = v3 & is_a_theorem(v3) = v4 & $i(v3) & $i(v2) & $i(v1) &
% 9.77/2.25 $i(v0)))
% 9.77/2.26
% 9.77/2.26 (hilbert_and_1)
% 9.77/2.26 ~ and_1
% 9.77/2.26
% 9.77/2.26 (kn2)
% 9.77/2.26 (kn2 & ! [v0: $i] : ! [v1: $i] : ! [v2: $i] : ! [v3: $i] : ( ~ (and(v0,
% 9.77/2.26 v1) = v2) | ~ (implies(v2, v0) = v3) | ~ $i(v1) | ~ $i(v0) |
% 9.77/2.26 is_a_theorem(v3) = 0)) | ( ~ kn2 & ? [v0: $i] : ? [v1: $i] : ? [v2: $i]
% 9.77/2.26 : ? [v3: $i] : ? [v4: int] : ( ~ (v4 = 0) & and(v0, v1) = v2 & implies(v2,
% 9.77/2.26 v0) = v3 & is_a_theorem(v3) = v4 & $i(v3) & $i(v2) & $i(v1) & $i(v0)))
% 10.51/2.26
% 10.51/2.26 (rosser_kn2)
% 10.51/2.26 kn2
% 10.51/2.26
% 10.51/2.26 (function-axioms)
% 10.66/2.26 ! [v0: $i] : ! [v1: $i] : ! [v2: $i] : ! [v3: $i] : (v1 = v0 | ~ (or(v3,
% 10.66/2.26 v2) = v1) | ~ (or(v3, v2) = v0)) & ! [v0: $i] : ! [v1: $i] : ! [v2:
% 10.66/2.26 $i] : ! [v3: $i] : (v1 = v0 | ~ (and(v3, v2) = v1) | ~ (and(v3, v2) =
% 10.66/2.26 v0)) & ! [v0: $i] : ! [v1: $i] : ! [v2: $i] : ! [v3: $i] : (v1 = v0 |
% 10.66/2.26 ~ (equiv(v3, v2) = v1) | ~ (equiv(v3, v2) = v0)) & ! [v0: $i] : ! [v1:
% 10.66/2.26 $i] : ! [v2: $i] : ! [v3: $i] : (v1 = v0 | ~ (implies(v3, v2) = v1) | ~
% 10.66/2.26 (implies(v3, v2) = v0)) & ! [v0: $i] : ! [v1: $i] : ! [v2: $i] : (v1 = v0
% 10.66/2.26 | ~ (not(v2) = v1) | ~ (not(v2) = v0)) & ! [v0: MultipleValueBool] : !
% 10.66/2.26 [v1: MultipleValueBool] : ! [v2: $i] : (v1 = v0 | ~ (is_a_theorem(v2) = v1)
% 10.66/2.26 | ~ (is_a_theorem(v2) = v0))
% 10.66/2.26
% 10.66/2.26 Further assumptions not needed in the proof:
% 10.66/2.26 --------------------------------------------
% 10.66/2.26 and_2, and_3, cn1, cn2, cn3, equivalence_1, equivalence_2, equivalence_3,
% 10.66/2.27 hilbert_op_equiv, hilbert_op_implies_and, hilbert_op_or, implies_1, implies_2,
% 10.66/2.27 implies_3, kn1, kn3, modus_ponens, modus_tollens, op_and, op_equiv,
% 10.66/2.27 op_implies_and, op_implies_or, op_or, or_1, or_2, or_3, r1, r2, r3, r4, r5,
% 10.66/2.27 rosser_kn1, rosser_kn3, rosser_modus_ponens, rosser_op_equiv,
% 10.66/2.27 rosser_op_implies_and, rosser_op_or, substitution_of_equivalents
% 10.66/2.27
% 10.66/2.27 Those formulas are unsatisfiable:
% 10.66/2.27 ---------------------------------
% 10.66/2.27
% 10.66/2.27 Begin of proof
% 10.66/2.27 |
% 10.66/2.27 | ALPHA: (function-axioms) implies:
% 10.66/2.27 | (1) ! [v0: MultipleValueBool] : ! [v1: MultipleValueBool] : ! [v2: $i] :
% 10.66/2.27 | (v1 = v0 | ~ (is_a_theorem(v2) = v1) | ~ (is_a_theorem(v2) = v0))
% 10.66/2.27 |
% 10.66/2.27 | BETA: splitting (kn2) gives:
% 10.66/2.27 |
% 10.66/2.27 | Case 1:
% 10.66/2.27 | |
% 10.66/2.27 | | (2) kn2 & ! [v0: $i] : ! [v1: $i] : ! [v2: $i] : ! [v3: $i] : ( ~
% 10.66/2.27 | | (and(v0, v1) = v2) | ~ (implies(v2, v0) = v3) | ~ $i(v1) | ~
% 10.66/2.27 | | $i(v0) | is_a_theorem(v3) = 0)
% 10.66/2.27 | |
% 10.66/2.27 | | ALPHA: (2) implies:
% 10.66/2.27 | | (3) ! [v0: $i] : ! [v1: $i] : ! [v2: $i] : ! [v3: $i] : ( ~ (and(v0,
% 10.66/2.27 | | v1) = v2) | ~ (implies(v2, v0) = v3) | ~ $i(v1) | ~ $i(v0) |
% 10.66/2.27 | | is_a_theorem(v3) = 0)
% 10.66/2.27 | |
% 10.66/2.27 | | BETA: splitting (and_1) gives:
% 10.66/2.27 | |
% 10.66/2.27 | | Case 1:
% 10.66/2.27 | | |
% 10.66/2.27 | | | (4) and_1 & ! [v0: $i] : ! [v1: $i] : ! [v2: $i] : ! [v3: $i] : ( ~
% 10.66/2.27 | | | (and(v0, v1) = v2) | ~ (implies(v2, v0) = v3) | ~ $i(v1) | ~
% 10.66/2.27 | | | $i(v0) | is_a_theorem(v3) = 0)
% 10.66/2.27 | | |
% 10.66/2.27 | | | ALPHA: (4) implies:
% 10.66/2.27 | | | (5) and_1
% 10.66/2.27 | | |
% 10.66/2.27 | | | PRED_UNIFY: (5), (hilbert_and_1) imply:
% 10.66/2.27 | | | (6) $false
% 10.66/2.27 | | |
% 10.66/2.27 | | | CLOSE: (6) is inconsistent.
% 10.66/2.27 | | |
% 10.66/2.27 | | Case 2:
% 10.66/2.27 | | |
% 10.66/2.28 | | | (7) ~ and_1 & ? [v0: $i] : ? [v1: $i] : ? [v2: $i] : ? [v3: $i] :
% 10.66/2.28 | | | ? [v4: int] : ( ~ (v4 = 0) & and(v0, v1) = v2 & implies(v2, v0) =
% 10.66/2.28 | | | v3 & is_a_theorem(v3) = v4 & $i(v3) & $i(v2) & $i(v1) & $i(v0))
% 10.66/2.28 | | |
% 10.66/2.28 | | | ALPHA: (7) implies:
% 10.66/2.28 | | | (8) ? [v0: $i] : ? [v1: $i] : ? [v2: $i] : ? [v3: $i] : ? [v4:
% 10.66/2.28 | | | int] : ( ~ (v4 = 0) & and(v0, v1) = v2 & implies(v2, v0) = v3 &
% 10.66/2.28 | | | is_a_theorem(v3) = v4 & $i(v3) & $i(v2) & $i(v1) & $i(v0))
% 10.66/2.28 | | |
% 10.66/2.28 | | | DELTA: instantiating (8) with fresh symbols all_37_0, all_37_1, all_37_2,
% 10.66/2.28 | | | all_37_3, all_37_4 gives:
% 10.66/2.28 | | | (9) ~ (all_37_0 = 0) & and(all_37_4, all_37_3) = all_37_2 &
% 10.66/2.28 | | | implies(all_37_2, all_37_4) = all_37_1 & is_a_theorem(all_37_1) =
% 10.66/2.28 | | | all_37_0 & $i(all_37_1) & $i(all_37_2) & $i(all_37_3) &
% 10.66/2.28 | | | $i(all_37_4)
% 10.66/2.28 | | |
% 10.66/2.28 | | | ALPHA: (9) implies:
% 10.66/2.28 | | | (10) ~ (all_37_0 = 0)
% 10.66/2.28 | | | (11) $i(all_37_4)
% 10.66/2.28 | | | (12) $i(all_37_3)
% 10.66/2.28 | | | (13) is_a_theorem(all_37_1) = all_37_0
% 10.66/2.28 | | | (14) implies(all_37_2, all_37_4) = all_37_1
% 10.66/2.28 | | | (15) and(all_37_4, all_37_3) = all_37_2
% 10.66/2.28 | | |
% 10.66/2.28 | | | GROUND_INST: instantiating (3) with all_37_4, all_37_3, all_37_2,
% 10.66/2.28 | | | all_37_1, simplifying with (11), (12), (14), (15) gives:
% 10.66/2.28 | | | (16) is_a_theorem(all_37_1) = 0
% 10.66/2.28 | | |
% 10.66/2.28 | | | GROUND_INST: instantiating (1) with all_37_0, 0, all_37_1, simplifying
% 10.66/2.28 | | | with (13), (16) gives:
% 10.66/2.28 | | | (17) all_37_0 = 0
% 10.66/2.28 | | |
% 10.66/2.28 | | | REDUCE: (10), (17) imply:
% 10.66/2.28 | | | (18) $false
% 10.66/2.28 | | |
% 10.66/2.28 | | | CLOSE: (18) is inconsistent.
% 10.66/2.28 | | |
% 10.66/2.28 | | End of split
% 10.66/2.28 | |
% 10.66/2.28 | Case 2:
% 10.66/2.28 | |
% 10.66/2.28 | | (19) ~ kn2 & ? [v0: $i] : ? [v1: $i] : ? [v2: $i] : ? [v3: $i] : ?
% 10.66/2.28 | | [v4: int] : ( ~ (v4 = 0) & and(v0, v1) = v2 & implies(v2, v0) = v3 &
% 10.66/2.28 | | is_a_theorem(v3) = v4 & $i(v3) & $i(v2) & $i(v1) & $i(v0))
% 10.66/2.28 | |
% 10.66/2.28 | | ALPHA: (19) implies:
% 10.66/2.28 | | (20) ~ kn2
% 10.66/2.28 | |
% 10.66/2.28 | | PRED_UNIFY: (20), (rosser_kn2) imply:
% 10.66/2.28 | | (21) $false
% 10.66/2.28 | |
% 10.66/2.28 | | CLOSE: (21) is inconsistent.
% 10.66/2.28 | |
% 10.66/2.28 | End of split
% 10.66/2.28 |
% 10.66/2.28 End of proof
% 10.66/2.28 % SZS output end Proof for theBenchmark
% 10.66/2.28
% 10.66/2.28 1666ms
%------------------------------------------------------------------------------