0.00/0.09 % Problem : theBenchmark.p : TPTP v0.0.0. Released v0.0.0. 0.00/0.09 % Command : run_vampire %s %d THM 0.08/0.29 % Computer : n014.cluster.edu 0.08/0.29 % Model : x86_64 x86_64 0.08/0.29 % CPU : Intel(R) Xeon(R) CPU E5-2620 v4 @ 2.10GHz 0.08/0.29 % Memory : 8042.1875MB 0.08/0.29 % OS : Linux 3.10.0-693.el7.x86_64 0.08/0.29 % CPULimit : 960 0.08/0.29 % WCLimit : 120 0.08/0.30 % DateTime : Wed Jul 30 05:49:49 EDT 2025 0.08/0.30 % CPUTime : 0.17/0.31 This is a TFF_ problem 0.17/0.31 Running first-order theorem proving 0.17/0.31 Running /export/starexec/sandbox2/solver/bin/vampire --mode casc -m 16384 --cores 7 -t 120 /export/starexec/sandbox2/benchmark/theBenchmark.p 0.17/0.39 % (8670)Running in auto input_syntax mode. Trying TPTP 0.17/0.39 % (8674)dis+10_3_slsqr=1,4:to=lpo:sil=128000:thi=strong:si=on:uwa=off:s2agt=20:slsqc=1:slsq=on:random_seed=1040885471:i=201:slsql=off:asg=cautious:rtra=on:gtg=all:ss=axioms:sgt=16_1199 on theBenchmark for (1199ds/201Mi) 0.17/0.39 % (8670)Running in auto input_syntax mode. Trying TPTP 0.17/0.39 % (8672)dis+1002_16:1_to=lpo:sil=64000:norm_ineq=on:sas=z3:si=on:gve=force:uwa=one_side_constant:random_seed=1036087918:i=12:doe=on:rtra=on:gtg=exists_top:ss=axioms_1199 on theBenchmark for (1199ds/12Mi) 0.17/0.39 % (8670)Running in auto input_syntax mode. Trying TPTP 0.17/0.39 % (8673)dis+1002_1_to=kbo:sil=128000:tgt=ground:sas=z3:si=on:spb=units:tha=off:random_seed=3516238366:i=307:kws=precedence:nm=0:rtra=on_1199 on theBenchmark for (1199ds/307Mi) 0.17/0.39 % (8670)Running in auto input_syntax mode. Trying TPTP 0.17/0.39 % (8678)lrs+10_1_tgt=ground:sas=z3:si=on:random_seed=17033430:i=33:rtra=on_1199 on theBenchmark for (1199ds/33Mi) 0.17/0.39 % (8670)Running in auto input_syntax mode. Trying TPTP 0.17/0.39 % (8677)lrs+10_1_to=lpo:sas=z3:si=on:tha=off:random_seed=893588942:i=46:rtra=on_1199 on theBenchmark for (1199ds/46Mi) 0.17/0.39 % (8670)Running in auto input_syntax mode. Trying TPTP 0.17/0.39 % (8676)dis+21_64_to=kbo:sil=128000:si=on:sp=weighted_frequency:uwa=alasca_can_abstract:random_seed=45989091:i=4:rtra=on_1199 on theBenchmark for (1199ds/4Mi) 0.17/0.39 % (8670)Running in auto input_syntax mode. Trying TPTP 0.17/0.39 % (8675)lrs+1002_4:1_to=lpo:sil=64000:si=on:br=off:random_seed=1081636721:s2a=on:i=7:rtra=on:inst=on_1199 on theBenchmark for (1199ds/7Mi) 0.17/0.39 % (8676)Instruction limit reached! 0.17/0.39 % (8676)------------------------------ 0.17/0.39 % (8676)Version: Vampire 5.0.0 (Release build, commit 3ce9b74f2 on 2025-07-14 12:22:21 +0200) 0.17/0.39 % (8676)Linked with Z3 4.14.0.0 3c47fd96cf5645d0c42b2c819d9e9a84380aa721 z3-4.8.4-9178-g3c47fd96c 0.17/0.39 % (8676)Termination reason: Instruction limit 0.17/0.39 % (8676)Termination phase: Saturation 0.17/0.39 0.17/0.39 % (8676)Time elapsed: 0.003 s 0.17/0.39 % (8676)Peak memory usage: 8 MB 0.17/0.39 % (8676)Instructions burned: 4 (million) 0.17/0.39 % (8675)Instruction limit reached! 0.17/0.39 % (8675)------------------------------ 0.17/0.39 % (8675)Version: Vampire 5.0.0 (Release build, commit 3ce9b74f2 on 2025-07-14 12:22:21 +0200) 0.17/0.39 % (8675)Linked with Z3 4.14.0.0 3c47fd96cf5645d0c42b2c819d9e9a84380aa721 z3-4.8.4-9178-g3c47fd96c 0.17/0.39 % (8675)Termination reason: Instruction limit 0.17/0.39 % (8675)Termination phase: Saturation 0.17/0.39 0.17/0.39 % (8675)Time elapsed: 0.004 s 0.17/0.39 % (8675)Peak memory usage: 8 MB 0.17/0.39 % (8675)Instructions burned: 8 (million) 0.17/0.41 % (8672)Instruction limit reached! 0.17/0.41 % (8672)------------------------------ 0.17/0.41 % (8672)Version: Vampire 5.0.0 (Release build, commit 3ce9b74f2 on 2025-07-14 12:22:21 +0200) 0.17/0.41 % (8672)Linked with Z3 4.14.0.0 3c47fd96cf5645d0c42b2c819d9e9a84380aa721 z3-4.8.4-9178-g3c47fd96c 0.17/0.41 % (8672)Termination reason: Instruction limit 0.17/0.41 % (8672)Termination phase: Saturation 0.17/0.41 0.17/0.41 % (8672)Time elapsed: 0.016 s 0.17/0.41 % (8672)Peak memory usage: 28 MB 0.17/0.41 % (8672)Instructions burned: 13 (million) 0.17/0.41 % (8677)First to succeed. 0.17/0.41 % (8677)Solution written to "/export/starexec/sandbox2/tmp/vampire-proof-8670" 0.17/0.41 % (8670)Running in auto input_syntax mode. Trying TPTP 0.17/0.41 % (8677)Refutation found. Thanks to Tanya! 0.17/0.41 % SZS status Theorem for theBenchmark 0.17/0.41 % SZS output start Proof for theBenchmark 0.17/0.41 tff(type_def_5, type, list: $tType). 0.17/0.41 tff(func_def_0, type, nil: list). 0.17/0.41 tff(func_def_1, type, cons: ($int * list) > list). 0.17/0.41 tff(func_def_2, type, head: list > $int). 0.17/0.41 tff(func_def_3, type, tail: list > list). 0.17/0.41 tff(func_def_4, type, length: list > $int). 0.17/0.41 tff(func_def_5, type, count: ($int * list) > $int). 0.17/0.41 tff(func_def_6, type, append: (list * list) > list). 0.17/0.41 tff(func_def_11, type, sK0: ($int * list) > $int). 0.17/0.41 tff(func_def_12, type, sK1: ($int * list) > list). 0.17/0.41 tff(func_def_13, type, sK2: (list * $int) > list). 0.17/0.41 tff(func_def_14, type, sK3: (list * $int) > $int). 0.17/0.41 tff(func_def_15, type, sK4: (list * $int) > $int). 0.17/0.41 tff(func_def_16, type, sK5: (list * $int) > list). 0.17/0.41 tff(pred_def_1, type, in: ($int * list) > $o). 0.17/0.41 tff(pred_def_2, type, inRange: ($int * list) > $o). 0.17/0.41 tff(f136,plain,( 0.17/0.41 $false), 0.17/0.41 inference(avatar_smt_refutation,[],[f92,f135])). 0.17/0.41 tff(f135,plain,( 0.17/0.41 ~spl6_1), 0.17/0.41 inference(avatar_contradiction_clause,[],[f134])). 0.17/0.41 tff(f134,plain,( 0.17/0.41 $false | ~spl6_1), 0.17/0.41 inference(evaluation,[],[f132])). 0.17/0.41 tff(f132,plain,( 0.17/0.41 ~$less(1,$sum(1,$sum(1,0))) | $less(4,$sum(1,$sum(1,$sum(1,$sum(1,0))))) | ~spl6_1), 0.17/0.41 inference(superposition,[],[f128,f91])). 0.17/0.41 tff(f91,plain,( 0.17/0.41 0 = length(nil) | ~spl6_1), 0.17/0.41 inference(avatar_component_clause,[],[f89])). 0.17/0.41 tff(f89,definition,( 0.17/0.41 spl6_1 <=> 0 = length(nil)), 0.17/0.41 introduced(definition,[new_symbols(naming,[spl6_1])],[avatar_definition])). 0.17/0.41 tff(f128,plain,( 0.17/0.41 ( ! [X0 : list] : ($less(4,$sum(1,$sum(1,$sum(1,$sum(1,length(X0)))))) | ~$less(1,$sum(1,$sum(1,length(X0))))) ) | ~spl6_1), 0.17/0.41 inference(superposition,[],[f124,f56])). 0.17/0.41 tff(f56,plain,( 0.17/0.41 ( ! [X0 : list,X1 : $int] : (length(cons(X1,X0)) = $sum(1,length(X0))) )), 0.17/0.41 inference(cnf_transformation,[],[f23])). 0.17/0.41 tff(f23,plain,( 0.17/0.41 ! [X0 : list,X1 : $int] : length(cons(X1,X0)) = $sum(1,length(X0))), 0.17/0.41 inference(rectify,[],[f3])). 0.17/0.41 tff(f3,axiom,( 0.17/0.41 ! [X3 : list,X4 : $int] : length(cons(X4,X3)) = $sum(1,length(X3))), 0.17/0.41 file('/export/starexec/sandbox2/benchmark/theBenchmark.p',unknown)). 0.17/0.41 tff(f124,plain,( 0.17/0.41 ( ! [X0 : list] : ($less(4,$sum(1,$sum(1,$sum(1,length(X0))))) | ~$less(1,$sum(1,length(X0)))) ) | ~spl6_1), 0.17/0.41 inference(superposition,[],[f120,f56])). 0.17/0.41 tff(f120,plain,( 0.17/0.41 ( ! [X0 : list] : ($less(4,$sum(1,$sum(1,length(X0)))) | ~$less(1,length(X0))) ) | ~spl6_1), 0.17/0.41 inference(evaluation,[],[f119])). 0.17/0.41 tff(f119,plain,( 0.17/0.41 ( ! [X0 : list] : ($less(4,$sum(1,$sum(1,length(X0)))) | ~$less(1,length(X0)) | ~$less(1,$sum(1,$sum(1,0)))) ) | ~spl6_1), 0.17/0.41 inference(forward_demodulation,[],[f117,f91])). 0.17/0.41 tff(f117,plain,( 0.17/0.41 ( ! [X0 : list] : (~$less(1,length(X0)) | ~$less(1,$sum(1,$sum(1,length(nil)))) | $less(4,$sum(1,$sum(1,length(X0))))) )), 0.17/0.41 inference(superposition,[],[f116,f58])). 0.17/0.41 tff(f58,plain,( 0.17/0.41 ( ! [X0 : list] : (append(nil,X0) = X0) )), 0.17/0.41 inference(cnf_transformation,[],[f31])). 0.17/0.41 tff(f31,plain,( 0.17/0.41 ! [X0 : list] : append(nil,X0) = X0), 0.17/0.41 inference(rectify,[],[f7])). 0.17/0.41 tff(f7,axiom,( 0.17/0.41 ! [X1 : list] : append(nil,X1) = X1), 0.17/0.41 file('/export/starexec/sandbox2/benchmark/theBenchmark.p',unknown)). 0.17/0.41 tff(f116,plain,( 0.17/0.41 ( ! [X2 : list,X1 : list] : ($less(4,$sum(1,$sum(1,length(append(X1,X2))))) | ~$less(1,$sum(1,$sum(1,length(X1)))) | ~$less(1,length(X2))) )), 0.17/0.41 inference(forward_demodulation,[],[f114,f56])). 0.17/0.41 tff(f114,plain,( 0.17/0.41 ( ! [X2 : list,X0 : $int,X1 : list] : ($less(4,$sum(1,$sum(1,length(append(X1,X2))))) | ~$less(1,$sum(1,length(cons(X0,X1)))) | ~$less(1,length(X2))) )), 0.17/0.41 inference(forward_demodulation,[],[f113,f56])). 0.17/0.41 tff(f113,plain,( 0.17/0.41 ( ! [X2 : list,X0 : $int,X1 : list] : ($less(4,$sum(1,length(cons(X0,append(X1,X2))))) | ~$less(1,length(X2)) | ~$less(1,$sum(1,length(cons(X0,X1))))) )), 0.17/0.41 inference(superposition,[],[f111,f57])). 0.17/0.41 tff(f57,plain,( 0.17/0.41 ( ! [X2 : list,X0 : list,X1 : $int] : (append(cons(X1,X0),X2) = cons(X1,append(X0,X2))) )), 0.17/0.41 inference(cnf_transformation,[],[f37])). 0.17/0.41 tff(f37,plain,( 0.17/0.41 ! [X0 : list,X1 : $int,X2 : list] : append(cons(X1,X0),X2) = cons(X1,append(X0,X2))), 0.17/0.41 inference(rectify,[],[f24])). 0.17/0.41 tff(f24,plain,( 0.17/0.41 ! [X1 : list,X0 : $int,X2 : list] : cons(X0,append(X1,X2)) = append(cons(X0,X1),X2)), 0.17/0.41 inference(rectify,[],[f14])). 0.17/0.41 tff(f14,axiom,( 0.17/0.41 ! [X6 : $int,X2 : list,X1 : list] : append(cons(X6,X2),X1) = cons(X6,append(X2,X1))), 0.17/0.41 file('/export/starexec/sandbox2/benchmark/theBenchmark.p',unknown)). 0.17/0.41 tff(f111,plain,( 0.17/0.41 ( ! [X2 : list,X1 : list] : ($less(4,$sum(1,length(append(X1,X2)))) | ~$less(1,$sum(1,length(X1))) | ~$less(1,length(X2))) )), 0.17/0.41 inference(forward_demodulation,[],[f110,f56])). 0.17/0.41 tff(f110,plain,( 0.17/0.41 ( ! [X2 : list,X0 : $int,X1 : list] : (~$less(1,length(X2)) | $less(4,$sum(1,length(append(X1,X2)))) | ~$less(1,length(cons(X0,X1)))) )), 0.17/0.41 inference(forward_demodulation,[],[f109,f56])). 0.17/0.41 tff(f109,plain,( 0.17/0.41 ( ! [X2 : list,X0 : $int,X1 : list] : ($less(4,length(cons(X0,append(X1,X2)))) | ~$less(1,length(X2)) | ~$less(1,length(cons(X0,X1)))) )), 0.17/0.41 inference(superposition,[],[f59,f57])). 0.17/0.41 tff(f59,plain,( 0.17/0.41 ( ! [X0 : list,X1 : list] : ($less(4,length(append(X1,X0))) | ~$less(1,length(X0)) | ~$less(1,length(X1))) )), 0.17/0.41 inference(cnf_transformation,[],[f38])). 0.17/0.41 tff(f38,plain,( 0.17/0.41 ! [X0 : list,X1 : list] : ($less(4,length(append(X1,X0))) | ~$less(1,length(X1)) | ~$less(1,length(X0)))), 0.17/0.41 inference(rectify,[],[f35])). 0.17/0.41 tff(f35,plain,( 0.17/0.41 ! [X1 : list,X0 : list] : ($less(4,length(append(X0,X1))) | ~$less(1,length(X0)) | ~$less(1,length(X1)))), 0.17/0.41 inference(flattening,[],[f34])). 0.17/0.41 tff(f34,plain,( 0.17/0.41 ! [X0 : list,X1 : list] : ($less(4,length(append(X0,X1))) | (~$less(1,length(X1)) | ~$less(1,length(X0))))), 0.17/0.41 inference(ennf_transformation,[],[f25])). 0.17/0.41 tff(f25,plain,( 0.17/0.41 ! [X0 : list,X1 : list] : (($less(1,length(X1)) & $less(1,length(X0))) => $less(4,length(append(X0,X1))))), 0.17/0.41 inference(rectify,[],[f18])). 0.17/0.41 tff(f18,plain,( 0.17/0.41 ! [X2 : list,X1 : list] : (($less(1,length(X1)) & $less(1,length(X2))) => $less(4,length(append(X2,X1))))), 0.17/0.41 inference(theory_normalization,[],[f16])). 0.17/0.41 tff(f16,negated_conjecture,( 0.17/0.41 ~~! [X2 : list,X1 : list] : (($greater(length(X1),1) & $greater(length(X2),1)) => $greater(length(append(X2,X1)),4))), 0.17/0.41 inference(negated_conjecture,[status(cth)],[f15])). 0.17/0.41 tff(f15,conjecture,( 0.17/0.41 ~! [X2 : list,X1 : list] : (($greater(length(X1),1) & $greater(length(X2),1)) => $greater(length(append(X2,X1)),4))), 0.17/0.41 file('/export/starexec/sandbox2/benchmark/theBenchmark.p',unknown)). 0.17/0.41 tff(f92,plain,( 0.17/0.41 spl6_1), 0.17/0.41 inference(avatar_split_clause,[],[f61,f89])). 0.17/0.41 tff(f61,plain,( 0.17/0.41 0 = length(nil)), 0.17/0.41 inference(cnf_transformation,[],[f4])). 0.17/0.41 tff(f4,axiom,( 0.17/0.41 0 = length(nil)), 0.17/0.41 file('/export/starexec/sandbox2/benchmark/theBenchmark.p',unknown)). 0.17/0.41 % SZS output end Proof for theBenchmark 0.17/0.41 % (8677)------------------------------ 0.17/0.41 % (8677)Version: Vampire 5.0.0 (Release build, commit 3ce9b74f2 on 2025-07-14 12:22:21 +0200) 0.17/0.41 % (8677)Linked with Z3 4.14.0.0 3c47fd96cf5645d0c42b2c819d9e9a84380aa721 z3-4.8.4-9178-g3c47fd96c 0.17/0.41 % (8677)Termination reason: Refutation 0.17/0.41 0.17/0.41 % (8677)Time elapsed: 0.018 s 0.17/0.41 % (8677)Peak memory usage: 28 MB 0.17/0.41 % (8677)Instructions burned: 15 (million) 0.17/0.41 % (8677)------------------------------ 0.17/0.41 % (8677)------------------------------ 0.17/0.41 % (8670)Success in time 0.093 s 0.17/0.41 EOF