Higher-order Theorems | Zipperpin 2.1 | Zipperpin 2.0 | Vampire 4.6 | Leo‑III 1.6 | Ehoh 2.7 | cvc5 1.0 | LEO‑II 1.7.0 |
---|---|---|---|---|---|---|---|
Solved/500 | 467/500 | 450/500 | 386/500 | 357/500 | 300/500 | 239/500 | 95/500 |
Av. WC Time | 9.37 | 11.30 | 4.65 | 11.80 | 15.24 | 7.62 | 6.69 |
Av. CPU Time | 65.52 | 71.90 | 34.90 | 34.00 | 15.20 | 7.57 | 6.65 |
Solutions | 467 93% | 450 90% | 386 77% | 357 71% | 300 60% | 239 47% | 94 18% |
μWCEfficiency | 476 | 428 | 566 | 142 | 410 | 344 | 144 |
μEfficiency | 358 | 327 | 371 | 79 | 410 | 346 | 144 |
SotAC | 0.35 | - | 0.24 | 0.20 | 0.17 | 0.10 | 0.03 |
Core Usage | 4.55 | 4.32 | 4.22 | 2.51 | 0.94 | 0.91 | 0.90 |
New Solved | 161/170 | 159/170 | 153/170 | 136/170 | 147/170 | 108/170 | 6/170 |
First-order Theorems | Vampire 4.6 | Vampire 4.5 | iProver 3.5 | CSE_E 1.3 | E 2.6 | GKC 0.7 | Zipperpin 2.1 | Etableau 0.67 | cvc5 1.0 | SATCoP 0.1 | Drodi 3.1.5 | Prover9 1109a | CSE 1.4 | CSE‑F 1.0 | Twee 2.4 | JavaRes 1.3.0 | RPx 1.0 |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Solved/500 | 448/500 | 439/500 | 373/500 | 373/500 | 368/500 | 342/500 | 310/500 | 291/500 | 281/500 | 217/500 | 204/500 | 138/500 | 128/500 | 112/500 | 98/500 | 16/500 | 16/500 |
Av. WC Time | 6.52 | 3.44 | 12.15 | 13.43 | 13.50 | 7.27 | 10.51 | 4.82 | 21.99 | 7.09 | 11.64 | 13.63 | 23.56 | 37.86 | 11.16 | 35.88 | 19.01 |
Av. CPU Time | 32.80 | 18.67 | 84.28 | 13.47 | 13.44 | 52.57 | 68.00 | 26.49 | 21.92 | 53.08 | 87.19 | 13.49 | 23.66 | 37.94 | 86.91 | 45.93 | 20.07 |
Solutions | 448 89% | 439 87% | 373 74% | 373 74% | 368 73% | 342 68% | 310 62% | 291 58% | 280 56% | 217 43% | 204 40% | 138 27% | 128 25% | 112 22% | 98 19% | 16 3% | 0 0% |
μWCEfficiency | 706 | 702 | 219 | 407 | 402 | 460 | 338 | 410 | 233 | 305 | 236 | 112 | 68 | 66 | 84 | 10 | 10 |
μEfficiency | 546 | 498 | 101 | 399 | 403 | 304 | 254 | 313 | 232 | 221 | 149 | 125 | 67 | 63 | 43 | 7 | 10 |
SotAC | 0.44 | - | 0.32 | 0.32 | 0.31 | 0.28 | 0.24 | 0.22 | 0.23 | 0.16 | 0.13 | 0.09 | 0.07 | 0.06 | 0.07 | 0.00 | 0.01 |
Core Usage | 2.95 | 3.27 | 4.71 | 1.03 | 0.90 | 4.18 | 3.73 | 3.16 | 0.94 | 3.99 | 4.90 | 0.92 | 1.02 | 1.02 | 6.20 | 1.69 | 1.06 |
New Solved | 44/50 | 38/50 | 30/50 | 27/50 | 29/50 | 32/50 | 24/50 | 24/50 | 16/50 | 20/50 | 7/50 | 13/50 | 9/50 | 11/50 | 14/50 | 0/50 | 0/50 |
First-order Non-theorems | Vampire SAT‑4.5 | Vampire SAT‑4.6 | iProver SAT‑3.5 | cvc5 SAT‑1.0 | E FNT‑2.6 | Etableau 0.67 | RPx 1.0 |
---|---|---|---|---|---|---|---|
Solved/250 | 230/250 | 229/250 | 145/250 | 79/250 | 65/250 | 58/250 | 0/250 |
Av. WC Time | 14.47 | 14.85 | 8.87 | 19.36 | 4.38 | 3.08 | - |
Av. CPU Time | 48.40 | 47.12 | 58.00 | 19.30 | 4.31 | 19.44 | - |
Solutions | 230 92% | 229 91% | 145 58% | 79 31% | 65 26% | 58 23% | 0 0% |
μWCEfficiency | 534 | 536 | 192 | 129 | 189 | 171 | - |
μEfficiency | 408 | 422 | 93 | 129 | 189 | 143 | - |
SotAC | - | 0.53 | 0.26 | 0.11 | 0.07 | 0.06 | - |
Core Usage | 3.46 | 3.29 | 4.52 | 0.94 | 0.81 | 3.06 | - |
New Solved | 3/3 | 3/3 | 3/3 | 1/3 | 0/3 | 0/3 | 0/3 |
Unit Equality CNF | Twee 2.4 | E 2.5 | E 2.6 | Vampire 4.6 | Etableau 0.67 | iProver 3.5 | GKC 0.7 | Drodi 3.1.5 |
---|---|---|---|---|---|---|---|---|
Solved/250 | 227/250 | 197/250 | 195/250 | 157/250 | 152/250 | 138/250 | 122/250 | 118/250 |
Av. WC Time | 8.17 | 13.91 | 12.15 | 10.47 | 5.61 | 17.41 | 13.69 | 8.98 |
Av. CPU Time | 62.95 | 13.88 | 12.10 | 32.53 | 36.50 | 125.77 | 104.97 | 67.05 |
Solutions | 227 90% | 197 78% | 195 78% | 157 62% | 152 60% | 138 55% | 122 48% | 118 47% |
μWCEfficiency | 572 | 503 | 498 | 334 | 444 | 146 | 202 | 258 |
μEfficiency | 343 | 502 | 501 | 213 | 355 | 58 | 111 | 155 |
SotAC | 0.44 | - | 0.28 | 0.15 | 0.15 | 0.11 | 0.07 | 0.05 |
Core Usage | 5.06 | 0.93 | 0.87 | 2.92 | 3.39 | 5.22 | 5.95 | 5.15 |
New Solved | 29/42 | 25/42 | 24/42 | 16/42 | 16/42 | 18/42 | 14/42 | 14/42 |
SLedgeHammer Theorems | Zipperpin SLH‑2.1 | Ehoh SLH‑2.7 | Vampire SLH‑4.6 | Leo‑III 1.6 | cvc5 SLH‑1.0 |
---|---|---|---|---|---|
Solved/720 | 675/720 | 655/720 | 626/720 | 499/720 | 310/720 |
Av. WC Time | 2.67 | 2.09 | 3.48 | 5.88 | 2.11 |
Av. CPU Time | 2.44 | 2.03 | 3.45 | 17.83 | 2.01 |
Solutions | 675 93% | 655 90% | 626 86% | 499 69% | 310 43% |
μWCEfficiency | 574 | 742 | 496 | 42 | 326 |
μEfficiency | 550 | 742 | 493 | 127 | 324 |
SotAC | 0.27 | 0.25 | 0.23 | 0.15 | 0.05 |
Core Usage | 0.90 | 0.83 | 0.93 | 3.18 | 0.83 |
New Solved | 0/0 | 0/0 | 0/0 | 0/0 | 0/0 |
Large Theory Batch | Vampire LTB‑4.6 | iProver LTB‑3.5 | Zipperpin LTB‑2.1 | E LTB‑2.5 | E LTB‑2.6 | GKC LTB‑0.7 | Leo‑III LTB‑1.6 |
---|---|---|---|---|---|---|---|
Solved/10000 | 7891/10000 | 7013/9993 | 6952/10000 | 6921/10000 | 6833/10000 | 6545/10000 | 2481/10000 |
Av. WC Time | 21.89 | 19.74 | 9.57 | 24.95 | 15.75 | 26.40 | 69.65 |
Av. CPU Time | 309.51 | 18.51 | 0.17 | 13.73 | 10.45 | 15.64 | 170.62 |
Solutions | 7891 78% | 7013 70% | 6952 69% | 6921 69% | 6833 68% | 6545 65% | 2481 24% |
μWCEfficiency | 428 | 402 | 695 | 532 | 548 | 544 | 23 |
μEfficiency | 408 | 115 | 695 | 473 | 470 | 413 | 4 |
SotAC | 0.21 | 0.13 | 0.13 | 0.12 | 0.12 | 0.11 | 0.02 |
Core Usage | 4.38 | 6.43 | 1.01 | 2.16 | 2.30 | 3.23 | 6.10 |
New Solved | 0/0 | 0/0 | 0/0 | 0/0 | 0/0 | 0/0 | 0/0 |
THF without Equality | Zipperpin 2.1 | Zipperpin 2.0 | Vampire 4.6 | Leo‑III 1.6 | cvc5 1.0 | LEO‑II 1.7.0 | Ehoh 2.7 |
---|---|---|---|---|---|---|---|
Solved/100 | 88/100 | 83/100 | 70/100 | 61/100 | 40/100 | 36/100 | 30/100 |
Av. WC Time | 12.79 | 15.00 | 3.05 | 6.05 | 2.01 | 1.05 | 4.20 |
Av. CPU Time | 95.64 | 100.33 | 22.56 | 14.54 | 1.96 | 1.01 | 4.16 |
Solutions | 88 88% | 83 83% | 70 70% | 61 61% | 40 40% | 36 36% | 30 30% |
μWCEfficiency | 433 | 422 | 479 | 126 | 340 | 330 | 274 |
μEfficiency | 499 | 464 | 556 | 195 | 340 | 330 | 274 |
SotAC | 0.43 | - | 0.26 | 0.19 | 0.08 | 0.06 | 0.04 |
Core Usage | 4.06 | 3.76 | 2.97 | 2.01 | 0.84 | 0.88 | 0.90 |
New Solved | 0/0 | 0/0 | 0/0 | 0/0 | 0/0 | 0/0 | 0/0 |
THF with Equality | Zipperpin 2.1 | Zipperpin 2.0 | Vampire 4.6 | Leo‑III 1.6 | Ehoh 2.7 | cvc5 1.0 | LEO‑II 1.7.0 |
---|---|---|---|---|---|---|---|
Solved/400 | 379/400 | 367/400 | 316/400 | 296/400 | 270/400 | 199/400 | 59/400 |
Av. WC Time | 8.58 | 10.46 | 5.01 | 12.99 | 16.47 | 8.75 | 10.14 |
Av. CPU Time | 58.52 | 65.47 | 37.63 | 38.01 | 16.43 | 8.70 | 10.09 |
Solutions | 379 94% | 367 91% | 316 79% | 296 74% | 270 67% | 199 49% | 58 14% |
μWCEfficiency | 339 | 303 | 343 | 68 | 444 | 348 | 97 |
μEfficiency | 470 | 419 | 568 | 129 | 444 | 346 | 97 |
SotAC | 0.34 | - | 0.23 | 0.21 | 0.19 | 0.11 | 0.02 |
Core Usage | 4.66 | 4.44 | 4.49 | 2.62 | 0.94 | 0.93 | 0.92 |
New Solved | 161/170 | 159/170 | 153/170 | 136/170 | 147/170 | 108/170 | 6/170 |
FOF Theorems without Equality | Vampire 4.5 | Vampire 4.6 | iProver 3.5 | CSE_E 1.3 | E 2.6 | GKC 0.7 | Zipperpin 2.1 | SATCoP 0.1 | Etableau 0.67 | Drodi 3.1.5 | cvc5 1.0 | CSE 1.4 | CSE‑F 1.0 | Prover9 1109a | Twee 2.4 | JavaRes 1.3.0 | RPx 1.0 |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Solved/100 | 93/100 | 92/100 | 90/100 | 85/100 | 83/100 | 80/100 | 73/100 | 69/100 | 68/100 | 64/100 | 56/100 | 44/100 | 39/100 | 25/100 | 18/100 | 12/100 | 9/100 |
Av. WC Time | 1.39 | 2.73 | 6.37 | 8.83 | 9.97 | 3.32 | 8.84 | 2.75 | 3.48 | 7.79 | 40.21 | 22.44 | 45.70 | 8.97 | 14.16 | 44.60 | 16.56 |
Av. CPU Time | 5.47 | 10.70 | 40.39 | 8.87 | 9.91 | 21.08 | 56.14 | 19.44 | 16.24 | 57.87 | 39.90 | 22.51 | 45.77 | 8.80 | 110.40 | 56.22 | 17.37 |
Solutions | 93 93% | 92 92% | 90 90% | 85 85% | 83 83% | 80 80% | 73 73% | 69 69% | 68 68% | 64 64% | 56 56% | 44 44% | 39 39% | 25 25% | 18 18% | 12 12% | 0 0% |
μWCEfficiency | 682 | 690 | 167 | 560 | 431 | 581 | 318 | 522 | 387 | 250 | 153 | 134 | 123 | 150 | 38 | 23 | 34 |
μEfficiency | 824 | 816 | 312 | 561 | 428 | 687 | 414 | 584 | 461 | 396 | 158 | 134 | 129 | 120 | 58 | 24 | 34 |
SotAC | - | 0.37 | 0.35 | 0.31 | 0.29 | 0.28 | 0.23 | 0.23 | 0.21 | 0.19 | 0.19 | 0.10 | 0.09 | 0.06 | 0.03 | 0.01 | 0.02 |
Core Usage | 2.34 | 2.16 | 3.99 | 1.04 | 0.90 | 2.36 | 3.23 | 2.26 | 2.77 | 4.55 | 0.93 | 1.03 | 1.03 | 0.90 | 6.40 | 1.38 | 1.02 |
New Solved | 0/0 | 0/0 | 0/0 | 0/0 | 0/0 | 0/0 | 0/0 | 0/0 | 0/0 | 0/0 | 0/0 | 0/0 | 0/0 | 0/0 | 0/0 | 0/0 | 0/0 |
FOF Theorems with Equality | Vampire 4.6 | Vampire 4.5 | CSE_E 1.3 | E 2.6 | iProver 3.5 | GKC 0.7 | Zipperpin 2.1 | cvc5 1.0 | Etableau 0.67 | SATCoP 0.1 | Drodi 3.1.5 | Prover9 1109a | CSE 1.4 | Twee 2.4 | CSE‑F 1.0 | JavaRes 1.3.0 | RPx 1.0 |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Solved/400 | 356/400 | 346/400 | 288/400 | 285/400 | 283/400 | 262/400 | 237/400 | 225/400 | 223/400 | 148/400 | 140/400 | 113/400 | 84/400 | 80/400 | 73/400 | 4/400 | 7/400 |
Av. WC Time | 7.50 | 3.99 | 14.79 | 14.52 | 13.99 | 8.47 | 11.03 | 17.45 | 5.23 | 9.11 | 13.40 | 14.65 | 24.14 | 10.48 | 33.66 | 9.73 | 22.16 |
Av. CPU Time | 38.52 | 22.22 | 14.83 | 14.47 | 98.24 | 62.18 | 71.66 | 17.45 | 29.61 | 68.76 | 100.59 | 14.53 | 24.27 | 81.62 | 33.76 | 15.06 | 23.54 |
Solutions | 356 89% | 346 86% | 288 72% | 285 71% | 283 70% | 262 65% | 237 59% | 224 56% | 223 55% | 148 37% | 140 35% | 113 28% | 84 21% | 80 20% | 73 18% | 4 1% | 0 0% |
μWCEfficiency | 511 | 452 | 359 | 396 | 84 | 234 | 238 | 252 | 294 | 146 | 124 | 119 | 50 | 45 | 49 | 3 | 4 |
μEfficiency | 678 | 672 | 369 | 396 | 196 | 403 | 319 | 252 | 398 | 236 | 196 | 110 | 52 | 90 | 51 | 6 | 4 |
SotAC | 0.46 | - | 0.32 | 0.32 | 0.32 | 0.28 | 0.24 | 0.24 | 0.22 | 0.14 | 0.12 | 0.10 | 0.06 | 0.07 | 0.05 | 0.00 | 0.00 |
Core Usage | 3.15 | 3.52 | 1.03 | 0.90 | 4.94 | 4.74 | 3.89 | 0.94 | 3.27 | 4.79 | 5.05 | 0.93 | 1.02 | 6.16 | 1.02 | 2.63 | 1.12 |
New Solved | 44/50 | 38/50 | 27/50 | 29/50 | 30/50 | 32/50 | 24/50 | 16/50 | 24/50 | 20/50 | 7/50 | 13/50 | 9/50 | 14/50 | 11/50 | 0/50 | 0/50 |
FOF Non-theorems without Equality | Vampire SAT‑4.5 | Vampire SAT‑4.6 | E FNT‑2.6 | iProver SAT‑3.5 | Etableau 0.67 | cvc5 SAT‑1.0 | RPx 1.0 |
---|---|---|---|---|---|---|---|
Solved/75 | 69/75 | 68/75 | 20/75 | 19/75 | 17/75 | 8/75 | 0/75 |
Av. WC Time | 39.15 | 39.63 | 5.00 | 14.49 | 8.80 | 1.31 | - |
Av. CPU Time | 104.72 | 105.77 | 4.97 | 93.01 | 58.19 | 1.24 | - |
Solutions | 69 92% | 68 90% | 20 26% | 19 25% | 17 22% | 8 10% | 0 0% |
μWCEfficiency | 167 | 173 | 112 | 53 | 49 | 95 | - |
μEfficiency | 226 | 224 | 112 | 80 | 88 | 95 | - |
SotAC | - | 0.61 | 0.10 | 0.10 | 0.08 | 0.04 | - |
Core Usage | 2.89 | 2.80 | 0.92 | 3.76 | 5.48 | 0.77 | - |
New Solved | 0/0 | 0/0 | 0/0 | 0/0 | 0/0 | 0/0 | 0/0 |
FOF Non-theorems with Equality | Vampire SAT‑4.5 | Vampire SAT‑4.6 | iProver SAT‑3.5 | cvc5 SAT‑1.0 | E FNT‑2.6 | Etableau 0.67 | RPx 1.0 |
---|---|---|---|---|---|---|---|
Solved/175 | 161/175 | 161/175 | 126/175 | 71/175 | 45/175 | 41/175 | 0/175 |
Av. WC Time | 3.89 | 4.38 | 8.02 | 21.39 | 4.10 | 0.71 | - |
Av. CPU Time | 24.26 | 22.35 | 52.72 | 21.33 | 4.02 | 3.38 | - |
Solutions | 161 92% | 161 92% | 126 72% | 71 40% | 45 25% | 41 23% | 0 0% |
μWCEfficiency | 512 | 529 | 110 | 144 | 221 | 183 | - |
μEfficiency | 666 | 670 | 240 | 144 | 221 | 207 | - |
SotAC | - | 0.50 | 0.33 | 0.15 | 0.06 | 0.05 | - |
Core Usage | 3.71 | 3.49 | 4.63 | 0.96 | 0.76 | 2.06 | - |
New Solved | 3/3 | 3/3 | 3/3 | 1/3 | 0/3 | 0/3 | 0/3 |
Unit Equality CNF | Twee 2.4 | E 2.5 | E 2.6 | Vampire 4.6 | Etableau 0.67 | iProver 3.5 | GKC 0.7 | Drodi 3.1.5 |
---|---|---|---|---|---|---|---|---|
Solved/250 | 227/250 | 197/250 | 195/250 | 157/250 | 152/250 | 138/250 | 122/250 | 118/250 |
Av. WC Time | 8.17 | 13.91 | 12.15 | 10.47 | 5.61 | 17.41 | 13.69 | 8.98 |
Av. CPU Time | 62.95 | 13.88 | 12.10 | 32.53 | 36.50 | 125.77 | 104.97 | 67.05 |
Solutions | 227 90% | 197 78% | 195 78% | 157 62% | 152 60% | 138 55% | 122 48% | 118 47% |
μWCEfficiency | 572 | 503 | 498 | 334 | 444 | 146 | 202 | 258 |
μEfficiency | 343 | 502 | 501 | 213 | 355 | 58 | 111 | 155 |
SotAC | 0.44 | - | 0.28 | 0.15 | 0.15 | 0.11 | 0.07 | 0.05 |
Core Usage | 5.06 | 0.93 | 0.87 | 2.92 | 3.39 | 5.22 | 5.95 | 5.15 |
New Solved | 29/42 | 25/42 | 24/42 | 16/42 | 16/42 | 18/42 | 14/42 | 14/42 |
SLedgeHammer Theorems | Zipperpin SLH‑2.1 | Ehoh SLH‑2.7 | Vampire SLH‑4.6 | Leo‑III 1.6 | cvc5 SLH‑1.0 |
---|---|---|---|---|---|
Solved/720 | 675/720 | 655/720 | 626/720 | 499/720 | 310/720 |
Av. WC Time | 2.67 | 2.09 | 3.48 | 5.88 | 2.11 |
Av. CPU Time | 2.44 | 2.03 | 3.45 | 17.83 | 2.01 |
Solutions | 675 93% | 655 90% | 626 86% | 499 69% | 310 43% |
μWCEfficiency | 574 | 742 | 496 | 42 | 326 |
μEfficiency | 550 | 742 | 493 | 127 | 324 |
SotAC | 0.27 | 0.25 | 0.23 | 0.15 | 0.05 |
Core Usage | 0.90 | 0.83 | 0.93 | 3.18 | 0.83 |
New Solved | 0/0 | 0/0 | 0/0 | 0/0 | 0/0 |
LTB JinjaThreads Theorems | Vampire LTB‑4.6 | iProver LTB‑3.5 | Zipperpin LTB‑2.1 | E LTB‑2.5 | E LTB‑2.6 | GKC LTB‑0.7 | Leo‑III LTB‑1.6 |
---|---|---|---|---|---|---|---|
Solved/10000 | 7891/10000 | 7013/9993 | 6952/10000 | 6921/10000 | 6833/10000 | 6545/10000 | 2481/10000 |
Av. WC Time | 21.89 | 19.74 | 9.57 | 24.95 | 15.75 | 26.40 | 69.65 |
Av. CPU Time | 309.51 | 18.51 | 0.17 | 13.73 | 10.45 | 15.64 | 170.62 |
Solutions | 7891 78% | 7013 70% | 6952 69% | 6921 69% | 6833 68% | 6545 65% | 2481 24% |
μWCEfficiency | 428 | 402 | 695 | 532 | 548 | 544 | 23 |
μEfficiency | 408 | 115 | 695 | 473 | 470 | 413 | 4 |
SotAC | 0.21 | 0.13 | 0.13 | 0.12 | 0.12 | 0.11 | 0.02 |
Core Usage | 4.38 | 6.43 | 1.01 | 2.16 | 2.30 | 3.23 | 6.10 |
New Solved | 0/0 | 0/0 | 0/0 | 0/0 | 0/0 | 0/0 | 0/0 |