First-order Theorems | Vampire 2.6 | E‑MaLeS 1.1 | EP 1.6pre | Vampire 0.6 | EP 1.4pre | iProver 0.99 | LEO‑II 1.4.0 | iProver‑Eq 0.8 | E‑Darwin 1.5 | E‑KRHyper 1.3 | leanCoP 2.2 | Princess 120604 | STP 1.0 | SuperZenon 0.0.1 | Zenon 0.7.1 | Muscadet 4.2 |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Solved/500 | 469/500 | 401/500 | 378/500 | 368/500 | 344/500 | 286/500 | 263/500 | 206/500 | 183/500 | 171/500 | 167/500 | 155/500 | 148/500 | 134/500 | 110/500 | 63/500 |
Av. CPU Time | 20.26 | 20.81 | 14.49 | 16.40 | 23.97 | 12.36 | 23.19 | 17.26 | 18.15 | 31.56 | 29.30 | 50.56 | 6.33 | 16.13 | 18.82 | 21.14 |
Solutions | 469/500 | 397/500 | 378/500 | 367/500 | 344/500 | 268/500 | 263/500 | 0/500 | 0/500 | 0/500 | 167/500 | 0/500 | 148/500 | 134/500 | 107/500 | 58/500 |
μEfficiency | 572 | 509 | 453 | 456 | 344 | 326 | 238 | 291 | 226 | 197 | 172 | 25 | 133 | 196 | 133 | 88 |
SOTAC | 0.19 | 0.14 | 0.13 | 0.14 | 0.13 | 0.12 | 0.10 | 0.09 | 0.09 | 0.10 | 0.09 | 0.11 | 0.17 | 0.12 | 0.09 | 0.11 |
New Solved | 90/97 | 87/97 | 73/97 | 29/97 | 66/97 | 42/97 | 21/97 | 30/97 | 18/97 | 21/97 | 23/97 | 19/97 | 29/97 | 2/97 | 14/97 | 15/97 |
First-order Non-theorems | iProver‑SAT 0.99 | Vampire‑SAT 2.6 | Paradox 3.0 | FIMO 0.3 | Nitrox 2012 | EP‑SAT 1.6pre | E‑KRHyper 1.3 | CVC4 0.0 | E‑Darwin 1.5 | iProver‑Eq 0.8 |
---|---|---|---|---|---|---|---|---|---|---|
Solved/200 | 160/200 | 124/200 | 123/200 | 123/200 | 96/200 | 82/200 | 65/200 | 63/200 | 52/200 | 42/200 |
Av. CPU Time | 57.22 | 17.58 | 4.40 | 18.42 | 28.42 | 2.92 | 4.71 | 34.42 | 12.17 | 19.53 |
Solutions | 159/200 | 124/200 | 123/200 | 123/200 | 96/200 | 82/200 | 0/200 | 0/200 | 52/200 | 0/200 |
μEfficiency | 153 | 376 | 507 | 314 | 35 | 312 | 223 | 162 | 196 | 100 |
SOTAC | 0.37 | 0.19 | 0.19 | 0.19 | 0.19 | 0.21 | 0.17 | 0.16 | 0.15 | 0.14 |
New Solved | 34/34 | 0/34 | 0/34 | 0/34 | 0/34 | 0/34 | 0/34 | 0/34 | 0/34 | 0/34 |
Mizar Theorems in Batch Mode | MaLARea 0.4 | Vampire‑TUR 2.6 | PS‑E 1.0 | EP‑LTB 1.6pre | iProver‑LTB 0.99 | iProver‑Eq‑LTB 0.8 | E‑KRHyper‑LTB 1.3 | leanCoP‑ARDE 2.2 |
---|---|---|---|---|---|---|---|---|
Solved/400 | 257/400 | 248/400 | 200/400 | 165/400 | 160/400 | 51/400 | 26/400 | 7/400 |
Av. CPU Time | 339.82 | 383.46 | 587.86 | 737.08 | 549.44 | 1755.40 | 612.09 | 7408.40 |
Av. WC Time | 62.26 | 64.52 | 80.00 | 93.26 | 100.00 | 313.73 | 615.24 | 2285.71 |
Solutions | 240/400 | 248/400 | 200/400 | 157/400 | 160/400 | 0/400 | 0/400 | 7/400 |
SOTAC | 0.31 | 0.30 | 0.24 | 0.21 | 0.22 | 0.16 | 0.15 | 0.86 |
Core Usage | 5.41 | 5.94 | 7.58 | 7.90 | 5.50 | 5.60 | 0.99 | 3.02 |
New Solved | 0/0 | 0/0 | 0/0 | 0/0 | 0/0 | 0/0 | 0/0 | 0/0 |
Theorems without Equality | Vampire 2.6 | Vampire 0.6 | iProver 0.99 | E‑MaLeS 1.1 | EP 1.6pre | EP 1.4pre | iProver‑Eq 0.8 | LEO‑II 1.4.0 | leanCoP 2.2 | E‑Darwin 1.5 | E‑KRHyper 1.3 | Zenon 0.7.1 | SuperZenon 0.0.1 | Princess 120604 | STP 1.0 | Muscadet 4.2 |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Solved/150 | 146/150 | 145/150 | 138/150 | 136/150 | 132/150 | 130/150 | 130/150 | 130/150 | 108/150 | 100/150 | 93/150 | 62/150 | 61/150 | 60/150 | 29/150 | 19/150 |
Av. CPU Time | 4.23 | 6.36 | 5.71 | 12.47 | 11.38 | 11.43 | 11.65 | 22.37 | 21.12 | 8.76 | 15.65 | 8.72 | 3.56 | 53.77 | 0.74 | 27.37 |
Solutions | 146/150 | 145/150 | 137/150 | 134/150 | 132/150 | 130/150 | 0/150 | 130/150 | 108/150 | 0/150 | 0/150 | 60/150 | 61/150 | 0/150 | 29/150 | 19/150 |
μEfficiency | 850 | 741 | 694 | 732 | 674 | 537 | 638 | 416 | 423 | 441 | 412 | 320 | 364 | 34 | 186 | 70 |
SOTAC | 0.10 | 0.10 | 0.09 | 0.09 | 0.09 | 0.09 | 0.09 | 0.09 | 0.08 | 0.08 | 0.09 | 0.08 | 0.11 | 0.08 | 0.09 | 0.09 |
New Solved | 21/21 | 21/21 | 21/21 | 20/21 | 16/21 | 16/21 | 21/21 | 16/21 | 21/21 | 15/21 | 4/21 | 11/21 | 1/21 | 13/21 | 21/21 | 7/21 |
Theorems with Equality | Vampire 2.6 | E‑MaLeS 1.1 | EP 1.6pre | Vampire 0.6 | EP 1.4pre | iProver 0.99 | LEO‑II 1.4.0 | STP 1.0 | Princess 120604 | E‑KRHyper 1.3 | leanCoP 2.2 | E‑Darwin 1.5 | SuperZenon 0.0.1 | iProver‑Eq 0.8 | Muscadet 4.2 | Zenon 0.7.1 |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Solved/300 | 274/300 | 240/300 | 218/300 | 191/300 | 189/300 | 112/300 | 108/300 | 102/300 | 76/300 | 61/300 | 56/300 | 54/300 | 46/300 | 45/300 | 37/300 | 18/300 |
Av. CPU Time | 16.62 | 26.14 | 14.17 | 24.89 | 32.29 | 19.76 | 20.75 | 8.74 | 55.50 | 49.86 | 46.17 | 40.04 | 13.76 | 44.82 | 21.29 | 13.28 |
Solutions | 274/300 | 239/300 | 218/300 | 191/300 | 189/300 | 111/300 | 108/300 | 102/300 | 0/300 | 0/300 | 56/300 | 0/300 | 46/300 | 0/300 | 35/300 | 18/300 |
μEfficiency | 455 | 441 | 382 | 325 | 268 | 140 | 175 | 87 | 15 | 103 | 68 | 110 | 121 | 64 | 97 | 32 |
SOTAC | 0.23 | 0.17 | 0.17 | 0.18 | 0.16 | 0.13 | 0.12 | 0.17 | 0.13 | 0.11 | 0.11 | 0.10 | 0.15 | 0.11 | 0.12 | 0.09 |
New Solved | 69/76 | 67/76 | 57/76 | 8/76 | 50/76 | 21/76 | 5/76 | 8/76 | 6/76 | 17/76 | 2/76 | 3/76 | 1/76 | 9/76 | 8/76 | 3/76 |
Effectively Propositional Theorems | Vampire 2.6 | iProver 0.99 | Vampire 0.6 | iProver‑Eq 0.8 | Zenon 0.7.1 | E‑Darwin 1.5 | EP 1.6pre | SuperZenon 0.0.1 | E‑MaLeS 1.1 | EP 1.4pre | LEO‑II 1.4.0 | Princess 120604 | STP 1.0 | E‑KRHyper 1.3 | Muscadet 4.2 | leanCoP 2.2 |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Solved/50 | 49/50 | 36/50 | 32/50 | 31/50 | 30/50 | 29/50 | 28/50 | 27/50 | 25/50 | 25/50 | 25/50 | 19/50 | 17/50 | 17/50 | 7/50 | 3/50 |
Av. CPU Time | 88.33 | 14.84 | 11.23 | 0.77 | 43.03 | 9.76 | 31.63 | 48.54 | 15.05 | 26.34 | 37.93 | 20.70 | 1.41 | 52.90 | 3.47 | 9.10 |
Solutions | 49/50 | 20/50 | 31/50 | 0/50 | 29/50 | 0/50 | 28/50 | 27/50 | 24/50 | 25/50 | 25/50 | 0/50 | 17/50 | 0/50 | 4/50 | 3/50 |
μEfficiency | 434 | 338 | 391 | 612 | 174 | 274 | 213 | 136 | 247 | 222 | 79 | 55 | 249 | 110 | 89 | 41 |
SOTAC | 0.18 | 0.20 | 0.10 | 0.10 | 0.10 | 0.10 | 0.09 | 0.10 | 0.09 | 0.09 | 0.09 | 0.10 | 0.33 | 0.10 | 0.08 | 0.07 |
New Solved | 0/0 | 0/0 | 0/0 | 0/0 | 0/0 | 0/0 | 0/0 | 0/0 | 0/0 | 0/0 | 0/0 | 0/0 | 0/0 | 0/0 | 0/0 | 0/0 |
Non-theorems without Equality | iProver‑SAT 0.99 | Paradox 3.0 | Nitrox 2012 | FIMO 0.3 | Vampire‑SAT 2.6 | CVC4 0.0 | iProver‑Eq 0.8 | EP‑SAT 1.6pre | E‑Darwin 1.5 | E‑KRHyper 1.3 |
---|---|---|---|---|---|---|---|---|---|---|
Solved/100 | 81/100 | 60/100 | 51/100 | 51/100 | 46/100 | 30/100 | 28/100 | 26/100 | 19/100 | 11/100 |
Av. CPU Time | 52.35 | 8.61 | 23.07 | 30.43 | 8.96 | 17.57 | 16.92 | 7.46 | 7.96 | 0.49 |
Solutions | 80/100 | 60/100 | 51/100 | 51/100 | 46/100 | 0/100 | 0/100 | 26/100 | 19/100 | 0/100 |
μEfficiency | 241 | 412 | 35 | 230 | 264 | 182 | 156 | 104 | 129 | 104 |
SOTAC | 0.51 | 0.19 | 0.18 | 0.17 | 0.20 | 0.15 | 0.14 | 0.27 | 0.13 | 0.13 |
New Solved | 34/34 | 0/34 | 0/34 | 0/34 | 0/34 | 0/34 | 0/34 | 0/34 | 0/34 | 0/34 |
Non-theorems with Equality | iProver‑SAT 0.99 | Vampire‑SAT 2.6 | FIMO 0.3 | Paradox 3.0 | EP‑SAT 1.6pre | E‑KRHyper 1.3 | Nitrox 2012 | E‑Darwin 1.5 | CVC4 0.0 | iProver‑Eq 0.8 |
---|---|---|---|---|---|---|---|---|---|---|
Solved/100 | 79/100 | 78/100 | 72/100 | 63/100 | 56/100 | 54/100 | 45/100 | 33/100 | 33/100 | 14/100 |
Av. CPU Time | 62.22 | 22.66 | 9.92 | 0.38 | 0.81 | 5.56 | 34.48 | 14.59 | 49.74 | 24.76 |
Solutions | 79/100 | 78/100 | 72/100 | 63/100 | 56/100 | 0/100 | 45/100 | 33/100 | 0/100 | 0/100 |
μEfficiency | 65 | 489 | 399 | 603 | 520 | 342 | 34 | 262 | 143 | 44 |
SOTAC | 0.23 | 0.18 | 0.20 | 0.18 | 0.19 | 0.18 | 0.20 | 0.16 | 0.16 | 0.13 |
New Solved | 0/0 | 0/0 | 0/0 | 0/0 | 0/0 | 0/0 | 0/0 | 0/0 | 0/0 | 0/0 |