University of CincinnatiCRILINTELLRI

Results of the SAT 2005 competition



Download

You can download the industrial and crafted benhcmarks from our site.
Concerning the industrial benchmarks, you need to obtain Miroslav Velev and IBM benchmarks directly from their site (Miroslav Velev benchmarks are too big and IBM benchmarks are only available for researchers from academia).

You can also download the raw results of the first stage or the raw results of the second stage.

First stage

In the first stage, the timeout was 20 minutes per benchmarks.

Benchmarks view


Just click on the name of the benchmarks to see the series submitted.
Clicking on a series will show/hide the summary of the results for each benchmark of the series. Clicking on a benchmark name will show the detailed results per solver.

Solvers view

You can grad the 2 pages description of the solvers by clicking on the (PDF) link near the name of the solver. (www) links are appearing on the site, to allow you to download the source code of the solvers. We do prefer to let the aauthors manager the way they distribute their work to the community.

Figures


The following figures were presented during the SAT 2005 conference.

(click on a figure to enlarge)

Benchmarks size


The following diagrams (called "box-plots") do represent the size (in number of literals occuring in the benchmarks) of the series of benchmarks in each category.
While this is not very useful for the random category (since the size was always the same in a given category, they are interesting in the crafted and industrial categories.

Size of the benchmarks in the random category Size of the benchmarks in the crafted categorySize of the benchmarks in the industrial category

Behavior of the solvers


Those curves are used to show the behavior of the solvers when the timeout is increasing.
Robust solvers should be able to solve more benchmarks (in abscisse) when the timeout (in ordinate) is growing.

All the solvers on all the benchmarks All the solvers on random benchmarksAll the solvers on crafted benchmarksAll the solvers on industrial benchmarks

Clustering of solvers


The solvers are grouped together if they behave similarly on a given set of benchmarks.
The figures also show for each cluster of solver the union and the intersection of the benchmarks solved.

Cluistering of all solversClustering of solvers on random benchmarks Clustering of solvers on industrial benchmarksClustering of the solvers on industrial benchmarks

Second stage

In the second stage, the timeout was 200 minutes per benchmarks in the industrial category, and 100 minutes per benchmarks in the random and crafted categories.

Benchmarks view

Note that only the solvers launched under the second stage conditions do appear.
We plan to run most of the competitors under the second stage conditions.

Solvers view

Note that here only the solvers selected for the second stage in a category are appearing, and that most solvers donot appear in all categories.

Some figures

Behavior of the solvers after the extended timeout.

The curves related to the timeout are very important, especially in the industrial category because the timeout in the second stage was one order of magnitude bigger than in the first stage.
The timeout of the first stage is shown as a dashed line. One can note that in many cases, the order of the solvers (by considering the number of benchmarks solved) does change when the timeout increases.

All the solvers on random benchmarks (2nd stage)All the solvers on crafted benchmarks (2nd stage)All the solvers on industrial benchmarks (2nd stage)