rEproducible sofTware peRformance analysIs in perfeCt Simplicity

These details have not been verified by PyPI

Project links

Homepage

Development Status
- 4 - Beta
License
- OSI Approved :: GNU Lesser General Public License v3 or later (LGPLv3+)
Operating System
- OS Independent
Programming Language
- Python :: 3

Project description

mETRICS - rEproducible sofTware peRformance analysIs in perfeCt Simplicity

PyPI - Python Version PyPI - Status Travis (.org) Sonar Quality Gate Sonar Coverage

Authors

Thibault Falque - Exakis Nelite
Romain Wallon - CRIL, Univ Artois & CNRS
Hugues Wattez - CRIL, Univ Artois & CNRS

Why Metrics?

When developing a SAT solver, one of the most important parts is to perform experiments so as to evaluate its performance. Most of the time, this process remains the same, so that everybody collects almost the same statistics about the solver execution. However, how many scripts are there to retrieve experimental data and draw scatter or cactus plots? Probably as many as researchers in the domain. Based on this observation, this repository provides Metrics, a Python library, aiming to unify and make easier the analysis of solver experiments. The ambition of Metrics is to provide a complete toolchain from the execution of the solver to the analysis of its performance. In particular, this library simplifies the retrieval of experimental data from many different inputs (including the solver’s output), and provides a nice interface for drawing commonly used plots, computing statistics about the execution of the solver, and effortlessly organizing them (e.g., in Jupyter notebooks). In the end, the main purpose of Metrics is to favor the sharing and reproducibility of experimental results and their analysis.

Installation

To execute Metrics on your computer, you first need to install Python on your computer (at least version 3.8).

As the metrics library is available on PyPI, you install it using pip.

pip install crillab-metrics

Note that, depending on your Python installation, you may need to use pip3 to install it, or to execute pip as a module, as follows.

python3 -m pip install crillab-metrics

Using mETRICS

To present how to use metrics, let us consider an example, based on the results of the SAT Race 2019, in which 51 solvers have been run on 400 instances. Each experiment (corresponding to the execution of a solver on a particular instance) has a timeout set to 5000 seconds and a memory limit set to 128GB.

from metrics.wallet.dataframe.builder import CampaignDataFrameBuilder
campaign_df = CampaignDataFrameBuilder(campaign).build_from_campaign()

Extracting Data with metrics-scalpel

Experimental data can be retrieved with metrics-scalpel. To do so, a YAML configuration file has to be given to the program to allow it to retrieve the required data. A sample configuration is given below.

name: SAT Race 2019
date: July 12th, 2019
setup:
    timeout: 5000
    memout: 128000
experiment-wares:
    - CCAnrSim default
    - ...
    - smallsat default
input-set:
    name: sat-race-2019
    type: hierarchy
    path-list:
    - /path/to/the/benchmarks/of/sat/race/2019/
source:
  path: /path/to/the/results/of/sat-2019.csv
data:
  mapping:
    input: benchmark
    experiment_ware:
    - solver
    - configuration
    cpu_time: solver time

The first elements of this configuration give informations about the campaign: name , date, timeout and memout.

Observe that the different solvers are listed in this file. This is quite a strong requirement (and we plan to automatically discover the solvers in future version of Metrics), but this approach has been designed to allow, when needed, to specify more informations about the solvers (such as their compilation date, their command line, etc.).

Regarding the input-set, note that it is considered as a hierarchy. Whenever this is the case, metrics-scalpel explore the file hierarchy rooted at the given directory to discover each file it contains. It is also possible to give directly the list of the file, or to give a path to a file that contains this list.

The last part, concerning the mapping, allow to retrieve from the CSV file (in this case) which columns corresponds to the data expected by Scalpel.

Now, from this configuration, we can now load the whole campaign corresponding to the SAT competition.

from metrics.scalpel import read_yaml
campaign = read_yaml("/path/to/configuration.yml")

Exploiting Data with metrics-wallet

Now that we have extracted relevant data from our campaign, we can start building figures. The first step consists in extracting a data-frame from the read campaign.

from metrics.wallet.dataframe.builder import CampaignDataFrameBuilder
campaign_df = CampaignDataFrameBuilder(campaign).build_from_campaign()

Cactus Plot

from metrics.wallet.figure.dynamic_figure import CactusPlotly
cactus = CactusPlotly(campaign_df)
cactus.get_figure()

Comparison of all competition solvers

subset = {
    'CaDiCaL default',
    'MapleLCMDistChronoBT-DL-v2.2 default',
    'MapleLCMDistChronoBT-DL-v2.1 default',
    'MapleLCMDiscChronoBT-DL-v3 default',
    'cmsatv56-walksat-chronobt default'
}
campaign_df_best = campaign_df.sub_data_frame('experiment_ware', subset)

cactus = CactusPlotly(campaign_df_best, show_marker=True, min_solved_inputs=200)
cactus.get_figure()

Comparison of best competition solvers

Table

Create VBS:

vbs1 = {
    'CaDiCaL default',
    'MapleLCMDistChronoBT-DL-v2.2 default'
}
vbs2 = {
    'CaDiCaL default',
    'MapleLCMDiscChronoBT-DL-v3 default'
}

campaign_df_best_plus_vbs = campaign_df_best\
    .add_vbew(vbs1, 'cpu_time', vbew_name='vbs1')\
    .add_vbew(vbs2, 'cpu_time', vbew_name='vbs2')

from metrics.wallet.figureure.static_figure import StatTable
stat = StatTable(campaign_df_best_plus_vbs)
stat.get_figure()

Scatter

stat = ScatterPlotly(campaign_df, 
    'CaDiCaL default', 'MapleLCMDistChronoBT-DL-v2.2 default')
stat.get_figure()

Comparison of best competition solvers

Box plot

from metrics.wallet.figure.dynamic_figure import BoxPlotly
box = BoxPlotly(campaign_df_best)
box.get_figure()

Comparison of best competition solvers

You can see the complete notebook example here

Citing mETRICS

Project details

These details have not been verified by PyPI

Project links

Homepage

Development Status
- 4 - Beta
License
- OSI Approved :: GNU Lesser General Public License v3 or later (LGPLv3+)
Operating System
- OS Independent
Programming Language
- Python :: 3

Release history Release notifications | RSS feed

1.3.0

Jun 6, 2023

1.2.6

Sep 2, 2022

1.2.5

Jul 12, 2022

1.2.4

Jul 11, 2022

1.2.3

Jun 30, 2022

1.2.2

Jun 30, 2022

1.2.1

Jun 30, 2022

1.2.0

Jun 29, 2022

1.1.3

Apr 27, 2022

1.1.2

Jan 26, 2022

1.1.1

Jan 25, 2022

1.1

Dec 22, 2021

1.0.7

May 4, 2021

1.0.6

May 4, 2021

1.0.5

May 4, 2021

1.0.4

Apr 19, 2021

1.0.3

Apr 2, 2021

1.0.2

Mar 15, 2021

1.0.1

Dec 15, 2020

1.0.0

Dec 4, 2020

0.3.0

Sep 24, 2020

0.2.5

Sep 24, 2020

0.2.4

Sep 17, 2020

0.2.3

Sep 17, 2020

0.2.2

Sep 17, 2020

0.2.1

Sep 17, 2020

0.2.0

Sep 17, 2020

This version

0.1.1

Jul 3, 2020

0.1.0

May 25, 2020

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

crillab-metrics-0.1.1.tar.gz (3.7 MB view hashes)

Uploaded Jul 3, 2020 Source

Hashes for crillab-metrics-0.1.1.tar.gz

Hashes for crillab-metrics-0.1.1.tar.gz
Algorithm	Hash digest
SHA256	`909c1577e68a38991fd3d4baf1acc85475ee76921baac6f3319bb83d44f3c1b7`
MD5	`3155b07c6be9431011e000accc33dc9e`
BLAKE2b-256	`b5eae69beb1bc3424285dde9e7f989a307a86f0822ce86f9da0661a87a7b9517`