CompStats

CompStats implements an evaluation methodology for statistically analyzing competition results and competition

These details have not been verified by PyPI

Project links

Project description

https://github.com/INGEOTEC/CompStats/actions/workflows/test.yaml/badge.svg

https://coveralls.io/repos/github/INGEOTEC/CompStats/badge.svg?branch=develop

https://dev.azure.com/conda-forge/feedstock-builds/_apis/build/status/compstats-feedstock?branchName=main

https://img.shields.io/conda/vn/conda-forge/compstats.svg

https://img.shields.io/conda/pn/conda-forge/compstats.svg

https://readthedocs.org/projects/compstats/badge/?version=latest

https://colab.research.google.com/assets/colab-badge.svg

Collaborative competitions have gained popularity in the scientific and technological fields. These competitions involve defining tasks, selecting evaluation scores, and devising result verification methods. In the standard scenario, participants receive a training set and are expected to provide a solution for a held-out dataset kept by organizers. An essential challenge for organizers arises when comparing algorithms’ performance, assessing multiple participants, and ranking them. Statistical tools are often used for this purpose; however, traditional statistical methods often fail to capture decisive differences between systems’ performance. CompStats implements an evaluation methodology for statistically analyzing competition results and competition. CompStats offers several advantages, including off-the-shell comparisons with correction mechanisms and the inclusion of confidence intervals.

To illustrate the use of CompStats, the following snippets show an example. The instructions load the necessary libraries, including the one to obtain the problem (e.g., digits), four different classifiers, and the last line is the score used to measure the performance and compare the algorithm.

>>> from sklearn.svm import LinearSVC
>>> from sklearn.naive_bayes import GaussianNB
>>> from sklearn.ensemble import RandomForestClassifier
>>> from sklearn.datasets import load_digits
>>> from sklearn.model_selection import train_test_split
>>> from sklearn.base import clone
>>> from CompStats.metrics import f1_score

The first step is to load the digits problem and split the dataset into training and validation sets. The second step is to estimate the parameters of a linear Support Vector Machine and predict the validation set’s classes. The predictions are stored in the variable hy.

>>> X, y = load_digits(return_X_y=True)
>>> _ = train_test_split(X, y, test_size=0.3)
>>> X_train, X_val, y_train, y_val = _
>>> m = LinearSVC().fit(X_train, y_train)
>>> hy = m.predict(X_val)

Once the predictions are available, it is time to measure the algorithm’s performance, as seen in the following code. It is essential to note that the API used in sklearn.metrics is followed; the difference is that the function returns an instance with different methods that can be used to estimate different performance statistics and compare algorithms.

>>> score = f1_score(y_val, hy, average='macro')
>>> score
<Perf(score_func=f1_score, statistic=0.9435, se=0.0099)>

The previous code shows the macro-f1 score and its standard error. The actual performance value is stored in the attributes statistic function, and se

>>> score.statistic, score.se
(0.9521479775366307, 0.009717884979482313)

Continuing with the example, let us assume that one wants to test another classifier on the same problem, in this case, a random forest, as can be seen in the following two lines. The second line predicts the validation set and sets it to the analysis.

>>> ens = RandomForestClassifier().fit(X_train, y_train)
>>> score(ens.predict(X_val), name='Random Forest')
<Perf(score_func=f1_score)>
Statistic with its standard error (se)
statistic (se)
0.9720 (0.0076) <= Random Forest
0.9521 (0.0097) <= alg-1

Let us incorporate another predictions, now with Naive Bayes classifier, and Histogram Gradient Boosting as seen below.

>>> nb = GaussianNB().fit(X_train, y_train)
>>> score(nb.predict(X_val), name='Naive Bayes')
>>> hist = HistGradientBoostingClassifier().fit(X_train, y_train)
>>> score(hist.predict(X_val), name='Hist. Grad. Boost. Tree')
<Perf(score_func=f1_score)>
Statistic with its standard error (se)
statistic (se)
0.9759 (0.0068) <= Hist. Grad. Boost. Tree
0.9720 (0.0076) <= Random Forest
0.9521 (0.0097) <= alg-1
0.8266 (0.0159) <= Naive Bayes

The performance, its confidence interval (5%), and a statistical comparison (5%) between the best performing system with the rest of the algorithms is depicted in the following figure.

>>> score.plot()

https://github.com/INGEOTEC/CompStats/raw/docs/docs/source/digits_perf.png

The final step is to compare the performance of the four classifiers, which can be done with the difference method, as seen next.

>>> diff = score.difference()
>>> diff
<Difference>
difference p-values  w.r.t Hist. Grad. Boost. Tree
0.0000 <= Naive Bayes
0.0100 <= alg-1
0.3240 <= Random Forest

The class Difference has the plot method that can be used to depict the difference with respect to the best.

>>> diff.plot()

https://github.com/INGEOTEC/CompStats/raw/docs/docs/source/digits_difference.png

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

This version

0.1.15

Jun 19, 2026

0.1.14

Jan 27, 2026

0.1.13

Apr 22, 2025

0.1.12

Mar 24, 2025

0.1.11

Feb 28, 2025

0.1.10

Feb 27, 2025

0.1.9

Feb 26, 2025

0.1.8

Feb 24, 2025

0.1.7

Feb 21, 2025

0.1.6

Feb 20, 2025

0.1.5

Feb 13, 2025

0.1.4

Feb 12, 2025

0.1.3

Jan 31, 2025

0.1.2

Oct 28, 2024

0.1.1

Oct 12, 2024

0.1.0

Aug 21, 2024

0.0.6

Feb 28, 2024

0.0.5

Feb 27, 2024

0.0.4

Feb 22, 2024

0.0.3

Feb 20, 2024

0.0.2

Feb 20, 2024

0.0.1

Feb 17, 2024

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

compstats-0.1.15.tar.gz (37.6 kB view details)

Uploaded Jun 19, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

compstats-0.1.15-py3-none-any.whl (43.6 kB view details)

Uploaded Jun 19, 2026 Python 3

File details

Details for the file compstats-0.1.15.tar.gz.

File metadata

Download URL: compstats-0.1.15.tar.gz
Upload date: Jun 19, 2026
Size: 37.6 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.11.15

File hashes

Hashes for compstats-0.1.15.tar.gz
Algorithm	Hash digest
SHA256	`6cd68f5e8ff794ee48fc5e3a22724babc7ac586c6c2eae9a641ad811de320d68`
MD5	`248c610d816949d487409cf466a65d82`
BLAKE2b-256	`ac0fdd26d91d72f3ddeea8766a4ee22b7f6db6b67c8e03a0bab31a06841a03da`

See more details on using hashes here.

File details

Details for the file compstats-0.1.15-py3-none-any.whl.

File metadata

Download URL: compstats-0.1.15-py3-none-any.whl
Upload date: Jun 19, 2026
Size: 43.6 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.9.23

File hashes

Hashes for compstats-0.1.15-py3-none-any.whl
Algorithm	Hash digest
SHA256	`e9ec266631bc1df430e829db88948568f2373a547345792cbfa464efaacd465d`
MD5	`a7436e355136ef41c8f8dedc1e686d2e`
BLAKE2b-256	`c1b5a1236a5a6f1d48a62f56c16d59791ba9d5dba4ea4f03cf2dc3ad78d38e05`

See more details on using hashes here.

CompStats 0.1.15

Navigation

Verified details

Maintainers

Unverified details

Project links

Classifiers

Project description

Project details

Verified details

Maintainers

Unverified details

Project links

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes