Skip to main content

Easily benchmark Machine Learning models on selected tasks and datasets

Project description


PyPI version Generic badge

sotabencheval is a framework-agnostic library that contains a collection of deep learning benchmarks you can use to benchmark your models. It can be used in conjunction with the sotabench service to record results for models, so the community can compare model performance on different tasks, as well as a continuous integration style service for your repository to benchmark your models on each commit.

Benchmarks Supported

PRs welcome for further benchmarks!

Installation

Requires Python 3.6+.

pip install sotabencheval

Get Benching! 🏋️

You should read the full documentation here, which contains guidance on getting started and connecting to sotabench.

Integration is lightweight. For example, if you are evaluating an ImageNet model, you initialize an Evaluator object and (optionally) link to any linked paper:

from sotabencheval.image_classification import ImageNetEvaluator
evaluator = ImageNetEvaluator(
             model_name='FixResNeXt-101 32x48d',
             paper_arxiv_id='1906.06423')

Then for each batch of predictions your model makes on ImageNet, pass a dictionary of keys as image IDs and values as a np.ndarrays of logits to the evaluator.add method:

evaluator.add(output_dict=dict(zip(image_ids, batch_output)))

The evaluation logic just needs to be written in a sotabench.py file and sotabench will run it on each commit and record the results:

Contributing

All contributions welcome!

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

sotabencheval-0.0.38.tar.gz (35.9 kB view details)

Uploaded Source

Built Distribution

sotabencheval-0.0.38-py3-none-any.whl (55.4 kB view details)

Uploaded Python 3

File details

Details for the file sotabencheval-0.0.38.tar.gz.

File metadata

  • Download URL: sotabencheval-0.0.38.tar.gz
  • Upload date:
  • Size: 35.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/44.0.0.post20200106 requests-toolbelt/0.9.1 tqdm/4.40.0 CPython/3.7.6

File hashes

Hashes for sotabencheval-0.0.38.tar.gz
Algorithm Hash digest
SHA256 64d6b93ffb72feaef301748c2b35d987b23c7242ee96b9d38ec270447b3f789e
MD5 6381e87b6645156ab97089c0832da1d8
BLAKE2b-256 d169c18b2b925f8e1d8b1dfd9bac1c3338a7e06f2054003a470f1b161f5d4bb0

See more details on using hashes here.

File details

Details for the file sotabencheval-0.0.38-py3-none-any.whl.

File metadata

  • Download URL: sotabencheval-0.0.38-py3-none-any.whl
  • Upload date:
  • Size: 55.4 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/44.0.0.post20200106 requests-toolbelt/0.9.1 tqdm/4.40.0 CPython/3.7.6

File hashes

Hashes for sotabencheval-0.0.38-py3-none-any.whl
Algorithm Hash digest
SHA256 3608f722646b46663e01fdb1189d61e84ddcf6e60d10837b7b21dda8e2cea04a
MD5 e9b722d3ea70e41dbcfa26f9dcc66af3
BLAKE2b-256 cf9dde6c4f490932b4018a04897011e34e99a6cc000f81c9fc202933bdec6448

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page