Skip to main content

DataEval provides a simple interface to characterize image data and its impact on model performance across classification and object-detection tasks

Project description

DataEval

About DataEval

DataEval focuses on characterizing image data and its impact on model performance across Classification and object-detection tasks.

Model-agnostic metrics that bound real-world performance

  • relevance/completeness/coverage
  • metafeatures (data complexity)

Model-specific metrics that guide model selection and training

  • dataset sufficiency
  • data/model complexity mismatch

Metrics for post-deployment monitoring of data with bounds on model performance to guide retraining

  • dataset-shift metrics
  • model performance bounds under covariate shift
  • guidance on sampling to assess model error and model retraining

Getting Started

Requirements

  • Python 3.9-3.11

Installing DataEval

You can install DataEval directly from pypi.org using the following command. The optional dependencies of DataEval are torch, tensorflow and all. Using torch enables Sufficiency metrics, and tensorflow enables OOD Detection.

pip install dataeval[all]

Installing DataEval in Conda/Mamba

DataEval can be installed in a Conda/Mamba environment using the provided environment.yaml file. As some dependencies are installed from the pytorch channel, the channel is specified in the below example.

micromamba create -f environment\environment.yaml -c pytorch

Installing DataEval from GitHub

To install DataEval from source locally on Ubuntu, you will need git-lfs to download larger, binary source files and poetry for project dependency management.

sudo apt-get install git-lfs
pip install poetry

Pull the source down and change to the DataEval project directory.

git clone https://github.com/aria-ml/dataeval.git
cd dataeval

Install DataEval with optional dependencies for development.

poetry install --all-extras --with dev

Now that DataEval is installed, you can run commands in the poetry virtual environment by prefixing shell commands with poetry run, or activate the virtual environment directly in the shell.

poetry shell

Documentation and Tutorials

For more ideas on getting started using DataEval in your workflow, additional information and tutorials are in our Sphinx documentation hosted on Read the Docs.

Attribution

This project uses code from the Alibi-Detect python library developed by SeldonIO. Additional documentation from the developers are also available here.

POCs

  • POC: Scott Swan @scott.swan
  • DPOC: Andrew Weng @aweng

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

dataeval-0.73.0.tar.gz (105.5 kB view details)

Uploaded Source

Built Distribution

dataeval-0.73.0-py3-none-any.whl (141.3 kB view details)

Uploaded Python 3

File details

Details for the file dataeval-0.73.0.tar.gz.

File metadata

  • Download URL: dataeval-0.73.0.tar.gz
  • Upload date:
  • Size: 105.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/5.1.1 CPython/3.12.7

File hashes

Hashes for dataeval-0.73.0.tar.gz
Algorithm Hash digest
SHA256 3e0e961c537ecc9adc372d4f637249715500e466c33165a48e69f7163d9fcf9d
MD5 6a9d3387f3a3780b3d6b57989df854e5
BLAKE2b-256 be70a24179c99b35221d8330e612e85f01db3c8e68aae4c7fcef67d628fb9491

See more details on using hashes here.

Provenance

The following attestation bundles were made for dataeval-0.73.0.tar.gz:

Publisher: publish.yml on aria-ml/dataeval

Attestations:

File details

Details for the file dataeval-0.73.0-py3-none-any.whl.

File metadata

  • Download URL: dataeval-0.73.0-py3-none-any.whl
  • Upload date:
  • Size: 141.3 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/5.1.1 CPython/3.12.7

File hashes

Hashes for dataeval-0.73.0-py3-none-any.whl
Algorithm Hash digest
SHA256 c4161c385699460b3a3c2ef0f1dad0c12887ec36bb39a97b6e3420de3be9b1ea
MD5 49e778358ff9b71ffd532c5026064b16
BLAKE2b-256 1fab511f24d3bb35971f9115a27af8b9c7bf5b569e92d02b85ea91d0d56d9e60

See more details on using hashes here.

Provenance

The following attestation bundles were made for dataeval-0.73.0-py3-none-any.whl:

Publisher: publish.yml on aria-ml/dataeval

Attestations:

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page