Software for sustainable and trustworty reporting (STREP) in ML and AI
Project description
STREP - Sustainable and Trustworthy Reporting for ML and AI
Software for sustainable and trustworthy reporting in machine learning and artificial intelligence, as proposed in my PhD thesis and original research paper. With the publicly available Exploration tool, you can investigate evaluation databases - no code needs to run on your machine!
Note that this software is under active development - it relfects work in progress and is subject to change, so you might encounter delays, off-times, and differences to earlier publications. Check out the paper branch and dissertation branch for frozen repository states at the respective time of publication.
News and Release History
- 23/01/2026 - Now on PyPI, so you can finally run
pip install strep(usestrep[frontend]to include labeling and exploration tool) - 22/09/2025 - Published my PhD thesis based on STREP
- 17/04/2025 - Some changes and lots of new figures, soon to be found in my PhD thesis
- 13/01/2025 - Many fixes, updated Papers with Code and EdgeAccUSB databases
- 02/10/2024 - Improved scaling code (x15 speed), updated MetaQuRe and AutoXPCR databases
- 11/09/2024 - Improved functionality and presented the work at ECML-PKDD '24
- 30/04/2024 - Paper published in Data Mining and Knowledge Discovery, alongside initial repository
Explore your own databases
Instead of exploring the databases assembled during my work, you can also investigate your own custom results by following these steps:
- Prepare your database as a
pandasDataFrame (each row lists one model performance evaluation on some data set) - Store it in a directory, optionally add some
JSONmeta information (check our databases for examples). - Install all STREP functionaility by running
pip install strep[frontend](Python 3.8 - 3.12). - Run the following code snippet:
from strep.index_scale import load_database, scale_and_rate
from strep.elex.app import Visualization
fname = 'path/to/your/database.pkl'
# load database and meta information (if available)
database, meta = load_database(fname)
# index-scale and rate database
rated_database = scale_and_rate(database, meta)
# start the interactive exploration tool
app = Visualization(rated_database)
app.run_server()
Contributing
I firmaly believe that sustainable and trustworthy reporting is a community effort. If you perform large-scale benchmark experiments, stress-test models, or have any other important evaluations to report - get in touch! I would love to showcase other resource-aware evaluation databases and highlight your work.
Current available databases:
- ImageNetEff22 (Fischer et al. 2022): Efficiency information of popular ImageNet models
- EdgeAccUSB (Staay et al. 2024): Efficiency results of stress-tested USB accelerators for edge inference with computer vision models
- XPCR / Forecasting (Fischer et al. 2024): Efficiency information of DNNs for time series forecasting tasks
- MetaQuRe (Fischer et al. 2024): Resource and quality information of ML algorithm performance on tabular data
- RobustBench (Croce et al. 2020): Robustness and quality information of image classification models
- Papers With Code: The most popular benchmarks from this public database (code for re-assembling can be found here)
Citing
If you appreciate our work and code, please cite my PhD thesis and original research paper:
Fischer, R. Advancing the Sustainability of Machine Learning and Artificial Intelligence via Labeling and Meta-Learning.
Ph.D. Dissertation, TU Dortmund University (2025). https://doi.org/10.17877/DE290R-25716
Fischer, R., Liebig, T. & Morik, K. Towards More Sustainable and Trustworthy Reporting in Machine Learning. Data Mining and Knowledge Discovery 38, 1909–1928 (2024). https://doi.org/10.1007/s10618-024-01020-3
You can also use the these bibtext entries:
@phdthesis{fischer_diss,
title={Advancing the Sustainability of Machine Learning and Artificial Intelligence via Labeling and Meta-Learning},
author={Fischer, Raphael},
school={TU Dortmund University},
url={http://doi.org/10.17877/DE290R-25716},
doi={10.17877/DE290R-25716},
year={2025}
}
@article{fischer_dami,
title = {Towards More Sustainable and Trustworthy Reporting in Machine Learning},
volume = {38},
issn = {1573-756X},
url = {https://doi.org/10.1007/s10618-024-01020-3},
doi = {10.1007/s10618-024-01020-3},
number = {4},
journal = {Data Mining and Knowledge Discovery},
author = {Fischer, Raphael and Liebig, Thomas and Morik, Katharina},
year = {2024},
pages = {1909--1928},
}
Repository Structure
databasescontains different gathered evaluation databases of ML reports, including scripts to assemble some of them.strepcontains software that processes the databases, calculates index values and compound scores, and visualizes them.materialscontains some additional data, scripts, and figures used in papers and my thesis.- The top level scripts are used to deploy the exploration tool on render, and a main script for running it locally.
Terms of Use
Copyright (c) 2025 Raphael Fischer
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file strep-0.0.2.tar.gz.
File metadata
- Download URL: strep-0.0.2.tar.gz
- Upload date:
- Size: 685.9 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.12.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
650f0d2f2b9f2fa96043ecb879028871d240af01f52ff6d211b24228e3c32e96
|
|
| MD5 |
3502438c47d318cfc463c74f3d49aea4
|
|
| BLAKE2b-256 |
d1d92a4e46db282df35a384a98fb3a8627f5981415cfe267ce2bd1a8d3592815
|
File details
Details for the file strep-0.0.2-py3-none-any.whl.
File metadata
- Download URL: strep-0.0.2-py3-none-any.whl
- Upload date:
- Size: 685.1 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.12.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
c16aa4549b3f399c7063c4e9d33a1a7f77f3a070765391f84b58abbba3b37237
|
|
| MD5 |
b6987a15f1bdf6c88c2d382939df6a39
|
|
| BLAKE2b-256 |
56560397e1282759af64a83687e8f1a5c46118dbeb759ec9e419073d451f536f
|