Provides utilities for the training and evaluation of machine learning algorithms

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

mrapp

These details have not been verified by PyPI

Project links

Project description

MLRL-Testbed

🔗 Important links: Documentation | Issue Tracker | Changelog | License

This software package provides mlrl-testbed - a command line utility for running machine learning experiments. It implements a straightforward, easily configurable, and extensible workflow for conducting experiments, including steps such as (but not restricted to) the following:

loading a dataset
splitting it into training and test sets
training one or several models
evaluating the models' performance
saving experimental results to output files

MLRL-Testbed

On its own, this package is not very powerful. It is intended as a basis for other packages that build functionality upon it. In fact, it does not make any assumptions about the problem domain or type of machine learning algorithm that should be used in an experiment. Instead, implementations of domain- or algorithm-specific functionality are provided by the extensions discussed below.

Tabular Machine Learning

The package mlrl-testbed-sklearn adds support for tabular machine learning problems by making use of the scikit-learn framework. It can easily be installed via the following command (and will pull mlrl-testbed as a dependency):

pip install mlrl-testbed-sklearn

Optionally, support for the Slurm Workload Manager can be installed via the package mlrl-testbed-slurm.

💡 Example

By writing just a small amount of code, any scikit-learn compatible estimator can be integrated with MLRL-Testbed and used in experiments. For example, the following code integrates scikit-learn's RandomForestClassifier:

from argparse import Namespace
from mlrl.testbed_sklearn.runnables import SkLearnRunnable
from mlrl.util.cli import Argument, IntArgument
from sklearn.ensemble import RandomForestClassifier
from sklearn.base import ClassifierMixin, RegressorMixin
from typing import Optional, Set


class Runnable(SkLearnRunnable):

    N_ESTIMATORS = IntArgument(
        '--n-estimators',
        description='The number of trees in the forest',
        default=100,
    )

    def get_algorithmic_arguments(self, known_args: Namespace) -> Set[Argument]:
        return { self.N_ESTIMATORS }

    def create_classifier(self, args: Namespace) -> Optional[ClassifierMixin]:
        return RandomForestClassifier()

    def create_regressor(self, args: Namespace) -> Optional[RegressorMixin]:
        return None  # Not needed in this case

The previously integrated algorithm can then be used in experiments controlled via a command line API. Assuming that the source code shown above is saved to a file named custom_runnable.py in the working directory, we are now capable of fitting a RandomForestClassifier to a dataset by using the command below.

mlrl-testbed custom_runnable.py \
    --data-dir path/to/datasets/ \
    --dataset dataset-name \
    --n-estimators 50

The above command does not only train a model, but also evaluates it according to common measures and prints the evaluation results. It does also demonstrate how algorithmic parameters can be controlled via command line arguments.

It is also possible to run multiple experiments at once by defining the datasets and algorithmic parameters to be used in the different runs in a YAML file:

mlrl-testbed custom_runnable.py --mode batch --config path/to/config.yaml

An exemplary YAML file is shown below. Each combination of the specified parameter values is applied to each dataset defined in the file.

datasets:
  - directory: path/to/datasets/
    names:
      - first-dataset
      - second-dataset
parameters:
  - name: --n-estimators
    values:
      - 50
      - 100

🏁 Advantages

Making use of MLRL-Testbed does not only help with the burdens of training and evaluating machine learning models, it can also help making your own methods and algorithms more accessible to users. This is demonstrated by the rule learning algorithms mlrl-boomer and mlrl-seco that can easily be run via the command line API described above and even extend it with rule-specific functionalities.

🔧 Functionalities

The package mlrl-testbed-sklearn provides a command line API that allows configuring and running machine learning algorithms. It allows to apply machine learning algorithms to different datasets and can evaluate their predictive performance in terms of commonly used measures. In detail, it supports the following functionalities:

Single- and multi-output datasets in the Mulan and MEKA format are supported (with the help of the package mlrl-testbed-arff).
Datasets can automatically be split into training and test data, including the possibility to use cross validation. Alternatively, predefined splits can be provided as separate files.
One-hot-encoding can be applied to nominal or binary features.
Binary predictions, scores, or probability estimates can be obtained from machine learning algorithms, if supported. Evaluation measures that are suited for the respective type of predictions are picked automatically.

Furthermore, the command line API provides many options for controlling the experimental results to be gathered during an experiment. Depending on the configuration, the following experimental results can be saved to output files or printed on the console:

Evaluation scores according to commonly used measures
Characteristics, i.e., statistical properties, of datasets
Predictions and their characteristics
Unique label vectors contained in a classification dataset

If the following are written to output files, they can be loaded and reused in future experiments:

The machine learning models that have been learned
Algorithmic parameters used for training

📚 Documentation

Our documentation provides an extensive user guide, as well as an API reference for developers.

Examples of how to save experimental results to output files.
Instructions for using your own algorithms with the command line API.
An overview of available command line arguments for controlling experiments.

For an overview of changes and new features that have been included in past releases, please refer to the changelog.

📜 License

This project is open source software licensed under the terms of the MIT license. We welcome contributions to the project to enhance its functionality and make it more accessible to a broader audience. A frequently updated list of contributors is available here.

All contributions to the project and discussions on the issue tracker are expected to follow the code of conduct.

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

mrapp

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

0.15.4

May 18, 2026

0.15.3

Apr 9, 2026

0.15.1

Jan 4, 2026

0.15.0

Dec 14, 2025

0.14.2

Nov 15, 2025

0.14.1

Oct 14, 2025

This version

0.14.0

Aug 22, 2025

0.13.1

Aug 4, 2025

0.12.3

Aug 2, 2025

0.12.2

Jul 31, 2025

0.12.1

Jul 6, 2025

0.12.0

Jun 29, 2025

0.11.4

Feb 27, 2025

0.11.3

Jan 30, 2025

0.11.2

Jan 22, 2025

0.11.1

Sep 24, 2024

0.11.0

Aug 9, 2024

0.10.2

Aug 9, 2024

0.10.1

Aug 1, 2024

0.10.0

May 5, 2024

0.9.0

Jul 2, 2023

0.8.2

Apr 11, 2022

0.8.1

Mar 3, 2022

0.8.0

Jan 31, 2022

0.7.1

Dec 16, 2021

0.7.0

Dec 5, 2021

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

mlrl_testbed-0.14.0-py3-none-any.whl (80.1 kB view details)

Uploaded Aug 22, 2025 Python 3

File details

Details for the file mlrl_testbed-0.14.0-py3-none-any.whl.

File metadata

Download URL: mlrl_testbed-0.14.0-py3-none-any.whl
Upload date: Aug 22, 2025
Size: 80.1 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.12.8

File hashes

Hashes for mlrl_testbed-0.14.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`ca2a5882acbc36f331a79ae2813721f11e3c2997238a18d6aacbec046e054915`
MD5	`91cb5f2c362c78d6b58fcfa443655609`
BLAKE2b-256	`7663cc492a306dd308cf7caac9f3ba2383ed55518402241070ee153d9b49297f`

See more details on using hashes here.

Provenance

The following attestation bundles were made for mlrl_testbed-0.14.0-py3-none-any.whl:

Publisher: publish.yml on mrapp-ke/MLRL-Boomer

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: mlrl_testbed-0.14.0-py3-none-any.whl
- Subject digest: ca2a5882acbc36f331a79ae2813721f11e3c2997238a18d6aacbec046e054915
- Sigstore transparency entry: 423627196
- Sigstore integration time: Aug 22, 2025
Source repository:
- Permalink: mrapp-ke/MLRL-Boomer@178ea7ece9cd77ed4991720ec6adbc79bd574daf
- Branch / Tag: refs/tags/0.14.0
- Owner: https://github.com/mrapp-ke
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@178ea7ece9cd77ed4991720ec6adbc79bd574daf
- Trigger Event: release

mlrl-testbed 0.14.0

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

MLRL-Testbed

Tabular Machine Learning

💡 Example

🏁 Advantages

🔧 Functionalities

📚 Documentation

📜 License

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distributions

Built Distribution

File details

File metadata

File hashes

Provenance