Machine Learning Research Wizard

These details have not been verified by PyPI

Project links

Homepage

Project description

MLWiz logo

MLWiz

Machine Learning Research Wizard — reproducible experiments from YAML (model selection + risk assessment) for vectors, images, time-series, and graphs.

🔗 Quick Links

📘 Docs: https://mlwiz.readthedocs.io/en/stable/
🧪 Tutorial (recommended): https://mlwiz.readthedocs.io/en/stable/tutorial.html
📦 PyPI: https://pypi.org/project/mlwiz/
📝 Changelog: CHANGELOG.md
🤝 Contributing: CONTRIBUTING.md

✨ What It Does

MLWiz helps you run end-to-end research experiments with minimal boilerplate:

🧱 Build/prepare datasets and generate splits (hold-out or nested CV)
🎛️ Expand a hyperparameter search space (grid, random, or Bayesian search)
⚡ Run model selection + risk assessment in parallel with Ray (CPU/GPU or cluster)
📈 Log metrics, checkpoints, and TensorBoard traces in a consistent folder structure

Inspired by (and a generalized version of) PyDGN.

✅ Key Features

Area	What you get
Research Oriented Framework	Anything is customizable, easy prototyping of models and setups
Reproducibility	Ensure your results are reproducible across multiple runs
Automatic Split Generation	Dataset preparation + `.splits` generation for hold-out / (nested) CV
Automatic and Robust Evaluation	Nested model selection (inner folds) + risk assessment (outer folds)
Parallelism	Ray-based execution across CPU/GPU (or a Ray cluster)

🚀 Getting Started

📦 Installation

MLWiz supports Python 3.10+.

pip install mlwiz

Tip: for GPU / graph workloads, install PyTorch and PyG following their official instructions first, then pip install mlwiz.

⚡ Quickstart

Step	Command	Notes
1) Prepare dataset + splits	`mlwiz-data --config-file examples/DATA_CONFIGS/config_MNIST.yml`	Creates processed data + a `.splits` file
2) Run an experiment (grid search)	`mlwiz-exp --config-file examples/MODEL_CONFIGS/config_MLP.yml`	Add `--debug` to run sequentially and print logs
3) Inspect results	`cat RESULTS/mlp_MNIST/MODEL_ASSESSMENT/assessment_results.json`	Aggregated results live under `RESULTS/`
4) Visualize in TensorBoard	`tensorboard --logdir RESULTS/mlp_MNIST`	Per-run logs are written automatically
5) Stop a running experiment	Press `Ctrl-C`

🧭 Navigating the CLI (non-debug mode)

Example of the global view CLI:

MLWiz terminal progress UI

Specific views can be accessed, e.g. to visualize a specific model run:

:<outer_fold> <inner_fold> <config_id> <run_id>

…or, analogously, a risk assessment run:

:<outer_fold> <run_id>

Here is how it will look like

MLWiz terminal specific view

Handy commands:

:  # or :g or :global (back to global view)
:r # or :refresh (refresh the screen)

You can use left-right arrows to move across configurations, and up-down arrows to switch between model selection and risk assessment runs.

🧩 Architecture (High-Level)

MLWiz is built around two YAML files and a small set of composable components:

data.yml ──► mlwiz-data ──► processed dataset + .splits
exp.yml  ──► mlwiz-exp  ──► Ray workers
                      ├─ inner folds: model selection (best hyperparams)
                      └─ outer folds: risk assessment (final scores)

🧰 Data pipeline: mlwiz-data instantiates your dataset class and writes a .splits file for hold-out / (nested) CV.
🧪 Search space: grid: and random: sections expand into concrete hyperparameter configurations.
🛰️ Orchestration: the evaluator schedules training runs with Ray across CPU/GPU (or a Ray cluster).
🏗️ Execution: each run builds a model + training engine from dotted paths, then logs artifacts and returns structured results.

⚙️ Configuration At A Glance

MLWiz expects:

🗂️ one YAML for data + splits
🧾 one YAML for experiment + search space

Minimal data config:

splitter:
  splits_folder: DATA_SPLITS/
  class_name: mlwiz.data.splitter.Splitter
  args:
    n_outer_folds: 3
    n_inner_folds: 2
    seed: 42

dataset:
  class_name: mlwiz.data.dataset.MNIST
  args:
    storage_folder: DATA/

Minimal experiment config (grid search):

storage_folder: DATA
dataset_class: mlwiz.data.dataset.MNIST
data_splits_file: DATA_SPLITS/MNIST/MNIST_outer3_inner2.splits

device: cpu
max_cpus: 8

dataset_getter: mlwiz.data.provider.DataProvider
data_loader:
  class_name: torch.utils.data.DataLoader
  args:
    num_workers : 0
    pin_memory: False

result_folder: RESULTS
exp_name: mlp
experiment: mlwiz.experiment.Experiment
model_selection_criteria:
  - metric: main_score
    direction: max
evaluate_every: 1
risk_assessment_training_runs: 3
model_selection_training_runs: 2

grid:
  model: mlwiz.model.MLP
  epochs: 400
  batch_size: 512
  dim_embedding: 5
  mlwiz_tests: True  # patch: allow reshaping of MNIST dataset
  optimizer:
    - class_name: mlwiz.training.callback.optimizer.Optimizer
      args:
        optimizer_class_name: torch.optim.Adam
        lr:
          - 0.01
          - 0.03
        weight_decay: 0.
  loss: mlwiz.training.callback.metric.MulticlassClassification
  scorer: mlwiz.training.callback.metric.MulticlassAccuracy
  engine:
    class_name: mlwiz.training.engine.TrainingEngine
    args:
      mixed_precision: false
      mixed_precision_dtype: torch.float16

When mixed_precision: true is used on CPU, requesting mixed_precision_dtype: torch.float16 is automatically converted to torch.bfloat16.

higher_results_are_better remains available as a legacy shortcut for main_score, but it cannot be set together with model_selection_criteria.

See examples/ for complete configs (including random/Bayesian search, schedulers, early stopping, and more).

🧩 Custom Code Via Dotted Paths

Point YAML entries to your own classes (in your project). mlwiz-data and mlwiz-exp add the current working directory to sys.path, so this works out of the box:

grid:
  model: my_project.models.MyModel

dataset:
  class_name: my_project.data.MyDataset

📦 Outputs

Runs are written under RESULTS/:

Output	Location
Aggregated outer-fold results	`RESULTS/<exp_name>_<dataset>/MODEL_ASSESSMENT/assessment_results.json`
Per-fold summaries	`RESULTS/<exp_name>_<dataset>/MODEL_ASSESSMENT/OUTER_FOLD_k/outer_results.json`
Model selection (inner folds + winner config)	`.../MODEL_SELECTION/...`
Final retrains with selected hyperparams	`.../final_run*/`

Each training run also writes TensorBoard logs under <run_dir>/tensorboard/.

🛠️ Utilities

🗂️ Config Management (CLI)

Duplicate a base experiment config across multiple datasets:

mlwiz-config-duplicator --base-exp-config base.yml --data-config-files data1.yml data2.yml

📊 Post-process Results (Python)

Filter configurations from a MODEL_SELECTION/ folder and convert them to a DataFrame:

from mlwiz.evaluation.util import retrieve_experiments, filter_experiments, create_dataframe

configs = retrieve_experiments(
    "RESULTS/mlp_MNIST/MODEL_ASSESSMENT/OUTER_FOLD_1/MODEL_SELECTION/"
)
filtered = filter_experiments(configs, logic="OR", parameters={"lr": 0.001})
df = create_dataframe(
    config_list=filtered,
    key_mappings=[("lr", float), ("avg_validation_score", float)],
)

Export aggregated assessment results to LaTeX:

from mlwiz.evaluation.util import create_latex_table_from_assessment_results

experiments = [
    ("RESULTS/mlp_MNIST", "MLP", "MNIST"),
    ("RESULTS/dgn_PROTEINS", "DGN", "PROTEINS"),
]

latex_table = create_latex_table_from_assessment_results(
    experiments,
    metric_key="main_score",
    no_decimals=3,
    model_as_row=True,
    use_single_outer_fold=False,
)
print(latex_table)

Compare statistical significance between models (Welch t-test):

from mlwiz.evaluation.util import statistical_significance

reference = ("RESULTS/mlp_MNIST", "MLP", "MNIST")
competitors = [
    ("RESULTS/baseline1_MNIST", "B1", "MNIST"),
    ("RESULTS/baseline2_MNIST", "B2", "MNIST"),
]

df = statistical_significance(
    highlighted_exp_metadata=reference,
    other_exp_metadata=competitors,
    metric_key="main_score",
    set_key="test",
    confidence_level=0.95,
)
print(df)

🔍 Load a Trained Model (Notebook-friendly)

Load the best configuration for a fold, instantiate dataset/model, and restore a checkpoint:

from mlwiz.evaluation.util import (
    retrieve_best_configuration,
    instantiate_dataset_from_config,
    instantiate_model_from_config,
    load_checkpoint,
)

config = retrieve_best_configuration(
    "RESULTS/mlp_MNIST/MODEL_ASSESSMENT/OUTER_FOLD_1/MODEL_SELECTION/"
)
dataset = instantiate_dataset_from_config(config)
model = instantiate_model_from_config(config, dataset)
load_checkpoint(
    "RESULTS/mlp_MNIST/MODEL_ASSESSMENT/OUTER_FOLD_1/final_run1/best_checkpoint.pth",
    model,
    device="cpu",
)

For more post-processing helpers, see the tutorial: https://mlwiz.readthedocs.io/en/stable/tutorial.html

🤝 Contributing

See CONTRIBUTING.md.

📄 License

BSD-3-Clause. See LICENSE.

Project details

These details have not been verified by PyPI

Project links

Homepage

Release history Release notifications | RSS feed

1.5.3

May 6, 2026

This version

1.5.2

Apr 29, 2026

1.5.1

Apr 19, 2026

1.5.0

Apr 12, 2026

1.4.2

Feb 18, 2026

1.4.1

Feb 13, 2026

1.4.0

Dec 16, 2025

1.3.3

Dec 13, 2025

1.3.2 yanked

Dec 13, 2025

Reason this release was yanked:

Minor bug when restarting final runs

1.3.1

Dec 12, 2025

1.3.0

Dec 5, 2025

1.2.7

Nov 18, 2025

1.2.6

Nov 16, 2025

1.2.5

Aug 23, 2025

1.2.4

May 9, 2025

1.2.3

Apr 26, 2025

1.2.2

Apr 1, 2025

1.2.1

Mar 30, 2025

1.2.0 yanked

Mar 16, 2025

Reason this release was yanked:

Bug in pyproject.toml

1.1.1

Jan 3, 2025

1.1.0

Oct 11, 2024

1.0.1

Sep 12, 2024

1.0.1a0 pre-release

Sep 10, 2024

1.0.0

Sep 9, 2024

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

mlwiz-1.5.2.tar.gz (126.3 kB view details)

Uploaded Apr 29, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

mlwiz-1.5.2-py3-none-any.whl (139.0 kB view details)

Uploaded Apr 29, 2026 Python 3

File details

Details for the file mlwiz-1.5.2.tar.gz.

File metadata

Download URL: mlwiz-1.5.2.tar.gz
Upload date: Apr 29, 2026
Size: 126.3 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.13.13

File hashes

Hashes for mlwiz-1.5.2.tar.gz
Algorithm	Hash digest
SHA256	`3048dae3c6bdfa4730c2223cc24df0d435a31598cf60274a1b7fa7ec6991e419`
MD5	`c96d4dfe0741e3258bd2e7787f8c383b`
BLAKE2b-256	`1944194d7b1f4e5732cda7eb2f85fa612103b5d06e5a1e6562936cae53f81751`

See more details on using hashes here.

File details

Details for the file mlwiz-1.5.2-py3-none-any.whl.

File metadata

Download URL: mlwiz-1.5.2-py3-none-any.whl
Upload date: Apr 29, 2026
Size: 139.0 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.13.13

File hashes

Hashes for mlwiz-1.5.2-py3-none-any.whl
Algorithm	Hash digest
SHA256	`981f5a4b3b6fc4c1c28fd4affe24ca5b6809702acb5f006bee7223f0c9904aec`
MD5	`d34f73f42bc97b72ccafca4f573679c6`
BLAKE2b-256	`11b7dcb1edadfa8b31fb458ff1658a8717581b1ed26901bf21b9bd1a5bbca6fd`

See more details on using hashes here.

mlwiz 1.5.2

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

MLWiz

🔗 Quick Links

✨ What It Does

✅ Key Features

🚀 Getting Started

📦 Installation

⚡ Quickstart

🧭 Navigating the CLI (non-debug mode)

🧩 Architecture (High-Level)

⚙️ Configuration At A Glance

🧩 Custom Code Via Dotted Paths

📦 Outputs

🛠️ Utilities

🗂️ Config Management (CLI)

📊 Post-process Results (Python)

🔍 Load a Trained Model (Notebook-friendly)

🤝 Contributing

📄 License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes