Lightweight framework for structured and repeatable model validation

These details have not been verified by PyPI

Project links

Project description

kotsu: lightweight framework for structuring model validation

lint-test status

What is it?

kotsu is Python package that provides a lightweight and flexible framework to structure validating and comparing machine learning models. It aims to provide the skeleton on which to develop models and to validate them in a robust and repeatable way, minimizing bloat or overhead. Its flexibility allows usage with any model interface and any validation technique, no matter how complex. The structure it provides avoids common pitfalls that occur when attempting to make fair comparisons between models.

Main Features

Register a model with hyperparameters to a unique ID
Register validations to a unique ID
Run all registered models through all registered validations, and have the results compiled and stored as a CSV
Optionally passes an artefacts_store_dir to your validations, for storing of outputs for further analysis, e.g. trained models or model predictions on test data sets
Doesn't enforce any constraints or requirements on your models' interfaces
Pure Python package, with no other setup or configuration of other systems required

Where to get it

The source code is currently hosted on GitHub at: https://github.com/datavaluepeople/kotsu

The latest released version of the package can be installed from PyPI with:

pip install kotsu

The package is also published on conda-forge:

conda install -c conda-forge kotsu

Usage

The following demonstrates a simple usage of kotsu to register and validate multiple models over multiple validations.

Import kotsu and your packages for modelling:

import kotsu
from sklearn import datasets, svm
from sklearn.model_selection import cross_val_score

Register your competing models:

Here we register two Support Vector Classifiers with different hyper-parameters.

model_registry = kotsu.registration.ModelRegistry()

model_registry.register(
    id="SVC-v1",
    entry_point=svm.SVC,
    kwargs={"kernel": "linear", "C": 1, "random_state": 1},
)

model_registry.register(
    id="SVC-v2",
    entry_point=svm.SVC,
    kwargs={"kernel": "linear", "C": 0.5, "random_state": 1},
)

Instantiate a registered model:

If you want to use your registries outside of kotsu.run, you can create model instances directly from their registered IDs. This is useful if you want to run an ad-hoc validation or interact with a model in an interactive environment without running the full benchmarking loop.

model_instance = model_registry.make("SVC-v1")

Register your validations:

You can register multiple validations if you want to compare models in different scenarios, e.g. on different datasets. Your validations should take an instance of your models as an argument, then return a dictionary containing the results from validation of that model. Here we register two Cross-Validation validations with different numbers of folds.

validation_registry = kotsu.registration.ValidationRegistry()


def factory_iris_cross_validation(folds: int):
    """Factory for iris cross validation."""

    def iris_cross_validation(model) -> dict:
        """Iris classification cross validation."""
        X, y = datasets.load_iris(return_X_y=True)
        scores = cross_val_score(model, X, y, cv=folds)
        results = {f"fold_{i}_score": score for i, score in enumerate(scores)}
        results["mean_score"] = scores.mean()
        results["std_score"] = scores.std()
        return results

    return iris_cross_validation


validation_registry.register(
    id="iris_cross_validation-v1",
    entry_point=factory_iris_cross_validation,
    kwargs={"folds": 5},
)

validation_registry.register(
    id="iris_cross_validation-v2",
    entry_point=factory_iris_cross_validation,
    kwargs={"folds": 10},
)

Run the models through the validations:

We choose the current directory as the location for writing the results.

kotsu.run(model_registry, validation_registry)

Then find the results from each model-validation combination in a CSV written to the current directory.

Documentation on interfaces

See kotsu.typing for documentation on the main entities; Models, Validations, and Results, and their interfaces.

Comprehensive example

See the end to end test for a more comprehensive example usage of kotsu, which includes storing the trained models from each model-validation run.

Releasing

Updating the conda-forge feedstock

kotsu is distributed on conda-forge through the kotsu-feedstock repository. Maintainers (@DBCerigo, @ali-tny, and @alex-hh) can publish new releases by:

Releasing the desired version to PyPI (push a new version tag) so that the source distribution is available.
Creating a branch on conda-forge/kotsu-feedstock and updating recipe/meta.yaml:
- set the version variable to the new package version and reset the number field to 0.
- update the sha256 checksum for the source archive. You can obtain it with:
```
curl -L https://pypi.io/packages/source/k/kotsu/kotsu-<version>.tar.gz | shasum -a 256
```
If dependencies change, ask the conda-forge bots to rerender by commenting @conda-forge-admin, please rerender on the pull request (or run conda smithy rerender locally).
Open a pull request to the feedstock and merge it once CI passes and the maintainers approve.

Once merged, conda-forge will publish the new build automatically.

License

MIT

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

This version

0.4.0

Jan 8, 2026

0.3.3

Aug 3, 2023

0.3.2

Sep 11, 2022

0.3.1

Jul 7, 2022

0.3.0

Jun 7, 2022

0.2.1

Oct 12, 2021

0.2.0

Oct 1, 2021

0.1.0

Sep 9, 2021

0.0.1a1 pre-release

Jul 14, 2021

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

kotsu-0.4.0.tar.gz (19.1 kB view details)

Uploaded Jan 8, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

kotsu-0.4.0-py3-none-any.whl (10.4 kB view details)

Uploaded Jan 8, 2026 Python 3

File details

Details for the file kotsu-0.4.0.tar.gz.

File metadata

Download URL: kotsu-0.4.0.tar.gz
Upload date: Jan 8, 2026
Size: 19.1 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.11.14

File hashes

Hashes for kotsu-0.4.0.tar.gz
Algorithm	Hash digest
SHA256	`686b2f1441e6e026638b1e4050e045d5fed5daa741de797e086182c36eb67e69`
MD5	`a66de6252f71bf3d08a22c373f1b1a61`
BLAKE2b-256	`a60ed6f76569245ee5a09587fba803fd02fd5113b9d837fe355cc0e4ef644631`

See more details on using hashes here.

File details

Details for the file kotsu-0.4.0-py3-none-any.whl.

File metadata

Download URL: kotsu-0.4.0-py3-none-any.whl
Upload date: Jan 8, 2026
Size: 10.4 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.11.14

File hashes

Hashes for kotsu-0.4.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`a9b92bc00f4df84aee4ea30b92c3174838f3ba1364db9548fe23b50b0c7a28b2`
MD5	`ff9156e042138bc9b955d66dc4ff944b`
BLAKE2b-256	`9ee2a5a9e90d190bff5bfef1f35065ce4b773b686d1bfd72e598758cc900a81d`

See more details on using hashes here.

kotsu 0.4.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

kotsu: lightweight framework for structuring model validation

What is it?

Main Features

Where to get it

Usage

Documentation on interfaces

Comprehensive example

Releasing

Updating the conda-forge feedstock

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes