NannyML, Your library for monitoring model performance.

These details have not been verified by PyPI

Project links

Homepage

Development Status
- 4 - Beta
Intended Audience
- Developers
License
- OSI Approved :: Apache Software License
Natural Language
- English
Programming Language

Project description

PyPI - License

Website • Docs • Community Slack

animated

💡 What is NannyML?

NannyML is an open-source python library that allows you to estimate post-deployment model performance (without access to targets), detect data drift, and intelligently link data drift alerts back to changes in model performance. Built for data scientists, NannyML has an easy-to-use interface, interactive visualizations, is completely model-agnostic and currently supports all tabular use cases, classification and regression.

The core contributors of NannyML have researched and developed multiple novel algorithms for estimating model performance: confidence-based performance estimation (CBPE) and direct loss estimation (DLE). The nansters also invented a new approach to detect multivariate data drift using PCA-based data reconstruction.

If you like what we are working on, be sure to become a Nanster yourself, join our community slack and support us with a GitHub star ⭐.

☔ Why use NannyML?

NannyML closes the loop with performance monitoring and post deployment data science, empowering data scientist to quickly understand and automatically detect silent model failure. By using NannyML, data scientists can finally maintain complete visibility and trust in their deployed machine learning models. Allowing you to have the following benefits:

End sleepless nights caused by not knowing your model performance 😴
Analyse data drift and model performance over time
Discover the root cause to why your models are not performing as expected
No alert fatigue! React only when necessary if model performance is impacted
Painless setup in any environment

🧠 GO DEEP

NannyML Resources	Description
☎️ NannyML 101	New to NannyML? Start here!
🔮 Performance estimation	How the magic works.
🌍 Real world example	Take a look at a real-world example of NannyML.
🔑 Key concepts	Glossary of key concepts we use.
🔬 Technical reference	Monitor the performance of your ML models.
🔎 Blog	Thoughts on post-deployment data science from the NannyML team.
📬 Newsletter	All things post-deployment data science. Subscribe to see the latest papers and blogs.
💎 New in v0.13.1	New features, bug fixes.
🧑‍💻 Contribute	How to contribute to the NannyML project and codebase.
Join slack	Need help with your specific use case? Say hi on slack!

🔱 Features

1. Performance estimation and monitoring

When the actual outcome of your deployed prediction models is delayed, or even when post-deployment target labels are completely absent, you can use NannyML's CBPE-algorithm to estimate model performance for classification or NannyML's DLE-algorithm for regression. These algorithms provide you with any estimated metric you would like, i.e. ROC AUC or RSME. Rather than estimating the performance of future model predictions, CBPE and DLE estimate the expected model performance of the predictions made at inference time.

NannyML can also track the realised performance of your machine learning model once targets are available.

2. Data drift detection

To detect multivariate feature drift NannyML uses PCA-based data reconstruction. Changes in the resulting reconstruction error are monitored over time and data drift alerts are logged when the reconstruction error in a certain period exceeds a threshold. This threshold is calculated based on the reconstruction error observed in the reference period.

NannyML utilises statistical tests to detect univariate feature drift. We have just added a bunch of new univariate tests including Jensen-Shannon Distance and L-Infinity Distance, check out the comprehensive list. The results of these tests are tracked over time, properly corrected to counteract multiplicity and overlayed on the temporal feature distributions. (It is also possible to visualise the test-statistics over time, to get a notion of the drift magnitude.)

NannyML uses the same statistical tests to detected model output drift.

Target distribution drift can also be monitored using the same statistical tests. Bear in mind that this operation requires the presence of actuals.

3. Intelligent alerting

Because NannyML can estimate performance, it is possible to weed out data drift alerts that do not impact expected performance, combatting alert fatigue. Besides linking data drift issues to drops in performance it is also possible to prioritise alerts according to other criteria using NannyML's Ranker.

🚀 Getting started

Install NannyML

NannyML depends on LightGBM. This might require you to set install additional OS-specific binaries. You can follow the official installation guide.

From PyPI:

pip install nannyml

From Conda:

 conda install -c conda-forge nannyml

Running via Docker:

docker -v /local/config/dir/:/config/ run nannyml/nannyml nml run

Here be dragons! Use the latest development version of NannyML at your own risk:

python -m pip install git+https://github.com/NannyML/nannyml

Extras

If you're using database connections to read model inputs/outputs or you're exporting monitoring results to a database, you'll need to include the optional db dependency. For example using pip:

pip install nannyml[db]

or using poetry

poetry install nannyml --all-extras

Quick Start

The following snippet is based on our latest release.

import nannyml as nml
import pandas as pd
from IPython.display import display

# Load real-world data:
reference_df, analysis_df, _ = nml.load_us_census_ma_employment_data()
display(reference_df.head())
display(analysis_df.head())

# Choose a chunker or set a chunk size:
chunk_size = 5000

# initialize, specify required data columns, fit estimator and estimate:
estimator = nml.CBPE(
    problem_type='classification_binary',
    y_pred_proba='predicted_probability',
    y_pred='prediction',
    y_true='employed',
    metrics=['roc_auc'],
    chunk_size=chunk_size,
)
estimator = estimator.fit(reference_df)
estimated_performance = estimator.estimate(analysis_df)

# Show results:
figure = estimated_performance.plot()
figure.show()

# Define feature columns:
features = ['AGEP', 'SCHL', 'MAR', 'RELP', 'DIS', 'ESP', 'CIT', 'MIG', 'MIL', 'ANC',
       'NATIVITY', 'DEAR', 'DEYE', 'DREM', 'SEX', 'RAC1P']

# Initialize the object that will perform the Univariate Drift calculations:
univariate_calculator = nml.UnivariateDriftCalculator(
    column_names=features,
    chunk_size=chunk_size
)

univariate_calculator.fit(reference_df)
univariate_drift = univariate_calculator.calculate(analysis_df)

# Get features that drift the most with count-based ranker:
alert_count_ranker = nml.AlertCountRanker()
alert_count_ranked_features = alert_count_ranker.rank(univariate_drift)
display(alert_count_ranked_features.head())

# Plot drift results for top 3 features:
figure = univariate_drift.filter(column_names=['RELP','AGEP', 'SCHL']).plot()
figure.show()

# Compare drift of a selected feature with estimated performance
uni_drift_AGEP_analysis = univariate_drift.filter(column_names=['AGEP'], period='analysis')
figure = estimated_performance.compare(uni_drift_AGEP_analysis).plot()
figure.show()

# Plot distribution changes of the selected features:
figure = univariate_drift.filter(period='analysis', column_names=['RELP','AGEP', 'SCHL']).plot(kind='distribution')
figure.show()

# Get target data, calculate, plot and compare realized performance with estimated performance:
_, _, analysis_targets_df = nml.load_us_census_ma_employment_data()

analysis_with_targets_df = pd.concat([analysis_df, analysis_targets_df], axis=1)
display(analysis_with_targets_df.head())

performance_calculator = nml.PerformanceCalculator(
    problem_type='classification_binary',
    y_pred_proba='predicted_probability',
    y_pred='prediction',
    y_true='employed',
    metrics=['roc_auc'],
    chunk_size=chunk_size)

performance_calculator.fit(reference_df)
calculated_performance = performance_calculator.calculate(analysis_with_targets_df)

figure = estimated_performance.filter(period='analysis').compare(calculated_performance).plot()
figure.show()

📖 Documentation

Performance monitoring
- Estimated performance
- Realized performance
Drift detection
- Multivariate feature drift
- Univariate feature drift

🦸 Contributing and Community

We want to build NannyML together with the community! The easiest to contribute at the moment is to propose new features or log bugs under issues. For more information, have a look at how to contribute.

Thanks to all of our contributors!

🙋 Get help

The best place to ask for help is in the community slack. Feel free to join and ask questions or raise issues. Someone will definitely respond to you.

🥷 Stay updated

If you want to stay up to date with recent changes to the NannyML library, you can subscribe to our release notes. For thoughts on post-deployment data science from the NannyML team, feel free to visit our blog. You can also sing up for our newsletter, which brings together the best papers, articles, news, and open-source libraries highlighting the ML challenges after deployment.

📍 Roadmap

Curious what we are working on next? Have a look at our roadmap. If you have any questions or if you would like to see things prioritised in a different way, let us know!

📝 Citing NannyML

To cite NannyML in academic papers, please use the following BibTeX entry.

Version 0.13.1

    @misc{nannyml,
        title = {{N}anny{ML} (release 0.13.1)},
        howpublished = {\url{https://github.com/NannyML/nannyml}},
        month = mar,
        year = 2023,
        note = {NannyML, Belgium, OHL.},
        key = {NannyML}
    }

📄 License

NannyML is distributed under an Apache License Version 2.0. A complete version can be found here. All contributions will be distributed under this license.

Project details

These details have not been verified by PyPI

Project links

Homepage

Development Status
- 4 - Beta
Intended Audience
- Developers
License
- OSI Approved :: Apache Software License
Natural Language
- English
Programming Language

Release history Release notifications | RSS feed

This version

0.13.1

Jul 12, 2025

0.13.0

Jan 14, 2025

0.12.1

Sep 6, 2024

0.12.0

Sep 6, 2024

0.11.0

Jul 19, 2024

0.10.7

Jun 7, 2024

0.10.6

May 16, 2024

0.10.5

Mar 8, 2024

0.10.4

Mar 4, 2024

0.10.3

Feb 17, 2024

0.10.2

Feb 13, 2024

0.10.1

Nov 28, 2023

0.10.0

Nov 21, 2023

0.9.1

Jul 12, 2023

0.9.0

Jun 26, 2023

0.8.6

May 24, 2023

0.8.5

Mar 29, 2023

0.8.4

Mar 20, 2023

0.8.3

Jan 31, 2023

0.8.2

Jan 24, 2023

0.8.1

Dec 1, 2022

0.8.0

Nov 22, 2022

0.7.0

Nov 7, 2022

0.6.3

Sep 22, 2022

0.6.2

Sep 16, 2022

0.6.1

Sep 9, 2022

0.6.0

Sep 7, 2022

0.5.3

Aug 30, 2022

0.5.2

Aug 17, 2022

0.5.1

Aug 16, 2022

0.5.0

Jul 7, 2022

0.4.1

May 19, 2022

0.4.0

May 13, 2022

0.3.2

May 3, 2022

0.3.1

Apr 11, 2022

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

nannyml-0.13.1.tar.gz (22.6 MB view details)

Uploaded Jul 12, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

nannyml-0.13.1-py3-none-any.whl (23.0 MB view details)

Uploaded Jul 12, 2025 Python 3

File details

Details for the file nannyml-0.13.1.tar.gz.

File metadata

Download URL: nannyml-0.13.1.tar.gz
Upload date: Jul 12, 2025
Size: 22.6 MB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.12.9

File hashes

Hashes for nannyml-0.13.1.tar.gz
Algorithm	Hash digest
SHA256	`9a616e540790b464491e05e2711656545e0730d8856cdde0feffc669424380a5`
MD5	`ae4b5e0fda98082ea583525acc8d6648`
BLAKE2b-256	`e28206567ebe4a3be50d7bd849175398ab68bf7c57ad9372c1a6e056ed2ec538`

See more details on using hashes here.

File details

Details for the file nannyml-0.13.1-py3-none-any.whl.

File metadata

Download URL: nannyml-0.13.1-py3-none-any.whl
Upload date: Jul 12, 2025
Size: 23.0 MB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.12.9

File hashes

Hashes for nannyml-0.13.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`cf5f4a33489834ae6773cb1baa33a0f98c0a3ee6ed9b43049478b25ce22fe869`
MD5	`60bbe48712cc6e1033256884463dc8f9`
BLAKE2b-256	`0bebbcba1e1510c2b664e360bc3c2aee31d3bea875455e9a6b1a33556ca68982`

See more details on using hashes here.

nannyml 0.13.1

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

💡 What is NannyML?

☔ Why use NannyML?

🧠 GO DEEP

🔱 Features

1. Performance estimation and monitoring

2. Data drift detection

3. Intelligent alerting

🚀 Getting started

Install NannyML

Extras

Quick Start

📖 Documentation

🦸 Contributing and Community

🙋 Get help

🥷 Stay updated

📍 Roadmap

📝 Citing NannyML

Version 0.13.1

📄 License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes