Skip to main content

A lightweight ML orchestration library with preprocessing, anomaly detection, and explainability tools

Project description

cherrypick-ml logo


cherrypick-ml: A Machine Learning Orchestration and Pipeline Toolkit

Testing Structured validation of preprocessing, orchestration, and explainability components
Package PyPI distribution for cherrypick-ml
Meta MIT License, Python-based machine learning pipeline framework

What is it?

cherrypick-ml is a Python package that provides a unified interface for building, managing, and evaluating machine learning workflows. It integrates preprocessing, anomaly detection, model orchestration, and explainability into a single, modular framework.

The library is designed to simplify real-world machine learning development by reducing repetitive code while maintaining flexibility and transparency in model pipelines.


Table of Contents


Main Features

cherrypick-ml provides the following core capabilities:

  • Automated model orchestration for classification and regression tasks
  • Integrated preprocessing utilities including encoding and missing value handling
  • Outlier detection using statistical method such as Inter quartile range(IQR), Z-score, modified Z-score, Isolation Forest and Local Outlier Factor based outlier pruning
  • SHAP-based explainability for feature importance and model interpretation
  • Flexible train-test splitting utilities
  • Modular design allowing independent usage of components
  • Designed for practical, real-world machine learning workflows

Core Components

The library is structured into the following modules:

  • Orchestrator
    High-level interface for training, evaluating, and selecting models with explainable visualisation

  • preprocessing
    Tools for encoding, imputation, and feature preparation

  • anomaly
    Outlier detection and data pruning utilities

  • explain
    Model explainability using SHAP-based analysis

  • splits
    Utilities for dataset partitioning


Where to get it

The source code is currently hosted on GitHub at:

https://github.com/Sujal-G-Sanyasi/cherrypick-ml

Binary installers for the latest released version are available at the Python Package Index (PyPI):

pip install cherrypick-ml

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

cherrypick_ml-0.1.3.tar.gz (16.7 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

cherrypick_ml-0.1.3-py3-none-any.whl (16.9 kB view details)

Uploaded Python 3

File details

Details for the file cherrypick_ml-0.1.3.tar.gz.

File metadata

  • Download URL: cherrypick_ml-0.1.3.tar.gz
  • Upload date:
  • Size: 16.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.14.0

File hashes

Hashes for cherrypick_ml-0.1.3.tar.gz
Algorithm Hash digest
SHA256 34e295e2bded14e36cc5c5d8ce1d0d10ab47888f2288aff9f7d2f6184bb42bf0
MD5 540ccb063a18175204bab9173dc07054
BLAKE2b-256 78c3a6285d1df95eefd7388d9e90577bf78d8772ca8d5e49d996db2464089bce

See more details on using hashes here.

File details

Details for the file cherrypick_ml-0.1.3-py3-none-any.whl.

File metadata

  • Download URL: cherrypick_ml-0.1.3-py3-none-any.whl
  • Upload date:
  • Size: 16.9 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.14.0

File hashes

Hashes for cherrypick_ml-0.1.3-py3-none-any.whl
Algorithm Hash digest
SHA256 f00bcd2c11c4359cc64322516c8a92d6df1b2ff853d3ba00756d74212d0492c2
MD5 107cd665d01eb30d522265c9b03ed114
BLAKE2b-256 a7c721b7bd4dc3e526dbe0e1ad20e3ec6f821d0a9803c85925cd78b7ad4ffdcf

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page