Skip to main content

A lightweight ML orchestration library with preprocessing, anomaly detection, and explainability tools

Project description

cherrypick-ml logo


cherrypick-ml: A Machine Learning Orchestration and Pipeline Toolkit

Testing Structured validation of preprocessing, orchestration, and explainability components
Package PyPI distribution for cherrypick-ml
Meta MIT License, Python-based machine learning pipeline framework

What is it?

cherrypick-ml is a Python package that provides a unified interface for building, managing, and evaluating machine learning workflows. It integrates preprocessing, anomaly detection, model orchestration, and explainability into a single, modular framework.

The library is designed to simplify real-world machine learning development by reducing repetitive code while maintaining flexibility and transparency in model pipelines.


Table of Contents


Main Features

cherrypick-ml provides the following core capabilities:

  • Automated model orchestration for classification and regression tasks
  • Integrated preprocessing utilities including encoding and missing value handling
  • Outlier detection using statistical method such as Inter quartile range(IQR), Z-score, modified Z-score, Isolation Forest and Local Outlier Factor based outlier pruning
  • SHAP-based explainability for feature importance and model interpretation
  • Flexible train-test splitting utilities
  • Modular design allowing independent usage of components
  • Designed for practical, real-world machine learning workflows

Core Components

The library is structured into the following modules:

  • Orchestrator
    High-level interface for training, evaluating, and selecting models with explainable visualisation

  • preprocessing
    Tools for encoding, imputation, and feature preparation

  • anomaly
    Outlier detection and data pruning utilities

  • explain
    Model explainability using SHAP-based analysis

  • splits
    Utilities for dataset partitioning


Where to get it

The source code is currently hosted on GitHub at:

https://github.com/Sujal-G-Sanyasi/cherrypick-ml

Binary installers for the latest released version are available at the Python Package Index (PyPI):

pip install cherrypick-ml

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

cherrypick_ml-0.1.2.tar.gz (16.7 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

cherrypick_ml-0.1.2-py3-none-any.whl (16.8 kB view details)

Uploaded Python 3

File details

Details for the file cherrypick_ml-0.1.2.tar.gz.

File metadata

  • Download URL: cherrypick_ml-0.1.2.tar.gz
  • Upload date:
  • Size: 16.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.14.0

File hashes

Hashes for cherrypick_ml-0.1.2.tar.gz
Algorithm Hash digest
SHA256 32c6004f7d9b67f31a85d6483aa1a4d76a6385d3b64e900bc0df03abf100a361
MD5 4ba3589ce8f0920d21e085dca961bd7c
BLAKE2b-256 e94fb1b5302b90d7b5c95d1c1416b58a38ff48e79e5e2f43d3bdf5b92d5866ec

See more details on using hashes here.

File details

Details for the file cherrypick_ml-0.1.2-py3-none-any.whl.

File metadata

  • Download URL: cherrypick_ml-0.1.2-py3-none-any.whl
  • Upload date:
  • Size: 16.8 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.14.0

File hashes

Hashes for cherrypick_ml-0.1.2-py3-none-any.whl
Algorithm Hash digest
SHA256 2f1d82640b24d0b249ad38a65c97432aeccf6521cb9e3b5f8880f8da3a6e1587
MD5 96bbad323805da5689d71d4eda7bbfb1
BLAKE2b-256 e13e3d42a32b24b6f34d29df075db49ac6de751cc3bb00e86dd05781b6212bed

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page