Skip to main content

A lightweight ML orchestration library with preprocessing, anomaly detection, and explainability tools

Project description

cherrypick-ml logo


cherrypick-ml: A Machine Learning Orchestration and Pipeline Toolkit

Testing Structured validation of preprocessing, orchestration, and explainability components
Package PyPI distribution for cherrypick-ml
Meta MIT License, Python-based machine learning pipeline framework

What is it?

cherrypick-ml is a Python package that provides a unified interface for building, managing, and evaluating machine learning workflows. It integrates preprocessing, anomaly detection, model orchestration, and explainability into a single, modular framework.

The library is designed to simplify real-world machine learning development by reducing repetitive code while maintaining flexibility and transparency in model pipelines.


Table of Contents


Main Features

cherrypick-ml provides the following core capabilities:

  • Automated model orchestration for classification and regression tasks
  • Integrated preprocessing utilities including encoding and missing value handling
  • Outlier detection using statistical method such as Inter quartile range(IQR), Z-score, modified Z-score, Isolation Forest and Local Outlier Factor based outlier pruning
  • SHAP-based explainability for feature importance and model interpretation
  • Flexible train-test splitting utilities
  • Modular design allowing independent usage of components
  • Designed for practical, real-world machine learning workflows

Core Components

The library is structured into the following modules:

  • Orchestrator
    High-level interface for training, evaluating, and selecting models with explainable visualisation

  • preprocessing
    Tools for encoding, imputation, and feature preparation

  • anomaly
    Outlier detection and data pruning utilities

  • explain
    Model explainability using SHAP-based analysis

  • splits
    Utilities for dataset partitioning


Where to get it

The source code is currently hosted on GitHub at:

https://github.com/Sujal-G-Sanyasi/cherrypick-ml

Binary installers for the latest released version are available at the Python Package Index (PyPI):

pip install cherrypick-ml

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

cherrypick_ml-0.1.0.tar.gz (16.4 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

cherrypick_ml-0.1.0-py3-none-any.whl (16.7 kB view details)

Uploaded Python 3

File details

Details for the file cherrypick_ml-0.1.0.tar.gz.

File metadata

  • Download URL: cherrypick_ml-0.1.0.tar.gz
  • Upload date:
  • Size: 16.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.14.0

File hashes

Hashes for cherrypick_ml-0.1.0.tar.gz
Algorithm Hash digest
SHA256 456170403097612ecc717c1dee0cc7d5b191b2f936f4dc7385d971f26cd67b2b
MD5 9b09f6fc40411062de2dde66427c073a
BLAKE2b-256 a7a8b50eb874fef634de382b6f133c8f763f07ac4a7bf661098589647ea07574

See more details on using hashes here.

File details

Details for the file cherrypick_ml-0.1.0-py3-none-any.whl.

File metadata

  • Download URL: cherrypick_ml-0.1.0-py3-none-any.whl
  • Upload date:
  • Size: 16.7 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.14.0

File hashes

Hashes for cherrypick_ml-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 09be060e7c016270a76722e362e3d653f0e2358f83ccbdc30d7a6bc6542a6c64
MD5 087879c82e26d762eba5b50970f326a4
BLAKE2b-256 9acd4b38fee77a71446ab74967c9ac57a16ef092c31188fc66378fad4fba37cc

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page