Skip to main content

A lightweight ML orchestration library with preprocessing, anomaly detection, and explainability tools

Project description

cherrypick-ml logo


cherrypick-ml: A Machine Learning Orchestration and Pipeline Toolkit

Testing Structured validation of preprocessing, orchestration, and explainability components
Package PyPI distribution for cherrypick-ml
Meta MIT License, Python-based machine learning pipeline framework

What is it?

cherrypick-ml is a Python package that provides a unified interface for building, managing, and evaluating machine learning workflows. It integrates preprocessing, anomaly detection, model orchestration, and explainability into a single, modular framework.

The library is designed to simplify real-world machine learning development by reducing repetitive code while maintaining flexibility and transparency in model pipelines.


Table of Contents


Main Features

cherrypick-ml provides the following core capabilities:

  • Automated model orchestration for classification and regression tasks
  • Integrated preprocessing utilities including encoding and missing value handling
  • Outlier detection using statistical method such as Inter quartile range(IQR), Z-score, modified Z-score, Isolation Forest and Local Outlier Factor based outlier pruning
  • SHAP-based explainability for feature importance and model interpretation
  • Flexible train-test splitting utilities
  • Modular design allowing independent usage of components
  • Designed for practical, real-world machine learning workflows

Core Components

The library is structured into the following modules:

  • Orchestrator
    High-level interface for training, evaluating, and selecting models with explainable visualisation

  • preprocessing
    Tools for encoding, imputation, and feature preparation

  • anomaly
    Outlier detection and data pruning utilities

  • explain
    Model explainability using SHAP-based analysis

  • splits
    Utilities for dataset partitioning


Where to get it

The source code is currently hosted on GitHub at:

https://github.com/Sujal-G-Sanyasi/cherrypick-ml

Binary installers for the latest released version are available at the Python Package Index (PyPI):

pip install cherrypick-ml

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

cherrypick_ml-0.1.1.tar.gz (16.5 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

cherrypick_ml-0.1.1-py3-none-any.whl (16.7 kB view details)

Uploaded Python 3

File details

Details for the file cherrypick_ml-0.1.1.tar.gz.

File metadata

  • Download URL: cherrypick_ml-0.1.1.tar.gz
  • Upload date:
  • Size: 16.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.14.0

File hashes

Hashes for cherrypick_ml-0.1.1.tar.gz
Algorithm Hash digest
SHA256 ff8ca4d4bbb091f1f6264a69c9510f254ac57c062757f77c34781af34fe869f4
MD5 61214a8ae791e8d896c2210f6fa9788c
BLAKE2b-256 6bbfcbec1d8e993ad1d9d233c42794daa96f2b87bf83e44a41ee495b0cdd54cd

See more details on using hashes here.

File details

Details for the file cherrypick_ml-0.1.1-py3-none-any.whl.

File metadata

  • Download URL: cherrypick_ml-0.1.1-py3-none-any.whl
  • Upload date:
  • Size: 16.7 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.14.0

File hashes

Hashes for cherrypick_ml-0.1.1-py3-none-any.whl
Algorithm Hash digest
SHA256 b1a926fa2385f09da5af9b0b0d6e1184fd5b8ecf3b27e566b2ccd369e71d9ebf
MD5 d6460550f35de99a95fe7f67efe1f211
BLAKE2b-256 84f63e338d272df3bcece5af8fca744f340df72b289fea6c46fe712fea196048

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page