A lightweight ML orchestration library with preprocessing, anomaly detection, and explainability tools
Project description
cherrypick-ml: A Machine Learning Orchestration and Pipeline Toolkit
| Testing | Structured validation of preprocessing, orchestration, and explainability components |
| Package | PyPI distribution for cherrypick-ml |
| Meta | MIT License, Python-based machine learning pipeline framework |
What is it?
cherrypick-ml is a Python package that provides a unified interface for building, managing, and evaluating machine learning workflows. It integrates preprocessing, anomaly detection, model orchestration, and explainability into a single, modular framework.
The library is designed to simplify real-world machine learning development by reducing repetitive code while maintaining flexibility and transparency in model pipelines.
Table of Contents
- Main Features
- Core Components
- Where to get it
- Dependencies
- Installation from sources
- Basic Usage
- License
- Documentation
Main Features
cherrypick-ml provides the following core capabilities:
- Automated model orchestration for classification and regression tasks
- Integrated preprocessing utilities including encoding and missing value handling
- Outlier detection using statistical method such as Inter quartile range(IQR), Z-score, modified Z-score, Isolation Forest and Local Outlier Factor based outlier pruning
- SHAP-based explainability for feature importance and model interpretation
- Flexible train-test splitting utilities
- Modular design allowing independent usage of components
- Designed for practical, real-world machine learning workflows
Core Components
The library is structured into the following modules:
-
Orchestrator
High-level interface for training, evaluating, and selecting models with explainable visualisation -
preprocessing
Tools for encoding, imputation, and feature preparation -
anomaly
Outlier detection and data pruning utilities -
explain
Model explainability using SHAP-based analysis -
splits
Utilities for dataset partitioning
Where to get it
The source code is currently hosted on GitHub at:
https://github.com/Sujal-G-Sanyasi/cherrypick-ml
Binary installers for the latest released version are available at the Python Package Index (PyPI):
pip install cherrypick-ml
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file cherrypick_ml-0.1.3.tar.gz.
File metadata
- Download URL: cherrypick_ml-0.1.3.tar.gz
- Upload date:
- Size: 16.7 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.14.0
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
34e295e2bded14e36cc5c5d8ce1d0d10ab47888f2288aff9f7d2f6184bb42bf0
|
|
| MD5 |
540ccb063a18175204bab9173dc07054
|
|
| BLAKE2b-256 |
78c3a6285d1df95eefd7388d9e90577bf78d8772ca8d5e49d996db2464089bce
|
File details
Details for the file cherrypick_ml-0.1.3-py3-none-any.whl.
File metadata
- Download URL: cherrypick_ml-0.1.3-py3-none-any.whl
- Upload date:
- Size: 16.9 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.14.0
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
f00bcd2c11c4359cc64322516c8a92d6df1b2ff853d3ba00756d74212d0492c2
|
|
| MD5 |
107cd665d01eb30d522265c9b03ed114
|
|
| BLAKE2b-256 |
a7c721b7bd4dc3e526dbe0e1ad20e3ec6f821d0a9803c85925cd78b7ad4ffdcf
|