Skip to main content

OptimalFlow is a high-level API to help data scientists building models in ensemble way, and automating Machine Learning workflow with simple code.

Project description

OptimalFlow

PyPI Latest Release Github Issues License Last Commit Python Version

Author: Tony Dong

OptimalFlow is a high-level API toolkit to help data scientists building models in ensemble way, and automating Machine Learning workflow with simple codes.

Comparing other popular "AutoML or Automatic Machine Learning" APIs, OptimalFlow is designed as an omni-ensembled ML workflow optimizer with higher-level API targeting to avoid manual repetitive train-along-evaluate experiments in general pipeline building.

To achieve that, OptimalFlow applies Pipeline Cluster Traversal Experiments algorithm to assemble all cross-matching pipelines covering major tasks of Machine Learning workflow, and apply traversal-experiment to search the optimal model. Besides, by modularizing all key pipeline components in reuseable packages, it allows all components to be custom tunable along with high scalability.

The core concept in OptimalFlow is Pipeline Cluster Traversal Experiments, which is a theory, first raised by Tony Dong during Genpact 2020 GVector Conference, to optimize and automate Machine Learning Workflow using ensemble pipelines algorithm.

Comparing other automatic or classic machine learning workflow's repetitive experiments using single pipeline, Pipeline Cluster Traversal Experiments is more powerful, with larger coverage scope, to find the best model without manual intervention, and also more flexible with elasticity to cope with unseen data due to its ensemble designs in each component.

Documentation: https://optimal-flow.readthedocs.io/

Installation

pip install optimalflow

License:

MIT (C)Tony Dong

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

optimalflow-0.1.0.tar.gz (21.3 kB view hashes)

Uploaded Source

Built Distribution

optimalflow-0.1.0-py3-none-any.whl (27.4 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page