A framework to define a machine learning pipeline

These details have not been verified by PyPI

Project links

Homepage

Environment
- Console
License
- OSI Approved :: MIT License
Natural Language
- English
Operating System
- POSIX :: Linux
Programming Language
- Python :: 3
Topic
- Scientific/Engineering
- Scientific/Engineering :: Artificial Intelligence

Project description

mlpipeline

This is a simple frawork to organize you machine learning workflow. It automates most of the basic functionalities such as logging, a framework for testing models and gluing together different steps at different stages. This project came about as a result of me abstracting the boilerplate code and automating different parts of the process.

The aim of this simple framework is to consolidate the different sub-problems (such as loading data, model configurations, training process, evalutaion process, exporting trained models, etc.) when working/researching with machine learning models. This allows the user to define how the different sub-problems are to be solved using their choice of tools and mlpipeline would handle piecing them together.

Core operations

This framework chains the different operations (sub-problems) depending on the mode it is executed in. mlpipeline currently has 3 modes:

TEST mode: When in TEST mode, it doesn't perform any logging or tracking. It creates a temporory empty directory for the experiment to store the artifacts of an experiment in. When developing and testing the different operations, this mode can be used.
RUN mode: In this mode, logging and tracking is performed. In addition, for each experiment run (refered to as a experiment version in mlpipeline) a directory is created for artifacts to be stored.
EXPORT mode: In this mode, the exporting related operations will be executed instead of the training/evaluation related operations.

In addition to providing different modes, the pipeline also supports logging and recording various details. Currently mlpipeline records all logs, metrics and artifacts using a bacis log files as well using mlflow <https://github.com/databricks/mlflow>_.

The following information is recorded:

The scripts that were executed/impoerted in relation to an experiment.
The any output results
The metrics and parameters

Documentation

The documentation is hosted at ReadTheDocs <https://mlpipeline.readthedocs.io/>_.

Installing

Can be installed directly using the Python Package Index using pip: pip install mlpipeline

Usage

work in progress

Project details

These details have not been verified by PyPI

Project links

Homepage

Environment
- Console
License
- OSI Approved :: MIT License
Natural Language
- English
Operating System
- POSIX :: Linux
Programming Language
- Python :: 3
Topic
- Scientific/Engineering
- Scientific/Engineering :: Artificial Intelligence

Release history Release notifications | RSS feed

2.0a7.post1 pre-release

Dec 17, 2020

2.0a7 pre-release

Dec 13, 2020

2.0a6 pre-release

Apr 12, 2020

2.0a5 pre-release

Apr 12, 2020

2.0a4.post19 pre-release

Feb 20, 2020

2.0a4.post18 pre-release

Sep 16, 2019

2.0a4.post17 pre-release

Sep 12, 2019

2.0a4.post16 pre-release

Sep 2, 2019

2.0a4.post14 pre-release

Aug 15, 2019

2.0a4.post13 pre-release

Aug 15, 2019

2.0a4.post12 pre-release

Jul 30, 2019

2.0a4.post11 pre-release

Jul 28, 2019

2.0a4.post10 pre-release

Jul 28, 2019

2.0a4.post9 pre-release

Jul 28, 2019

2.0a4.post8 pre-release

Jul 28, 2019

2.0a4.post7 pre-release

Jul 28, 2019

2.0a4.post6 pre-release

Jul 27, 2019

2.0a4.post5 pre-release

Jul 25, 2019

2.0a4.post4 pre-release

Jul 25, 2019

2.0a4.post3 pre-release

Jul 24, 2019

2.0a4.post2 pre-release

Jul 24, 2019

2.0a4.post1 pre-release

Jul 24, 2019

2.0a4 pre-release

Jul 24, 2019

2.0a3.post2 pre-release

Jul 23, 2019

2.0a3.post1 pre-release

Jul 23, 2019

This version

2.0a3 pre-release

Jul 15, 2019

2.0a2 pre-release

Jul 8, 2019

2.0a1 pre-release

Jun 18, 2019

1.1a3.post12 pre-release

Jun 12, 2019

1.1a3.post11 pre-release

Jun 9, 2019

1.1a3.post10 pre-release

Jun 9, 2019

1.1a3.post9 pre-release

Apr 24, 2019

1.1a3.post8 pre-release

Mar 4, 2019

1.1a3.post7 pre-release

Jan 8, 2019

1.1a3.post6 pre-release

Dec 3, 2018

1.1a3.post5 pre-release

Dec 3, 2018

1.1a3.post4 pre-release

Dec 3, 2018

1.1a3.post3 pre-release

Nov 29, 2018

1.1a3.post2 pre-release

Nov 17, 2018

1.1a3.post1 pre-release

Nov 15, 2018

1.1a3 pre-release

Nov 14, 2018

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

mlpipeline-2.0a3.tar.gz (20.2 kB view hashes)

Uploaded Jul 15, 2019 Source

Built Distribution

mlpipeline-2.0a3-py3-none-any.whl (24.3 kB view hashes)

Uploaded Jul 15, 2019 Python 3

Hashes for mlpipeline-2.0a3.tar.gz

Hashes for mlpipeline-2.0a3.tar.gz
Algorithm	Hash digest
SHA256	`e89275b895c507be254b6536e358bd4d759df1b633efd6b5e8e0f383bb581c48`
MD5	`f0861072ff2fd40b31b332cfe5fd4262`
BLAKE2b-256	`21c622307eaa5a471bf83c569da91cd67df2eb13ea590628383f66b0ab277d4a`

Hashes for mlpipeline-2.0a3-py3-none-any.whl

Hashes for mlpipeline-2.0a3-py3-none-any.whl
Algorithm	Hash digest
SHA256	`6095af5ddeb96c3a48959751ff118c1e99952db2896167cab9bbf952cc8d0283`
MD5	`a7fa4ca2b598bfdbb0823734c05cff0f`
BLAKE2b-256	`ebaf826649ad62e11921c1f2bdb94c2c7fa38f5f91083dd8cccbfd4d3be41f5e`