A framework to define a machine learning pipeline

These details have not been verified by PyPI

Project links

Homepage

Environment
- Console
License
- OSI Approved :: MIT License
Natural Language
- English
Operating System
- POSIX :: Linux
Programming Language
- Python :: 3
Topic
- Scientific/Engineering
- Scientific/Engineering :: Artificial Intelligence

Project description

mlpipeline

This is a simple framework to organize you machine learning workflow. It automates most of the basic functionalities such as logging, a framework for testing models and gluing together different steps at different stages. This project came about as a result of me abstracting the boilerplate code and automating different parts of the process.

The aim of this simple framework is to consolidate the different sub-problems (such as loading data, model configurations, training process, evaluation process, exporting trained models, etc.) when working/researching with machine learning models. This allows the user to define how the different sub-problems are to be solved using their choice of tools and mlpipeline would handle piecing them together.

Core operations

This framework chains the different operations (sub-problems) depending on the mode it is executed in. mlpipeline currently has 3 modes:

TEST mode: When in TEST mode, it doesn't perform any logging or tracking. It creates a temporary empty directory for the experiment to store the artifacts of an experiment in. When developing and testing the different operations, this mode can be used.
RUN mode: In this mode, logging and tracking is performed. In addition, for each experiment run (referred to as a experiment version in mlpipeline) a directory is created for artifacts to be stored.
EXPORT mode: In this mode, the exporting related operations will be executed instead of the training/evaluation related operations.

In addition to providing different modes, the pipeline also supports logging and recording various details. Currently mlpipeline records all logs, metrics and artifacts using a basic log files as well using mlflow <https://github.com/databricks/mlflow>_.

The following information is recorded:

The scripts that were executed/imported in relation to an experiment.
The any output results
The metrics and parameters

Documentation

The documentation is hosted at ReadTheDocs <https://mlpipeline.readthedocs.io/>_.

Installing

Can be installed directly using the Python Package Index using pip::

pip install mlpipeline

Usage

work in progress

Project details

These details have not been verified by PyPI

Project links

Homepage

Environment
- Console
License
- OSI Approved :: MIT License
Natural Language
- English
Operating System
- POSIX :: Linux
Programming Language
- Python :: 3
Topic
- Scientific/Engineering
- Scientific/Engineering :: Artificial Intelligence

Release history Release notifications | RSS feed

This version

2.0a7.post1 pre-release

Dec 17, 2020

2.0a7 pre-release

Dec 13, 2020

2.0a6 pre-release

Apr 12, 2020

2.0a5 pre-release

Apr 12, 2020

2.0a4.post19 pre-release

Feb 20, 2020

2.0a4.post18 pre-release

Sep 16, 2019

2.0a4.post17 pre-release

Sep 12, 2019

2.0a4.post16 pre-release

Sep 2, 2019

2.0a4.post14 pre-release

Aug 15, 2019

2.0a4.post13 pre-release

Aug 15, 2019

2.0a4.post12 pre-release

Jul 30, 2019

2.0a4.post11 pre-release

Jul 28, 2019

2.0a4.post10 pre-release

Jul 28, 2019

2.0a4.post9 pre-release

Jul 28, 2019

2.0a4.post8 pre-release

Jul 28, 2019

2.0a4.post7 pre-release

Jul 28, 2019

2.0a4.post6 pre-release

Jul 27, 2019

2.0a4.post5 pre-release

Jul 25, 2019

2.0a4.post4 pre-release

Jul 25, 2019

2.0a4.post3 pre-release

Jul 24, 2019

2.0a4.post2 pre-release

Jul 24, 2019

2.0a4.post1 pre-release

Jul 24, 2019

2.0a4 pre-release

Jul 24, 2019

2.0a3.post2 pre-release

Jul 23, 2019

2.0a3.post1 pre-release

Jul 23, 2019

2.0a3 pre-release

Jul 15, 2019

2.0a2 pre-release

Jul 8, 2019

2.0a1 pre-release

Jun 18, 2019

1.1a3.post12 pre-release

Jun 12, 2019

1.1a3.post11 pre-release

Jun 9, 2019

1.1a3.post10 pre-release

Jun 9, 2019

1.1a3.post9 pre-release

Apr 24, 2019

1.1a3.post8 pre-release

Mar 4, 2019

1.1a3.post7 pre-release

Jan 8, 2019

1.1a3.post6 pre-release

Dec 3, 2018

1.1a3.post5 pre-release

Dec 3, 2018

1.1a3.post4 pre-release

Dec 3, 2018

1.1a3.post3 pre-release

Nov 29, 2018

1.1a3.post2 pre-release

Nov 17, 2018

1.1a3.post1 pre-release

Nov 15, 2018

1.1a3 pre-release

Nov 14, 2018

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

mlpipeline-2.0a7.post1.tar.gz (26.6 kB view details)

Uploaded Dec 17, 2020 Source

File details

Details for the file mlpipeline-2.0a7.post1.tar.gz.

File metadata

Download URL: mlpipeline-2.0a7.post1.tar.gz
Upload date: Dec 17, 2020
Size: 26.6 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/3.2.0 pkginfo/1.6.1 requests/2.25.0 setuptools/50.3.2 requests-toolbelt/0.9.1 tqdm/4.54.1 CPython/3.8.6

File hashes

Hashes for mlpipeline-2.0a7.post1.tar.gz
Algorithm	Hash digest
SHA256	`10d0df79ff72987cf5825e0b94060c80e8c0e07baf5192bfd6b155daafdb0e2b`
MD5	`bd2f0c1a3439b74de7597fe5cbac857a`
BLAKE2b-256	`35e4b042535df64d8e4e95686878bf3e7e382019256d55ef894f2d1f39961bb5`

See more details on using hashes here.

mlpipeline 2.0a7.post1

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

mlpipeline

Core operations

Documentation

Installing

Usage

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

File details

File metadata

File hashes