zenml

ZenML: Write production-ready ML code.

These details have not been verified by PyPI

Project links

Homepage

Project description

ZenML.io • docs.ZenML.io • Quickstart • Community • Join Slack

GitHub

Join our

Slack Community and become part of the ZenML family

Have questions? Join our weekly

community hour and talk to us directly

Give us a

GitHub star to show your love

Why?

Ichi Wa Zen, Zen Wa Ichi.

ZenML is built for ML practitioners who are ramping up their ML workflows towards production. We built ZenML because we could not find an easy framework that translates the patterns observed in the research phase with Jupyter notebooks into a production-ready ML environment. Here is what's hard to replicate in production:

It's hard to version data, code, configuration, and models.
It's difficult to reproduce experiments across environments.
There is no gold-standard to organize ML code and manage technical debt as complexity grows.
It's a struggle to establish a reliable link between training and deployment.
It's arduous to track metadata and artifacts that are produced.

ZenML is not here to replace the great tools that solve the individual problems above. Rather, it uses them as integrations to expose a coherent, simple path to getting any ML model in production.

What is ZenML?

ZenML is an extensible, open-source MLOps framework for creating production-ready Machine Learning pipelines - in a simple way.

A user of ZenML is asked to break down their ML development into individual Steps, each representing an individual task in the ML development process. A sequence of steps put together is a Pipeline. Each pipeline contains a Datasource, which represents a snapshot of a versioned dataset in time. Lastly, every pipeline (and indeed almost every step) can run in Backends, that specify how and where a step is executed.

By developing in pipelines, ML practitioners give themselves a platform to transition from research to production from the very beginning, and are also helped in the research phase by the powerful automations introduced by ZenML.

Quickstart

The quickest way to get started is to create a simple pipeline. The dataset used here is the Pima Indians Diabetes Dataset (originally from the National Institute of Diabetes and Digestive and Kidney Diseases)

Step 0: Installation

ZenML is available for easy installation into your environment via PyPI:

pip install zenml

Alternatively, if you’re feeling brave, feel free to install the bleeding edge: NOTE: Do so on your own risk, no guarantees given!

pip install git+https://github.com/maiot-io/zenml.git@main --upgrade

Step 1: Initialize a ZenML repo from within a git repo

zenml init

Step 2: Assemble, run and evaluate your pipeline locally

from zenml.datasources import CSVDatasource
from zenml.pipelines import TrainingPipeline
from zenml.steps.evaluator import TFMAEvaluator
from zenml.steps.split import RandomSplit
from zenml.steps.preprocesser import StandardPreprocesser
from zenml.steps.trainer import TFFeedForwardTrainer

training_pipeline = TrainingPipeline(name='Quickstart')

# Add a datasource. This will automatically track and version it.
ds = CSVDatasource(name='Pima Indians Diabetes Dataset',
                   path='gs://zenml_quickstart/diabetes.csv')
training_pipeline.add_datasource(ds)

# Add a random 70/30 train-eval split
training_pipeline.add_split(RandomSplit(split_map={'train': 0.7, 
                                                   'eval': 0.2,
                                                   'test': 0.1}))

# StandardPreprocesser() has sane defaults for normal preprocessing methods
training_pipeline.add_preprocesser(
    StandardPreprocesser(
        features=['times_pregnant', 'pgc', 'dbp', 'tst', 
                  'insulin', 'bmi', 'pedigree', 'age'],
        labels=['has_diabetes'],
        overwrite={'has_diabetes': {
            'transform': [{'method': 'no_transform', 'parameters': {}}]}}
    ))

# Add a trainer
training_pipeline.add_trainer(TFFeedForwardTrainer(
    loss='binary_crossentropy',
    last_activation='sigmoid',
    output_units=1,
    metrics=['accuracy'],
    epochs=20))

# Add an evaluator
training_pipeline.add_evaluator(
    TFMAEvaluator(slices=[['has_diabetes']],
                  metrics={'has_diabetes': ['binary_crossentropy',
                                            'binary_accuracy']}))

# Run the pipeline locally
training_pipeline.run()

While the above is great to get a quick flavor of ZenML, a more practical way to start is to follow our guide to convert your legacy codebase into ZenML code here.

Leverage powerful integrations

Once code is organized into a ZenML pipeline, you can supercharge your ML development through powerful integrations. Some of the benefits you get are:

Work locally but switch seamlessly to the cloud

Switching from local experiments to cloud-based pipelines doesn't need to be complex.

From local to cloud with one parameter

Versioning galore

ZenML makes sure for every pipeline you can trust that:

✅ Code is versioned
✅ Data is versioned
✅ Models are versioned
✅ Configurations are versioned
ZenML declarative config

Automatically detect schema

# See the schema of your data
training_pipeline.view_schema()

Automatic schema dection

View statistics

# See statistics of train and eval
training_pipeline.view_statistics()

Evaluate the model using built-in evaluators

# Creates a notebook for evaluation
training_pipeline.evaluate()

Compare training pipelines

repo.compare_training_runs()

ZenML built-in pipeline comparison

Distribute preprocessing to the cloud

Leverage distributed compute powered by Apache Beam:

training_pipeline.add_preprocesser(
    StandardPreprocesser(...).with_backend(
      ProcessingDataFlowBackend(
        project=GCP_PROJECT,
        num_workers=10,
    ))
)

Train on spot instances

Easily train on spot instances to save 80% cost.

training_pipeline.run(
  OrchestratorGCPBackend(
    preemptible=True,  # reduce costs by using preemptible instances
    machine_type='n1-standard-4',
    gpu='nvidia-tesla-k80',
    gpu_count=1,
    ...
  )
  ...
)

Deploy models automatically

Automatically deploy each model with powerful Deployment integrations like Cortex.

training_pipeline.add_deployment(
    CortexDeployer(
        api_spec=api_spec,
        predictor=PythonPredictor,
    )
)

The best part is that ZenML is extensible easily, and can be molded to your use-case. You can create your own custom logic or create a PR and contribute to the ZenML community, so that everyone can benefit.

Community

Our community is the backbone of making ZenML a success! We are currently actively maintaining two main channels for community discussions:

Our Slack Channel: Chat with us here.
The GitHub Community: Create your first thread here.

From March 23, 2021 onwards, we are hosting a weekly community hour with the entire ZenML fam. Come talk to us about ZenML (or whatever else tickles your fancy)! Community hour happens at Wednesday at 5PM GMT+2. Register in advance here to join.

Contributing

We would love to receive your contributions! Check our Contributing Guide for more details on how to contribute best.

Copyright

ZenML is distributed under the terms of the Apache License Version 2.0. A complete version of the license is available in the LICENSE.md in this repository.

Any contribution made to this project will be licensed under the Apache License Version 2.0.

Credit

ZenML is built on the shoulders of giants: We leverage, and would like to give credit to, existing open-source libraries like TFX. The goal of our framework is neither to replace these libraries, nor to diminish their usage. ZenML is simply an opinionated, higher level interface with the focus being purely on easy-of-use and coherent intuitive design. You can read more about why we actually started building ZenML at our blog.

Project details

These details have not been verified by PyPI

Project links

Homepage

Release history Release notifications | RSS feed

0.66.0

Sep 9, 2024

0.65.0

Aug 28, 2024

0.64.0

Aug 8, 2024

0.63.0

Jul 30, 2024

0.62.0

Jul 16, 2024

0.61.0

Jul 9, 2024

0.60.0

Jun 26, 2024

0.58.2

Jun 10, 2024

0.58.1

Jun 6, 2024

0.58.0

May 27, 2024

0.57.1

May 14, 2024

0.57.0

May 2, 2024

0.57.0rc2 pre-release

Apr 30, 2024

0.57.0rc1 pre-release

Apr 29, 2024

0.56.4

Apr 24, 2024

0.56.3

Apr 9, 2024

0.56.2

Mar 25, 2024

0.56.1 yanked

Mar 21, 2024

Reason this release was yanked:

This version introduced the services class that is causing a bug for those users who are migrating from older versions. 0.56.3 will be out shortly in place of this release.

0.56.0 yanked

Mar 21, 2024

Reason this release was yanked:

This version introduced the services class that is causing a bug for those users who are migrating from older versions. 0.56.3 will be out shortly in place of this release.

0.55.5

Mar 6, 2024

0.55.4

Feb 29, 2024

0.55.3

Feb 20, 2024

0.55.2

Feb 6, 2024

0.55.1

Jan 26, 2024

0.55.0

Jan 23, 2024

0.54.1

Jan 15, 2024

0.54.0

Jan 8, 2024

0.53.1

Dec 21, 2023

0.53.0

Dec 20, 2023

0.52.0

Dec 12, 2023

0.51.0

Dec 11, 2023

0.50.0

Nov 28, 2023

0.47.0

Nov 14, 2023

0.46.1 yanked

Nov 10, 2023

Reason this release was yanked:

A small bug was introduced into this version that duplicates the default user

0.46.0

Nov 6, 2023

0.45.6

Oct 31, 2023

0.45.5

Oct 24, 2023

0.45.4

Oct 20, 2023

0.45.3

Oct 18, 2023

0.45.2

Oct 16, 2023

0.45.1 yanked

Oct 15, 2023

Reason this release was yanked:

Again migration issues

0.45.0 yanked

Oct 12, 2023

Reason this release was yanked:

Alembic migration issues see release notes for details

0.44.4

Nov 14, 2023

0.44.3

Sep 27, 2023

0.44.2

Sep 11, 2023

0.44.1

Aug 31, 2023

0.44.0 yanked

Aug 29, 2023

Reason this release was yanked:

This releases causes a corupt alembic migration that may cause production failures.

0.43.1

Jan 23, 2024

0.43.0

Aug 15, 2023

0.42.2

Jan 23, 2024

0.42.1

Jul 21, 2023

0.42.0

Jul 20, 2023

0.41.0

Jul 4, 2023

0.40.3

Jun 19, 2023

0.40.2

Jun 5, 2023

0.40.1

May 26, 2023

0.40.0

May 26, 2023

0.39.1

May 10, 2023

0.39.0

May 10, 2023

0.38.0

Apr 12, 2023

0.37.0

Apr 3, 2023

0.36.1

Mar 24, 2023

0.36.0

Mar 20, 2023

0.35.1

Mar 8, 2023

0.35.0 yanked

Mar 6, 2023

Reason this release was yanked:

This release caused a bug in the dashboard loading of the docs url. 0.35.1 will fix it

0.34.0

Feb 20, 2023

0.33.0

Feb 3, 2023

0.32.1

Jan 26, 2023

0.32.0

Jan 24, 2023

0.31.1

Jan 13, 2023

0.31.0

Dec 23, 2022

0.30.0

Dec 9, 2022

0.30.0rc3 pre-release

Dec 8, 2022

0.30.0rc2 pre-release

Dec 6, 2022

0.30.0rc1 pre-release

Dec 5, 2022

0.30.0rc0 pre-release

Dec 2, 2022

0.23.0

Dec 2, 2022

0.22.0

Nov 18, 2022

0.21.1

Nov 4, 2022

0.21.0

Nov 3, 2022

0.20.5

Oct 21, 2022

0.20.4

Oct 15, 2022

0.20.3

Oct 12, 2022

0.20.2

Oct 7, 2022

0.20.1

Oct 5, 2022

0.20.0

Oct 5, 2022

0.20.0rc1 pre-release

Oct 2, 2022

0.13.2

Sep 9, 2022

0.13.1

Aug 26, 2022

0.13.0

Aug 17, 2022

0.12.0

Aug 2, 2022

0.11.0

Jul 19, 2022

0.10.0

Jun 28, 2022

0.9.0

Jun 14, 2022

0.8.1

May 24, 2022

0.8.1rc0 pre-release

May 24, 2022

0.8.0

May 18, 2022

0.7.3

Apr 28, 2022

0.7.2

Apr 14, 2022

0.7.1

Apr 11, 2022

0.7.0

Mar 28, 2022

0.6.3

Mar 14, 2022

0.6.2

Feb 23, 2022

0.6.1

Feb 7, 2022

0.6.0

Jan 26, 2022

0.5.7

Jan 13, 2022

0.5.6

Dec 23, 2021

0.5.5

Dec 13, 2021

0.5.4

Dec 5, 2021

0.5.3

Nov 23, 2021

0.5.2

Nov 5, 2021

0.5.1

Oct 23, 2021

0.5.0

Oct 15, 2021

0.5.0rc2 pre-release

Oct 5, 2021

0.5.0rc1 pre-release

Oct 5, 2021

0.3.9rc2 pre-release

Jun 10, 2021

0.3.9rc1 pre-release

Jun 10, 2021

0.3.8

Jun 4, 2021

0.3.7.1rc4 pre-release

May 17, 2021

This version

0.3.7.1rc3 pre-release

May 17, 2021

0.3.7.1rc1 pre-release

May 15, 2021

0.3.7.1rc0 pre-release

May 14, 2021

0.3.7

Apr 22, 2021

0.3.7rc0 pre-release

Apr 22, 2021

0.3.6.1

Apr 6, 2021

0.3.6

Mar 30, 2021

0.3.6rc0 pre-release

Mar 30, 2021

0.3.5

Mar 18, 2021

0.3.5rc0 pre-release

Mar 18, 2021

0.3.4

Mar 11, 2021

0.3.4rc0 pre-release

Mar 11, 2021

0.3.3

Feb 26, 2021

0.3.3rc0 pre-release

Feb 26, 2021

0.3.2

Feb 12, 2021

0.3.1

Feb 5, 2021

0.3.1rc0 pre-release

Feb 5, 2021

0.2.0 yanked

Jan 22, 2021

Reason this release was yanked:

This version is too old and causes confusion for latest ZenML installs.

0.2.0rc2 pre-release yanked

Jan 22, 2021

Reason this release was yanked:

This version is too old and causes confusion for latest ZenML installs.

0.2.0rc1 pre-release yanked

Jan 22, 2021

Reason this release was yanked:

This version is too old and causes confusion for latest ZenML installs.

0.1.5 yanked

Jan 19, 2021

Reason this release was yanked:

This version is too old and causes confusion for latest ZenML installs.

0.1.4 yanked

Jan 8, 2021

Reason this release was yanked:

This version is too old and causes confusion for latest ZenML installs.

0.1.3 yanked

Dec 26, 2020

Reason this release was yanked:

This version is too old and causes confusion for latest ZenML installs.

0.1.3rc0 pre-release yanked

Dec 26, 2020

Reason this release was yanked:

This version is too old and causes confusion for latest ZenML installs.

0.1.2 yanked

Dec 22, 2020

Reason this release was yanked:

This version is too old and causes confusion for latest ZenML installs.

0.1.1 yanked

Dec 21, 2020

Reason this release was yanked:

This version is too old and causes confusion for latest ZenML installs.

0.1.0 yanked

Dec 21, 2020

Reason this release was yanked:

This version is too old and causes confusion for latest ZenML installs.

0.0.1rc2 pre-release yanked

Dec 16, 2020

Reason this release was yanked:

This version is too old and causes confusion for latest ZenML installs.

0.0.1rc1 pre-release yanked

Dec 13, 2020

Reason this release was yanked:

This version is too old and causes confusion for latest ZenML installs.

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

zenml-0.3.7.1rc3.tar.gz (167.1 kB view hashes)

Uploaded May 17, 2021 Source

Hashes for zenml-0.3.7.1rc3.tar.gz

Hashes for zenml-0.3.7.1rc3.tar.gz
Algorithm	Hash digest
SHA256	`ff625f440ae2b3c6b97f47d7ed55433c000827cdf3cf4815ced19e020656c8e6`
MD5	`ba91450a834dfd28b6fffffab6034dc3`
BLAKE2b-256	`6f68bbd45e4bf19b91e1412239d629912c6895124634647667db40d5e3a55499`

zenml 0.3.7.1rc3

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Why?

What is ZenML?

Quickstart

Step 0: Installation

Step 1: Initialize a ZenML repo from within a git repo

Step 2: Assemble, run and evaluate your pipeline locally

Leverage powerful integrations

Work locally but switch seamlessly to the cloud

Versioning galore

Automatically detect schema

View statistics

Evaluate the model using built-in evaluators

Compare training pipelines

Distribute preprocessing to the cloud

Train on spot instances

Deploy models automatically

Community

Contributing

Copyright

Credit

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution