Write maintainable, production-ready pipelines using Jupyter or your favorite text editor. Develop locally, deploy to the cloud.
Project description
Join our community | Newsletter | Contact us | Docs | Blog | Website | YouTube
Ploomber is the fastest way to build data pipelines. Use your favorite editor (Jupyter, VSCode, PyCharm) to develop interactively and deploy without code changes (Kubernetes, Airflow, AWS Batch, and SLURM). Do you have legacy notebooks? Refactor them into modular pipelines with a single command.
Installation
Compatible with Python 3.6 and higher.
Install with pip
:
pip install ploomber
Or with conda
:
conda install ploomber -c conda-forge
Getting started
Use Binder to try out Ploomber without setting up an environment:
Or run an example locally:
# ML pipeline example
ploomber examples -n templates/ml-basic -o ml-basic
cd ml-basic
# if using pip
pip install -r requirements.txt
# if using conda
conda env create --file environment.yml
conda activate ml-basic
# run pipeline
ploomber build
Pipeline output saved in the output/
folder. Check out the pipeline definition
in the pipeline.yaml
file.
To get a list of examples, run ploomber examples
.
Click here to go to our examples repository.
Community
Main Features
⚡️ Get started quickly
A simple YAML API to get started quickly, a powerful Python API for total flexibility.
https://user-images.githubusercontent.com/989250/150660813-fc289c6c-0ed5-432d-b6df-063ce98c0093.mp4
⏱ Shorter development cycles
Automatically cache your pipeline’s previous results and only re-compute tasks that have changed since your last execution.
https://user-images.githubusercontent.com/989250/150660820-9a3a0abd-5904-492b-97ff-5494285dfebf.mp4
☁️ Deploy anywhere
Run as a shell script in a single machine or distributively in Kubernetes, Airflow, AWS Batch, or SLURM.
https://user-images.githubusercontent.com/989250/150660830-3f81c9a2-5392-49e5-976d-cb8a38441ecb.mp4
📙 Automated migration from legacy notebooks
Bring your old monolithic notebooks, and we’ll automatically convert them into maintainable, modular pipelines.
https://user-images.githubusercontent.com/989250/150660840-b0c12f85-504c-4233-8c3d-6724d291f1aa.mp4
Resources
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for ploomber-0.14.7-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 2497fe1ddf745e33d40bde79172a2f461ad5e0c782b54fd487a6efa8f29c1ad4 |
|
MD5 | 5e6fb7e383e38b9aa5612d4e243c2083 |
|
BLAKE2b-256 | 073f051bbad72397aae852746c25e61c8d8be9e68e297c4ff26dc9b0096cc309 |