A framework built on top of Ploomber that allows code-first definition of pipelines.
Project description
Code-First Pipelines
A framework built on top of Ploomber that allows code-first definition of pipelines. No YAML needed!
Installation
TBA
Usage
ML Pipelines
import pandas as pd
from cf_pipelines.ml import MLPipeline
my_pipeline = MLPipeline("My Cool Pipeline")
@my_pipeline.data_ingestion
def data_ingestion():
input_data = pd.read_csv('input_data.csv')
adult_data = input_data[input_data['age'] > 18]
return {'adult_data.csv':adult_data}
my_pipeline.run()
See the tutorial notebook for a more comprehensive example.
Getting started with a template
Once installed, you can create a new pipeline template by running:
pipelines new [pipeline name]
Development
Extra dependencies
This project depends on Graphviz being installed in your system, follow the instructions here.
We use Poetry to manage this project's dependencies and virtual environment.
Once cloned, just run poetry install
to install them. Any time you want to work on this project, just run
poetry shell
to activate the virtual environment, and you will be ready.
We use some tools to enforce code formatting. To make sure your code meets these standards, run make fmt
(this will
modify the source files automatically) and then make lint
to spot potential deficiencies.
Make sure you add tests for any new code contributed to this repo, and make sure you run all the tests with make test
before committing or opening a new pull request.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file code-first-pipelines-0.8.0.tar.gz
.
File metadata
- Download URL: code-first-pipelines-0.8.0.tar.gz
- Upload date:
- Size: 13.8 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: poetry/1.1.14 CPython/3.8.11 Darwin/21.6.0
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | b7a51f53e5a4bcaa0617b8530d5702587c9e047e5927bb58d1083693c28a4046 |
|
MD5 | d878a4d5ce3b0162d1c4658710d5d7e0 |
|
BLAKE2b-256 | 1b247dead577016ff64c53bd2c9e6a94aaa8bc422031efc0d2d0c9261bd34981 |
File details
Details for the file code_first_pipelines-0.8.0-py3-none-any.whl
.
File metadata
- Download URL: code_first_pipelines-0.8.0-py3-none-any.whl
- Upload date:
- Size: 19.3 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: poetry/1.1.14 CPython/3.8.11 Darwin/21.6.0
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | af3a537ab7f76875b58d7cb2600519b5e42ca3705b58912769703fefed1f492d |
|
MD5 | 1ddf1cc6b18f9021d43cc1924cf00a85 |
|
BLAKE2b-256 | 3bec51119f6f19ac3293899ff4dda02e1297fba1cb63bf62db1d759e6f991fd9 |