Flypipe

These details have not been verified by PyPI

Project links

Home

Development Status
- 1 - Planning
- 3 - Alpha
Intended Audience
- Developers
License
- OSI Approved :: Apache Software License
Programming Language
- Python :: 3

Project description

Flypipe

Flypipe is a Python framework to simplify development, management and maintenance of transformation pipelines, which are commonly used in the data, feature and ML model space.

Each transformation is implemented in a small, composable function, a special decorator is then used to define it as a Flypipe node, which is the primary model Flypipe uses. Metadata on the node decorator allows for multiple nodes to be linked together into a Directed Acyclic Graph (DAG).

from flypipe.node import node


@node(
  type="pandas",
  dependencies=[t0.select("fruit").alias("df")]
)
def t1(df):
  categories = {'mango': 'sweet', 'lemon': 'sour'}
  df['flavour'] = df['fruit']
  df = df.replace({'flavour': categories})
  return df

Flypipe Pipelines

As each node (transformation) is connected to its ancestors, we can easily view the pipeline graphically in a html page (my_graph.html()) or execute it by invoking my_graph.run()

Flypipe Graph Pipeline

What Flypipe aims to facilitate?

End-to-end transformation lineage
Create development standards for Data Engineers, Machine Learning Engineers and Data Scientists
Improve re-usability of transformations in different pipelines & contexts via composable nodes
Faster integration and portability of pipelines to different contexts with different available technology stacks:
- Flexibility to use and mix up pyspark/pandas on spark/pandas in transformations seamlessly
- As a simple wheel package, it's very lightweight and unopinionated about runtime environment. This allows for it to be easily integrated into Databricks and independently of Databricks.
Low latency for on-demand feature generation and predictions
Framework level optimisations and dynamic transformations help to make even complex transformation pipelines low latency. This in turn allows for on-demand feature generation/predictions.

Project details

These details have not been verified by PyPI

Project links

Home

Development Status
- 1 - Planning
- 3 - Alpha
Intended Audience
- Developers
License
- OSI Approved :: Apache Software License
Programming Language
- Python :: 3

Release history Release notifications | RSS feed

4.0.1

Aug 20, 2024

4.0.0

Aug 1, 2023

3.0.3

Mar 28, 2023

3.0.2

Mar 27, 2023

3.0.1

Mar 27, 2023

3.0.0

Mar 22, 2023

2.0.0

Mar 10, 2023

1.0.1

Jan 31, 2023

This version

1.0.0

Jan 24, 2023

0.0.0

Sep 5, 2022

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

flypipe-1.0.0.tar.gz (1.2 MB view hashes)

Uploaded Jan 24, 2023 Source

Built Distribution

flypipe-1.0.0-py3-none-any.whl (87.3 kB view hashes)

Uploaded Jan 24, 2023 Python 3

Hashes for flypipe-1.0.0.tar.gz

Hashes for flypipe-1.0.0.tar.gz
Algorithm	Hash digest
SHA256	`2b36c3f8d4a6a9476e79fc10718afd6e019ffde09f78133a6ae0223a3c5db529`
MD5	`3ae8e6085fbcda9cfef92321fa69c22f`
BLAKE2b-256	`d4601eb2596fc9c4fde3ae6f5ab9941dbb34c74101bc9d07eed43aa5953abcd5`

Hashes for flypipe-1.0.0-py3-none-any.whl

Hashes for flypipe-1.0.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`b17b3177a9f47d46383587e52f58108b8c633e860782dbc70963c11bce05a300`
MD5	`fea83f0456f0926b206b871ebdd7e706`
BLAKE2b-256	`8ae05179507e5607ea42bb844534751002ef05cf713262ee8f7282e509fd1833`