Skip to main content

A tool for building feature stores - Transform your raw data into beautiful features.

Project description

Butterfree

A tool for building feature stores. Transform your raw data into beautiful features.

Release Python Version License Code style: black

Source Downloads Page Installation Command
PyPi PyPi Downloads Link pip install butterfree

Build status

Develop Stable Documentation Sonar
Test Publish Documentation Status Quality Gate Status

Made with :heart: by the MLOps team from QuintoAndar

This library supports Python version 3.7+ and meant to provide tools for building ETL pipelines for Feature Stores using Apache Spark.

The library is centered on the following concetps:

  • ETL: central framework to create data pipelines. Spark-based Extract, Transform and Load modules ready to use.
  • Declarative Feature Engineering: care about what you want to compute and not how to code it.
  • Feature Store Modeling: the library easily provides everything you need to process and load data to your Feature Store.

To understand the main concepts of Feature Store modeling and library main features you can check Butterfree's Documentation, which is hosted by Read the Docs.

To learn how to use Butterfree in practice, see Butterfree's notebook examples

Requirements and Installation

Butterfree depends on Python 3.7+ and it is Spark 3.0 ready :heavy_check_mark:

PyPI hosts reference to a pip-installable module of this library, using it is as straightforward as including it on your project's requirements.

pip install butterfree

Or after listing butterfree in your requirements.txt file:

pip install -r requirements.txt

Dev Package are available for testing using the .devN versions of the Butterfree on PyPi.

License

Apache License 2.0

Contributing

All contributions are welcome! Feel free to open Pull Requests. Check the development and contributing guidelines described here.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

butterfree-1.5.0.tar.gz (74.0 kB view details)

Uploaded Source

Built Distribution

butterfree-1.5.0-py3-none-any.whl (110.9 kB view details)

Uploaded Python 3

File details

Details for the file butterfree-1.5.0.tar.gz.

File metadata

  • Download URL: butterfree-1.5.0.tar.gz
  • Upload date:
  • Size: 74.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.1.1 pkginfo/1.12.1.1 requests/2.32.3 setuptools/70.0.0 requests-toolbelt/1.0.0 tqdm/4.67.1 CPython/3.9.21

File hashes

Hashes for butterfree-1.5.0.tar.gz
Algorithm Hash digest
SHA256 9d1b17da847b7a9c651da28c5229e6f3d4c5281a089126a3966803a97220dbb6
MD5 e36a90c6c882fe4ad24f34b12e7ad2a8
BLAKE2b-256 f80315f79b4907adff8e6a61b8316ea96fdbcf822d9d78c58912ac387127a133

See more details on using hashes here.

File details

Details for the file butterfree-1.5.0-py3-none-any.whl.

File metadata

  • Download URL: butterfree-1.5.0-py3-none-any.whl
  • Upload date:
  • Size: 110.9 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.1.1 pkginfo/1.12.1.1 requests/2.32.3 setuptools/70.0.0 requests-toolbelt/1.0.0 tqdm/4.67.1 CPython/3.9.21

File hashes

Hashes for butterfree-1.5.0-py3-none-any.whl
Algorithm Hash digest
SHA256 d5f49890cd2a0c958a9873e352af666d5e22eed5bf9a151b918a9836a37aaa44
MD5 aae20f7cb4ba78708f70d83993eb6e9d
BLAKE2b-256 3834ea8b00b001b1bf14960db81fd497e2aaf7d8032bb4837131158a35025082

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page