Skip to main content

A tool for building feature stores - Transform your raw data into beautiful features.

Project description

Butterfree

A tool for building feature stores. Transform your raw data into beautiful features.

Made with :heart: by the MLOps team from QuintoAndar

This library supports Python version 3.6+ and meant to provide tools for building ETL pipelines for Feature Stores using Apache Spark.

The library is centered on the following concetps:

  • ETL: central framework to create data pipelines. Spark-based Extract, Transform and Load modules ready to use.
  • Declarative Feature Engineering: care about what you want to compute and not how to code it.
  • Feature Store Modeling: the library easily provides everything you need to process and load data to your Feature Store.

To understand the main concepts of Feature Store modeling and library main features you can check Butterfree's Wiki.

To learn how to use Butterfree in practice, see Butterfree's notebook examples

Requirements and Installation

Butterfree depends on Python 3.6+ and it is Spark 3.0 ready :heavy_check_mark:

Python Package Index hosts reference to a pip-installable module of this library, using it is as straightforward as including it on your project's requirements.

pip install butterfree

Or after listing butterfree in your requirements.txt file:

pip install -r requirements.txt

You may also have access to our preview build (unstable) by installing from staging branch:

pip install git+https://github.com/quintoandar/butterfree.git@staging

Documentation

The official documentation is hosted on Read the Docs

License

Apache License 2.0

Contributing

All contributions are welcome! Feel free to open Pull Requests. Check the development and contributing guidelines described here.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

butterfree-1.0.1.tar.gz (47.3 kB view details)

Uploaded Source

Built Distribution

butterfree-1.0.1-py3-none-any.whl (78.9 kB view details)

Uploaded Python 3

File details

Details for the file butterfree-1.0.1.tar.gz.

File metadata

  • Download URL: butterfree-1.0.1.tar.gz
  • Upload date:
  • Size: 47.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.24.0 setuptools/49.2.0 requests-toolbelt/0.9.1 tqdm/4.48.0 CPython/3.6.9

File hashes

Hashes for butterfree-1.0.1.tar.gz
Algorithm Hash digest
SHA256 b68fff88c985a645f04c996113575194a20caf7a9837ff2cf2243128d5f221d4
MD5 14c256f4456d3352346ecade52f786f1
BLAKE2b-256 8ee3a7bfa248e39d8067ae2791404c8891497f43320b2f92731b79a1d24b7841

See more details on using hashes here.

File details

Details for the file butterfree-1.0.1-py3-none-any.whl.

File metadata

  • Download URL: butterfree-1.0.1-py3-none-any.whl
  • Upload date:
  • Size: 78.9 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.24.0 setuptools/49.2.0 requests-toolbelt/0.9.1 tqdm/4.48.0 CPython/3.6.9

File hashes

Hashes for butterfree-1.0.1-py3-none-any.whl
Algorithm Hash digest
SHA256 8bea9c3847e9ae46700bf8486c0ee131fdbbd9786cda60fc60291c2ac8feb425
MD5 424d8a068d274da1346b30d1bcb9a772
BLAKE2b-256 0bc8bf283234e207aee534caaf9bf9af048c59a55d75b12b3db45cfd37ccad8d

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page