An S3 feature store client for data pipelines.
Project description
openfs
An S3 feature store client for data pipelines.
Openfs provides a simple api to boost the quality of your training data while keeping your data pipelines clean and manageable.
Installation
pip install openfs
Quick Start
Creating a Store
import openfs as fs
from openfs.stores import FeatureStore
from openfs.boosters import Booster
import os
# <- import files to upload here
# Connect to store bucket
fs.client.connect(
region_name=os.environ['FSTORE_REGION'],
endpoint_url=os.environ['FSTORE_ENDPOINT_URL'],
access_key_id=os.environ['FSTORE_ACCESS_KEY'],
secret_access_key=os.environ['FSTORE_SECRET_KEY']
)
# Create store
store = FeatureStore("store_name", "description of store", "some_primary_key")
# Upload store
response = store.upload(files, filenames)
Creating a Booster Dataset
# Create booster
booster = Booster(
name="booster_name",
description="booster description",
store_id=response['store_id'])
# Add features
booster.add_single("feature_1", alias="alias_for_feature")
booster.add_group(["feature_2", "feature_3"], alias="grouped_feature", how='sum')
# pull features from store (for testing)
df = booster.create_df()
# upload booster
booster.upload()
Viewing Stores
fb.client.list_stores()
Contributors
We'd love to welcome contributors to openfs
to help make training data
richer and more open for everyone. We're working on our contributor docs at the
moment, but if you're interested in contributing, please send us a message at
contact@xplainable.io.
Thanks for trying openfs!
Made with ❤️ in Australia
© copyright xplainable pty ltd
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distributions
No source distribution files available for this release.See tutorial on generating distribution archives.
Built Distribution
File details
Details for the file openfs-1.0.0-py3-none-any.whl
.
File metadata
- Download URL: openfs-1.0.0-py3-none-any.whl
- Upload date:
- Size: 9.2 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.10.0
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 26630a352fa04b75f4e43e61dc8a115d3437fc797595307a4e1de9280e11a926 |
|
MD5 | d59af2e8cccf08371bbca9c3f3045783 |
|
BLAKE2b-256 | c71c018533b3fc6daa55c85ad7be5dc6e7a894db672a3e5e7921eb7e512376bc |