Skip to main content

Spark support for Feast offline store

Project description

Feast Spark Offline Store plugin

This repo contains a plugin for feast to run an offline store on Spark. It can be installed from pip and configured in the feature_store.yaml configuration file to interface with DataSources using Spark.

Note that this repository has not yet had a major release as it is still work in progress.

Contributing

We strongly encourage you to contribute to our repository. Find out more in our contribution guidelines

Requirements

Currently only supports Feast versions >=0.15.0.

Installation

pip install feast-spark-offline-store

Or to install from source:

git clone git@github.com:Adyen/feast-spark-offline-store.git
cd feast-spark-offline-store
pip install -e '.[dev]'

Usage

Install feast and feast_spark_offline_store and change the Feast configurations in feature_store.yaml to use feast_spark_offline_store.SparkOfflineStore:

project: example_feature_repo
registry: data/registry.db
provider: local
online_store:
    ...
offline_store:
    type: feast_spark_offline_store.spark.SparkOfflineStore
    spark_conf:
        spark.master: "local[*]"
        spark.ui.enabled: "false"
        spark.eventLog.enabled: "false"
        spark.sql.catalogImplementation: "hive"
        spark.sql.parser.quotedRegexColumnNames: "true"
        spark.sql.session.timeZone: "UTC"

Documentation

See Feast documentation on offline stores and adding custom offline stores.

License

MIT license. For more information, see the LICENSE file.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

feast_spark_offline_store-0.0.5.tar.gz (13.8 kB view hashes)

Uploaded Source

Built Distribution

feast_spark_offline_store-0.0.5-py3-none-any.whl (10.8 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page