Python SDK and ff CLI for Redis Feature Form
Project description
Redis Feature Form Python SDK
redis-featureform is the Python SDK and ff CLI for Redis Feature Form.
Redis Feature Form gives data and ML teams a declarative workflow for defining providers, datasets, transformations, entities, features, labels, training sets, and feature views in Python while keeping existing offline data systems in place. Redis is the low-latency online store for feature serving.
Install from PyPI as redis-featureform, then import it in Python as
featureform.
Official Redis Feature Form docs:
- Overview:
https://redis.io/docs/latest/develop/ai/featureform/overview/ - Quickstart:
https://redis.io/docs/latest/develop/ai/featureform/quickstart/ - Documentation hub:
https://redis.io/docs/latest/develop/ai/featureform/
Install
pip install redis-featureform
Optional extras:
pip install "redis-featureform[otel]"
pip install "redis-featureform[pyspark]"
What Redis Feature Form Does
Redis Feature Form helps you:
- register offline systems and Redis as providers
- define feature engineering resources as Python code
- submit a complete resource graph into a workspace with
ff applyorclient.apply() - inspect the resulting plan before execution
- query datasets and training sets over Arrow Flight
- serve online feature values from Redis-backed feature views
Core Workflow
The Redis Feature Form workflow is:
- Create a workspace.
- Register providers for offline systems and the Redis online store.
- Author resources in Python.
- Apply or merge the resource graph into the workspace.
- Let Redis Feature Form plan and run the required execution work.
- Query offline results for training or fetch online features for inference.
Core Concepts
- Workspace: logical boundary for a Redis Feature Form resource graph
- Provider: connection to an external system such as Postgres, Snowflake, Spark, S3, Iceberg, or Redis
- Resource: Python declaration of an entity, dataset, transformation, feature, label, training set, or feature view
- Apply: single-shot workspace update that submits one combined resource set, computes a delta, and optionally runs an apply job
- Merge: apply strategy that preserves omitted resources already present in the workspace
- Execution mode: planning mode for runtime work, currently
NORMAL,UPDATE, orFULL_REMATERIALIZE - Training set: offline, model-ready dataset composed from features and labels
- Feature view: online-serving surface backed by Redis for inference-time lookup
Connect And Register Providers
The CLI is usually the fastest way to bootstrap Redis Feature Form resources:
ff --server https://api.example.com --transport rest auth login
ff --server https://api.example.com --transport rest \
workspace create fraud-detection
ff --server https://api.example.com --transport rest \
provider register analytics-postgres \
--workspace fraud-detection \
--type postgres \
--pg-host postgres.example.com \
--pg-port 5432 \
--pg-database analytics \
--pg-user ff_user \
--pg-password-secret env:PG_PASSWORD
ff --server https://api.example.com --transport rest \
provider register online-store \
--workspace fraud-detection \
--type redis \
--redis-host redis.example.com \
--redis-port 6379
Providers are registered separately from apply. Resource definitions then refer to those provider names when you define datasets, transformations, training sets, and Redis-backed feature views.
What ff apply Does
ff apply is the main authoring workflow in Redis Feature Form.
At a high level it:
- loads one Python entry file or package directory
- collects resources from either an explicit
resources = [...]list or the global registry - serializes the full resource graph into one request for one workspace
- compares the submitted graph with the current workspace graph
- returns a dry-run plan or commits the new version and optionally waits for the apply job
ff apply is graph-wide. It does not patch one object at a time.
Example Apply File
from datetime import timedelta
import featureform as ff
from featureform.types.resource import FeatureView, MaterializationEngine, TrainingSetType
postgres = ff.get_postgres("analytics-postgres")
customer = ff.Entity(
name="customer",
description="Customer entity for fraud models",
)
transactions = postgres.dataset(
name="transactions",
schema="public",
table="transactions",
timestamp_column="event_ts",
description="Raw payment events",
)
@postgres.sql_transformation(
name="customer_daily_rollups",
description="Daily transaction totals per customer",
inputs=[transactions],
)
def customer_daily_rollups() -> str:
return """
SELECT
customer_id,
date_trunc('day', event_ts) AS event_day,
SUM(amount) AS total_amount,
COUNT(*) AS transaction_count
FROM {{transactions}}
GROUP BY 1, 2
"""
customer_amount_7d = (
ff.Feature(name="customer_amount_7d")
.from_dataset(
customer_daily_rollups,
entity="customer",
entity_column="customer_id",
value="total_amount",
timestamp="event_day",
)
.with_provider("analytics-postgres")
.aggregate(function=ff.AggregateFunction.SUM, window=timedelta(days=7))
)
fraud_label = (
ff.Label()
.from_dataset(transactions)
.value("is_fraud")
.timestamp("event_ts")
.entity("customer", column="customer_id")
.build()
)
fraud_training = ff.TrainingSet(
name="fraud_training",
provider="analytics-postgres",
features=[customer_amount_7d],
label=fraud_label,
type=TrainingSetType.STATIC,
)
customer_features = FeatureView(
name="customer_features",
entity="customer",
features=[customer_amount_7d.name],
inference_store="online-store",
materialization_engine=MaterializationEngine.K8S,
)
resources = [
customer,
transactions,
customer_daily_rollups,
customer_amount_7d,
fraud_label,
fraud_training,
customer_features,
]
Apply Strategies And Execution Modes
Redis Feature Form now exposes apply strategy and execution mode as separate axes.
Apply strategies:
- default apply: submitted resources replace the desired graph for the workspace
- merge: submitted resources merge into the workspace and omitted resources are retained
Execution modes:
NORMAL: standard diff-based planningUPDATE: revisit the target graph in update modeFULL_REMATERIALIZE: revisit the target graph in full-rematerialize mode
That means combinations such as these are valid:
- apply + normal
- apply + update
- apply + full rematerialize
- merge + normal
- merge + update
- merge + full rematerialize
Apply From The CLI
Preview the graph delta without changing the workspace:
ff --server https://api.example.com --transport rest \
apply -f resources.py --workspace fraud-detection --plan
Apply the submitted graph and wait for completion:
ff --server https://api.example.com --transport rest \
apply -f resources.py \
--workspace fraud-detection \
--message "initial fraud pipeline" \
--wait --wait-for finished
Merge into the existing workspace instead of replacing omitted resources:
ff --server https://api.example.com --transport rest \
apply -f resources.py \
--workspace fraud-detection \
--merge \
--wait --wait-for finished
Run a plan in update mode:
ff --server https://api.example.com --transport rest \
apply -f resources.py \
--workspace fraud-detection \
--plan \
--update
Request a merge with full rematerialization semantics:
ff --server https://api.example.com --transport rest \
apply -f resources.py \
--workspace fraud-detection \
--merge \
--full-rematerialize
Useful flags:
--plan: show what would change without applying--merge: keep omitted resources already present in the workspace--update: use update execution mode--full-rematerialize: use full rematerialization execution mode--wait --wait-for finished: block until the apply job completes and the workspace version is visible
Apply From Python
Use client.apply() for standard apply semantics:
import os
import featureform as ff
import resources
client = ff.Client(
base_url="grpc.example.com:443",
transport="grpc",
token=os.environ["FEATUREFORM_TOKEN"],
)
result = client.apply(
resources=resources.resources,
workspace="fraud-detection",
message="initial fraud pipeline",
wait=True,
)
print(result.version)
for change in result.changes:
print(change.resource_type, change.resource_name, change.action)
Use execution mode directly from Python:
plan = client.apply(
resources=resources.resources,
workspace="fraud-detection",
dry_run=True,
execution_mode="UPDATE",
)
Use merge semantics without deleting omitted resources:
result = client.merge(
resources=resources.resources,
workspace="fraud-detection",
execution_mode="FULL_REMATERIALIZE",
wait=True,
)
If resources is omitted, client.apply(...) and client.merge(...) use the
global resource registry. That works well when definition modules register
resources during import.
Query Offline Data
Use Arrow Flight helpers to pull datasets, training sets, or feature-view snapshots into pandas or stream them incrementally.
training = client.query_training_set(
"fraud_training",
workspace="fraud-detection",
flight_server="grpc.example.com:9090",
).to_pandas()
print(training.head())
You can also query datasets directly:
transactions = client.query_dataset(
"transactions",
workspace="fraud-detection",
limit=1000,
).to_pandas()
Serve Online Features From Redis
Once a feature view is materialized, Redis Feature Form serves online values from
Redis with client.serve(...).
features = client.serve(
"customer_features",
entity="customer_123",
workspace="fraud-detection",
)
print(features["customer_amount_7d"])
Batch serving is supported as well:
result = client.serve(
"customer_features",
entities=["customer_123", "customer_456"],
workspace="fraud-detection",
return_type="pandas",
)
Package Surface
featureform.Client: client for workspaces, providers, apply and merge, offline queries, and online servingff: CLI for auth, apply, workspace, provider, catalog, graph, scheduler, and RBAC operationsfeatureform.resources: typed resource DSL for authoring Redis Feature Form resource graphsfeatureform.types: request and response models used by the client surface
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file redis_featureform-3.0.0.tar.gz.
File metadata
- Download URL: redis_featureform-3.0.0.tar.gz
- Upload date:
- Size: 351.2 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.13.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
a16dd894d6190a31d1eddc5fa68363228e7bc99c3dc1447bf4952202040342f2
|
|
| MD5 |
33c03e4a60c14f4d857564f9b831fba6
|
|
| BLAKE2b-256 |
858e19979ac56215a7092929d598ea3eed01168c470f70d82c6856e4f06ac99a
|
File details
Details for the file redis_featureform-3.0.0-py3-none-any.whl.
File metadata
- Download URL: redis_featureform-3.0.0-py3-none-any.whl
- Upload date:
- Size: 427.9 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.13.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
6abb4fcc1d8a1e9fdb10124d82c3df2968dadc01ea0adb95601e2a5d65e7945e
|
|
| MD5 |
e6bd67bd8aba80f28005f22c99569d5b
|
|
| BLAKE2b-256 |
8454312d01dadfb6b6afa636623a5d8000f3db2e938943a874b91aeef9d90b81
|