Skip to main content

Python SDK and ff CLI for Redis Feature Form

Project description

Redis Feature Form Python SDK

redis-featureform is the Python SDK and ff CLI for Redis Feature Form.

Redis Feature Form gives data and ML teams a declarative workflow for defining providers, datasets, transformations, entities, features, labels, training sets, and feature views in Python while keeping existing offline data systems in place. Redis is the low-latency online store for feature serving.

Install from PyPI as redis-featureform, then import it in Python as featureform.

Official Redis Feature Form docs:

  • Overview: https://redis.io/docs/latest/develop/ai/featureform/overview/
  • Quickstart: https://redis.io/docs/latest/develop/ai/featureform/quickstart/
  • Documentation hub: https://redis.io/docs/latest/develop/ai/featureform/

Install

pip install redis-featureform

Optional extras:

pip install "redis-featureform[otel]"
pip install "redis-featureform[pyspark]"

What Redis Feature Form Does

Redis Feature Form helps you:

  • register offline systems and Redis as providers
  • define feature engineering resources as Python code
  • submit a complete resource graph into a workspace with ff apply or client.apply()
  • inspect the resulting plan before execution
  • query datasets and training sets over Arrow Flight
  • serve online feature values from Redis-backed feature views

Core Workflow

The Redis Feature Form workflow is:

  1. Create a workspace.
  2. Register providers for offline systems and the Redis online store.
  3. Author resources in Python.
  4. Apply or merge the resource graph into the workspace.
  5. Let Redis Feature Form plan and run the required execution work.
  6. Query offline results for training or fetch online features for inference.

Core Concepts

  • Workspace: logical boundary for a Redis Feature Form resource graph
  • Provider: connection to an external system such as Postgres, Snowflake, Spark, S3, Iceberg, or Redis
  • Resource: Python declaration of an entity, dataset, transformation, feature, label, training set, or feature view
  • Apply: single-shot workspace update that submits one combined resource set, computes a delta, and optionally runs an apply job
  • Merge: apply strategy that preserves omitted resources already present in the workspace
  • Execution mode: planning mode for runtime work, currently NORMAL, UPDATE, or FULL_REMATERIALIZE
  • Training set: offline, model-ready dataset composed from features and labels
  • Feature view: online-serving surface backed by Redis for inference-time lookup

Connect And Register Providers

The CLI is usually the fastest way to bootstrap Redis Feature Form resources:

ff --server https://api.example.com --transport rest auth login

ff --server https://api.example.com --transport rest \
  workspace create fraud-detection

ff --server https://api.example.com --transport rest \
  provider register analytics-postgres \
  --workspace fraud-detection \
  --type postgres \
  --pg-host postgres.example.com \
  --pg-port 5432 \
  --pg-database analytics \
  --pg-user ff_user \
  --pg-password-secret env:PG_PASSWORD

ff --server https://api.example.com --transport rest \
  provider register online-store \
  --workspace fraud-detection \
  --type redis \
  --redis-host redis.example.com \
  --redis-port 6379

Providers are registered separately from apply. Resource definitions then refer to those provider names when you define datasets, transformations, training sets, and Redis-backed feature views.

What ff apply Does

ff apply is the main authoring workflow in Redis Feature Form.

At a high level it:

  1. loads one Python entry file or package directory
  2. collects resources from either an explicit resources = [...] list or the global registry
  3. serializes the full resource graph into one request for one workspace
  4. compares the submitted graph with the current workspace graph
  5. returns a dry-run plan or commits the new version and optionally waits for the apply job

ff apply is graph-wide. It does not patch one object at a time.

Example Apply File

from datetime import timedelta

import featureform as ff
from featureform.types.resource import FeatureView, MaterializationEngine, TrainingSetType

postgres = ff.get_postgres("analytics-postgres")

customer = ff.Entity(
    name="customer",
    description="Customer entity for fraud models",
)

transactions = postgres.dataset(
    name="transactions",
    schema="public",
    table="transactions",
    timestamp_column="event_ts",
    description="Raw payment events",
)


@postgres.sql_transformation(
    name="customer_daily_rollups",
    description="Daily transaction totals per customer",
    inputs=[transactions],
)
def customer_daily_rollups() -> str:
    return """
        SELECT
            customer_id,
            date_trunc('day', event_ts) AS event_day,
            SUM(amount) AS total_amount,
            COUNT(*) AS transaction_count
        FROM {{transactions}}
        GROUP BY 1, 2
    """


customer_amount_7d = (
    ff.Feature(name="customer_amount_7d")
    .from_dataset(
        customer_daily_rollups,
        entity="customer",
        entity_column="customer_id",
        value="total_amount",
        timestamp="event_day",
    )
    .with_provider("analytics-postgres")
    .aggregate(function=ff.AggregateFunction.SUM, window=timedelta(days=7))
)

fraud_label = (
    ff.Label()
    .from_dataset(transactions)
    .value("is_fraud")
    .timestamp("event_ts")
    .entity("customer", column="customer_id")
    .build()
)

fraud_training = ff.TrainingSet(
    name="fraud_training",
    provider="analytics-postgres",
    features=[customer_amount_7d],
    label=fraud_label,
    type=TrainingSetType.STATIC,
)

customer_features = FeatureView(
    name="customer_features",
    entity="customer",
    features=[customer_amount_7d.name],
    inference_store="online-store",
    materialization_engine=MaterializationEngine.K8S,
)

resources = [
    customer,
    transactions,
    customer_daily_rollups,
    customer_amount_7d,
    fraud_label,
    fraud_training,
    customer_features,
]

Apply Strategies And Execution Modes

Redis Feature Form now exposes apply strategy and execution mode as separate axes.

Apply strategies:

  • default apply: submitted resources replace the desired graph for the workspace
  • merge: submitted resources merge into the workspace and omitted resources are retained

Execution modes:

  • NORMAL: standard diff-based planning
  • UPDATE: revisit the target graph in update mode
  • FULL_REMATERIALIZE: revisit the target graph in full-rematerialize mode

That means combinations such as these are valid:

  • apply + normal
  • apply + update
  • apply + full rematerialize
  • merge + normal
  • merge + update
  • merge + full rematerialize

Apply From The CLI

Preview the graph delta without changing the workspace:

ff --server https://api.example.com --transport rest \
  apply -f resources.py --workspace fraud-detection --plan

Apply the submitted graph and wait for completion:

ff --server https://api.example.com --transport rest \
  apply -f resources.py \
  --workspace fraud-detection \
  --message "initial fraud pipeline" \
  --wait --wait-for finished

Merge into the existing workspace instead of replacing omitted resources:

ff --server https://api.example.com --transport rest \
  apply -f resources.py \
  --workspace fraud-detection \
  --merge \
  --wait --wait-for finished

Run a plan in update mode:

ff --server https://api.example.com --transport rest \
  apply -f resources.py \
  --workspace fraud-detection \
  --plan \
  --update

Request a merge with full rematerialization semantics:

ff --server https://api.example.com --transport rest \
  apply -f resources.py \
  --workspace fraud-detection \
  --merge \
  --full-rematerialize

Useful flags:

  • --plan: show what would change without applying
  • --merge: keep omitted resources already present in the workspace
  • --update: use update execution mode
  • --full-rematerialize: use full rematerialization execution mode
  • --wait --wait-for finished: block until the apply job completes and the workspace version is visible

Apply From Python

Use client.apply() for standard apply semantics:

import os

import featureform as ff
import resources

client = ff.Client(
    base_url="grpc.example.com:443",
    transport="grpc",
    token=os.environ["FEATUREFORM_TOKEN"],
)

result = client.apply(
    resources=resources.resources,
    workspace="fraud-detection",
    message="initial fraud pipeline",
    wait=True,
)

print(result.version)
for change in result.changes:
    print(change.resource_type, change.resource_name, change.action)

Use execution mode directly from Python:

plan = client.apply(
    resources=resources.resources,
    workspace="fraud-detection",
    dry_run=True,
    execution_mode="UPDATE",
)

Use merge semantics without deleting omitted resources:

result = client.merge(
    resources=resources.resources,
    workspace="fraud-detection",
    execution_mode="FULL_REMATERIALIZE",
    wait=True,
)

If resources is omitted, client.apply(...) and client.merge(...) use the global resource registry. That works well when definition modules register resources during import.

Query Offline Data

Use Arrow Flight helpers to pull datasets, training sets, or feature-view snapshots into pandas or stream them incrementally.

training = client.query_training_set(
    "fraud_training",
    workspace="fraud-detection",
    flight_server="grpc.example.com:9090",
).to_pandas()

print(training.head())

You can also query datasets directly:

transactions = client.query_dataset(
    "transactions",
    workspace="fraud-detection",
    limit=1000,
).to_pandas()

Serve Online Features From Redis

Once a feature view is materialized, Redis Feature Form serves online values from Redis with client.serve(...).

features = client.serve(
    "customer_features",
    entity="customer_123",
    workspace="fraud-detection",
)

print(features["customer_amount_7d"])

Batch serving is supported as well:

result = client.serve(
    "customer_features",
    entities=["customer_123", "customer_456"],
    workspace="fraud-detection",
    return_type="pandas",
)

Package Surface

  • featureform.Client: client for workspaces, providers, apply and merge, offline queries, and online serving
  • ff: CLI for auth, apply, workspace, provider, catalog, graph, scheduler, and RBAC operations
  • featureform.resources: typed resource DSL for authoring Redis Feature Form resource graphs
  • featureform.types: request and response models used by the client surface

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

redis_featureform-3.0.2.tar.gz (352.7 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

redis_featureform-3.0.2-py3-none-any.whl (429.4 kB view details)

Uploaded Python 3

File details

Details for the file redis_featureform-3.0.2.tar.gz.

File metadata

  • Download URL: redis_featureform-3.0.2.tar.gz
  • Upload date:
  • Size: 352.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for redis_featureform-3.0.2.tar.gz
Algorithm Hash digest
SHA256 01670353b57f14926f30dcc0a0a0cfce683ff79ba4c43121e15f842f9b7ef414
MD5 c52603d9d0b13524c9f30af07723e2b3
BLAKE2b-256 ec537012d2240d4fbd4be8a864b77f92b3f608a750c9a0dc44102626d7f6cd23

See more details on using hashes here.

File details

Details for the file redis_featureform-3.0.2-py3-none-any.whl.

File metadata

File hashes

Hashes for redis_featureform-3.0.2-py3-none-any.whl
Algorithm Hash digest
SHA256 6a44a602f890d9f93f466451e4721c08c9700da2584e7991f844df2dd07c3a1c
MD5 8a1eda7b81981a7a1d18d06aeba91459
BLAKE2b-256 e9ab1072032e2124669e967f0afa3c6ddba804368932febc737a916bd60171e7

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page