Skip to main content

Python SDK and ff CLI for Redis Feature Form

Project description

Redis Feature Form Python SDK

redis-featureform is the Python SDK and ff CLI for Redis Feature Form.

Redis Feature Form gives data and ML teams a declarative workflow for defining providers, datasets, transformations, entities, features, labels, training sets, and feature views in Python while keeping existing offline data systems in place. Redis is the low-latency online store for feature serving.

Install from PyPI as redis-featureform, then import it in Python as featureform.

Official Redis Feature Form docs:

  • Overview: https://redis.io/docs/latest/develop/ai/featureform/overview/
  • Quickstart: https://redis.io/docs/latest/develop/ai/featureform/quickstart/
  • Documentation hub: https://redis.io/docs/latest/develop/ai/featureform/

Install

pip install redis-featureform

Optional extras:

pip install "redis-featureform[otel]"
pip install "redis-featureform[pyspark]"

What Redis Feature Form Does

Redis Feature Form helps you:

  • register offline systems and Redis as providers
  • define feature engineering resources as Python code
  • submit a complete resource graph into a workspace with ff apply or client.apply()
  • inspect the resulting plan before execution
  • query datasets and training sets over Arrow Flight
  • serve online feature values from Redis-backed feature views

Core Workflow

The Redis Feature Form workflow is:

  1. Create a workspace.
  2. Register providers for offline systems and the Redis online store.
  3. Author resources in Python.
  4. Apply or merge the resource graph into the workspace.
  5. Let Redis Feature Form plan and run the required execution work.
  6. Query offline results for training or fetch online features for inference.

Core Concepts

  • Workspace: logical boundary for a Redis Feature Form resource graph
  • Provider: connection to an external system such as Postgres, Snowflake, Spark, S3, Iceberg, or Redis
  • Resource: Python declaration of an entity, dataset, transformation, feature, label, training set, or feature view
  • Apply: single-shot workspace update that submits one combined resource set, computes a delta, and optionally runs an apply job
  • Merge: apply strategy that preserves omitted resources already present in the workspace
  • Execution mode: planning mode for runtime work, currently NORMAL, UPDATE, or FULL_REMATERIALIZE
  • Training set: offline, model-ready dataset composed from features and labels
  • Feature view: online-serving surface backed by Redis for inference-time lookup

Connect And Register Providers

The CLI is usually the fastest way to bootstrap Redis Feature Form resources:

ff --server https://api.example.com --transport rest auth login

ff --server https://api.example.com --transport rest \
  workspace create fraud-detection

ff --server https://api.example.com --transport rest \
  provider register analytics-postgres \
  --workspace fraud-detection \
  --type postgres \
  --pg-host postgres.example.com \
  --pg-port 5432 \
  --pg-database analytics \
  --pg-user ff_user \
  --pg-password-secret env:PG_PASSWORD

ff --server https://api.example.com --transport rest \
  provider register online-store \
  --workspace fraud-detection \
  --type redis \
  --redis-host redis.example.com \
  --redis-port 6379

Providers are registered separately from apply. Resource definitions then refer to those provider names when you define datasets, transformations, training sets, and Redis-backed feature views.

What ff apply Does

ff apply is the main authoring workflow in Redis Feature Form.

At a high level it:

  1. loads one Python entry file or package directory
  2. collects resources from either an explicit resources = [...] list or the global registry
  3. serializes the full resource graph into one request for one workspace
  4. compares the submitted graph with the current workspace graph
  5. returns a dry-run plan or commits the new version and optionally waits for the apply job

ff apply is graph-wide. It does not patch one object at a time.

Example Apply File

from datetime import timedelta

import featureform as ff
from featureform.types.resource import FeatureView, MaterializationEngine, TrainingSetType

postgres = ff.get_postgres("analytics-postgres")

customer = ff.Entity(
    name="customer",
    description="Customer entity for fraud models",
)

transactions = postgres.dataset(
    name="transactions",
    schema="public",
    table="transactions",
    timestamp_column="event_ts",
    description="Raw payment events",
)


@postgres.sql_transformation(
    name="customer_daily_rollups",
    description="Daily transaction totals per customer",
    inputs=[transactions],
)
def customer_daily_rollups() -> str:
    return """
        SELECT
            customer_id,
            date_trunc('day', event_ts) AS event_day,
            SUM(amount) AS total_amount,
            COUNT(*) AS transaction_count
        FROM {{transactions}}
        GROUP BY 1, 2
    """


customer_amount_7d = (
    ff.Feature(name="customer_amount_7d")
    .from_dataset(
        customer_daily_rollups,
        entity="customer",
        entity_column="customer_id",
        value="total_amount",
        timestamp="event_day",
    )
    .with_provider("analytics-postgres")
    .aggregate(function=ff.AggregateFunction.SUM, window=timedelta(days=7))
)

fraud_label = (
    ff.Label()
    .from_dataset(transactions)
    .value("is_fraud")
    .timestamp("event_ts")
    .entity("customer", column="customer_id")
    .build()
)

fraud_training = ff.TrainingSet(
    name="fraud_training",
    provider="analytics-postgres",
    features=[customer_amount_7d],
    label=fraud_label,
    type=TrainingSetType.STATIC,
)

customer_features = FeatureView(
    name="customer_features",
    entity="customer",
    features=[customer_amount_7d.name],
    inference_store="online-store",
    materialization_engine=MaterializationEngine.K8S,
)

resources = [
    customer,
    transactions,
    customer_daily_rollups,
    customer_amount_7d,
    fraud_label,
    fraud_training,
    customer_features,
]

Apply Strategies And Execution Modes

Redis Feature Form now exposes apply strategy and execution mode as separate axes.

Apply strategies:

  • default apply: submitted resources replace the desired graph for the workspace
  • merge: submitted resources merge into the workspace and omitted resources are retained

Execution modes:

  • NORMAL: standard diff-based planning
  • UPDATE: revisit the target graph in update mode
  • FULL_REMATERIALIZE: revisit the target graph in full-rematerialize mode

That means combinations such as these are valid:

  • apply + normal
  • apply + update
  • apply + full rematerialize
  • merge + normal
  • merge + update
  • merge + full rematerialize

Apply From The CLI

Preview the graph delta without changing the workspace:

ff --server https://api.example.com --transport rest \
  apply -f resources.py --workspace fraud-detection --plan

Apply the submitted graph and wait for completion:

ff --server https://api.example.com --transport rest \
  apply -f resources.py \
  --workspace fraud-detection \
  --message "initial fraud pipeline" \
  --wait --wait-for finished

Merge into the existing workspace instead of replacing omitted resources:

ff --server https://api.example.com --transport rest \
  apply -f resources.py \
  --workspace fraud-detection \
  --merge \
  --wait --wait-for finished

Run a plan in update mode:

ff --server https://api.example.com --transport rest \
  apply -f resources.py \
  --workspace fraud-detection \
  --plan \
  --update

Request a merge with full rematerialization semantics:

ff --server https://api.example.com --transport rest \
  apply -f resources.py \
  --workspace fraud-detection \
  --merge \
  --full-rematerialize

Useful flags:

  • --plan: show what would change without applying
  • --merge: keep omitted resources already present in the workspace
  • --update: use update execution mode
  • --full-rematerialize: use full rematerialization execution mode
  • --wait --wait-for finished: block until the apply job completes and the workspace version is visible

Apply From Python

Use client.apply() for standard apply semantics:

import os

import featureform as ff
import resources

client = ff.Client(
    base_url="grpc.example.com:443",
    transport="grpc",
    token=os.environ["FEATUREFORM_TOKEN"],
)

result = client.apply(
    resources=resources.resources,
    workspace="fraud-detection",
    message="initial fraud pipeline",
    wait=True,
)

print(result.version)
for change in result.changes:
    print(change.resource_type, change.resource_name, change.action)

Use execution mode directly from Python:

plan = client.apply(
    resources=resources.resources,
    workspace="fraud-detection",
    dry_run=True,
    execution_mode="UPDATE",
)

Use merge semantics without deleting omitted resources:

result = client.merge(
    resources=resources.resources,
    workspace="fraud-detection",
    execution_mode="FULL_REMATERIALIZE",
    wait=True,
)

If resources is omitted, client.apply(...) and client.merge(...) use the global resource registry. That works well when definition modules register resources during import.

Query Offline Data

Use Arrow Flight helpers to pull datasets, training sets, or feature-view snapshots into pandas or stream them incrementally.

training = client.query_training_set(
    "fraud_training",
    workspace="fraud-detection",
    flight_server="grpc.example.com:9090",
).to_pandas()

print(training.head())

You can also query datasets directly:

transactions = client.query_dataset(
    "transactions",
    workspace="fraud-detection",
    limit=1000,
).to_pandas()

Serve Online Features From Redis

Once a feature view is materialized, Redis Feature Form serves online values from Redis with client.serve(...).

features = client.serve(
    "customer_features",
    entity="customer_123",
    workspace="fraud-detection",
)

print(features["customer_amount_7d"])

Batch serving is supported as well:

result = client.serve(
    "customer_features",
    entities=["customer_123", "customer_456"],
    workspace="fraud-detection",
    return_type="pandas",
)

Package Surface

  • featureform.Client: client for workspaces, providers, apply and merge, offline queries, and online serving
  • ff: CLI for auth, apply, workspace, provider, catalog, graph, scheduler, and RBAC operations
  • featureform.resources: typed resource DSL for authoring Redis Feature Form resource graphs
  • featureform.types: request and response models used by the client surface

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

redis_featureform-3.0.0.tar.gz (351.2 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

redis_featureform-3.0.0-py3-none-any.whl (427.9 kB view details)

Uploaded Python 3

File details

Details for the file redis_featureform-3.0.0.tar.gz.

File metadata

  • Download URL: redis_featureform-3.0.0.tar.gz
  • Upload date:
  • Size: 351.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for redis_featureform-3.0.0.tar.gz
Algorithm Hash digest
SHA256 a16dd894d6190a31d1eddc5fa68363228e7bc99c3dc1447bf4952202040342f2
MD5 33c03e4a60c14f4d857564f9b831fba6
BLAKE2b-256 858e19979ac56215a7092929d598ea3eed01168c470f70d82c6856e4f06ac99a

See more details on using hashes here.

File details

Details for the file redis_featureform-3.0.0-py3-none-any.whl.

File metadata

File hashes

Hashes for redis_featureform-3.0.0-py3-none-any.whl
Algorithm Hash digest
SHA256 6abb4fcc1d8a1e9fdb10124d82c3df2968dadc01ea0adb95601e2a5d65e7945e
MD5 e6bd67bd8aba80f28005f22c99569d5b
BLAKE2b-256 8454312d01dadfb6b6afa636623a5d8000f3db2e938943a874b91aeef9d90b81

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page