A python package to simplify the usage of feature store using Teradata Vantage ...

Project description

tdfs4ds logo

tdfs4ds — A Feature Store Library for Data Scientists working with ClearScape Analytics

tdfs4ds (Teradata Feature Store for Data Scientists) is a Python package for managing temporal feature stores in Teradata Vantage databases. It provides easy-to-use functions for creating, registering, storing, and retrieving features — with full time-travel support, lineage tracking, and process operationalization.

Installation

pip install tdfs4ds

Quick Start

Import tdfs4ds after establishing a teradataml connection so the package can auto-detect your default database:

import teradataml as tdml
tdml.create_context(host=..., username=..., password=...)

import tdfs4ds
# tdfs4ds.SCHEMA is auto-set from the teradataml context;
# override if needed: tdfs4ds.SCHEMA = 'my_database'

# Data domain management — use the dedicated functions:
tdfs4ds.create_data_domain('MY_PROJECT')   # create and activate a new domain
# or
tdfs4ds.select_data_domain('MY_PROJECT')   # activate an existing domain
# or
tdfs4ds.get_data_domains()                 # list all available domains (* marks the active one)

Core API

Function	Description
`tdfs4ds.setup(database)`	Create feature catalog, process catalog, and follow-up tables in `database`
`tdfs4ds.upload_features(df, entity_id, feature_names, metadata={})`	Ingest features from a teradataml DataFrame into the feature store
`tdfs4ds.build_dataset(entity_id, selected_features, view_name, comment='dataset')`	Assemble a dataset view from registered features
`tdfs4ds.run(process_id)`	Re-execute a registered feature engineering process
`tdfs4ds.roll_out(...)`	Operationalize processes at scale
`tdfs4ds.connect(database)`	Connect to an existing feature store

`entity_id` must specify SQL data types (dict, not list)

entity_id = {'CUSTOMER_ID': 'BIGINT', 'EVENT_DATE': 'DATE'}   # correct
entity_id = ['CUSTOMER_ID', 'EVENT_DATE']                      # wrong

Walkthrough Example

Step 1 — Set up a feature store

import teradataml as tdml
tdml.create_context(host=..., username=..., password=...)

import tdfs4ds
tdfs4ds.setup(database='my_database')

Step 2 — Configure the active context

tdfs4ds.SCHEMA = 'my_database'   # override if not auto-detected

# Use dedicated functions to manage the data domain:
tdfs4ds.create_data_domain('DATA_QUALITY')   # create and activate (first time)
# tdfs4ds.select_data_domain('DATA_QUALITY') # activate an existing domain
# tdfs4ds.get_data_domains()                 # list all domains

Step 3 — Define your feature engineering view

df = tdml.DataFrame(tdml.in_schema('my_database', 'my_feature_view'))
# If teradataml created intermediate views, make them permanent first:
# tdfs4ds.crystallize_view(df)

Step 4 — Upload and operationalize

entity_id     = {'EVENT_DT': 'DATE', 'ID': 'BIGINT'}
feature_names = ['KPI1', 'KPI2']

tdfs4ds.upload_features(
    df=df,
    entity_id=entity_id,
    feature_names=feature_names,
    metadata={'project': 'data quality'}
)

This registers entities and features (if not already present), registers a feature engineering process in the process catalog, and writes the feature values into the feature store.

Step 5 — Re-run a process

# List all registered processes to find the process ID
tdfs4ds.process_catalog()

# Re-execute by process ID
tdfs4ds.run(process_id)

Step 6 — Build a dataset

selected_features = {
    'KPI1': '<process_uuid>',
    'KPI2': '<process_uuid>',
}

dataset = tdfs4ds.build_dataset(
    entity_id={'ID': 'BIGINT'},
    selected_features=selected_features,
    view_name='my_dataset',
    comment='Dataset for churn model'
)

selected_features maps each feature name to the UUID of the process that computed it.

Configuration

Programmatic (in-session)

tdfs4ds.SCHEMA                = 'my_database'        # target database (auto-set from context)
# Data domain: use tdfs4ds.create_data_domain() / select_data_domain() / get_data_domains()
tdfs4ds.FEATURE_STORE_TIME    = None                 # None = current; '2024-01-01 00:00:00' = time travel
tdfs4ds.DISPLAY_LOGS          = True                 # verbose logging
tdfs4ds.DEBUG_MODE            = False
tdfs4ds.STORE_FEATURE         = 'MERGE'              # 'MERGE' or 'UPDATE_INSERT'

# GenAI documentation
tdfs4ds.INSTRUCT_MODEL_PROVIDER = 'openai'           # or 'bedrock'
tdfs4ds.INSTRUCT_MODEL_MODEL    = 'gpt-4o'
tdfs4ds.INSTRUCT_MODEL_API_KEY  = 'sk-...'           # prefer env var instead (see below)

Config file (persistent per-project or per-user)

Create a tdfs4ds.json file in your project directory (or ~/.tdfs4ds/config.json for user-wide defaults) to avoid repeating the setup cell in every notebook:

{
    "schema": "MY_DATABASE",
    "data_domain": "MY_PROJECT",
    "display_logs": true,
    "store_feature": "MERGE",
    "varchar_size": 1024,
    "instruct_model_provider": "openai",
    "instruct_model_model": "gpt-4o",
    "instruct_model_url": null
}

Keys are case-insensitive. instruct_model_api_key is rejected from JSON config to prevent accidental commits — use a .env file or OS env var for credentials.

`.env` file (local secrets and overrides)

Place a .env file in your project directory (or ~/.tdfs4ds/.env for user-wide defaults). Only TDFS4DS_* variables are read — the file is parsed without touching os.environ:

TDFS4DS_SCHEMA=MY_DATABASE
TDFS4DS_DATA_DOMAIN=MY_PROJECT
TDFS4DS_INSTRUCT_MODEL_API_KEY=sk-...
TDFS4DS_INSTRUCT_MODEL_PROVIDER=openai
TDFS4DS_INSTRUCT_MODEL_MODEL=gpt-4o

Add .env to .gitignore to keep secrets out of source control. Quoted values and export KEY=VALUE syntax are supported.

Environment variables

All settings can also be set via TDFS4DS_<VAR_NAME> OS environment variables (useful in CI/CD):

Variable	Corresponding setting
`TDFS4DS_SCHEMA`	`tdfs4ds.SCHEMA`
`TDFS4DS_DATA_DOMAIN`	`tdfs4ds.DATA_DOMAIN`
`TDFS4DS_DISPLAY_LOGS`	`tdfs4ds.DISPLAY_LOGS`
`TDFS4DS_DEBUG_MODE`	`tdfs4ds.DEBUG_MODE`
`TDFS4DS_STORE_FEATURE`	`tdfs4ds.STORE_FEATURE`
`TDFS4DS_VARCHAR_SIZE`	`tdfs4ds.VARCHAR_SIZE`
`TDFS4DS_INSTRUCT_MODEL_PROVIDER`	`tdfs4ds.INSTRUCT_MODEL_PROVIDER`
`TDFS4DS_INSTRUCT_MODEL_MODEL`	`tdfs4ds.INSTRUCT_MODEL_MODEL`
`TDFS4DS_INSTRUCT_MODEL_URL`	`tdfs4ds.INSTRUCT_MODEL_URL`
`TDFS4DS_INSTRUCT_MODEL_API_KEY`	`tdfs4ds.INSTRUCT_MODEL_API_KEY`

`load_config()` — explicit reload

# Reload from default search paths
tdfs4ds.load_config()

# Point at specific files
tdfs4ds.load_config(
    path='/configs/feature_store.json',
    dotenv_path='/project/.env.production',
)

Priority chain

programmatic (tdfs4ds.X = value)
  > OS environment variable (TDFS4DS_X)
  > .env file (./.env or ~/.tdfs4ds/.env)
  > JSON config file (./tdfs4ds.json or ~/.tdfs4ds/config.json)
  > teradataml auto-detection (SCHEMA only)
  > built-in defaults

Time Travel

All catalogs and feature stores are temporal. Point-in-time queries are available via:

tdfs4ds.FEATURE_STORE_TIME = '2024-01-01 00:00:00'   # query historical state
tdfs4ds.FEATURE_STORE_TIME = None                     # back to current state

Package Structure

tdfs4ds/
├── __init__.py                    — Global config variables & re-exported public API
├── config.py                      — External config loading (JSON, .env, env vars); load_config()
├── lifecycle.py                   — setup(), connect()
├── execution.py                   — run(), upload_features(), roll_out()
├── catalog.py                     — feature_catalog(), process_catalog(), dataset_catalog()
├── data_domain.py                 — get_data_domains(), select_data_domain(), create_data_domain()
├── datasets.py                    — Utility dataset helpers
├── feature_store/
│   ├── entity_management.py       — register_entity(), remove_entity()
│   ├── feature_data_processing.py — prepare_feature_ingestion(), store_feature(), apply_collect_stats()
│   ├── feature_query_retrieval.py — get_list_features(), get_available_features(), get_feature_versions()
│   └── feature_store_management.py — register_features(), feature_store_table_creation()
├── process_store/
│   ├── process_followup.py        — followup_open(), followup_close(), follow_up_report()
│   ├── process_query_administration.py — list_processes(), get_process_id(), remove_process()
│   ├── process_registration_management.py — register_process_view()
│   └── process_store_catalog_management.py — process_store_catalog_creation()
├── dataset/
│   ├── builder.py                 — build_dataset(), build_dataset_opt(), augment_source_with_features()
│   ├── dataset.py                 — Dataset class
│   └── dataset_catalog.py        — DatasetCatalog class
├── genai/
│   └── documentation.py          — LLM-powered auto-documentation of SQL processes (OpenAI / Bedrock)
├── lineage/
│   ├── lineage.py                 — SQL query parsing, DDL analysis
│   ├── network.py                 — Dependency graph construction
│   └── indexing.py                — Lineage indexing utilities
└── utils/
    ├── query_management.py        — execute_query(), execute_query_wrapper()
    ├── filter_management.py       — FilterManager class
    ├── time_management.py         — TimeManager class
    ├── lineage.py                 — crystallize_view(), analyze_sql_query(), generate_view_dependency_network()
    ├── info.py                    — update_varchar_length(), get_column_types(), seconds_to_dhms()
    └── visualization.py           — plot_graph(), visualize_graph(), display_table()

GenAI Documentation

The genai module provides two complementary ways to document the feature store.

LLM-powered process documentation

document_process() calls an LLM (OpenAI, Azure, vLLM, or AWS Bedrock) to generate:

Business-logic description of the SQL query
Entity description and per-column annotations
EXPLAIN-plan quality score (1–5) with warnings and recommendations

import tdfs4ds
from tdfs4ds.genai import document_process

# Configure the LLM (or use TDFS4DS_INSTRUCT_MODEL_* env vars / .env file)
tdfs4ds.INSTRUCT_MODEL_PROVIDER = 'openai'
tdfs4ds.INSTRUCT_MODEL_MODEL    = 'gpt-4o'
tdfs4ds.INSTRUCT_MODEL_API_KEY  = 'sk-...'

process_info = document_process(process_id='<UUID>', show_explain_plan=True)

LLM-powered dataset documentation

document_dataset_incremental() documents a dataset by walking its full lineage bottom-up:

Source tables — uses the business dictionary if available
Intermediate views — auto-documented via LLM if undocumented
Process views — actively calls document_process_incremental if undocumented
Feature/entity column descriptions are propagated from process docs (no extra LLM call)
A single JSON-constrained LLM call generates five structured sections for the dataset

from tdfs4ds.genai import document_dataset_incremental

result = document_dataset_incremental(
    dataset_id   = '<UUID>',  # from dataset_catalog()
    force_update = False,
    upload       = True,
)

# result['DATASET_SECTIONS'] contains:
#   OVERVIEW, ENTITY, FEATURE_THEMES, BUSINESS_QUESTIONS, INTENDED_AUDIENCE

Each section is stored as an independent row in FS_BUSINESS_DICTIONARY_SECTIONS — no chunking needed for RAG retrieval.

Business dictionary (no LLM required)

Three temporal tables store business-oriented descriptions for any database object, its columns, and its documentation sections. They form a 3-level hierarchy designed for chunking-free hierarchical RAG:

Level	Table	Key	Purpose
0	`FS_BUSINESS_DICTIONARY_OBJECTS`	`(DATABASE_NAME, OBJECT_NAME)`	One summary per object (`OBJECT_TYPE`: `'T'`/`'V'`/`'D'`)
1	`FS_BUSINESS_DICTIONARY_SECTIONS`	`(DATABASE_NAME, OBJECT_NAME, SECTION_NAME)`	One row per documentation section per object
2	`FS_BUSINESS_DICTIONARY_COLUMNS`	`(DATABASE_NAME, TABLE_NAME, COLUMN_NAME)`	One description per column

All tables are VALIDTIME temporal and provisioned automatically by tdfs4ds.connect(create_if_missing=True).

import pandas as pd
from tdfs4ds.genai import (
    upload_business_dictionary_objects,
    upload_business_dictionary_columns,
    upload_business_dictionary_sections,
)

# Level 0 — Object-level descriptions
upload_business_dictionary_objects(pd.DataFrame([
    {
        'DATABASE_NAME'       : 'MY_DB',
        'OBJECT_NAME'         : 'CUSTOMER',
        'OBJECT_TYPE'         : 'T',
        'BUSINESS_DESCRIPTION': 'Core customer table. Each row represents a unique enrolled customer.',
    },
]))

# Level 1 — Section-level descriptions (typically LLM-generated for datasets)
upload_business_dictionary_sections(pd.DataFrame([
    {
        'DATABASE_NAME'  : 'MY_DB',
        'OBJECT_NAME'    : 'DATASET_CUSTOMER',
        'SECTION_NAME'   : 'OVERVIEW',
        'SECTION_CONTENT': 'Customer-level analytical dataset combining spending and category features...',
    },
]))

# Level 2 — Column-level descriptions
upload_business_dictionary_columns(pd.DataFrame([
    {
        'DATABASE_NAME'       : 'MY_DB',
        'TABLE_NAME'          : 'CUSTOMER',
        'COLUMN_NAME'         : 'CUSTOMER_ID',
        'BUSINESS_DESCRIPTION': 'Unique customer identifier assigned at enrolment.',
    },
]))

All three functions validate required columns and perform a CURRENT VALIDTIME MERGE — re-running them updates existing descriptions and preserves the full change history.

Discover Registered Features

from tdfs4ds.feature_store.feature_query_retrieval import (
    get_list_entity,
    get_list_features,
    get_available_features,
    get_feature_versions,
)

Lineage

The lineage module builds end-to-end dependency graphs from a SQL query or a dataset view DDL.

Dependency graph

from tdfs4ds.lineage import build_teradata_dependency_graph, plot_lineage_sankey, show_plotly_robust

# Start from a dataset view DDL (obtained via SHOW VIEW)
sql = tdml.execute_sql("SHOW VIEW DATASET_CUSTOMER").fetchall()[0][0]

graph = build_teradata_dependency_graph(sql_query=sql)
# Returns: {"nodes": {...}, "edges": [...], "roots": [...]}

By default (expand_datasets_via_process_catalog=True) dataset nodes are resolved through the process catalog: FEATURE_VERSION UUIDs embedded in the dataset DDL are matched to PROCESS_ID entries in FS_V_PROCESS_CATALOG, and edges are drawn directly to the registered feature-engineering views.

DATASET_CUSTOMER  →  FEAT_ENG_CUST  →  DB_SOURCE.TRANSACTIONS

Set expand_datasets_via_process_catalog=False to connect the dataset directly to the raw feature-store storage tables (previous behaviour).

fig = plot_lineage_sankey(graph, title="Customer Dataset Lineage")
show_plotly_robust(fig)

Migration manifest

graph_to_migration_manifest converts any lineage graph into a flat, JSON-serialisable dict — useful for planning a feature store migration.

from tdfs4ds.lineage import graph_to_migration_manifest
import json

# All databases
manifest = graph_to_migration_manifest(graph)

# Scoped to the feature store schema only (cross-boundary edges excluded)
manifest_fs = graph_to_migration_manifest(graph, filter_database=tdfs4ds.SCHEMA)
print(json.dumps(manifest_fs, indent=2))
# {
#   "views":  [{"database": "demo_user", "name": "DATASET_CUSTOMER", "type": "dataset"},
#              {"database": "demo_user", "name": "FEAT_ENG_CUST",    "type": "view"}],
#   "tables": [],
#   "edges":  [{"from": "demo_user.DATASET_CUSTOMER", "to": "demo_user.FEAT_ENG_CUST"}]
# }

with open("migration_manifest.json", "w") as f:
    json.dump(manifest_fs, f, indent=2)

Requirements

Python >= 3.6
teradataml >= 17.20
Active Teradata Vantage connection
VALIDTIME temporal tables must be enabled on the Teradata Vantage system — all feature catalogs, process catalogs, and feature stores rely on VALIDTIME support

Project details

Release history Release notifications | RSS feed

0.2.9.1

Apr 30, 2026

0.2.9.0

Apr 17, 2026

0.2.8.1

Apr 16, 2026

0.2.8.0

Apr 14, 2026

0.2.7.5

Apr 14, 2026

0.2.7.2

Apr 10, 2026

0.2.7.1

Apr 10, 2026

0.2.7.0

Apr 9, 2026

This version

0.2.6.5

Apr 7, 2026

0.2.6.4

Apr 3, 2026

0.2.6.2

Mar 26, 2026

0.2.6.1

Mar 26, 2026

0.2.6.0

Mar 19, 2026

0.2.5.6

Feb 6, 2026

0.2.5.5

Feb 5, 2026

0.2.5.4

Jan 21, 2026

0.2.5.3

Jan 21, 2026

0.2.5.2

Jan 19, 2026

0.2.5.1

Jan 19, 2026

0.2.5.0

Jan 19, 2026

0.2.4.47

Dec 16, 2025

0.2.4.46

Dec 8, 2025

0.2.4.45

Nov 23, 2025

0.2.4.44

Nov 21, 2025

0.2.4.43

Nov 21, 2025

0.2.4.42

Nov 4, 2025

0.2.4.41

Nov 4, 2025

0.2.4.40

Oct 29, 2025

0.2.4.39

Oct 28, 2025

0.2.4.38

Oct 27, 2025

0.2.4.37

Oct 27, 2025

0.2.4.36

Oct 27, 2025

0.2.4.35

Oct 24, 2025

0.2.4.34

Oct 24, 2025

0.2.4.33

Oct 23, 2025

0.2.4.32

Oct 21, 2025

0.2.4.31

Oct 14, 2025

0.2.4.30

Sep 30, 2025

0.2.4.29

Sep 22, 2025

0.2.4.28

Sep 22, 2025

0.2.4.27

Sep 17, 2025

0.2.4.26

Sep 17, 2025

0.2.4.25

Sep 5, 2025

0.2.4.24

Aug 1, 2025

0.2.4.23

Aug 1, 2025

0.2.4.22

Jul 31, 2025

0.2.4.21

Jul 31, 2025

0.2.4.20

Jul 31, 2025

0.2.4.19

Jul 30, 2025

0.2.4.18

Jul 30, 2025

0.2.4.17

Jun 30, 2025

0.2.4.16

Jun 12, 2025

0.2.4.15

May 19, 2025

0.2.4.14

May 19, 2025

0.2.4.13

Mar 31, 2025

0.2.4.12

Feb 13, 2025

0.2.4.11

Feb 11, 2025

0.2.4.10

Feb 11, 2025

0.2.4.9

Feb 11, 2025

0.2.4.8

Feb 10, 2025

0.2.4.7

Feb 10, 2025

0.2.4.6

Feb 5, 2025

0.2.4.5

Feb 5, 2025

0.2.4.4

Feb 3, 2025

0.2.4.3

Feb 3, 2025

0.2.4.2

Feb 3, 2025

0.2.4.1

Feb 3, 2025

0.2.4.0

Jan 29, 2025

0.2.3.26

Jan 24, 2025

0.2.3.25

Jan 21, 2025

0.2.3.24

Jan 15, 2025

0.2.3.23

Jan 10, 2025

0.2.3.22

Jan 9, 2025

0.2.3.21

Jan 9, 2025

0.2.3.20

Jan 9, 2025

0.2.3.19

Jan 9, 2025

0.2.3.18

Dec 19, 2024

0.2.3.17

Dec 18, 2024

0.2.3.16

Dec 18, 2024

0.2.3.15

Dec 4, 2024

0.2.3.14

Dec 4, 2024

0.2.3.13

Dec 4, 2024

0.2.3.12

Dec 4, 2024

0.2.3.11

Dec 3, 2024

0.2.3.10

Dec 3, 2024

0.2.3.9

Nov 27, 2024

0.2.3.8

Nov 27, 2024

0.2.3.7

Nov 18, 2024

0.2.3.6

Nov 14, 2024

0.2.3.5

Nov 14, 2024

0.2.3.4

Nov 14, 2024

0.2.3.3

Nov 14, 2024

0.2.3.2

Nov 13, 2024

0.2.3.1

Nov 13, 2024

0.2.3.0

Nov 13, 2024

0.2.2.85

Nov 13, 2024

0.2.2.84

Nov 4, 2024

0.2.2.83

Nov 4, 2024

0.2.2.82

Nov 4, 2024

0.2.2.81

Oct 30, 2024

0.2.2.80

Oct 29, 2024

0.2.2.79

Oct 29, 2024

0.2.2.78

Oct 28, 2024

0.2.2.77

Oct 28, 2024

0.2.2.76

Oct 25, 2024

0.2.2.75

Oct 25, 2024

0.2.2.74

Oct 25, 2024

0.2.2.73

Oct 25, 2024

0.2.2.72

Oct 15, 2024

0.2.2.71

Oct 3, 2024

0.2.2.70

Oct 3, 2024

0.2.2.69

Sep 25, 2024

0.2.2.68

Sep 25, 2024

0.2.2.67

Jul 18, 2024

0.2.2.66

Jul 17, 2024

0.2.2.65

Jul 10, 2024

0.2.2.64

Jul 10, 2024

0.2.2.63

Jul 10, 2024

0.2.2.62

Jul 10, 2024

0.2.2.61

Jul 10, 2024

0.2.2.60

Jul 8, 2024

0.2.2.59

Jul 6, 2024

0.2.2.58

Jul 6, 2024

0.2.2.57

Jul 6, 2024

0.2.2.56

Jul 6, 2024

0.2.2.55

Jul 5, 2024

0.2.2.54

Jul 5, 2024

0.2.2.53

Jul 5, 2024

0.2.2.52

Jul 5, 2024

0.2.2.51

Jul 5, 2024

0.2.2.50

Jul 4, 2024

0.2.2.49

Jul 3, 2024

0.2.2.48

Jul 3, 2024

0.2.2.47

Jul 3, 2024

0.2.2.46

Jun 28, 2024

0.2.2.45

Jun 28, 2024

0.2.2.44

Jun 28, 2024

0.2.2.43

Jun 28, 2024

0.2.2.42

Jun 28, 2024

0.2.2.41

Jun 27, 2024

0.2.2.40

Jun 27, 2024

0.2.2.39

Jun 27, 2024

0.2.2.38

Jun 27, 2024

0.2.2.37

Jun 27, 2024

0.2.2.36

Jun 19, 2024

0.2.2.35

Jun 17, 2024

0.2.2.34

Jun 14, 2024

0.2.2.33

Jun 13, 2024

0.2.2.32

Jun 13, 2024

0.2.2.31

Jun 12, 2024

0.2.2.30

Jun 10, 2024

0.2.2.29

Jun 10, 2024

0.2.2.28

May 29, 2024

0.2.2.27

May 21, 2024

0.2.2.26

May 21, 2024

0.2.2.25

May 21, 2024

0.2.2.24

May 21, 2024

0.2.2.23

May 21, 2024

0.2.2.22

May 16, 2024

0.2.2.21

May 16, 2024

0.2.2.20

May 14, 2024

0.2.2.19

May 14, 2024

0.2.2.18

May 14, 2024

0.2.2.17

May 14, 2024

0.2.2.16

May 14, 2024

0.2.2.15

Apr 26, 2024

0.2.2.14

Apr 25, 2024

0.2.2.13

Apr 11, 2024

0.2.2.12

Apr 11, 2024

0.2.2.11

Apr 8, 2024

0.2.2.10

Apr 5, 2024

0.2.2.8

Mar 27, 2024

0.2.2.7

Mar 26, 2024

0.2.2.6

Mar 26, 2024

0.2.2.5

Mar 19, 2024

0.2.2.4

Mar 19, 2024

0.2.2.3

Mar 14, 2024

0.2.2.2

Mar 14, 2024

0.2.2.1

Mar 8, 2024

0.2.2.0

Mar 8, 2024

0.2.1.28

Mar 4, 2024

0.2.1.27

Mar 4, 2024

0.2.1.26

Feb 15, 2024

0.2.1.25

Feb 15, 2024

0.2.1.24

Feb 14, 2024

0.2.1.23

Feb 14, 2024

0.2.1.22

Feb 14, 2024

0.2.1.21

Feb 13, 2024

0.2.1.20

Feb 13, 2024

0.2.1.19

Feb 13, 2024

0.2.1.18

Feb 13, 2024

0.2.1.17

Feb 12, 2024

0.2.1.16

Feb 12, 2024

0.2.1.15

Feb 12, 2024

0.2.1.14

Feb 9, 2024

0.2.1.13

Feb 9, 2024

0.2.1.12

Feb 8, 2024

0.2.1.11

Feb 7, 2024

0.2.1.9

Feb 7, 2024

0.2.1.8

Feb 6, 2024

0.2.1.7

Feb 6, 2024

0.2.1.6

Feb 5, 2024

0.2.1.5

Feb 5, 2024

0.2.1.4

Feb 5, 2024

0.2.1.3

Feb 5, 2024

0.2.1.2

Feb 5, 2024

0.2.1.1

Feb 2, 2024

0.2.0.1

Feb 2, 2024

0.1.0.26

Jan 29, 2024

0.1.0.25

Jan 18, 2024

0.1.0.24

Jan 18, 2024

0.1.0.22

Jan 15, 2024

0.1.0.21

Jan 10, 2024

0.1.0.20

Dec 22, 2023

0.1.0.19

Dec 22, 2023

0.1.0.18

Dec 21, 2023

0.1.0.17

Dec 20, 2023

0.1.0.16

Dec 20, 2023

0.1.0.15

Dec 20, 2023

0.1.0.14

Dec 19, 2023

0.1.0.13

Dec 19, 2023

0.1.0.12

Dec 5, 2023

0.1.0.11

Dec 1, 2023

0.1.0.10

Dec 1, 2023

0.1.0.9

Nov 30, 2023

0.1.0.8

Nov 15, 2023

0.1.0.7

Nov 10, 2023

0.1.0.6

Sep 13, 2023

0.1.0.5

Sep 13, 2023

0.1.0.4

Sep 13, 2023

0.1.0.3

Sep 12, 2023

0.1.0.2

Sep 11, 2023

0.1.0.1

Jul 5, 2023

0.1.0.0

Jul 5, 2023

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

tdfs4ds-0.2.6.5-py3-none-any.whl (574.1 kB view details)

Uploaded Apr 7, 2026 Python 3

File details

Details for the file tdfs4ds-0.2.6.5-py3-none-any.whl.

File metadata

Download URL: tdfs4ds-0.2.6.5-py3-none-any.whl
Upload date: Apr 7, 2026
Size: 574.1 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.13.12

File hashes

Hashes for tdfs4ds-0.2.6.5-py3-none-any.whl
Algorithm	Hash digest
SHA256	`569f46bf51e97327095dbb129f3c5c6deea815ca59703946d191217dbf235456`
MD5	`621b12536fd006e49e69d2c990c18ae7`
BLAKE2b-256	`37d2b6f5d136a1141004c8940e763a639d90df572ddccc70d1685f5b2199e3e0`

See more details on using hashes here.

tdfs4ds 0.2.6.5

Navigation

Verified details

Maintainers

Unverified details

Meta

Project description

tdfs4ds — A Feature Store Library for Data Scientists working with ClearScape Analytics

Installation

Quick Start

Core API

entity_id must specify SQL data types (dict, not list)

Walkthrough Example

Step 1 — Set up a feature store

Step 2 — Configure the active context

Step 3 — Define your feature engineering view

Step 4 — Upload and operationalize

Step 5 — Re-run a process

Step 6 — Build a dataset

Configuration

Programmatic (in-session)

Config file (persistent per-project or per-user)

.env file (local secrets and overrides)

Environment variables

load_config() — explicit reload

Priority chain

Time Travel

Package Structure

GenAI Documentation

LLM-powered process documentation

LLM-powered dataset documentation

Business dictionary (no LLM required)

Discover Registered Features

Lineage

Dependency graph

Migration manifest

Requirements

Project details

Verified details

Maintainers

Unverified details

Meta

Release history Release notifications | RSS feed

Download files

Source Distributions

Built Distribution

File details

File metadata

File hashes

`entity_id` must specify SQL data types (dict, not list)

`.env` file (local secrets and overrides)

`load_config()` — explicit reload