A python package to simplify the usage of feature store using Teradata Vantage ...

Project description

tdfs4ds logo

tdfs4ds — A Feature Store Library for Data Scientists working with ClearScape Analytics

tdfs4ds (Teradata Feature Store for Data Scientists) is a Python package for managing temporal feature stores in Teradata Vantage databases. It provides easy-to-use functions for creating, registering, storing, and retrieving features — with full time-travel support, lineage tracking, and process operationalization.

Installation

pip install tdfs4ds

Quick Start

Import tdfs4ds after establishing a teradataml connection so the package can auto-detect your default database:

import teradataml as tdml
tdml.create_context(host=..., username=..., password=...)

import tdfs4ds
# tdfs4ds.SCHEMA is auto-set from the teradataml context;
# override if needed: tdfs4ds.SCHEMA = 'my_database'

# Data domain management — use the dedicated functions:
tdfs4ds.create_data_domain('MY_PROJECT')   # create and activate a new domain
# or
tdfs4ds.select_data_domain('MY_PROJECT')   # activate an existing domain
# or
tdfs4ds.get_data_domains()                 # list all available domains (* marks the active one)

Core API

Function	Description
`tdfs4ds.setup(database)`	Create feature catalog, process catalog, and follow-up tables in `database`
`tdfs4ds.upload_features(df, entity_id, feature_names, metadata={})`	Ingest features from a teradataml DataFrame into the feature store
`tdfs4ds.build_dataset(entity_id, selected_features, view_name, comment='dataset')`	Assemble a dataset view from registered features
`tdfs4ds.run(process_id)`	Re-execute a registered feature engineering process
`tdfs4ds.roll_out(...)`	Operationalize processes at scale
`tdfs4ds.connect(database)`	Connect to an existing feature store

`entity_id` must specify SQL data types (dict, not list)

entity_id = {'CUSTOMER_ID': 'BIGINT', 'EVENT_DATE': 'DATE'}   # correct
entity_id = ['CUSTOMER_ID', 'EVENT_DATE']                      # wrong

Walkthrough Example

Step 1 — Set up a feature store

import teradataml as tdml
tdml.create_context(host=..., username=..., password=...)

import tdfs4ds
tdfs4ds.setup(database='my_database')

Step 2 — Configure the active context

tdfs4ds.SCHEMA = 'my_database'   # override if not auto-detected

# Use dedicated functions to manage the data domain:
tdfs4ds.create_data_domain('DATA_QUALITY')   # create and activate (first time)
# tdfs4ds.select_data_domain('DATA_QUALITY') # activate an existing domain
# tdfs4ds.get_data_domains()                 # list all domains

Step 3 — Define your feature engineering view

df = tdml.DataFrame(tdml.in_schema('my_database', 'my_feature_view'))
# If teradataml created intermediate views, make them permanent first:
# tdfs4ds.crystallize_view(df)

Step 4 — Upload and operationalize

entity_id     = {'EVENT_DT': 'DATE', 'ID': 'BIGINT'}
feature_names = ['KPI1', 'KPI2']

tdfs4ds.upload_features(
    df=df,
    entity_id=entity_id,
    feature_names=feature_names,
    metadata={'project': 'data quality'}
)

This registers entities and features (if not already present), registers a feature engineering process in the process catalog, and writes the feature values into the feature store.

Step 5 — Re-run a process

# List all registered processes to find the process ID
tdfs4ds.process_catalog()

# Re-execute by process ID
tdfs4ds.run(process_id)

Step 6 — Build a dataset

selected_features = {
    'KPI1': '<process_uuid>',
    'KPI2': '<process_uuid>',
}

dataset = tdfs4ds.build_dataset(
    entity_id={'ID': 'BIGINT'},
    selected_features=selected_features,
    view_name='my_dataset',
    comment='Dataset for churn model'
)

selected_features maps each feature name to the UUID of the process that computed it.

Key Configuration Variables

tdfs4ds.SCHEMA                = 'my_database'        # target database (auto-set from context)
# Data domain: use tdfs4ds.create_data_domain() / select_data_domain() / get_data_domains()
tdfs4ds.FEATURE_STORE_TIME    = None                 # None = current; '2024-01-01 00:00:00' = time travel
tdfs4ds.DISPLAY_LOGS          = True                 # verbose logging
tdfs4ds.DEBUG_MODE            = False
tdfs4ds.STORE_FEATURE         = 'MERGE'              # 'MERGE' or 'UPDATE_INSERT'

# GenAI documentation
tdfs4ds.INSTRUCT_MODEL_PROVIDER = 'openai'           # or 'bedrock'
tdfs4ds.INSTRUCT_MODEL_MODEL    = 'gpt-4o'
tdfs4ds.INSTRUCT_MODEL_API_KEY  = 'sk-...'

Time Travel

All catalogs and feature stores are temporal. Point-in-time queries are available via:

tdfs4ds.FEATURE_STORE_TIME = '2024-01-01 00:00:00'   # query historical state
tdfs4ds.FEATURE_STORE_TIME = None                     # back to current state

Package Structure

tdfs4ds/
├── __init__.py                    — Global config variables & re-exported public API
├── lifecycle.py                   — setup(), connect()
├── execution.py                   — run(), upload_features(), roll_out()
├── catalog.py                     — feature_catalog(), process_catalog(), dataset_catalog()
├── data_domain.py                 — get_data_domains(), select_data_domain(), create_data_domain()
├── datasets.py                    — Utility dataset helpers
├── feature_store/
│   ├── entity_management.py       — register_entity(), remove_entity()
│   ├── feature_data_processing.py — prepare_feature_ingestion(), store_feature(), apply_collect_stats()
│   ├── feature_query_retrieval.py — get_list_features(), get_available_features(), get_feature_versions()
│   └── feature_store_management.py — register_features(), feature_store_table_creation()
├── process_store/
│   ├── process_followup.py        — followup_open(), followup_close(), follow_up_report()
│   ├── process_query_administration.py — list_processes(), get_process_id(), remove_process()
│   ├── process_registration_management.py — register_process_view()
│   └── process_store_catalog_management.py — process_store_catalog_creation()
├── dataset/
│   ├── builder.py                 — build_dataset(), build_dataset_opt(), augment_source_with_features()
│   ├── dataset.py                 — Dataset class
│   └── dataset_catalog.py        — DatasetCatalog class
├── genai/
│   └── documentation.py          — LLM-powered auto-documentation of SQL processes (OpenAI / Bedrock)
├── lineage/
│   ├── lineage.py                 — SQL query parsing, DDL analysis
│   ├── network.py                 — Dependency graph construction
│   └── indexing.py                — Lineage indexing utilities
└── utils/
    ├── query_management.py        — execute_query(), execute_query_wrapper()
    ├── filter_management.py       — FilterManager class
    ├── time_management.py         — TimeManager class
    ├── lineage.py                 — crystallize_view(), analyze_sql_query(), generate_view_dependency_network()
    ├── info.py                    — update_varchar_length(), get_column_types(), seconds_to_dhms()
    └── visualization.py           — plot_graph(), visualize_graph(), display_table()

Discover Registered Features

from tdfs4ds.feature_store.feature_query_retrieval import (
    get_list_entity,
    get_list_features,
    get_available_features,
    get_feature_versions,
)

Requirements

Python >= 3.6
teradataml >= 17.20
Active Teradata Vantage connection
VALIDTIME temporal tables must be enabled on the Teradata Vantage system — all feature catalogs, process catalogs, and feature stores rely on VALIDTIME support

Project details

Release history Release notifications | RSS feed

0.2.9.1

Apr 30, 2026

0.2.9.0

Apr 17, 2026

0.2.8.1

Apr 16, 2026

0.2.8.0

Apr 14, 2026

0.2.7.5

Apr 14, 2026

0.2.7.2

Apr 10, 2026

0.2.7.1

Apr 10, 2026

0.2.7.0

Apr 9, 2026

0.2.6.5

Apr 7, 2026

0.2.6.4

Apr 3, 2026

This version

0.2.6.2

Mar 26, 2026

0.2.6.1

Mar 26, 2026

0.2.6.0

Mar 19, 2026

0.2.5.6

Feb 6, 2026

0.2.5.5

Feb 5, 2026

0.2.5.4

Jan 21, 2026

0.2.5.3

Jan 21, 2026

0.2.5.2

Jan 19, 2026

0.2.5.1

Jan 19, 2026

0.2.5.0

Jan 19, 2026

0.2.4.47

Dec 16, 2025

0.2.4.46

Dec 8, 2025

0.2.4.45

Nov 23, 2025

0.2.4.44

Nov 21, 2025

0.2.4.43

Nov 21, 2025

0.2.4.42

Nov 4, 2025

0.2.4.41

Nov 4, 2025

0.2.4.40

Oct 29, 2025

0.2.4.39

Oct 28, 2025

0.2.4.38

Oct 27, 2025

0.2.4.37

Oct 27, 2025

0.2.4.36

Oct 27, 2025

0.2.4.35

Oct 24, 2025

0.2.4.34

Oct 24, 2025

0.2.4.33

Oct 23, 2025

0.2.4.32

Oct 21, 2025

0.2.4.31

Oct 14, 2025

0.2.4.30

Sep 30, 2025

0.2.4.29

Sep 22, 2025

0.2.4.28

Sep 22, 2025

0.2.4.27

Sep 17, 2025

0.2.4.26

Sep 17, 2025

0.2.4.25

Sep 5, 2025

0.2.4.24

Aug 1, 2025

0.2.4.23

Aug 1, 2025

0.2.4.22

Jul 31, 2025

0.2.4.21

Jul 31, 2025

0.2.4.20

Jul 31, 2025

0.2.4.19

Jul 30, 2025

0.2.4.18

Jul 30, 2025

0.2.4.17

Jun 30, 2025

0.2.4.16

Jun 12, 2025

0.2.4.15

May 19, 2025

0.2.4.14

May 19, 2025

0.2.4.13

Mar 31, 2025

0.2.4.12

Feb 13, 2025

0.2.4.11

Feb 11, 2025

0.2.4.10

Feb 11, 2025

0.2.4.9

Feb 11, 2025

0.2.4.8

Feb 10, 2025

0.2.4.7

Feb 10, 2025

0.2.4.6

Feb 5, 2025

0.2.4.5

Feb 5, 2025

0.2.4.4

Feb 3, 2025

0.2.4.3

Feb 3, 2025

0.2.4.2

Feb 3, 2025

0.2.4.1

Feb 3, 2025

0.2.4.0

Jan 29, 2025

0.2.3.26

Jan 24, 2025

0.2.3.25

Jan 21, 2025

0.2.3.24

Jan 15, 2025

0.2.3.23

Jan 10, 2025

0.2.3.22

Jan 9, 2025

0.2.3.21

Jan 9, 2025

0.2.3.20

Jan 9, 2025

0.2.3.19

Jan 9, 2025

0.2.3.18

Dec 19, 2024

0.2.3.17

Dec 18, 2024

0.2.3.16

Dec 18, 2024

0.2.3.15

Dec 4, 2024

0.2.3.14

Dec 4, 2024

0.2.3.13

Dec 4, 2024

0.2.3.12

Dec 4, 2024

0.2.3.11

Dec 3, 2024

0.2.3.10

Dec 3, 2024

0.2.3.9

Nov 27, 2024

0.2.3.8

Nov 27, 2024

0.2.3.7

Nov 18, 2024

0.2.3.6

Nov 14, 2024

0.2.3.5

Nov 14, 2024

0.2.3.4

Nov 14, 2024

0.2.3.3

Nov 14, 2024

0.2.3.2

Nov 13, 2024

0.2.3.1

Nov 13, 2024

0.2.3.0

Nov 13, 2024

0.2.2.85

Nov 13, 2024

0.2.2.84

Nov 4, 2024

0.2.2.83

Nov 4, 2024

0.2.2.82

Nov 4, 2024

0.2.2.81

Oct 30, 2024

0.2.2.80

Oct 29, 2024

0.2.2.79

Oct 29, 2024

0.2.2.78

Oct 28, 2024

0.2.2.77

Oct 28, 2024

0.2.2.76

Oct 25, 2024

0.2.2.75

Oct 25, 2024

0.2.2.74

Oct 25, 2024

0.2.2.73

Oct 25, 2024

0.2.2.72

Oct 15, 2024

0.2.2.71

Oct 3, 2024

0.2.2.70

Oct 3, 2024

0.2.2.69

Sep 25, 2024

0.2.2.68

Sep 25, 2024

0.2.2.67

Jul 18, 2024

0.2.2.66

Jul 17, 2024

0.2.2.65

Jul 10, 2024

0.2.2.64

Jul 10, 2024

0.2.2.63

Jul 10, 2024

0.2.2.62

Jul 10, 2024

0.2.2.61

Jul 10, 2024

0.2.2.60

Jul 8, 2024

0.2.2.59

Jul 6, 2024

0.2.2.58

Jul 6, 2024

0.2.2.57

Jul 6, 2024

0.2.2.56

Jul 6, 2024

0.2.2.55

Jul 5, 2024

0.2.2.54

Jul 5, 2024

0.2.2.53

Jul 5, 2024

0.2.2.52

Jul 5, 2024

0.2.2.51

Jul 5, 2024

0.2.2.50

Jul 4, 2024

0.2.2.49

Jul 3, 2024

0.2.2.48

Jul 3, 2024

0.2.2.47

Jul 3, 2024

0.2.2.46

Jun 28, 2024

0.2.2.45

Jun 28, 2024

0.2.2.44

Jun 28, 2024

0.2.2.43

Jun 28, 2024

0.2.2.42

Jun 28, 2024

0.2.2.41

Jun 27, 2024

0.2.2.40

Jun 27, 2024

0.2.2.39

Jun 27, 2024

0.2.2.38

Jun 27, 2024

0.2.2.37

Jun 27, 2024

0.2.2.36

Jun 19, 2024

0.2.2.35

Jun 17, 2024

0.2.2.34

Jun 14, 2024

0.2.2.33

Jun 13, 2024

0.2.2.32

Jun 13, 2024

0.2.2.31

Jun 12, 2024

0.2.2.30

Jun 10, 2024

0.2.2.29

Jun 10, 2024

0.2.2.28

May 29, 2024

0.2.2.27

May 21, 2024

0.2.2.26

May 21, 2024

0.2.2.25

May 21, 2024

0.2.2.24

May 21, 2024

0.2.2.23

May 21, 2024

0.2.2.22

May 16, 2024

0.2.2.21

May 16, 2024

0.2.2.20

May 14, 2024

0.2.2.19

May 14, 2024

0.2.2.18

May 14, 2024

0.2.2.17

May 14, 2024

0.2.2.16

May 14, 2024

0.2.2.15

Apr 26, 2024

0.2.2.14

Apr 25, 2024

0.2.2.13

Apr 11, 2024

0.2.2.12

Apr 11, 2024

0.2.2.11

Apr 8, 2024

0.2.2.10

Apr 5, 2024

0.2.2.8

Mar 27, 2024

0.2.2.7

Mar 26, 2024

0.2.2.6

Mar 26, 2024

0.2.2.5

Mar 19, 2024

0.2.2.4

Mar 19, 2024

0.2.2.3

Mar 14, 2024

0.2.2.2

Mar 14, 2024

0.2.2.1

Mar 8, 2024

0.2.2.0

Mar 8, 2024

0.2.1.28

Mar 4, 2024

0.2.1.27

Mar 4, 2024

0.2.1.26

Feb 15, 2024

0.2.1.25

Feb 15, 2024

0.2.1.24

Feb 14, 2024

0.2.1.23

Feb 14, 2024

0.2.1.22

Feb 14, 2024

0.2.1.21

Feb 13, 2024

0.2.1.20

Feb 13, 2024

0.2.1.19

Feb 13, 2024

0.2.1.18

Feb 13, 2024

0.2.1.17

Feb 12, 2024

0.2.1.16

Feb 12, 2024

0.2.1.15

Feb 12, 2024

0.2.1.14

Feb 9, 2024

0.2.1.13

Feb 9, 2024

0.2.1.12

Feb 8, 2024

0.2.1.11

Feb 7, 2024

0.2.1.9

Feb 7, 2024

0.2.1.8

Feb 6, 2024

0.2.1.7

Feb 6, 2024

0.2.1.6

Feb 5, 2024

0.2.1.5

Feb 5, 2024

0.2.1.4

Feb 5, 2024

0.2.1.3

Feb 5, 2024

0.2.1.2

Feb 5, 2024

0.2.1.1

Feb 2, 2024

0.2.0.1

Feb 2, 2024

0.1.0.26

Jan 29, 2024

0.1.0.25

Jan 18, 2024

0.1.0.24

Jan 18, 2024

0.1.0.22

Jan 15, 2024

0.1.0.21

Jan 10, 2024

0.1.0.20

Dec 22, 2023

0.1.0.19

Dec 22, 2023

0.1.0.18

Dec 21, 2023

0.1.0.17

Dec 20, 2023

0.1.0.16

Dec 20, 2023

0.1.0.15

Dec 20, 2023

0.1.0.14

Dec 19, 2023

0.1.0.13

Dec 19, 2023

0.1.0.12

Dec 5, 2023

0.1.0.11

Dec 1, 2023

0.1.0.10

Dec 1, 2023

0.1.0.9

Nov 30, 2023

0.1.0.8

Nov 15, 2023

0.1.0.7

Nov 10, 2023

0.1.0.6

Sep 13, 2023

0.1.0.5

Sep 13, 2023

0.1.0.4

Sep 13, 2023

0.1.0.3

Sep 12, 2023

0.1.0.2

Sep 11, 2023

0.1.0.1

Jul 5, 2023

0.1.0.0

Jul 5, 2023

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

tdfs4ds-0.2.6.2-py3-none-any.whl (514.5 kB view details)

Uploaded Mar 26, 2026 Python 3

File details

Details for the file tdfs4ds-0.2.6.2-py3-none-any.whl.

File metadata

Download URL: tdfs4ds-0.2.6.2-py3-none-any.whl
Upload date: Mar 26, 2026
Size: 514.5 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.13.12

File hashes

Hashes for tdfs4ds-0.2.6.2-py3-none-any.whl
Algorithm	Hash digest
SHA256	`eaa01ea81e1f6cf8c5089cf1b5e71b9c5bfd8fe832c715a1387966426bbe6484`
MD5	`b7c55b3214f84df2ac818f4f6915d75c`
BLAKE2b-256	`1734153289f75b624ac8e77eafb6bd7b7082424bf8bf462603c7e293480229fc`

See more details on using hashes here.

tdfs4ds 0.2.6.2

Navigation

Verified details

Maintainers

Unverified details

Meta

Project description

tdfs4ds — A Feature Store Library for Data Scientists working with ClearScape Analytics

Installation

Quick Start

Core API

`entity_id` must specify SQL data types (dict, not list)

Walkthrough Example

Step 1 — Set up a feature store

Step 2 — Configure the active context

Step 3 — Define your feature engineering view

Step 4 — Upload and operationalize

Step 5 — Re-run a process

Step 6 — Build a dataset

Key Configuration Variables

Time Travel

Package Structure

Discover Registered Features

Requirements

Project details

Verified details

Maintainers

Unverified details

Meta

Release history Release notifications | RSS feed

Download files

Source Distributions

Built Distribution

File details

File metadata

File hashes

tdfs4ds 0.2.6.2

Navigation

Verified details

Maintainers

Unverified details

Meta

Project description

tdfs4ds — A Feature Store Library for Data Scientists working with ClearScape Analytics

Installation

Quick Start

Core API

entity_id must specify SQL data types (dict, not list)

Walkthrough Example

Step 1 — Set up a feature store

Step 2 — Configure the active context

Step 3 — Define your feature engineering view

Step 4 — Upload and operationalize

Step 5 — Re-run a process

Step 6 — Build a dataset

Key Configuration Variables

Time Travel

Package Structure

Discover Registered Features

Requirements

Project details

Verified details

Maintainers

Unverified details

Meta

Release history Release notifications | RSS feed

Download files

Source Distributions

Built Distribution

File details

File metadata

File hashes

`entity_id` must specify SQL data types (dict, not list)