Skip to main content

Wedata Feature Engineering Library Development

Project description

Wedata Feature Engineering Library

Wedata 特征工程库,提供特征存储、特征管理、离在线同步等功能,适用于腾讯云 WeData 平台上的 ML 特征工程场景。

功能特性

  • 特征表管理:创建、删除、修改特征表,支持 Iceberg 等多种存储引擎
  • 特征数据读写:基于 Spark 的高效特征数据写入和读取
  • 离在线同步:一键将离线特征表发布为在线特征表,支持周期性同步和一次性同步
  • 在线特征服务:基于 Redis 的在线特征存储和实时查询
  • 训练集构建:通过 Feature Lookup 自动关联多个特征表,构建训练数据集

安装

pip install tencent-wedata-feature-engineering

快速开始

from wedata.feature_store.feature_table_client import FeatureTableClient

# 初始化客户端(在 WeData Notebook 中自动获取环境配置)
client = FeatureTableClient(spark)

# 创建特征表
feature_table = client.create_table(
    name="user_features",
    database_name="my_feature_db",
    primary_keys=["user_id"],
    df=features_df,
    timestamp_key="event_timestamp"
)

# 发布为在线特征表
client.publish_table(
    table_name="user_features",
    data_source_name="my_datasource"
)

使用环境

本库需要在腾讯云 WeData 平台的 Notebook 或 CodeStudio 环境中使用,依赖平台提供的 Spark 引擎和 Feast 特征服务。

许可证

Apache 2.0

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

tencent_wedata_feature_engineering-0.1.51.3.tar.gz (151.6 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

File details

Details for the file tencent_wedata_feature_engineering-0.1.51.3.tar.gz.

File metadata

File hashes

Hashes for tencent_wedata_feature_engineering-0.1.51.3.tar.gz
Algorithm Hash digest
SHA256 b4c9107981d737d12a4ee6d8d46ec76ed5c43af97d50e9a3554c00f3502ce4be
MD5 a722dccd55ef294b5670b4a0438d9587
BLAKE2b-256 f6ed8c0821a719f46af82cf98d4995a918ee18d095e07b20d289deaab1ca6718

See more details on using hashes here.

File details

Details for the file tencent_wedata_feature_engineering-0.1.51.3-py3-none-any.whl.

File metadata

File hashes

Hashes for tencent_wedata_feature_engineering-0.1.51.3-py3-none-any.whl
Algorithm Hash digest
SHA256 80030f30151853d688f7156db685e435ebbfc6ae9f026e7f99e00ba2aaa664bf
MD5 0f1cf5accfbe76d06eb9b4ef604a1b07
BLAKE2b-256 7ab223ab3f1e78b2d880854553ab6667b0b696c8f6efe565531da4927a585e02

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page