Skip to main content

a two-dimensional data object with labeled axes and typing information

Project description

Woodwork

codecov

DataTable are common data objects to use with Featuretools, EvalML, and general ML. A DataTable object contains the physical, logical, and semantic data types present in the data. In addition, it stores metadata about the data.

Installation

Clone repo

git clone https://github.com/FeatureLabs/woodwork.git
cd woodwork

Install with pip in editable mode

python -m pip install -e .

Example

Below is an example of using Woodwork.

from woodwork import DataTable
from woodwork.logical_types import Datetime, Categorical, NaturalLanguage

import pandas as pd

df = pd.read_csv(...)

dt = DataTable(df,
               name='retail', # default to df.name
               index=None,
               time_index=None)

dt.set_types({
    "datetime": Datetime,
    "comments": NaturalLanguage,
    "store_id": Categorical
})

Built at Alteryx Innovation Labs

Alteryx Innovation Labs

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

woodwork-0.0.1.tar.gz (22.7 kB view hashes)

Uploaded Source

Built Distribution

woodwork-0.0.1-py3-none-any.whl (27.9 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page