a two-dimensional data object with labeled axes and typing information
Project description
Woodwork
DataTable are common data objects to use with Featuretools, EvalML, and general ML. A DataTable object contains the physical, logical, and semantic data types present in the data. In addition, it stores metadata about the data.
Installation
Clone repo
git clone https://github.com/FeatureLabs/woodwork.git
cd woodwork
Install with pip in editable mode
python -m pip install -e .
Example
Below is an example of using Woodwork.
from woodwork import DataTable
from woodwork.logical_types import Datetime, Categorical, NaturalLanguage
import pandas as pd
df = pd.read_csv(...)
dt = DataTable(df,
name='retail', # default to df.name
index=None,
time_index=None)
dt.set_types({
"datetime": Datetime,
"comments": NaturalLanguage,
"store_id": Categorical
})
Built at Alteryx Innovation Labs
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
woodwork-0.0.1.tar.gz
(22.7 kB
view hashes)
Built Distribution
woodwork-0.0.1-py3-none-any.whl
(27.9 kB
view hashes)