Deep-learning Toolkit for Tabular datasets
Project description
DeepTables
DeepTables: Deep-learning Toolkit for Tabular data
DeepTables(DT) is a easy-to-use toolkit that enables deep learning to unleash great power on tabular data.
Overview
MLP (also known as Fully-connected neural networks) have been shown inefficient in learning distribution representation. The "add" operations of the perceptron layer have been proven poor performance to exploring multiplicative feature interactions. In most cases, manual feature engineering is necessary and this work requires extensive domain knowledge and very cumbersome. How learning feature interactions efficiently in neural networks becomes the most important problem.
Various models have been proposed to CTR prediction and continue to outperform existing state-of-the-art approaches to the late years. Well-known examples include FM, DeepFM, Wide&Deep, DCN, PNN, etc. These models can also provide good performance on tabular data under reasonable utilization.
DT aims to utilize the latest research findings to provide users with an end-to-end toolkit on tabular data.
DT has been designed with these key goals in mind:
- Easy to use, non-experts can also use.
- Provide good performance out of the box.
- Flexible architecture and easy expansion by user.
Example:
import numpy as np
from deeptables.models import deeptable, deepnets
from examples.datasets import utils as dsutils
from sklearn.model_selection import train_test_split
#loading data
df = dsutils.load_bank()
df_train, df_test = train_test_split(df, test_size=0.2, random_state=42)
y = df_train.pop('y')
y_test = df_test.pop('y')
config = deeptable.ModelConfig(nets=deepnets.DeepFM)
#training
dt = deeptable.DeepTable(config=config)
model, history = dt.fit(df_train, y, epochs=10)
#evaluation
result = dt.evaluate(df_test,y_test, batch_size=512, verbose=0)
print(result)
#scoring
preds = dt.predict(df_test)
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file deeptables-0.1.3.tar.gz
.
File metadata
- Download URL: deeptables-0.1.3.tar.gz
- Upload date:
- Size: 55.0 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.22.0 setuptools/44.0.0.post20200106 requests-toolbelt/0.9.1 tqdm/4.40.2 CPython/3.7.5
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 3bd323fb8fa93a6c7c4c6a8087b51c04c74271395214944e13a41b5ee99cfe3a |
|
MD5 | ede96fc1270b1996ffd8130e192ef36a |
|
BLAKE2b-256 | 2e2458ea730d09ed0990b51e457bd5ac97012105c3742e607dffec03a03b860c |
File details
Details for the file deeptables-0.1.3-py3-none-any.whl
.
File metadata
- Download URL: deeptables-0.1.3-py3-none-any.whl
- Upload date:
- Size: 2.2 MB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.22.0 setuptools/44.0.0.post20200106 requests-toolbelt/0.9.1 tqdm/4.40.2 CPython/3.7.5
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 11f9232809beb11d0b5aed70cef5e027ac0f93b97a5a80deba484cfcf230bb6d |
|
MD5 | 7dbfabec8a322e95872100864fe87563 |
|
BLAKE2b-256 | dcc06e78190296163ae576e9d1bb000ccc74c9383f5670ed0a40e23d12898ad7 |