Skip to main content

Deep-learning Toolkit for Tabular datasets

Project description

DeepTables

DeepTables: Deep-learning Toolkit for Tabular data

DeepTables(DT) is a easy-to-use toolkit that enables deep learning to unleash great power on tabular data.

Overview

MLP (also known as Fully-connected neural networks) have been shown inefficient in learning distribution representation. The "add" operations of the perceptron layer have been proven poor performance to exploring multiplicative feature interactions. In most cases, manual feature engineering is necessary and this work requires extensive domain knowledge and very cumbersome. How learning feature interactions efficiently in neural networks becomes the most important problem.

Various models have been proposed to CTR prediction and continue to outperform existing state-of-the-art approaches to the late years. Well-known examples include FM, DeepFM, Wide&Deep, DCN, PNN, etc. These models can also provide good performance on tabular data under reasonable utilization.

DT aims to utilize the latest research findings to provide users with an end-to-end toolkit on tabular data.

DT has been designed with these key goals in mind:

  • Easy to use, non-experts can also use.
  • Provide good performance out of the box.
  • Flexible architecture and easy expansion by user.

Example:

import numpy as np
from deeptables.models import deeptable, deepnets
from examples.datasets import utils as dsutils
from sklearn.model_selection import train_test_split

#loading data
df = dsutils.load_bank()
df_train, df_test = train_test_split(df, test_size=0.2, random_state=42)

y = df_train.pop('y')
y_test = df_test.pop('y')

config = deeptable.ModelConfig(nets=deepnets.DeepFM)

#training
dt = deeptable.DeepTable(config=config)
model, history = dt.fit(df_train, y, epochs=10)

#evaluation
result = dt.evaluate(df_test,y_test, batch_size=512, verbose=0)
print(result)

#scoring
preds = dt.predict(df_test)

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

deeptables-0.1.3.tar.gz (55.0 kB view details)

Uploaded Source

Built Distribution

deeptables-0.1.3-py3-none-any.whl (2.2 MB view details)

Uploaded Python 3

File details

Details for the file deeptables-0.1.3.tar.gz.

File metadata

  • Download URL: deeptables-0.1.3.tar.gz
  • Upload date:
  • Size: 55.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.22.0 setuptools/44.0.0.post20200106 requests-toolbelt/0.9.1 tqdm/4.40.2 CPython/3.7.5

File hashes

Hashes for deeptables-0.1.3.tar.gz
Algorithm Hash digest
SHA256 3bd323fb8fa93a6c7c4c6a8087b51c04c74271395214944e13a41b5ee99cfe3a
MD5 ede96fc1270b1996ffd8130e192ef36a
BLAKE2b-256 2e2458ea730d09ed0990b51e457bd5ac97012105c3742e607dffec03a03b860c

See more details on using hashes here.

File details

Details for the file deeptables-0.1.3-py3-none-any.whl.

File metadata

  • Download URL: deeptables-0.1.3-py3-none-any.whl
  • Upload date:
  • Size: 2.2 MB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.22.0 setuptools/44.0.0.post20200106 requests-toolbelt/0.9.1 tqdm/4.40.2 CPython/3.7.5

File hashes

Hashes for deeptables-0.1.3-py3-none-any.whl
Algorithm Hash digest
SHA256 11f9232809beb11d0b5aed70cef5e027ac0f93b97a5a80deba484cfcf230bb6d
MD5 7dbfabec8a322e95872100864fe87563
BLAKE2b-256 dcc06e78190296163ae576e9d1bb000ccc74c9383f5670ed0a40e23d12898ad7

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page