Skip to main content

Deep-learning Toolkit for Tabular datasets

Project description

DeepTables

Python Versions TensorFlow Versions Downloads PyPI Version

Documentation Status Build Status Coverage Status License

DeepTables: Deep-learning Toolkit for Tabular data

DeepTables(DT) is a easy-to-use toolkit that enables deep learning to unleash great power on tabular data.

Overview

MLP (also known as Fully-connected neural networks) have been shown inefficient in learning distribution representation. The "add" operations of the perceptron layer have been proven poor performance to exploring multiplicative feature interactions. In most cases, manual feature engineering is necessary and this work requires extensive domain knowledge and very cumbersome. How learning feature interactions efficiently in neural networks becomes the most important problem.

Various models have been proposed to CTR prediction and continue to outperform existing state-of-the-art approaches to the late years. Well-known examples include FM, DeepFM, Wide&Deep, DCN, PNN, etc. These models can also provide good performance on tabular data under reasonable utilization.

DT aims to utilize the latest research findings to provide users with an end-to-end toolkit on tabular data.

DT has been designed with these key goals in mind:

  • Easy to use, non-experts can also use.
  • Provide good performance out of the box.
  • Flexible architecture and easy expansion by user.

Tutorials

Please refer to the official docs at https://deeptables.readthedocs.io/en/latest/.

Installation

pip install deeptables

GPU Setup (Optional)

pip install deeptables[gpu]

Verify the install:

python -c "from deeptables.utils.quicktest import test; test()”

Example:

import numpy as np
from deeptables.models import deeptable, deepnets
from deeptables.datasets import dsutils
from sklearn.model_selection import train_test_split

#loading data
df = dsutils.load_bank()
df_train, df_test = train_test_split(df, test_size=0.2, random_state=42)

y = df_train.pop('y')
y_test = df_test.pop('y')

#training
config = deeptable.ModelConfig(nets=deepnets.DeepFM)
dt = deeptable.DeepTable(config=config)
model, history = dt.fit(df_train, y, epochs=10)

#evaluation
result = dt.evaluate(df_test,y_test, batch_size=512, verbose=0)
print(result)

#scoring
preds = dt.predict(df_test)

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

deeptables-0.1.9.tar.gz (56.6 kB view details)

Uploaded Source

Built Distribution

deeptables-0.1.9-py3-none-any.whl (2.2 MB view details)

Uploaded Python 3

File details

Details for the file deeptables-0.1.9.tar.gz.

File metadata

  • Download URL: deeptables-0.1.9.tar.gz
  • Upload date:
  • Size: 56.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.22.0 setuptools/44.0.0.post20200106 requests-toolbelt/0.9.1 tqdm/4.40.2 CPython/3.7.5

File hashes

Hashes for deeptables-0.1.9.tar.gz
Algorithm Hash digest
SHA256 8a58e9afc4ad9b2a30279994a6862e9d6ae732b7954d203bfbd03fb8cf7fe015
MD5 e3e60b5d0fb0ff43a7cde8db7ffded70
BLAKE2b-256 9bbbd44a7c6357f4d69d20a1256f1c671ee18b9f010702f1b5e9f28b88222363

See more details on using hashes here.

File details

Details for the file deeptables-0.1.9-py3-none-any.whl.

File metadata

  • Download URL: deeptables-0.1.9-py3-none-any.whl
  • Upload date:
  • Size: 2.2 MB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.22.0 setuptools/44.0.0.post20200106 requests-toolbelt/0.9.1 tqdm/4.40.2 CPython/3.7.5

File hashes

Hashes for deeptables-0.1.9-py3-none-any.whl
Algorithm Hash digest
SHA256 a5b2449319f008ec1d8468046c53411a5558c6c5644bd50b27bb2dd14f92fa2e
MD5 5ede7ea2ca2cc5973b9fd9595bfa026d
BLAKE2b-256 bed33a358b20ad18f97421b21d4e2267a1390cd63cd98ace610f06bbf1e358c0

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page