Skip to main content

Tools for processing tabular datasets for PyTorch

Project description

https://circleci.com/gh/keitakurita/torchtable.svg?style=svg Documentation Status

torchtable

Torchtable is a library for handling tabular datasets in PyTorch. It is heavily inspired by torchtext and uses a similar API but without some of the limitations (e.g. only one field per column). Torchtable aims to be simple to use and easily extensible. It provides sensible defaults while allowing the user to define their own custom pipelines, putting all of this behind an intuitive interface.

Installation

Install via pip.

$ pip install torchtable

Documentation

Documentation is a work in progress, but the current docs can be read here. In addition, you can read the notebooks in the examples directory or dev_nb directory to learn more.

Usage

Torchtable uses a declarative API similar to torchtext. Here is an example of how you might handle an imaginary dataset where you are supposed to predict the price of some product.

>>> train = TabularDataset.from_csv('data/train.csv',
...    fields={'seller_id': CategoricalField(min_freq=3),
...            'timestamp': [DayofWeekField(), HourField()],
...            'price': NumericalField(fill_missing="median", is_target=True)
...    })
...

See the examples directory for more examples.

TODO

  • Add more models

  • Implement default field selection

  • Implement text field/operations

  • Implement swap noise

  • Implement input/output validation

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

torchtable-0.1.0.tar.gz (18.0 kB view details)

Uploaded Source

Built Distribution

torchtable-0.1.0-py3-none-any.whl (24.7 kB view details)

Uploaded Python 3

File details

Details for the file torchtable-0.1.0.tar.gz.

File metadata

  • Download URL: torchtable-0.1.0.tar.gz
  • Upload date:
  • Size: 18.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.12.1 pkginfo/1.4.2 requests/2.21.0 setuptools/40.6.3 requests-toolbelt/0.8.0 tqdm/4.28.1 CPython/3.6.7

File hashes

Hashes for torchtable-0.1.0.tar.gz
Algorithm Hash digest
SHA256 2cc9575905226c0ad6c701d03017402f7a7187d189b8da5023ce04f2cb1aa860
MD5 b15096d94e80ea12b4b4700297ec6680
BLAKE2b-256 11e74332b142118efb307ee91e24841f5ce781c06e6427fb6d851bc9eef2202b

See more details on using hashes here.

File details

Details for the file torchtable-0.1.0-py3-none-any.whl.

File metadata

  • Download URL: torchtable-0.1.0-py3-none-any.whl
  • Upload date:
  • Size: 24.7 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.12.1 pkginfo/1.4.2 requests/2.21.0 setuptools/40.6.3 requests-toolbelt/0.8.0 tqdm/4.28.1 CPython/3.6.7

File hashes

Hashes for torchtable-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 29b5768fb6ee3195eea3dc8e23d3b072e17fff2d69aa1e2309b8cc1ccbbfb4b1
MD5 ba54e0fb26eefd7b8d9e776f4b7fedaa
BLAKE2b-256 d7a40eab2395179abce676828882466e41991de388fa9235f0795db349c8dc1a

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page