Skip to main content

PyTorch dataset wrapper for the

Project description

adult-dataset

A PyTorch dataset wrapper for the Adult (Census Income) dataset. Adult is a popular dataset in machine learning fairness research.

This package provides the adult.Adult class: atorch.utils.data.Datasets loading and, optionally, downloading the Adult dataset. It can be used like the MNIST dataset in torchvision.

Beyond adult.Adult, this package also provides adult.AdultRaw, which works just as adult.Adult, but does not standardize the features in the dataset and does not apply one-hot encoding.

Installation

pip install adult-dataset

Basic Usage

from adult import Adult

# load (if necessary, download) the Adult training dataset 
train_set = Adult(root="datasets", download=True)
# load the test set
test_set = Adult(root="datasets", train=False, download=True)

inputs, target = train_set[0]  # retrieve the first sample of the training set

# iterate over the training set
for inputs, target in iter(train_set):
    ...  # Do something with a single sample

# use a PyTorch data loader
from torch.utils.data import DataLoader

loader = DataLoader(test_set, batch_size=32, shuffle=True)
for epoch in range(100):
    for inputs, targets in iter(loader):
        ...  # Do something with a batch of samples

Advanced Usage

Turn off status messages while downloading the dataset:

Adult(root=..., output_fn=None)

Use the logging module for logging status messages while downloading the dataset instead of placing the status messages on sys.stdout.

import logging

Adult(root=..., output_fn=logging.info)

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

adult_dataset-3.0.0.tar.gz (11.1 kB view details)

Uploaded Source

Built Distribution

adult_dataset-3.0.0-py3-none-any.whl (8.9 kB view details)

Uploaded Python 3

File details

Details for the file adult_dataset-3.0.0.tar.gz.

File metadata

  • Download URL: adult_dataset-3.0.0.tar.gz
  • Upload date:
  • Size: 11.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: python-requests/2.31.0

File hashes

Hashes for adult_dataset-3.0.0.tar.gz
Algorithm Hash digest
SHA256 ff1822a4c3b7c8fa9e1be9bf9055f372127eb79d68131abc7167017ae94f31e0
MD5 a373bc69b9bfdb1ca196d15319b6971d
BLAKE2b-256 03651e1491346966603f5c23f7ae3c8a0d87e695fae1e2d7967853763442b0b4

See more details on using hashes here.

File details

Details for the file adult_dataset-3.0.0-py3-none-any.whl.

File metadata

File hashes

Hashes for adult_dataset-3.0.0-py3-none-any.whl
Algorithm Hash digest
SHA256 8d711c0ce7cd1b07c7c02f6c7e14b8603bed0ec173ca4e3b7c20cca1fb4a0ba2
MD5 7840bb17e68f17ecb9078bbd781ce254
BLAKE2b-256 358242bdf580e63e70c68c7c6f6c027cb6a49ceca128521750041fda9a11ed41

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page