Skip to main content

Easy access to datasets from the UCI Machine Learning Repository

Project description

Build Status Documentation Status

An API to the UCI Machine Learning Repository

This is a python package to enable easy access to datasets in the UCI Machine Learning Repository. Note that this is not an official API. Any usage of datasets should be cited according to instructions in the UCI Machine Learning Repository.

The project is at an early alpha stage, so suggestion for changes or additions are very welcome.

Link to documentation.

Basic usage

from ucimlr import regression_datasets
abalone = regression_datasets.Abalone('dataset_folder')
print(abalone.type_)
>>> regression
print(len(abalone))
>>> 3341

Independent and dependent variables are accessed as numpy arrays:

print(abalone.x.shape)
>>> (3341, 10)
print(abalone.y.shape)
>>> (3341, 1)

Or by element access:

x, y = dataset[0]

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

ucimlr-0.3.0.tar.gz (9.1 kB view hashes)

Uploaded source

Built Distribution

ucimlr-0.3.0-py3-none-any.whl (20.6 kB view hashes)

Uploaded py3

Supported by

AWS AWS Cloud computing Datadog Datadog Monitoring Facebook / Instagram Facebook / Instagram PSF Sponsor Fastly Fastly CDN Google Google Object Storage and Download Analytics Huawei Huawei PSF Sponsor Microsoft Microsoft PSF Sponsor NVIDIA NVIDIA PSF Sponsor Pingdom Pingdom Monitoring Salesforce Salesforce PSF Sponsor Sentry Sentry Error logging StatusPage StatusPage Status page