Skip to main content

Fetcher for datasets

Project description

image0 image1

A python util to fetch datasets from different databases.

Currently supported databases are:

Getting design matrix and target variable is as easy as:

from libsvmdata import fetch_dataset
X, y = fetch_dataset("news20.binary")

Currently supported datasets are in libsvmdata.supported and can be displayed as:

from libsvmdata import print_supported_datasets
print_supported_datasets()

There is no need to specify the database name.

Files are saved under DATA_HOME/<database_name>, where the value of DATA_HOME is:

  • the environment variable LIBSVMDATA_HOME if it exists,

  • else, the environment variable XDG_DATA_HOME if it exists,

  • else, $HOME/data.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

libsvmdata-0.5.tar.gz (9.5 kB view details)

Uploaded Source

File details

Details for the file libsvmdata-0.5.tar.gz.

File metadata

  • Download URL: libsvmdata-0.5.tar.gz
  • Upload date:
  • Size: 9.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.10.14

File hashes

Hashes for libsvmdata-0.5.tar.gz
Algorithm Hash digest
SHA256 292b675ce8ea363dfbf7d32b4b6a5e13b59b74ffff545a2198237270da710e46
MD5 ab627d4dd917a135e71da3e6cb7be4b4
BLAKE2b-256 63a46ce78f70a5ef600ea534c1ba45403783e6f3750230f83e85850f8282732b

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page