Skip to main content

Collection of Kaggle Datasets ready to use for Everyone

Project description

kaggledatasets

Collection of Kaggle Datasets ready to use for Everyone

License Release Platform Support

System Python 3.5 Python 3.6 Python 3.7
Linux Build Status Build Status Build Status
macOS Build Status Build Status Build Status
Windows Build Status Build Status Build Status

More About Kaggle Datasets

import kaggledatasets as kd

dataset = kd.structured.CreditCardFraudDetection(download=True)
# Returns the split for train and test in Scikit and Tensorflow
train, test = dataset.load()
# Returns the train and test data loader for PyTorch
train_dataloader, test_dataloader = dataset.dataloader()

Installation

Binaries

Commands to install from binaries via Conda or pip wheels are on our website: https://kaggledatasets.github.io

From Source

Install Dependencies

pip install -r requirements.txt

Get the kaggledatasets Source

git clone --recursive https://github.com/kaggledatasets/kaggledatasets
cd kaggledatasets

Install kaggledatasets

python setup.py install

Getting Started

Communication

  • GitHub Issues: bug reports, feature requests, dataset requests, install issues, help wanted, thoughts, etc.
  • Slack: The Kaggle Datasets Slack hosts a primary audience of moderate to experienced Kaggle Datasets users and developers for general chat, online discussions, collaboration etc. If you need a slack invite, please visit: http://bit.ly/kdslack

Releases and Contributing

kaggledatasets is expecting to have a 30 day release cycle (major releases). Please let us know if you encounter a bug by filing an issue.

We appreciate all contributions and make sure you go through our Contributing Guide. If you are planning to contribute back bug-fixes, please do so without any further discussion.

If you plan to contribute new features, new datasets, utility functions or extensions to the core, please first open an issue and discuss the feature with us. Sending a PR without discussion might end up resulting in a rejected PR, because we might be taking kaggledatasets in a different direction than you might be aware of.

License

kaggledatasets is Apache-2.0 licensed, as found in the LICENSE file.

Project details


Release history Release notifications

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Files for kaggledatasets, version 0.0.1
Filename, size File type Python version Upload date Hashes
Filename, size kaggledatasets-0.0.1-py2.py3-none-any.whl (15.2 kB) File type Wheel Python version py2.py3 Upload date Hashes View hashes
Filename, size kaggledatasets-0.0.1.tar.gz (7.4 kB) File type Source Python version None Upload date Hashes View hashes

Supported by

Elastic Elastic Search Pingdom Pingdom Monitoring Google Google BigQuery Sentry Sentry Error logging AWS AWS Cloud computing DataDog DataDog Monitoring Fastly Fastly CDN SignalFx SignalFx Supporter DigiCert DigiCert EV certificate StatusPage StatusPage Status page