Skip to main content

Collection of Kaggle Datasets ready to use for Everyone

Project description


Collection of Kaggle Datasets ready to use for Everyone

License Release Platform Support

System Python 3.5 Python 3.6 Python 3.7
Linux Build Status Build Status Build Status
macOS Build Status Build Status Build Status
Windows Build Status Build Status Build Status

More About Kaggle Datasets

import kaggledatasets as kd

dataset = kd.structured.CreditCardFraudDetection(download=True)
# Returns the split for train and test in Scikit and Tensorflow
train, test = dataset.load()
# Returns the train and test data loader for PyTorch
train_dataloader, test_dataloader = dataset.dataloader()



Commands to install from binaries via Conda or pip wheels are on our website:

From Source

Install Dependencies

pip install -r requirements.txt

Get the kaggledatasets Source

git clone --recursive
cd kaggledatasets

Install kaggledatasets

python install

Getting Started


  • GitHub Issues: bug reports, feature requests, dataset requests, install issues, help wanted, thoughts, etc.
  • Slack: The Kaggle Datasets Slack hosts a primary audience of moderate to experienced Kaggle Datasets users and developers for general chat, online discussions, collaboration etc. If you need a slack invite, please visit:

Releases and Contributing

kaggledatasets is expecting to have a 30 day release cycle (major releases). Please let us know if you encounter a bug by filing an issue.

We appreciate all contributions and make sure you go through our Contributing Guide. If you are planning to contribute back bug-fixes, please do so without any further discussion.

If you plan to contribute new features, new datasets, utility functions or extensions to the core, please first open an issue and discuss the feature with us. Sending a PR without discussion might end up resulting in a rejected PR, because we might be taking kaggledatasets in a different direction than you might be aware of.


kaggledatasets is Apache-2.0 licensed, as found in the LICENSE file.

Project details

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

kaggledatasets-0.0.1.tar.gz (7.4 kB view hashes)

Uploaded source

Built Distribution

kaggledatasets-0.0.1-py2.py3-none-any.whl (15.2 kB view hashes)

Uploaded py2 py3

Supported by

AWS AWS Cloud computing Datadog Datadog Monitoring Facebook / Instagram Facebook / Instagram PSF Sponsor Fastly Fastly CDN Google Google Object Storage and Download Analytics Huawei Huawei PSF Sponsor Microsoft Microsoft PSF Sponsor NVIDIA NVIDIA PSF Sponsor Pingdom Pingdom Monitoring Salesforce Salesforce PSF Sponsor Sentry Sentry Error logging StatusPage StatusPage Status page