Skip to main content

Collection of Kaggle Datasets ready to use for Everyone

Project description

kaggledatasets

Collection of Kaggle Datasets ready to use for Everyone

License Release Platform Support

System Python 3.5 Python 3.6 Python 3.7
Linux Build Status Build Status Build Status
macOS Build Status Build Status Build Status
Windows Build Status Build Status Build Status

More About Kaggle Datasets

import kaggledatasets as kd

dataset = kd.structured.CreditCardFraudDetection(download=True)
# Returns the split for train and test in Scikit and Tensorflow
train, test = dataset.load()
# Returns the train and test data loader for PyTorch
train_dataloader, test_dataloader = dataset.dataloader()

Installation

Binaries

Commands to install from binaries via Conda or pip wheels are on our website: https://kaggledatasets.github.io

From Source

Install Dependencies

pip install -r requirements.txt

Get the kaggledatasets Source

git clone --recursive https://github.com/kaggledatasets/kaggledatasets
cd kaggledatasets

Install kaggledatasets

python setup.py install

Getting Started

Communication

  • GitHub Issues: bug reports, feature requests, dataset requests, install issues, help wanted, thoughts, etc.
  • Slack: The Kaggle Datasets Slack hosts a primary audience of moderate to experienced Kaggle Datasets users and developers for general chat, online discussions, collaboration etc. If you need a slack invite, please visit: http://bit.ly/kdslack

Releases and Contributing

kaggledatasets is expecting to have a 30 day release cycle (major releases). Please let us know if you encounter a bug by filing an issue.

We appreciate all contributions and make sure you go through our Contributing Guide. If you are planning to contribute back bug-fixes, please do so without any further discussion.

If you plan to contribute new features, new datasets, utility functions or extensions to the core, please first open an issue and discuss the feature with us. Sending a PR without discussion might end up resulting in a rejected PR, because we might be taking kaggledatasets in a different direction than you might be aware of.

License

kaggledatasets is Apache-2.0 licensed, as found in the LICENSE file.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

kaggledatasets-0.0.1.tar.gz (7.4 kB view hashes)

Uploaded Source

Built Distribution

kaggledatasets-0.0.1-py2.py3-none-any.whl (15.2 kB view hashes)

Uploaded Python 2 Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page