A curated collection of datasets for data analysis & machine learning, downloadable with a single Python command
Project description
opendatasets
A curated collection of datasets for data analysis & machine learning, downloadable with a single Python command.
Try it out online: https://jovian.ml/aakashns/opendatasets-demo
Installation & Usage
Install the library using pip
:
pip install opendatasets --upgrade
To use the library, just import it and use the download
function.
import opendatasets as od
od.download('stackoverflow-developer-survey-2020')
See the next section for a list of available dataset IDs.
Datasets
Dataset ID | Description | Source |
---|---|---|
stackoverflow-developer-survey-2020 |
Stack Overflow Developer Survey 2020 | Stack Overflow |
Contributing
This is an open source project and we welcome contributions.
Local Development Setup
- Clone the repository:
git clone https://github.com/JovianML/opendatasets.git
- Setup the Python environment for development
conda create -n opendatasets python=3.5
conda activate opendatasets
pip install -r requirements.txt
- Open up the project in VS code and make your changes. Make sure to install the Python Extension for VS Code and select the
opendatasets
conda environment.
Adding Datasets
TODO - more details will be added here later
This package is developed and mainted by the Jovian.ml team.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
opendatasets-0.0.108.tar.gz
(4.0 kB
view hashes)
Built Distribution
Close
Hashes for opendatasets-0.0.108-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | ab73d632816fca5dca3943aabf08f4f2925d728c30ba3ff3b8abba8a4c149239 |
|
MD5 | 2bbe35b47e5dfd04f76796c7b23d0bb5 |
|
BLAKE2b-256 | 28edc6aac97ff7a006b0fc9fb8808a25c6454f86d8ccf7df6682de085df5a9dc |