A curated collection of datasets for data analysis & machine learning, downloadable with a single Python command
Project description
opendatasets
A curated collection of datasets for data analysis & machine learning, downloadable with a single Python command.
Try it out online: https://jovian.ml/aakashns/opendatasets-demo
Installation & Usage
Install the library using pip
:
pip install opendatasets --upgrade
To use the library, just import it and use the download
function.
import opendatasets as od
od.download('stackoverflow-developer-survey-2020')
See the next section for a list of available dataset IDs.
Datasets
Dataset ID | Description | Source |
---|---|---|
stackoverflow-developer-survey-2020 |
Stack Overflow Developer Survey 2020 | Stack Overflow |
owid-covid-19-latest |
Covid-19 Stats by Our World in Data | Our World in Data |
state-of-javascript-2016 |
State of Javascript Annual Survey 2016 | StateOfJS |
state-of-javascript-2017 |
State of Javascript Annual Survey 2017 | StateOfJS |
state-of-javascript-2018 |
State of Javascript Annual Survey 2018 | StateOfJS |
state-of-javascript-2019 |
State of Javascript Annual Survey 2019 | StateOfJS |
countries-languages-spoken |
Languages Spoken in Different Countries | Infoplease |
More datasets will be added soon..
Contributing
This is an open source project and we welcome contributions.
Local Development Setup
- Clone the repository:
git clone https://github.com/JovianML/opendatasets.git
- Setup the Python environment for development
conda create -n opendatasets python=3.5
conda activate opendatasets
pip install -r requirements.txt
- Open up the project in VS code and make your changes. Make sure to install the Python Extension for VS Code and select the
opendatasets
conda environment.
Adding Datasets
TODO - more details will be added here later
This package is developed and mainted by the Jovian.ml team.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for opendatasets-0.1.0-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 0ccc924f3876cbd0c32055fe84ade480f592ee5bb05bdecc4a19247c9911994e |
|
MD5 | 400fe2cfcccb8796b406cdd216cb72b5 |
|
BLAKE2b-256 | 5692921bc86243669f3cd8e9aa570121770f1fb90d87e33df4fd52c58a924662 |