A curated collection of datasets for data analysis & machine learning, downloadable with a single Python command
Project description
opendatasets
A curated collection of datasets for data analysis & machine learning, downloadable with a single Python command.
Try it out online: https://jovian.ml/aakashns/opendatasets-demo
Installation & Usage
Install the library using pip
:
pip install opendatasets --upgrade
To use the library, just import it and use the download
function.
import opendatasets as od
od.download('stackoverflow-developer-survey-2020')
See the next section for a list of available dataset IDs.
Datasets
Dataset ID | Description | Source |
---|---|---|
stackoverflow-developer-survey-2020 |
Stack Overflow Developer Survey 2020 | Stack Overflow |
owid-covid-19-latest |
Covid-19 Stats by Our World in Data | Our World in Data |
state-of-javascript-2016 |
State of Javascript Annual Survey 2016 | StateOfJS |
state-of-javascript-2017 |
State of Javascript Annual Survey 2017 | StateOfJS |
state-of-javascript-2018 |
State of Javascript Annual Survey 2018 | StateOfJS |
state-of-javascript-2019 |
State of Javascript Annual Survey 2019 | StateOfJS |
countries-languages-spoken |
Languages Spoken in Different Countries | Infoplease |
More datasets will be added soon..
Contributing
This is an open source project and we welcome contributions.
Local Development Setup
- Clone the repository:
git clone https://github.com/JovianML/opendatasets.git
- Setup the Python environment for development
conda create -n opendatasets python=3.5
conda activate opendatasets
pip install -r requirements.txt
- Open up the project in VS code and make your changes. Make sure to install the Python Extension for VS Code and select the
opendatasets
conda environment.
Adding Datasets
TODO - more details will be added here later
This package is developed and mainted by the Jovian.ml team.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for opendatasets-0.0.109-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | a418fcb3247d0d5ac80f641dc397a12677f5c85d9462773c0b23d34904e1b519 |
|
MD5 | 4e7786fa6041c840fa167f5351754c96 |
|
BLAKE2b-256 | 45af4d1b479d5861dc48127b4ee5651123b4cd8aa9f28fc4b08db803e67c8237 |