Skip to main content

A curated collection of datasets for data analysis & machine learning, downloadable with a single Python command

Project description

opendatasets

A curated collection of datasets for data analysis & machine learning, downloadable with a single Python command.

Try it out online: https://jovian.ml/aakashns/opendatasets-demo

Installation & Usage

Install the library using pip:

pip install opendatasets --upgrade

To use the library, just import it and use the download function.

import opendatasets as od
od.download('stackoverflow-developer-survey-2020')

See the next section for a list of available dataset IDs.

Datasets

Dataset ID Description Source
stackoverflow-developer-survey-2020 Stack Overflow Developer Survey 2020 Stack Overflow
owid-covid-19-latest Covid-19 Stats by Our World in Data Our World in Data
state-of-javascript-2016 State of Javascript Annual Survey 2016 StateOfJS
state-of-javascript-2017 State of Javascript Annual Survey 2017 StateOfJS
state-of-javascript-2018 State of Javascript Annual Survey 2018 StateOfJS
state-of-javascript-2019 State of Javascript Annual Survey 2019 StateOfJS
countries-languages-spoken Languages Spoken in Different Countries Infoplease

More datasets will be added soon..

Contributing

This is an open source project and we welcome contributions.

Local Development Setup

  1. Clone the repository:
git clone https://github.com/JovianML/opendatasets.git
  1. Setup the Python environment for development
conda create -n opendatasets python=3.5
conda activate opendatasets
pip install -r requirements.txt
  1. Open up the project in VS code and make your changes. Make sure to install the Python Extension for VS Code and select the opendatasets conda environment.

Adding Datasets

TODO - more details will be added here later

This package is developed and mainted by the Jovian.ml team.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

opendatasets-0.0.109.tar.gz (4.4 kB view hashes)

Uploaded Source

Built Distribution

opendatasets-0.0.109-py3-none-any.whl (7.9 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page