Skip to main content

A Python package for interfacing with the Mozilla Data Collective's API

Project description

Project logo

Published Docs Tests

Mozilla Data Collective Python API Library

Python library for interfacing with the Mozilla Data Collective REST API.

Installation

pip install datacollective

Quick Start

  1. Get your API key from the Mozilla Data Collective dashboard

  2. Set the API key in your environment variable (or create .env file add it there):

IMPORTANT NOTE: Before trying to access any dataset, make sure you have thoroughly read and agreed to the specific dataset's conditions & licensing terms.

export MDC_API_KEY=your-api-key-here
  1. Get your dataset ID from the last section of the dataset URL at the MDC website.

[!TIP] You can find the dataset-id by looking at the URL of the dataset's page on MDC platform. The ID is the unique string of characters located at the very end of the URL, after the /datasets/ path. For example, for URL https://datacollective.mozillafoundation.org/datasets/cminc35no007no707hql26lzk dataset id will be cminc35no007no707hql26lzk.

  1. Save a dataset locally:
from datacollective import save_dataset_to_disk

dataset_path = save_dataset_to_disk("your-dataset-id")
  1. Get information & metadata about a dataset:
from datacollective import get_dataset_details

details = get_dataset_details("your-dataset-id")
  1. Load the dataset into a pandas DataFrame (Only Common Voice datasets are supported right now):
from datacollective import load_dataset

dataset = load_dataset("your-dataset-id")

For more details, visit our docs

License

This project is released under MPL (Mozilla Public License) 2.0.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

datacollective-0.2.0.tar.gz (12.1 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

datacollective-0.2.0-py3-none-any.whl (14.8 kB view details)

Uploaded Python 3

File details

Details for the file datacollective-0.2.0.tar.gz.

File metadata

  • Download URL: datacollective-0.2.0.tar.gz
  • Upload date:
  • Size: 12.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for datacollective-0.2.0.tar.gz
Algorithm Hash digest
SHA256 21c3a0f1e99e1681fb11c51e44926e405d412c11650a11788ae5c1629edece84
MD5 905cf0305525ba8403aa239eb47b02b0
BLAKE2b-256 452fbd30f523b93709a634bcfc3612596505016474c0f5a5853078b6d8a40999

See more details on using hashes here.

File details

Details for the file datacollective-0.2.0-py3-none-any.whl.

File metadata

  • Download URL: datacollective-0.2.0-py3-none-any.whl
  • Upload date:
  • Size: 14.8 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for datacollective-0.2.0-py3-none-any.whl
Algorithm Hash digest
SHA256 a66c4236a7416ec2586c6c78fa9906b2d75cb653cf2ece2c60c3f94778e87fad
MD5 fcd836527461ea646e1a97fcad8f4f60
BLAKE2b-256 8ddb3a3a76e52842b6b4ac48e196e39ebb06c8158e86eba3b346a12d11ce2031

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page