Skip to main content

Python library for loading and working with sound datasets.

Project description

soundata

Python library for downloading, loading & working with sound datasets. Check the API documentation and the contributing instructions.
For Music Information Retrieval (MIR) datasets please check mirdata.

CI status Formatting status Linting status Downloads

codecov Documentation Status GitHub PyPI version PRs Welcome

This library provides tools for working with common sound datasets, including tools for:

  • Downloading datasets to a common location and format
  • Validating that the files for a dataset are all present
  • Loading annotation files to a common format
  • Parsing clip-level metadata for detailed evaluations

Here's soundata's list of currently supported datasets.

Installation

To install, simply run:

pip install soundata

Quick example

import soundata

dataset = soundata.initialize('urbansound8k')
dataset.download()  # download the dataset
dataset.validate()  # validate that all the expected files are there

example_clip = dataset.choice_clip()  # choose a random example clip
print(example_clip)  # see the available data

See the documentation for more examples and the API reference.

Contributing a new dataset loader

We welcome and encourage contributions to this library, especially new dataset loaders. Please see contributing for guidelines. Feel free to open an issue if you have any doubt or your run into problems when working on the library.

Citing

TBA

When working with datasets, please cite the version of soundata that you are using AND include the reference of the dataset, which can be found in the respective dataset loader using the cite() method.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

soundata-1.0.1.tar.gz (142.0 kB view hashes)

Uploaded Source

Built Distribution

soundata-1.0.1-py3-none-any.whl (162.0 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page