This package aims at simplifying the download of the AudioCaps dataset.
Project description
AudioCaps Download
DISCLAIMER: This repository is a modified version of the AudioSet Download repository.
This repository contains code for downloading the AudioCaps dataset. The repository is not officially affiliated with the AudioCaps dataset.
Requirements
- Python 3.9 (it may work with other versions, but it has not been tested)
Installation
# Install ffmpeg
sudo apt install ffmpeg
# Install audiocaps-download
pip install audiocaps-download
Usage
The following code snippet downloads the complete dataset in WAV format, and stores it in the test
directory.
from audiocaps_download import Downloader
d = Downloader(root_path='audiocaps/', n_jobs=16)
d.download(format = 'wav')
Implementation
The main class is audiocaps_download.Downloader
. It is initialized using the following parameters:
root_path
: the path to the directory where the dataset will be downloaded.n_jobs
: the number of parallel downloads. Default is 1.
The methods of the class are:
download(format='vorbis', quality=5)
: downloads the dataset.- The format can be one of the following (supported by yt-dlp
--audio-format
parameter):vorbis
: downloads the dataset in Ogg Vorbis format. This is the default.wav
: downloads the dataset in WAV format.mp3
: downloads the dataset in MP3 format.m4a
: downloads the dataset in M4A format.flac
: downloads the dataset in FLAC format.opus
: downloads the dataset in Opus format.webm
: downloads the dataset in WebM format.- ... and many more.
- The quality can be an integer between 0 and 10. Default is 5.
load_dataset()
: reads the csv files from the original repository. It is not used externally.download_file(...)
: downloads a single file. It is not used externally.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distributions
No source distribution files available for this release.See tutorial on generating distribution archives.
Built Distribution
File details
Details for the file audiocaps_download-1.0.1-py3-none-any.whl
.
File metadata
- Download URL: audiocaps_download-1.0.1-py3-none-any.whl
- Upload date:
- Size: 5.1 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.10.9
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 4606a77ff2d8fda22e3afb0d386980e66374ad9863812dd29d78fb4a9952216a |
|
MD5 | bf4af19d1a2da56e2717ff8d567a43f1 |
|
BLAKE2b-256 | ed440021eeed653567146deb78912fa73b5f0a05a1267aac567338ebf9d4d274 |