Skip to main content

A Python library for audio data augmentation. Inspired by albumentations. Useful for machine learning.

Project description

Audiomentations

Build status Code coverage Code Style: Black Licence: MIT DOI

A Python library for audio data augmentation. Inspired by albumentations. Useful for deep learning. Runs on CPU. Supports mono audio and multichannel audio. Can be integrated in training pipelines in e.g. Tensorflow/Keras or Pytorch. Has helped people get world-class results in Kaggle competitions. Is used by companies making next-generation audio products.

Need a Pytorch-specific alternative with GPU support? Check out torch-audiomentations!

Setup

Python version support PyPI version Number of downloads from PyPI per month

pip install audiomentations

Usage example

from audiomentations import Compose, AddGaussianNoise, TimeStretch, PitchShift, Shift
import numpy as np

augment = Compose([
    AddGaussianNoise(min_amplitude=0.001, max_amplitude=0.015, p=0.5),
    TimeStretch(min_rate=0.8, max_rate=1.25, p=0.5),
    PitchShift(min_semitones=-4, max_semitones=4, p=0.5),
    Shift(min_fraction=-0.5, max_fraction=0.5, p=0.5),
])

# Generate 2 seconds of dummy audio for the sake of example
samples = np.random.uniform(low=-0.2, high=0.2, size=(32000,)).astype(np.float32)

# Augment/transform/perturb the audio data
augmented_samples = augment(samples=samples, sample_rate=16000)

Documentation

See https://iver56.github.io/audiomentations/

Transforms

Changelog

See https://iver56.github.io/audiomentations/changelog/

Acknowledgements

Thanks to Nomono for backing audiomentations.

Thanks to all contributors who help improving audiomentations.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

audiomentations-0.29.0.tar.gz (40.9 kB view details)

Uploaded Source

Built Distribution

audiomentations-0.29.0-py3-none-any.whl (66.1 kB view details)

Uploaded Python 3

File details

Details for the file audiomentations-0.29.0.tar.gz.

File metadata

  • Download URL: audiomentations-0.29.0.tar.gz
  • Upload date:
  • Size: 40.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.1 CPython/3.9.13

File hashes

Hashes for audiomentations-0.29.0.tar.gz
Algorithm Hash digest
SHA256 cb0599f8c16d2568c1da0bbe87954f22988a1cfe26d17eac24b403eb1f9b646b
MD5 3124e5810326a804c0d8af5ec06180a2
BLAKE2b-256 c4f343ff10bc91fbf1dbbfdb2d39f7b3bcc4c64a63307185a8f044612e9ba583

See more details on using hashes here.

File details

Details for the file audiomentations-0.29.0-py3-none-any.whl.

File metadata

File hashes

Hashes for audiomentations-0.29.0-py3-none-any.whl
Algorithm Hash digest
SHA256 7592b7b42b9acb1b17d6db8d7285891ca37b9b7f0c7adc508afa290ae3829288
MD5 ba43ee2663802f513ff553188e7b1530
BLAKE2b-256 9afc07a2318f958cb3daf6cedcaae29897909fe22e9f578fda7631191ad7318f

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page