Skip to main content

A Python library for audio data augmentation. Inspired by albumentations. Useful for machine learning.

Project description

Audiomentations

Build status Code coverage Code Style: Black Licence: MIT DOI

A Python library for audio data augmentation. Inspired by albumentations. Useful for deep learning. Runs on CPU. Supports mono audio and multichannel audio. Can be integrated in training pipelines in e.g. Tensorflow/Keras or Pytorch. Has helped people get world-class results in Kaggle competitions. Is used by companies making next-generation audio products.

Need a Pytorch-specific alternative with GPU support? Check out torch-audiomentations!

Setup

Python version support PyPI version Number of downloads from PyPI per month

pip install audiomentations

Usage example

from audiomentations import Compose, AddGaussianNoise, TimeStretch, PitchShift, Shift
import numpy as np

augment = Compose([
    AddGaussianNoise(min_amplitude=0.001, max_amplitude=0.015, p=0.5),
    TimeStretch(min_rate=0.8, max_rate=1.25, p=0.5),
    PitchShift(min_semitones=-4, max_semitones=4, p=0.5),
    Shift(min_fraction=-0.5, max_fraction=0.5, p=0.5),
])

# Generate 2 seconds of dummy audio for the sake of example
samples = np.random.uniform(low=-0.2, high=0.2, size=(32000,)).astype(np.float32)

# Augment/transform/perturb the audio data
augmented_samples = augment(samples=samples, sample_rate=16000)

Documentation

See https://iver56.github.io/audiomentations/

Transforms

Changelog

See https://iver56.github.io/audiomentations/changelog/

Acknowledgements

Thanks to Nomono for backing audiomentations.

Thanks to all contributors who help improving audiomentations.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

audiomentations-0.28.0.tar.gz (40.7 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

audiomentations-0.28.0-py3-none-any.whl (66.0 kB view details)

Uploaded Python 3

File details

Details for the file audiomentations-0.28.0.tar.gz.

File metadata

  • Download URL: audiomentations-0.28.0.tar.gz
  • Upload date:
  • Size: 40.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.1 CPython/3.9.13

File hashes

Hashes for audiomentations-0.28.0.tar.gz
Algorithm Hash digest
SHA256 9475efd47f1279bd3b93a8b7e7d6de5001b5e1356910d1dc22d5b2ff9b52beab
MD5 97da10991925b192f33c856db5418206
BLAKE2b-256 bfe78437a5882c1d0193bf6bad031396c68052a56331aaa69a1bffdda789242f

See more details on using hashes here.

File details

Details for the file audiomentations-0.28.0-py3-none-any.whl.

File metadata

File hashes

Hashes for audiomentations-0.28.0-py3-none-any.whl
Algorithm Hash digest
SHA256 bf24de9dcbde9d533aba814874a0297799e808de9c1466cd0da8e1db997cb822
MD5 0459220e83527e3b361944e78f550436
BLAKE2b-256 c02a4ba56eb921f8c1cadabc225baa6eb5d0504581ff33d5bcf4b458af6e0ace

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page