Skip to main content

A Python library for audio data augmentation. Inspired by albumentations. Useful for machine learning.

Project description

Audiomentations

Build status Code coverage Code Style: Black Licence: MIT DOI

A Python library for audio data augmentation. Inspired by albumentations. Useful for deep learning. Runs on CPU. Supports mono audio and multichannel audio. Can be integrated in training pipelines in e.g. Tensorflow/Keras or Pytorch. Has helped people get world-class results in Kaggle competitions. Is used by companies making next-generation audio products.

Need a Pytorch-specific alternative with GPU support? Check out torch-audiomentations!

Setup

Python version support PyPI version Number of downloads from PyPI per month

pip install audiomentations

Usage example

from audiomentations import Compose, AddGaussianNoise, TimeStretch, PitchShift, Shift
import numpy as np

augment = Compose([
    AddGaussianNoise(min_amplitude=0.001, max_amplitude=0.015, p=0.5),
    TimeStretch(min_rate=0.8, max_rate=1.25, p=0.5),
    PitchShift(min_semitones=-4, max_semitones=4, p=0.5),
    Shift(min_fraction=-0.5, max_fraction=0.5, p=0.5),
])

# Generate 2 seconds of dummy audio for the sake of example
samples = np.random.uniform(low=-0.2, high=0.2, size=(32000,)).astype(np.float32)

# Augment/transform/perturb the audio data
augmented_samples = augment(samples=samples, sample_rate=16000)

Transforms

Documentation

See https://iver56.github.io/audiomentations/

Changelog

See https://iver56.github.io/audiomentations/changelog/

Acknowledgements

Thanks to Nomono for backing audiomentations.

Thanks to all contributors who help improving audiomentations.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

audiomentations-0.27.0.tar.gz (42.0 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

audiomentations-0.27.0-py3-none-any.whl (64.8 kB view details)

Uploaded Python 3

File details

Details for the file audiomentations-0.27.0.tar.gz.

File metadata

  • Download URL: audiomentations-0.27.0.tar.gz
  • Upload date:
  • Size: 42.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.1 CPython/3.9.13

File hashes

Hashes for audiomentations-0.27.0.tar.gz
Algorithm Hash digest
SHA256 eb7800669c7206330ee338c1eb3e2422f5c15e1def2af443bb3ea4f1a977b759
MD5 cc82ff7a482c8ab73cd3494b606bd384
BLAKE2b-256 236b4ba8850b0ba5d0b447ed4ade948309a153617adfd4bfa4339eb1b023e262

See more details on using hashes here.

File details

Details for the file audiomentations-0.27.0-py3-none-any.whl.

File metadata

File hashes

Hashes for audiomentations-0.27.0-py3-none-any.whl
Algorithm Hash digest
SHA256 4e95d627cb76d4034bd7b0d8bb2b9aad1dfc52c44c2d564e071c7f6cad9e983e
MD5 c44c67ae4ef5c5c4bf33134cee519be3
BLAKE2b-256 8302a78a5b3c1f9a2ab9391444c9ac9688a5fcd277af9ac275d29b2d54e15191

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page