A Python library for audio data augmentation. Inspired by albumentations. Useful for machine learning.

These details have not been verified by PyPI

Project links

Project description

Audiomentations

Audiomentations is a Python library for audio data augmentation, built to be fast and easy to use - its API is inspired by albumentations. It's useful for making audio deep learning models work well in the real world, not just in the lab. Audiomentations runs on CPU, supports mono audio and multichannel audio and integrates well in training pipelines, such as those built with TensorFlow/Keras or PyTorch. It has helped users achieve world-class results in Kaggle competitions and is trusted by companies building next-generation audio products with AI.

Need a Pytorch-specific alternative with GPU support? Check out torch-audiomentations!

Setup

Python version support os: Linux, macOS, Windows

pip install audiomentations

Usage example

from audiomentations import Compose, AddGaussianNoise, TimeStretch, PitchShift, Shift
import numpy as np

augment = Compose([
    AddGaussianNoise(min_amplitude=0.001, max_amplitude=0.015, p=0.5),
    TimeStretch(min_rate=0.8, max_rate=1.25, p=0.5),
    PitchShift(min_semitones=-4, max_semitones=4, p=0.5),
    Shift(p=0.5),
])

# Generate 2 seconds of dummy audio for the sake of example
samples = np.random.uniform(low=-0.2, high=0.2, size=(32000,)).astype(np.float32)

# Augment/transform/perturb the audio data
augmented_samples = augment(samples=samples, sample_rate=16000)

Documentation

The API documentation, along with guides, example code, illustrations and example sounds, is available at https://iver56.github.io/audiomentations/

Transforms

AddBackgroundNoise: Mixes in another sound to add background noise
AddColorNoise: Adds noise with specific color
AddGaussianNoise: Adds gaussian noise to the audio samples
AddGaussianSNR: Injects gaussian noise using a randomly chosen signal-to-noise ratio
AddShortNoises: Mixes in various short noise sounds
AdjustDuration: Trims or pads the audio to fit a target duration
AirAbsorption: Applies frequency-dependent attenuation simulating air absorption
Aliasing: Produces aliasing artifacts by downsampling without low-pass filtering and then upsampling
ApplyImpulseResponse: Convolves the audio with a randomly chosen impulse response
BandPassFilter: Applies band-pass filtering within randomized parameters
BandStopFilter: Applies band-stop (notch) filtering within randomized parameters
BitCrush: Applies bit reduction without dithering
Clip: Clips audio samples to specified minimum and maximum values
ClippingDistortion: Distorts the signal by clipping a random percentage of samples
Gain: Multiplies the audio by a random gain factor
GainTransition: Gradually changes the gain over a random time span
HighPassFilter: Applies high-pass filtering within randomized parameters
HighShelfFilter: Applies a high shelf filter with randomized parameters
Lambda: Applies a user-defined transform
Limiter: Applies dynamic range compression limiting the audio signal
LoudnessNormalization: Applies gain to match a target loudness
LowPassFilter: Applies low-pass filtering within randomized parameters
LowShelfFilter: Applies a low shelf filter with randomized parameters
Mp3Compression: Compresses the audio to lower the quality
Normalize: Applies gain so that the highest signal level becomes 0 dBFS
Padding: Replaces a random part of the beginning or end with padding
PeakingFilter: Applies a peaking filter with randomized parameters
PitchShift: Shifts the pitch up or down without changing the tempo
PolarityInversion: Flips the audio samples upside down, reversing their polarity
RepeatPart: Repeats a subsection of the audio a number of times
Resample: Resamples the signal to a randomly chosen sampling rate
Reverse: Reverses the audio along its time axis
RoomSimulator: Simulates the effect of a room on an audio source
SevenBandParametricEQ: Adjusts the volume of 7 frequency bands
Shift: Shifts the samples forwards or backwards
TanhDistortion: Applies tanh distortion to distort the signal
TimeMask: Makes a random part of the audio silent
TimeStretch: Changes the speed without changing the pitch
Trim: Trims leading and trailing silence from the audio

Changelog

[0.42.0] - 2025-07-04

Added

Add support for Python 3.13
Add support for librosa 0.11.0

Changed

Make Mp3Compression 25-300% faster (depending on hardware, audio properties like duration and number of channels and various params, like bitrate) with the new backend="fast-mp3-augment" (now default). The extra dependency for this is fast-mp3-augment, which pulls a few useful tricks for faster execution.
Make Limiter 30% faster and easier to install (extra dependency is now numpy-audio-limiter instead of cylimiter). The Limiter behavior has not changed, although there are minor numerical differences.

Fixed

Handle non-contiguous audio ndarray input to PitchShift and TimeStretch properly

For the full changelog, including older versions, see https://iver56.github.io/audiomentations/changelog/

Acknowledgements

Thanks to Nomono for backing audiomentations.

Thanks to all contributors who help improving audiomentations.

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

This version

0.42.0

Jul 4, 2025

0.41.0

May 5, 2025

0.40.0

Mar 20, 2025

0.39.0

Feb 12, 2025

0.38.0

Dec 6, 2024

0.37.0

Sep 3, 2024

0.36.1

Aug 20, 2024

0.36.0

Jun 10, 2024

0.35.0

Mar 15, 2024

0.34.1

Nov 24, 2023

0.33.0

Aug 30, 2023

0.32.0

Aug 15, 2023

0.31.0

Jun 21, 2023

0.30.0

May 2, 2023

0.29.0

Mar 15, 2023

0.28.0

Jan 12, 2023

0.27.0

Sep 13, 2022

0.26.0

Aug 19, 2022

0.25.1

Jun 15, 2022

0.25.0

May 30, 2022

0.24.0

Mar 18, 2022

0.23.0

Mar 7, 2022

0.22.0

Feb 18, 2022

0.21.0

Feb 10, 2022

0.20.0

Nov 18, 2021

0.19.0

Oct 18, 2021

0.18.0

Aug 5, 2021

0.17.0

Jun 25, 2021

0.16.0

Feb 11, 2021

0.15.0

Dec 10, 2020

0.14.0

Dec 6, 2020

0.13.0

Nov 10, 2020

0.12.1

Sep 28, 2020

0.12.0

Sep 23, 2020

0.11.0

Aug 27, 2020

0.10.1

Jul 27, 2020

0.10.0

May 5, 2020

0.9.0

Feb 20, 2020

0.8.0

Jan 28, 2020

0.7.0

Jun 14, 2019

0.6.0

May 27, 2019

0.5.0

Feb 23, 2019

0.4.0

Feb 19, 2019

0.3.0

Feb 19, 2019

0.2.0

Feb 18, 2019

0.1.0

Feb 15, 2019

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

audiomentations-0.42.0.tar.gz (84.2 kB view details)

Uploaded Jul 4, 2025 Source

Built Distribution

audiomentations-0.42.0-py3-none-any.whl (86.5 kB view details)

Uploaded Jul 4, 2025 Python 3

File details

Details for the file audiomentations-0.42.0.tar.gz.

File metadata

Download URL: audiomentations-0.42.0.tar.gz
Upload date: Jul 4, 2025
Size: 84.2 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/5.0.0 CPython/3.10.14

File hashes

Hashes for audiomentations-0.42.0.tar.gz
Algorithm	Hash digest
SHA256	`bcac449911c7d9eedfc4efd00ccbb7f749f6fa6b3254109e5a562efbc3d84eb3`
MD5	`b887c7aba4229666b3b88872c0f84de3`
BLAKE2b-256	`654ab1aae7820db44fe6bfc0fabe245814708bd3ba3ff54d7e0c9c654626b29c`

See more details on using hashes here.

File details

Details for the file audiomentations-0.42.0-py3-none-any.whl.

File metadata

Download URL: audiomentations-0.42.0-py3-none-any.whl
Upload date: Jul 4, 2025
Size: 86.5 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/5.0.0 CPython/3.10.14

File hashes

Hashes for audiomentations-0.42.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`7c0b6f2198184c02f850c1ac169459f3fb2cf7a1354cddd347af9acfc45461fc`
MD5	`c0ab826374f28a3fa0c362a16275a1fe`
BLAKE2b-256	`939a88a133ad3e72c33de363149842c6ed8280993b32034e1ecf035ccdf4a48e`

See more details on using hashes here.

audiomentations 0.42.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Audiomentations

Setup

Usage example

Documentation

Transforms

Changelog

[0.42.0] - 2025-07-04

Added

Changed

Fixed

Acknowledgements

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes