Skip to main content

A simple audio data augmentation package

Project description

augaudio

This package contains multiple simple audio data augmentations in order to increase/test the robustness of neural networks.

Original
Gaussian Noise
Pitch Shift
Time Stretch
Crush

Installation

You can install this package via pip install augaudio

Usage

Example usage:

import librosa
import augaudio
import soundfile

y, sr = librosa.load('audio.wav')

augmented = augaudio.augment(y, 1, 4)

soundfile.write('augmented.wav', augmented, sr)

Directly access one of the augmentations:

...
augmented = augaudio.GaussianNoise(y, 4)
...

Chain augmentations together:

...
augmented = augaudio.GaussianNoise(augaudio.TimeStretch(y, 2), 4)
...

Iterate over augmentations:

...
for augmentation in range(4):
  for intensity in range(1,10):
    augmented = augaudio.augment(y, augmentation, intensity)
    soundfile.write(str(augmentation) + '-' + str(intensity) + '.wav', augmented, sr)
...

Command Line Usage:

Single File:

augaudio /path/to/audio.wav 0 5

Complete Folder:

augaudio /path/to/input 0 5 /path/to/output

Augmentations

Augmentation index Description negative intensity
Augmentations -2 Applies /multiple/ random augmentations available
Random -1 Applies random augmentation available
GaussianNoise 0 Applies gaussian noise to the audio data n/a
PitchShift 1 Shifts the pitch of the audio data available
TimeStretch 2 Stretches the time of the audio data n/a
Crush 3 Crushes details by applying and reversing a pitch shift available

Copyright Notice

Copyright 2020 Bastian Schwickert

Licensed under the Apache License, Version 2.0 (the "License");
you may not use this file except in compliance with the License.
You may obtain a copy of the License at

    http://www.apache.org/licenses/LICENSE-2.0

Unless required by applicable law or agreed to in writing, software
distributed under the License is distributed on an "AS IS" BASIS,
WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
See the License for the specific language governing permissions and
limitations under the License.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

augaudio-1.0.4.tar.gz (7.8 kB view details)

Uploaded Source

Built Distribution

augaudio-1.0.4-py3-none-any.whl (8.2 kB view details)

Uploaded Python 3

File details

Details for the file augaudio-1.0.4.tar.gz.

File metadata

  • Download URL: augaudio-1.0.4.tar.gz
  • Upload date:
  • Size: 7.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.5.0.1 requests/2.24.0 setuptools/40.8.0 requests-toolbelt/0.9.1 tqdm/4.41.1 CPython/3.7.3

File hashes

Hashes for augaudio-1.0.4.tar.gz
Algorithm Hash digest
SHA256 8ac3085c4a96a371461ad8d870a57872c651a5928e38ae66335c82bb360b7c23
MD5 08c571d5919b218b2286010e98839502
BLAKE2b-256 31ad998ec4cff4a25a6944c7dfaca713ec23b7cd2ea632407e96f0450cf88dd0

See more details on using hashes here.

File details

Details for the file augaudio-1.0.4-py3-none-any.whl.

File metadata

  • Download URL: augaudio-1.0.4-py3-none-any.whl
  • Upload date:
  • Size: 8.2 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.5.0.1 requests/2.24.0 setuptools/40.8.0 requests-toolbelt/0.9.1 tqdm/4.41.1 CPython/3.7.3

File hashes

Hashes for augaudio-1.0.4-py3-none-any.whl
Algorithm Hash digest
SHA256 35484fd0c26b2a7c716b0d30370b8651934b8f687729cc33524247c003b58c8e
MD5 8fbd2763fe2a6a57ab986c3543ff9455
BLAKE2b-256 19f2bb36fe3dd7c6bece0a84cf3e61a65019c2904bac225e8aeff72978cf0d1b

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page