Skip to main content

Add your description here

Project description

AudioAugmenter

Desktop GUI for importing, augmenting, and exporting WAV/MP3 samples.

Features

  • Drag-and-drop import into a local workspace/ directory.
  • Central table to edit transcriptions, delete rows, and track applied augmentations.
  • One-click augmentation (Gaussian noise, time stretch, pitch shift) via audiomentations.
  • Export to CSV or JSON with optional audio copies.

Screenshot

AudioAugmenter interface

Requirements

  • Python 3.13+
  • Dependencies: PyQt6, audiomentations, pandas, numpy, soundfile, pydub (see pyproject.toml).

Quick start

python -m venv .venv
source .venv/bin/activate
pip install -e .
python app.py

Usage

  • Use Add Files or drag .wav/.mp3 files into the left pane. Files are copied into workspace/.
  • Edit transcriptions directly in the middle table.
  • Select rows, check one or more augmentations (edit parameters in the panel), and click Apply Augmentation to create a single derived file per source with all checked augmentations applied in order.
  • Delete selected rows and their workspace files with Delete Selected.
  • Export selected rows (or all if none selected) to CSV/JSON; exported rows are reindexed from 0 and filenames reset to rec_<n>.wav; audio copies go to exports/audio/ by default.
  • Use Save to workspace to persist the current table to workspace/entries.csv; restarting the app reloads this state.
  • Double-click a row to play the audio (requires pydub playback support, e.g., simpleaudio).

Notes

  • Augmentations rely on audiomentations and soundfile; ensure their native deps are available on your system.
  • Augmented outputs are written as WAV files to maximize compatibility.
  • Playback is not included; integrate pydub/sounddevice if needed.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

audioaugmenter-0.3.0.tar.gz (209.0 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

audioaugmenter-0.3.0-py3-none-any.whl (11.8 kB view details)

Uploaded Python 3

File details

Details for the file audioaugmenter-0.3.0.tar.gz.

File metadata

  • Download URL: audioaugmenter-0.3.0.tar.gz
  • Upload date:
  • Size: 209.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.7.14

File hashes

Hashes for audioaugmenter-0.3.0.tar.gz
Algorithm Hash digest
SHA256 99482a6c7eece95623ec1bc68cb2ad91ff2c5ab50c5bdfb17ac31ee5cf1ddba5
MD5 8c4273855b18033a51129ab66947a024
BLAKE2b-256 5a3b0b25da8fb3174ce43cb2f7e98efa7e4de7c7e75c33fb2784eae663726b64

See more details on using hashes here.

File details

Details for the file audioaugmenter-0.3.0-py3-none-any.whl.

File metadata

File hashes

Hashes for audioaugmenter-0.3.0-py3-none-any.whl
Algorithm Hash digest
SHA256 d1a415d06f4afca9d61cb6960c667656853b30c933aeeb3d53e8d523f817e840
MD5 963dc5ff82de8e183fd9d19ebf3da863
BLAKE2b-256 b158f07c7df496e49bc381569286fb4879d4480b6f4697102817f72caf7de1d5

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page