Skip to main content

Add your description here

Project description

AudioAugmenter

Desktop GUI for importing, augmenting, and exporting WAV/MP3 samples.

Features

  • Drag-and-drop import into a local workspace/ directory.
  • Central table to edit transcriptions, delete rows, and track applied augmentations.
  • One-click augmentation (Gaussian noise, time stretch, pitch shift, shift) via audiomentations.
  • Export to CSV or JSON with optional audio copies.

Requirements

  • Python 3.13+
  • Dependencies: PyQt6, audiomentations, pandas, numpy, soundfile, pydub (see pyproject.toml).

Quick start

python -m venv .venv
source .venv/bin/activate
pip install -e .
python app.py

Usage

  • Use Add Files or drag .wav/.mp3 files into the left pane. Files are copied into workspace/.
  • Edit transcriptions directly in the middle table.
  • Select rows, check one or more augmentations (edit parameters in the panel), and click Apply Augmentation to create a single derived file per source with all checked augmentations applied in order.
  • Delete selected rows and their workspace files with Delete Selected.
  • Export selected rows (or all if none selected) to CSV/JSON; exported rows are reindexed from 0 and filenames reset to rec_<n>.wav; audio copies go to exports/audio/ by default.
  • Use Save to workspace to persist the current table to workspace/entries.csv; restarting the app reloads this state.
  • Double-click a row to play the audio (requires pydub playback support, e.g., simpleaudio).

Notes

  • Augmentations rely on audiomentations and soundfile; ensure their native deps are available on your system.
  • Augmented outputs are written as WAV files to maximize compatibility.
  • Playback is not included; integrate pydub/sounddevice if needed.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

audioaugmenter-0.2.0.tar.gz (209.7 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

audioaugmenter-0.2.0-py3-none-any.whl (11.8 kB view details)

Uploaded Python 3

File details

Details for the file audioaugmenter-0.2.0.tar.gz.

File metadata

  • Download URL: audioaugmenter-0.2.0.tar.gz
  • Upload date:
  • Size: 209.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.7.14

File hashes

Hashes for audioaugmenter-0.2.0.tar.gz
Algorithm Hash digest
SHA256 a9ffe69f3ec4201fea725044b1a52f07da3e39cb5ebd92706b8671904c3f9ccc
MD5 cf752c510e930cbf67992ec30790a3cc
BLAKE2b-256 c67ac3ffc95f6e224d283662fa7b4540cf55d67083387e3cc11658a4d69570cb

See more details on using hashes here.

File details

Details for the file audioaugmenter-0.2.0-py3-none-any.whl.

File metadata

File hashes

Hashes for audioaugmenter-0.2.0-py3-none-any.whl
Algorithm Hash digest
SHA256 3b531cea3c41e92cb071947f49ab4872aa5e68a18e72f584687ea3a0302e50c9
MD5 63f801adefbd2f001fc0b874be1c81ec
BLAKE2b-256 c0faa4157550147c676f723afd6138380b2d96cfd94004d423300d5b3047c60f

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page