Skip to main content

Structured audio comparison for producers and developers. Think git diff, but for audio.

Project description

sounddiff

CI PyPI Python License: MIT CodeRabbit Reviews

sounddiff is a CLI tool for audio producers and developers to compare two audio files and see exactly what changed. It reports differences in loudness, spectral balance, timing, and flags issues like clipping and silence. Output comes as colored terminal text, structured JSON, or a self-contained HTML report.

Example

$ sounddiff mix-v3.wav mix-v4.wav

sounddiff: mix-v3.wav vs mix-v4.wav

Duration     3:42.108 → 3:42.108  (no change)
Sample Rate  48000 Hz → 48000 Hz  (no change)
Channels     stereo   → stereo    (no change)

Loudness (integrated)
  LUFS       -14.2    → -12.8     (+1.4 dB)
  Peak dBTP  -1.1     → -0.3      (+0.8 dB)
  LRA         8.2     →  6.4      (-1.8 LU)

Spectral
  Low  (20-250 Hz)    +0.8 dB avg
  Mid  (250-4k Hz)    +0.3 dB avg
  High (4k-20k Hz)    +1.9 dB avg

Segments
  0:00-1:12   similar (correlation: 0.97)
  1:12-1:14   ADDED (new content, 2.1s)
  1:14-3:42   similar (correlation: 0.98, shifted +2.1s)

Issues
  ⚠ Clipping detected in mix-v4.wav at 2:31.4 (3 samples)

Installation

pip install sounddiff

Requires Python 3.10 or later. Supports WAV, FLAC, OGG, and AIFF natively. For MP3 and AAC support, install ffmpeg.

Usage

Compare two files with colored terminal output:

sounddiff old-mix.wav new-mix.wav

Get structured JSON for scripts and CI pipelines:

sounddiff old.wav new.wav --format json

Generate an HTML report:

sounddiff old.wav new.wav --format html -o report.html

See docs/usage.md for all options.

What it analyzes

Category Measurements
Loudness Integrated LUFS, true peak (dBTP), loudness range (LRA) per ITU-R BS.1770
Spectral Average energy per frequency band (low, mid, high) with configurable ranges
Temporal Segment-level cross-correlation, added/removed/shifted section detection
Detection Clipping events (timestamp, channel, sample count), silence regions
Metadata Duration, sample rate, channels, bit depth, format

Output formats

Terminal is the default. Colored, grouped by category, designed to be read top to bottom. Uses rich for formatting.

JSON outputs the same data in a structured format. Pipe it to jq, parse it in Python, or use it in CI pipelines for automated regression testing.

HTML generates a self-contained report with inline styles. No external dependencies. Open it in any browser, share it with your team, or archive it alongside your session files.

How it's built

sounddiff is written in Python with a modular architecture. Each analysis type (loudness, spectral, temporal, detection) lives in its own module with no cross-dependencies. The core orchestrator loads two audio files, runs all analyzers, and passes the results to a formatter.

Dependency Purpose
soundfile Audio I/O via libsndfile
numpy Array math, FFT, cross-correlation
scipy Signal processing
pyloudnorm ITU-R BS.1770 loudness measurement
click CLI framework
rich Terminal formatting
jinja2 HTML report templates

See docs/architecture.md for the full module breakdown and data flow.

Documentation

Contributing

Contributions are welcome. See CONTRIBUTING.md for setup instructions and our development workflow.

The issue board has open work organized by milestone. Issues labeled good first issue are scoped for newcomers and have enough context to get started without deep DSP knowledge.

Security

Report vulnerabilities to dev@systemblue.io. See SECURITY.md for our disclosure policy.

License

MIT

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

sounddiff-0.1.0.tar.gz (36.6 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

sounddiff-0.1.0-py3-none-any.whl (18.8 kB view details)

Uploaded Python 3

File details

Details for the file sounddiff-0.1.0.tar.gz.

File metadata

  • Download URL: sounddiff-0.1.0.tar.gz
  • Upload date:
  • Size: 36.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for sounddiff-0.1.0.tar.gz
Algorithm Hash digest
SHA256 5c465c0b2cae99666eca9e1700ed76c89ea23e3e6930c79f197deb91deb1862b
MD5 4c965be3b08e3d41207b31cb9ddeeee5
BLAKE2b-256 d173aa9e9cf7bc6d8acc171f3aebc3aa1f7881755d3e272f48e2bd5d59419062

See more details on using hashes here.

Provenance

The following attestation bundles were made for sounddiff-0.1.0.tar.gz:

Publisher: release.yml on systemblueteam/sounddiff

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file sounddiff-0.1.0-py3-none-any.whl.

File metadata

  • Download URL: sounddiff-0.1.0-py3-none-any.whl
  • Upload date:
  • Size: 18.8 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for sounddiff-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 212b959dbdcbeb8d103dcab0ea955a6ace5f949fb4ec7d8caad159bb294c8db6
MD5 4d2439bb107801708a984af76108d9ce
BLAKE2b-256 74212313c05f5151562fe3fe9c201390026d8fbe2918e26a6941089ef6954251

See more details on using hashes here.

Provenance

The following attestation bundles were made for sounddiff-0.1.0-py3-none-any.whl:

Publisher: release.yml on systemblueteam/sounddiff

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page