audiotq

A high-performance lossy audio compression engine inspired by LLM weight quantization

These details have not been verified by PyPI

Project description

TurboQuant Audio Banner

Version Python

A minimal, high-performance lossy audio compression engine built in Python and NumPy.

TQA adapts the mathematical principles of TurboQuant (originally designed for data-oblivious weight quantization of LLMs on GPU clusters) into a localized, cache-aligned CPU audio codec. By mapping high-dimensional audio amplitudes into zero-centered symmetrical Gaussian distributions via Hadamard rotations, it achieves a ~3x memory reduction while preserving transient fidelity.

Features

Data-Oblivious Energy Flattening: Spreads transient spike energy uniformly across audio blocks using Fast Walsh-Hadamard Transform (FWHT) rotations.
Optimal Centroid Clustering: Employs an iterative Lloyd-Max solver to converge on the Mean-Squared-Error (MSE) optimal 6-bit codebook.
1-Bit QJL Residual Layer: Uses a Quantized Joint Least-Squares error sign layer to track quantization rounding errors and suppress distortion.
Zero-Dependency: Built purely on Python and NumPy.

Directory Structure

├── audiotq/
│   ├── engine.py       # Core compression & decompression pipeline loops
│   ├── rotator.py      # Fast Walsh-Hadamard Transform (FWHT) rotations
│   ├── quantizer.py    # Iterative Lloyd-Max solver & bit-packing mechanics
│   └── wrapper.py      # WAV file read/write wrapper (Supports 8, 16, and 24-bit PCM)
├── audio_cli.py        # CLI for processing audio files and running telemetry
├── main.py             # CLI for generating and compressing synthetic signals
└── tests/
    └── run_tests.py    # System stress tests and SQNR verification

Installation

Ensure you have Python 3.10+ and numpy / scipy installed:

pip install numpy scipy

Usage

1. Run System Verification Tests

To run the automated test suite (unit tests, benchmarks, and verification):

uv run pytest

You can also run the legacy sequential stress tests via:

uv run python3 -m tests.run_tests

2. Compress and Decompress Audio

Use the audio_cli.py utility to process any standard .wav audio track:

uv run audio_cli.py run -i input.wav -o output_reconstructed.wav

3. Compare Reconstructed Audio

Extract mathematical fidelity metrics (MSE, SQNR, correlation, and envelope preservation) between raw and processed signals:

uv run audio_cli.py compare -f1 input.wav -f2 output_reconstructed.wav

4. Run Synthetic Simulations

Generate custom synthetic signals (e.g., sine waves, square waves, noise, transients) with custom parameters:

uv run main.py --type square --frequency 440 --duration 2.0 --spikes 5

Performance Benchmarks

Running uv run pytest executes telemetry benchmarking on a standard high-sample dataset (44.1 kHz stereo/mono):

Metric	Performance Profile
Original Dataset Size	2.52 MB (15.0 seconds)
Compressed On-Disk Footprint	0.65 MB
Compression Ratio	~3.91x smaller footprint (74.4% reduction)
Fidelity (SQNR)	30.24 dB
Compression Throughput	~1.32 MB/s
Decompression Throughput	~1.35 MB/s

Known Limitations & Failure Modes

1. WHT Basis Alignment (Extreme Sparsity Failure)

The codec relies on a Fast Walsh-Hadamard Transform (FWHT) rotation to "Gaussianize" audio blocks, making them fit the Gaussian Lloyd-Max codebook.

Failure Scenario: If an input block perfectly aligns with one of the Walsh-Hadamard basis vectors (e.g. signal = rotator.hadamardSigns), the rotated vector becomes a single extreme Kronecker delta spike.
Result: Because the Lloyd-Max codebook is optimized for normal distributions, it clips this extreme spike to the outermost centroid boundary ($\pm 2.41$ standard deviations). This clipping noise destroys reconstruction quality, dropping the SQNR to ~1.31 dB. (Proven in tests/test_failures.py::test_failure_hadamard_basis_alignment).

2. Silent Block Edge Case

Edge Case: Silent blocks have a standard deviation of 0.0. Dividing by this value during block standardization would lead to NaN or Inf errors.
Resolution: The engine implements a safety threshold guard (std_dev > 1e-6). Silent blocks bypass normalization and are reconstructed as perfect silence. (Proven in tests/test_failures.py::test_boundary_silent_signal).

License

This project is licensed under the GNU General Public License v3.0. See the LICENSE file for details.

Project details

These details have not been verified by PyPI

Release history Release notifications | RSS feed

0.1.1

Jun 9, 2026

This version

0.1.0

Jun 9, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

audiotq-0.1.0.tar.gz (494.8 kB view details)

Uploaded Jun 9, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

audiotq-0.1.0-py3-none-any.whl (23.3 kB view details)

Uploaded Jun 9, 2026 Python 3

File details

Details for the file audiotq-0.1.0.tar.gz.

File metadata

Download URL: audiotq-0.1.0.tar.gz
Upload date: Jun 9, 2026
Size: 494.8 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: uv/0.11.3 {"installer":{"name":"uv","version":"0.11.3","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"macOS","version":null,"id":null,"libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}

File hashes

Hashes for audiotq-0.1.0.tar.gz
Algorithm	Hash digest
SHA256	`e0a34a463fb8dc0f0461732e7af7f3dd112d65338fada090aec656e9f4d4b29d`
MD5	`8d4b3b01d0c59908a752ce484d3a93b8`
BLAKE2b-256	`4868b7b3c4b4a1d819dd50cee45ec2890889c4b60b15cb17f0cd76cef6ba217f`

See more details on using hashes here.

File details

Details for the file audiotq-0.1.0-py3-none-any.whl.

File metadata

Download URL: audiotq-0.1.0-py3-none-any.whl
Upload date: Jun 9, 2026
Size: 23.3 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: uv/0.11.3 {"installer":{"name":"uv","version":"0.11.3","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"macOS","version":null,"id":null,"libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}

File hashes

Hashes for audiotq-0.1.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`26c135fe90a00dc3faf2d4ebe828ea7fdb4988af021edcecbc40ab142c3668cb`
MD5	`a06dd7e039fd818075b9d7bc7dd3c20e`
BLAKE2b-256	`47ea3e44276a5fb04a117aea14d33c791c683895c1a717b0296adcc0d62c3d70`

See more details on using hashes here.

audiotq 0.1.0

Navigation

Verified details

Maintainers

Unverified details

Meta

Classifiers

Project description

Features

Directory Structure

Installation

Usage

1. Run System Verification Tests

2. Compress and Decompress Audio

3. Compare Reconstructed Audio

4. Run Synthetic Simulations

Performance Benchmarks

Known Limitations & Failure Modes

1. WHT Basis Alignment (Extreme Sparsity Failure)

2. Silent Block Edge Case

License

Project details

Verified details

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes