Double-ender time alignment engine for podcast production

These details have not been verified by PyPI

Project description

double-ender-sync

double-ender-sync is a CLI tool that aligns each speaker's local recording to a mixed reference recording ("master") for podcast post-production.

It focuses on time alignment and diagnostics, not final audio mixing.

Project status

This project is currently experimental (alpha).

It can produce useful alignment results for some double-ender podcast recordings, but it is not yet a fully validated production-grade editor. Always review generated reports, markers, warnings, and synced audio manually before using outputs in final production.

What this tool does

Detects initial timing offset between each local track and the master.
Estimates long-duration clock drift from multiple anchor points.
Applies global time correction and exports synced WAV files.
Produces alignment diagnostics (sync-report.json, markers, warnings) so editors can review confidence and problem areas.

Offset definition:

offset_seconds = master_time - local_time

Install

Requirements

Python 3.11+
WAV input files for master and local tracks

From source

pip install .

Development install

python -m venv .venv
source .venv/bin/activate
python -m pip install --upgrade pip
pip install -e .

Run tests:

pip install -e ".[dev]"
pytest

# for pitch-preserving stretch option
pip install -e ".[stretch]"

After installation, the command is available as:

double-ender-sync --help

Quick start

Input example:

input/
  master.wav
  speaker-a.wav
  speaker-b.wav

Run:

double-ender-sync \
  --master input/master.wav \
  --track input/speaker-a.wav \
  --track input/speaker-b.wav \
  --out output/

Output files

Typical output:

output/
  speaker-a.synced.wav
  speaker-b.synced.wav
  sync-report.json
  sync-markers.csv
  warnings.txt

Useful options

--analysis-sample-rate 16000
Set analysis sample rate used for feature extraction/matching.
--local-adjust-enabled
Enable experimental optional local adjustment around large residual errors. This is disabled by default and should only be used after manual report/audio review.
--local-adjust-threshold-ms 80
Threshold for triggering local adjustment diagnostics/correction.
--normalize-output
Normalize final synced WAV peak level before writing. Disabled by default.
--stretch-ratio-warning-threshold 0.003
Warn when abs(stretch_ratio - 1.0) exceeds threshold (default 0.003 = 0.3%).
--stretch-ratio-auto-continue
Skip interactive confirmation and continue even when stretch ratio warning threshold is exceeded.
--stretch-method {resample,pitch_preserving}
Global correction method. resample is default. pitch_preserving uses librosa and prioritizes pitch stability for larger drift corrections.
--debug
Enable debug logging to identify which stage is running when resource usage spikes.
--log-file output/debug.log
Write logs to a specific file path (default: output/double-ender-sync.log).

Use double-ender-sync --help for the full option list.

GUI (PySide6, drag & drop)

This project also provides an optional desktop GUI built with PySide6.

Install with GUI dependency:

pip install -e ".[gui]"

Launch GUI:

double-ender-sync-gui

Language option (`--lang`) common specification

Project-wide behavior for language resolution is fixed as follows:

--lang <code> is accepted (for example: en, ja).
If --lang is omitted, system locale is used (LC_ALL then LANG).
If the normalized language is unsupported, fallback is en.
Regional codes are normalized to their language part before support checks (for example: en-US -> en, ja_JP.UTF-8 -> ja).
GUI applies this resolver first, and the same resolver is reusable from CLI/API so each entry point does not need separate language detection logic.

Examples:

double-ender-sync-gui --lang en
double-ender-sync-gui --lang ja
double-ender-sync-gui

GUI features (current):

Select master.wav
Drag and drop multiple speaker .wav tracks
Choose output directory
Run the same alignment pipeline as CLI

Python API (import from another project)

In addition to CLI usage, you can run the same pipeline from Python.

from pathlib import Path

from double_ender_sync import AlignmentOptions, run_alignment

options = AlignmentOptions(
    master=Path("input/master.wav"),
    tracks=[Path("input/speaker-a.wav"), Path("input/speaker-b.wav")],
    out=Path("output"),
    analysis_sample_rate=16000,
    local_adjust_enabled=False,
    normalize_output=False,
)

exit_code = run_alignment(options)
if exit_code != 0:
    raise RuntimeError(f"alignment failed with exit code {exit_code}")

run_alignment(...) returns the same exit code semantics as the CLI main(...).

Translation operations rules

Translation keys are domain-prefixed and stable (gui.*, cli.*, api.*, errors.*, warnings.*).
Never use display text itself as a key.
Missing key behavior is unified:
- If the target locale does not have the key, fallback to en.
- If en also does not have the key, show the key string and emit a warning log.
Placeholder formatting is unified (for example: "File not found: {path}").
- Placeholder names must match exactly across all languages for the same key.

Adding a new language

Add a locale file: src/double_ender_sync/i18n/locales/<lang>.json.
Add <lang> to SUPPORTED_LANGUAGES in src/double_ender_sync/i18n/resolver.py.
Run required key validation: double-ender-sync-validate-locales (or python -m double_ender_sync.i18n.validate).
Verify UI rendering manually:
- launch double-ender-sync-gui with your locale selected,
- confirm labels/dialog/errors render correctly,
- run one alignment and check runtime messages/logs.

Intended use case

This tool is intended for podcast double-ender workflows where:

each participant records a local WAV file,
a mixed call recording is available as timing reference,
local recordings contain enough speech anchors across the session,
final output is reviewed and edited by a human in a DAW.

It may perform poorly when:

the master recording is heavily compressed/noisy or missing large sections,
a local track contains very little speech,
local and master recordings contain different edits,
long dropouts or repeated phrases confuse anchor matching,
timing changes are non-linear and not well approximated by a simple drift model.

Reviewing the result

After running the tool, inspect:

warnings.txt for low-confidence regions and skipped adjustments,
sync-markers.csv for anchor/residual positions,
sync-report.json for per-track offset/stretch/residual diagnostics,
exported .synced.wav files by listening in your DAW.

Do not treat generated synced files as final mastered audio.

Temporary files

This tool creates temporary memory-mapped files during analysis to reduce peak RAM usage for long recordings. These temporary files are cleaned up at the end of a normal CLI run.

Current implementation status

Implemented pipeline includes:

audio loading and normalization for analysis,
speech-region detection (RMS-based),
anchor selection and matching against master,
initial offset estimation,
multi-anchor linear drift estimation,
global correction and synced WAV export,
detailed reporting with warnings/errors.

Scope and non-goals

This project does not do final podcast mastering tasks such as:

noise reduction,
EQ/compression/loudness normalization,
transcript-based editing,
final mixdown/publishing.

The expected workflow is:

raw recordings -> double-ender-sync -> synced WAV + report -> human DAW edit

Licensing and distribution policy

Project code is MIT licensed.

Current policy is source-only distribution from this repository. No official prebuilt binaries are published.

Before publishing any binary builds in the future, review third-party obligations (especially LGPL-related components) and update distribution/legal documentation accordingly.

See:

THIRD_PARTY_NOTICES.md
docs/licensing-source-only.md

Project details

These details have not been verified by PyPI

Release history Release notifications | RSS feed

0.2.4

May 16, 2026

0.2.3

May 11, 2026

0.2.2

May 9, 2026

0.2.1

May 8, 2026

0.2.0

May 6, 2026

0.1.3

May 3, 2026

0.1.1

May 3, 2026

This version

0.1.0

May 3, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

double_ender_sync-0.1.0.tar.gz (31.8 kB view details)

Uploaded May 3, 2026 Source

File details

Details for the file double_ender_sync-0.1.0.tar.gz.

File metadata

Download URL: double_ender_sync-0.1.0.tar.gz
Upload date: May 3, 2026
Size: 31.8 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for double_ender_sync-0.1.0.tar.gz
Algorithm	Hash digest
SHA256	`cead36a6a579f54eca78a239a18a05b3287017eaa35aa31c23012863695ec77c`
MD5	`b6a491381b75fb055eed7dc296fed9f1`
BLAKE2b-256	`831494d7009755bf5dc0938d13052111e6179a8177714ebeca14c3b0c4d3d499`

See more details on using hashes here.

Provenance

The following attestation bundles were made for double_ender_sync-0.1.0.tar.gz:

Publisher: publish.yml on ogra/double-ender-sync

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: double_ender_sync-0.1.0.tar.gz
- Subject digest: cead36a6a579f54eca78a239a18a05b3287017eaa35aa31c23012863695ec77c
- Sigstore transparency entry: 1431868258
- Sigstore integration time: May 3, 2026
Source repository:
- Permalink: ogra/double-ender-sync@659e458636935368d258e1c4d8c89aa71de5794a
- Branch / Tag: refs/tags/v0.1.0
- Owner: https://github.com/ogra
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@659e458636935368d258e1c4d8c89aa71de5794a
- Trigger Event: push

double-ender-sync 0.1.0

Navigation

Verified details

Maintainers

Unverified details

Meta

Classifiers

Project description

double-ender-sync

Project status

What this tool does

Install

Requirements

From source

Development install

Quick start

Output files

Useful options

GUI (PySide6, drag & drop)

Language option (--lang) common specification

Python API (import from another project)

Translation operations rules

Adding a new language

Intended use case

Reviewing the result

Temporary files

Current implementation status

Scope and non-goals

Licensing and distribution policy

Project details

Verified details

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

File details

File metadata

File hashes

Provenance

Language option (`--lang`) common specification