Skip to main content

CLI for speeding up long-form talks by removing silence

Project description

Talks Reducer

Talks Reducer shortens long-form presentations by removing silent gaps and optionally re-encoding them to smaller files. The project was renamed from jumpcutter to emphasize its focus on conference talks and lectures.

When CUDA-capable hardware is available the pipeline leans on GPU encoders to keep export times low, but it still runs great on CPUs.

Repository Structure

  • talks_reducer/ — Python package that exposes the CLI and reusable pipeline:
    • cli.py parses arguments and dispatches to the pipeline.
    • pipeline.py orchestrates FFmpeg, audio processing, and temporary assets.
    • audio.py handles audio validation, volume analysis, and phase vocoder processing.
    • chunks.py builds timing metadata and FFmpeg expressions for frame selection.
    • ffmpeg.py discovers the FFmpeg binary, checks CUDA availability, and assembles command strings.
  • requirements.txt — Python dependencies for local development.
  • default.nix — reproducible environment definition for Nix users.
  • CONTRIBUTION.md — development workflow, formatting expectations, and release checklist.
  • AGENTS.md — maintainer tips and coding conventions for this repository.

Example

  • 1h 37m, 571 MB — Original OBS video
  • 1h 19m, 751 MB — Talks Reducer
  • 1h 19m, 171 MB — Talks Reducer --small

The --small preset applies a 720p video scale and 128 kbps audio bitrate, making it useful for sharing talks over constrained connections. Without --small, the script aims to preserve original quality while removing silence.

Highlights

  • Builds on gegell's classic jumpcutter workflow with more efficient frame and audio processing
  • Generates FFmpeg filter graphs instead of writing temporary frames to disk
  • Streams audio transformations in memory to avoid slow intermediate files
  • Accepts multiple inputs or directories of recordings in a single run
  • Provides progress feedback via tqdm
  • Automatically detects NVENC availability, so you no longer need to pass --cuda

Processing Pipeline

  1. Validate that each input file contains an audio stream using ffprobe.
  2. Extract audio and calculate loudness to identify silent regions.
  3. Stretch the non-silent segments with audiotsm to maintain speech clarity.
  4. Stitch the processed audio and video together with FFmpeg, using NVENC if the GPU encoders are detected.

Recent Updates

  • October 2025 — Project renamed to Talks Reducer across documentation and scripts.
  • October 2025 — Added --small preset with 720p/128 kbps defaults for bandwidth-friendly exports.
  • October 2025 — Removed the --cuda flag; CUDA/NVENC support is now auto-detected.
  • October 2025 — Improved --small encoder arguments to balance size and clarity.
  • October 2025 — CLI argument parsing fixes to prevent crashes on invalid combinations.
  • October 2025 — Added example output comparison to the README.

Quick Start

  1. Install FFmpeg and ensure it is on your PATH
  2. Install Talks Reducer with pip install talks-reducer (this exposes the talks-reducer command)
  3. Inspect available options with talks-reducer --help
  4. Process a recording using talks-reducer /path/to/video

Graphical Interface

Prefer a form-based workflow? Launch the bundled Tkinter application with talks-reducer-gui:

  • Simple mode — the default experience shrinks the window to a large drop zone, hides the manual run controls and log, and automatically processes new files as soon as you drop them. Uncheck the box to return to the full layout with file pickers, the Run button, and detailed logging.
  • Input drop zone — drag files or folders from your desktop or add them via the Explorer/Finder dialog; duplicates are ignored.
  • Small video — toggles the --small preset used by the CLI.
  • Advanced — reveals optional controls for the output path, temp folder, timing/audio knobs mirrored from the command line, and an appearance picker that can force dark or light mode or follow your operating system.

Progress updates stream into the 10-line log panel while the processing runs in a background thread. Once every queued job succeeds an Open last output button appears so you can jump straight to the exported file in your system file manager.

Note: Drag and drop support relies on the tkinterdnd2 package. It is installed automatically with Talks Reducer but still requires Tk 8.6 with tkdnd support on your operating system.

Programmatic Usage

The pipeline can be reused outside of the CLI by constructing talks_reducer.models.ProcessingOptions and invoking talks_reducer.pipeline.speed_up_video. A progress reporter implementing talks_reducer.progress.ProgressReporter can be supplied to bridge different user interfaces (for example, a GUI signal emitter or a logging sink).

from pathlib import Path

from talks_reducer.models import ProcessingOptions
from talks_reducer.pipeline import speed_up_video
from talks_reducer.progress import NullProgressReporter

options = ProcessingOptions(input_file=Path("talk.mp4"))
result = speed_up_video(options, reporter=NullProgressReporter())
print("Output created at", result.output_file)

See tests/test_pipeline_service.py for additional examples that stub the FFmpeg layer during unit tests.

Requirements

  • Python 3 with numpy, scipy, audiotsm, and tqdm
  • FFmpeg with optional NVIDIA NVENC support for CUDA acceleration

Contributing

See CONTRIBUTION.md for development setup details and guidance on sharing improvements.

License

Talks Reducer is released under the MIT License. See LICENSE for the full text.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

talks_reducer-0.2.1.tar.gz (25.6 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

talks_reducer-0.2.1-py3-none-any.whl (25.4 kB view details)

Uploaded Python 3

File details

Details for the file talks_reducer-0.2.1.tar.gz.

File metadata

  • Download URL: talks_reducer-0.2.1.tar.gz
  • Upload date:
  • Size: 25.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.12.10

File hashes

Hashes for talks_reducer-0.2.1.tar.gz
Algorithm Hash digest
SHA256 c3a1a2b3666e9e8e4689972856697d414a6d511fbedc81a2e617cd1d61c3de2e
MD5 c71a329dea3d45931162351d71e97572
BLAKE2b-256 70e33f6c026772b4cdca801eb484d555094a1381aa50624a5996b1be25d27b8d

See more details on using hashes here.

File details

Details for the file talks_reducer-0.2.1-py3-none-any.whl.

File metadata

  • Download URL: talks_reducer-0.2.1-py3-none-any.whl
  • Upload date:
  • Size: 25.4 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.12.10

File hashes

Hashes for talks_reducer-0.2.1-py3-none-any.whl
Algorithm Hash digest
SHA256 bb2487fcc045cd3ee9983d0076eaed8068b98d0950c9a98268d3508d6e65f1c7
MD5 ba3e4ceff75c4f79f68dcf773d829ce2
BLAKE2b-256 99a4c2cde7eb203205064adc72005d744ff4508dff2cb604db0906bf836eb982

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page