Skip to main content

CLI for speeding up long-form talks by removing silence

Project description

Talks Reducer

Talks Reducer shortens long-form presentations by removing silent gaps and optionally re-encoding them to smaller files. The project was renamed from jumpcutter to emphasize its focus on conference talks and lectures.

When CUDA-capable hardware is available the pipeline leans on GPU encoders to keep export times low, but it still runs great on CPUs.

Repository Structure

  • talks_reducer/ — Python package that exposes the CLI and reusable pipeline:
    • cli.py parses arguments and dispatches to the pipeline.
    • pipeline.py orchestrates FFmpeg, audio processing, and temporary assets.
    • audio.py handles audio validation, volume analysis, and phase vocoder processing.
    • chunks.py builds timing metadata and FFmpeg expressions for frame selection.
    • ffmpeg.py discovers the FFmpeg binary, checks CUDA availability, and assembles command strings.
  • requirements.txt — Python dependencies for local development.
  • default.nix — reproducible environment definition for Nix users.
  • CONTRIBUTION.md — development workflow, formatting expectations, and release checklist.
  • AGENTS.md — maintainer tips and coding conventions for this repository.

Example

  • 1h 37m, 571 MB — Original OBS video
  • 1h 19m, 751 MB — Talks Reducer
  • 1h 19m, 171 MB — Talks Reducer --small

The --small preset applies a 720p video scale and 128 kbps audio bitrate, making it useful for sharing talks over constrained connections. Without --small, the script aims to preserve original quality while removing silence.

Highlights

  • Builds on gegell's classic jumpcutter workflow with more efficient frame and audio processing
  • Generates FFmpeg filter graphs instead of writing temporary frames to disk
  • Streams audio transformations in memory to avoid slow intermediate files
  • Accepts multiple inputs or directories of recordings in a single run
  • Provides progress feedback via tqdm
  • Automatically detects NVENC availability, so you no longer need to pass --cuda

Processing Pipeline

  1. Validate that each input file contains an audio stream using ffprobe.
  2. Extract audio and calculate loudness to identify silent regions.
  3. Stretch the non-silent segments with audiotsm to maintain speech clarity.
  4. Stitch the processed audio and video together with FFmpeg, using NVENC if the GPU encoders are detected.

Recent Updates

  • October 2025 — Project renamed to Talks Reducer across documentation and scripts.
  • October 2025 — Added --small preset with 720p/128 kbps defaults for bandwidth-friendly exports.
  • October 2025 — Removed the --cuda flag; CUDA/NVENC support is now auto-detected.
  • October 2025 — Improved --small encoder arguments to balance size and clarity.
  • October 2025 — CLI argument parsing fixes to prevent crashes on invalid combinations.
  • October 2025 — Added example output comparison to the README.

Quick Start

  1. Install FFmpeg and ensure it is on your PATH
  2. Install Talks Reducer with pip install talks-reducer (this exposes the talks-reducer command)
  3. Inspect available options with talks-reducer --help
  4. Process a recording using talks-reducer /path/to/video

Graphical Interface

Prefer a form-based workflow? Launch the bundled Tkinter application with talks-reducer-gui:

  • Simple mode — the default experience shrinks the window to a large drop zone, hides the manual run controls and log, and automatically processes new files as soon as you drop them. Uncheck the box to return to the full layout with file pickers, the Run button, and detailed logging.
  • Input drop zone — drag files or folders from your desktop or add them via the Explorer/Finder dialog; duplicates are ignored.
  • Small video — toggles the --small preset used by the CLI.
  • Advanced — reveals optional controls for the output path, temp folder, timing/audio knobs mirrored from the command line, and an appearance picker that can force dark or light mode or follow your operating system.

Progress updates stream into the 10-line log panel while the processing runs in a background thread. Once every queued job succeeds an Open last output button appears so you can jump straight to the exported file in your system file manager.

Note: Drag and drop support relies on the tkinterdnd2 package. It is installed automatically with Talks Reducer but still requires Tk 8.6 with tkdnd support on your operating system.

Programmatic Usage

The pipeline can be reused outside of the CLI by constructing talks_reducer.models.ProcessingOptions and invoking talks_reducer.pipeline.speed_up_video. A progress reporter implementing talks_reducer.progress.ProgressReporter can be supplied to bridge different user interfaces (for example, a GUI signal emitter or a logging sink).

from pathlib import Path

from talks_reducer.models import ProcessingOptions
from talks_reducer.pipeline import speed_up_video
from talks_reducer.progress import NullProgressReporter

options = ProcessingOptions(input_file=Path("talk.mp4"))
result = speed_up_video(options, reporter=NullProgressReporter())
print("Output created at", result.output_file)

See tests/test_pipeline_service.py for additional examples that stub the FFmpeg layer during unit tests.

Requirements

  • Python 3 with numpy, scipy, audiotsm, and tqdm
  • FFmpeg with optional NVIDIA NVENC support for CUDA acceleration

Contributing

See CONTRIBUTION.md for development setup details and guidance on sharing improvements.

License

Talks Reducer is released under the MIT License. See LICENSE for the full text.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

talks_reducer-0.2.0.tar.gz (25.6 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

talks_reducer-0.2.0-py3-none-any.whl (25.4 kB view details)

Uploaded Python 3

File details

Details for the file talks_reducer-0.2.0.tar.gz.

File metadata

  • Download URL: talks_reducer-0.2.0.tar.gz
  • Upload date:
  • Size: 25.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.12.10

File hashes

Hashes for talks_reducer-0.2.0.tar.gz
Algorithm Hash digest
SHA256 930d06b4aa8a2903d5e2525627363ea04e625b176ab174e12cbd7a17aa5ed470
MD5 ece3b99a6aea1f7e3d768cbb4ade53af
BLAKE2b-256 dc27b48a9713deddc86f09afe1c3e9fbae081328ba88e4f73b2b83d4984cde03

See more details on using hashes here.

File details

Details for the file talks_reducer-0.2.0-py3-none-any.whl.

File metadata

  • Download URL: talks_reducer-0.2.0-py3-none-any.whl
  • Upload date:
  • Size: 25.4 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.12.10

File hashes

Hashes for talks_reducer-0.2.0-py3-none-any.whl
Algorithm Hash digest
SHA256 9b7f052cfd5374c933ac497c989f1f62595077058d7e44204689d436efee67ce
MD5 426fbe180381016928bec9f4e162b3a5
BLAKE2b-256 3ab2bc35bd95726f7ec48c40998b8a88161a1bb4218ece6bc39c851ec27c68e7

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page