Skip to main content

AI-powered video captioning tool — ASR, subtitle optimization, translation, and synthesis

Project description

VideoCaptioner Logo

VideoCaptioner

基于大语言模型的视频字幕处理工具 — 语音识别、字幕优化、翻译、视频合成一站式处理

在线文档 · CLI 使用 · GUI 桌面版 · Claude Code Skill

安装

pip install videocaptioner          # 仅安装 CLI(轻量,无 GUI 依赖)
pip install videocaptioner[gui]     # 安装 CLI + GUI 桌面版

免费功能(必剪语音识别、必应/谷歌翻译)无需任何配置,安装即用

CLI 命令行

# 语音转录(免费,无需 API Key)
videocaptioner transcribe video.mp4 --asr bijian

# 字幕翻译(免费必应翻译)
videocaptioner subtitle input.srt --translator bing --target-language en

# 全流程:转录 → 优化 → 翻译 → 合成
videocaptioner process video.mp4 --target-language ja

# 字幕烧录到视频
videocaptioner synthesize video.mp4 -s subtitle.srt

# 下载在线视频
videocaptioner download "https://youtube.com/watch?v=xxx"

需要 LLM 功能(字幕优化、大模型翻译)时,配置 API Key:

videocaptioner config set llm.api_key <your-key>
videocaptioner config set llm.api_base https://api.openai.com/v1
videocaptioner config set llm.model gpt-4o-mini

配置优先级:命令行参数 > 环境变量 (VIDEOCAPTIONER_*) > 配置文件 > 默认值。运行 videocaptioner config show 查看当前配置。

所有 CLI 命令一览
命令 说明
transcribe 语音转字幕。引擎:faster-whisperwhisper-apibijian(免费)、jianying(免费)、whisper-cpp
subtitle 字幕优化/翻译。翻译服务:llmbing(免费)、google(免费)
synthesize 字幕烧录到视频(软字幕/硬字幕)
process 全流程处理
download 下载 YouTube、B站等平台视频
config 配置管理(showsetgetpathinit

运行 videocaptioner <命令> --help 查看完整参数。完整 CLI 文档见 docs/cli.md

GUI 桌面版

pip install videocaptioner[gui]
videocaptioner                      # 无参数时自动打开桌面版
其他安装方式:Windows 安装包 / macOS 一键脚本

Windows:从 Release 下载安装包

macOS

curl -fsSL https://raw.githubusercontent.com/WEIFENG2333/VideoCaptioner/main/scripts/run.sh | bash

页面预览 页面预览

LLM API 配置

LLM 仅用于字幕优化和大模型翻译,免费功能(必剪识别、必应翻译)无需配置。

支持所有 OpenAI 兼容接口的服务商:

服务商 官网
VideoCaptioner 中转站 api.videocaptioner.cn — 高并发,性价比高,支持 GPT/Claude/Gemini 等
SiliconCloud cloud.siliconflow.cn
DeepSeek platform.deepseek.com

在软件设置或 CLI 中填入 API Base URL 和 API Key 即可。详细配置教程

Claude Code Skill

本项目提供了 Claude Code Skill,让 AI 编程助手可以直接调用 VideoCaptioner 处理视频。

安装到 Claude Code:

mkdir -p ~/.claude/skills/videocaptioner
cp skills/SKILL.md ~/.claude/skills/videocaptioner/SKILL.md

然后在 Claude Code 中输入 /videocaptioner transcribe video.mp4 --asr bijian 即可使用。

工作原理

音视频输入 → 语音识别 → 字幕断句 → LLM 优化 → 翻译 → 视频合成
  • 词级时间戳 + VAD 语音活动检测,识别准确率高
  • LLM 语义理解断句,字幕阅读体验自然流畅
  • 上下文感知翻译,支持反思优化机制
  • 批量并发处理,效率高

开发

git clone https://github.com/WEIFENG2333/VideoCaptioner.git
cd VideoCaptioner
uv sync && uv run videocaptioner     # 运行 GUI
uv run videocaptioner --help          # 运行 CLI
uv run pyright                        # 类型检查
uv run pytest tests/test_cli/ -q      # 运行测试

许可证

GPL-3.0

Star History Chart

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

videocaptioner-1.4.1.tar.gz (14.0 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

videocaptioner-1.4.1-py3-none-any.whl (12.1 MB view details)

Uploaded Python 3

File details

Details for the file videocaptioner-1.4.1.tar.gz.

File metadata

  • Download URL: videocaptioner-1.4.1.tar.gz
  • Upload date:
  • Size: 14.0 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for videocaptioner-1.4.1.tar.gz
Algorithm Hash digest
SHA256 688c8747b5df756980f7a71beccabef2526a7f777b5992edb3a0783816161f85
MD5 915f5a930b8dc4b43e2bec30a3a81b59
BLAKE2b-256 036b2ada6e560f4d790a884f8123f0507fa4c0e7de3911b7170a905c8877b3cb

See more details on using hashes here.

Provenance

The following attestation bundles were made for videocaptioner-1.4.1.tar.gz:

Publisher: publish-pypi.yml on WEIFENG2333/VideoCaptioner

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file videocaptioner-1.4.1-py3-none-any.whl.

File metadata

  • Download URL: videocaptioner-1.4.1-py3-none-any.whl
  • Upload date:
  • Size: 12.1 MB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for videocaptioner-1.4.1-py3-none-any.whl
Algorithm Hash digest
SHA256 26eb14256c30116716212fbd8d7b22e0e216be3c63a840557288e7361cccfd23
MD5 f7540d89a0d9619d23d23c9c0f66e3d6
BLAKE2b-256 6ba7aeb4b4559435702718cc72c0a891f4df09d4f928e6e3fb2bf4de7503e5d8

See more details on using hashes here.

Provenance

The following attestation bundles were made for videocaptioner-1.4.1-py3-none-any.whl:

Publisher: publish-pypi.yml on WEIFENG2333/VideoCaptioner

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page