视频英文字幕翻译为中文字幕工具
Project description
🎬 Video Subtitle Translation Tool
Automatically transcribe video audio, translate to target language, and generate subtitle files or embed them into videos. Supports translation between 18 languages.
中文文档 | 日本語ドキュメント | 한국어 문서 | Français | Deutsch
✨ Features
- 🎤 Speech Recognition: High-precision speech recognition using OpenAI Whisper
- 🌐 Multi-language Translation: Supports translation between 18 languages (Chinese, English, Japanese, Korean, French, German, Spanish, etc.)
- 🤖 Multiple Engine Support: Supports DeepSeek, OpenAI, and other translation engines
- 📄 Subtitle Generation: Supports multiple subtitle formats including SRT, VTT, ASS
- 🎥 Subtitle Embedding: Supports both soft and hard subtitle methods
- 🌍 Bilingual Subtitles: Optional bilingual subtitle generation
- 📝 Video Summary: LLM-powered video content summarization with key points, topics, and timeline
- ⚡ Hardware Acceleration: Auto-detect hardware encoding (VideoToolbox/NVENC/QSV/AMF) for faster hard subtitle rendering
- 🖥️ GUI Integration: JSON progress output for seamless integration with graphical interfaces
- 💰 Cost-Effective: DeepSeek API offers affordable pricing with excellent translation quality
- 🏗️ Modular Design: Easy to extend and maintain
🌍 Supported Languages
| Code | Language | Code | Language |
|---|---|---|---|
zh |
Chinese (中文) | en |
English |
ja |
Japanese (日本語) | ko |
Korean (한국어) |
fr |
French (Français) | de |
German (Deutsch) |
es |
Spanish (Español) | ru |
Russian (Русский) |
pt |
Portuguese (Português) | it |
Italian (Italiano) |
nl |
Dutch (Nederlands) | pl |
Polish (Polski) |
tr |
Turkish (Türkçe) | ar |
Arabic (العربية) |
hi |
Hindi (हिन्दी) | th |
Thai (ไทย) |
vi |
Vietnamese (Tiếng Việt) | id |
Indonesian (Bahasa Indonesia) |
Use video-translate --list-languages to view the complete list.
📁 Project Structure
video-translate/
├── src/
│ └── video_translate/
│ ├── __init__.py # Package initialization
│ ├── __main__.py # Entry point
│ ├── cli.py # Command-line interface
│ ├── config.py # Configuration management
│ ├── models.py # Data models
│ ├── transcriber.py # Speech recognition module
│ ├── translator.py # Translation module
│ ├── summarizer.py # Video content summarization module
│ ├── subtitle.py # Subtitle processing module
│ ├── video.py # Video processing module
│ ├── pipeline.py # Processing pipeline
│ └── utils.py # Utility functions
├── pyproject.toml # Project configuration
├── requirements.txt # Dependencies
├── LICENSE # MIT License
├── .gitignore # Git ignore file
└── README.md
📦 Installation
Prerequisites
FFmpeg is required for video processing. Please install it first:
macOS:
# Basic installation (sufficient for soft subtitles)
brew install ffmpeg
# For hard subtitles (--hard-sub), you need FFmpeg with libass support:
brew install ffmpeg-full
echo 'export PATH="/opt/homebrew/opt/ffmpeg-full/bin:$PATH"' >> ~/.zshrc
source ~/.zshrc
Note: The standard
brew install ffmpegdoes not include libass support, which is required for the--hard-subfeature. If you encounter errors like "No option name near force_style", please installffmpeg-fullinstead.
Ubuntu/Debian:
sudo apt update && sudo apt install ffmpeg
The apt package typically includes libass support. If you encounter "No option name near force_style" errors with
--hard-sub, install libass:sudo apt install libass-devand reinstall ffmpeg.
Windows: Download and install FFmpeg (recommended: gyan.dev full build or BtbN builds, which include libass support)
Quick Installation (Recommended)
pip install video-translate
Or use uv (faster):
uv pip install video-translate
Development Installation
If you want to contribute to development or modify the code:
# 1. Clone the project
git clone https://github.com/innovationmech/video-translate.git
cd video-translate
# 2. Install uv (if not already installed)
# macOS/Linux
curl -LsSf https://astral.sh/uv/install.sh | sh
# Windows
powershell -ExecutionPolicy ByPass -c "irm https://astral.sh/uv/install.ps1 | iex"
# 3. Install dependencies (including dev tools)
uv sync --dev
# Or install with pip in editable mode
pip install -e ".[dev]"
Set up API Key
Register and get an API Key from DeepSeek Open Platform:
export DEEPSEEK_API_KEY='your-api-key-here'
Or use OpenAI:
export OPENAI_API_KEY='your-api-key-here'
🚀 Usage
Command Line Usage
# Basic usage (English → Chinese)
video-translate video.mp4
# Or use python -m
python -m video_translate video.mp4
Multi-language Translation Examples
# English → Chinese (default)
video-translate video.mp4
# Japanese → Chinese
video-translate video.mp4 --source ja --target zh
# English → Japanese
video-translate video.mp4 --source en --target ja
# Chinese → English
video-translate video.mp4 --source zh --target en
# Korean → Japanese
video-translate video.mp4 --source ko --target ja
# French → German
video-translate video.mp4 --source fr --target de
Command Line Options
Basic Options:
| Option | Description |
|---|---|
-s, --source |
Source language code (default: en) |
-t, --target |
Target language code (default: zh) |
--list-languages |
List all supported languages |
-o, --output |
Specify output directory |
-m, --model |
Whisper model size (tiny/base/small/medium/large) |
-v, --version |
Show version |
--verbose |
Show detailed logs |
Translation Options:
| Option | Description |
|---|---|
--translator |
Translation engine (deepseek/openai) |
--api-key |
Translation API Key |
--api-base |
API Base URL (optional, for custom endpoints) |
--llm-model |
LLM model name (optional, override default model) |
Subtitle Options:
| Option | Description |
|---|---|
--target-only |
Output only target language subtitles, without source text |
--source-first |
Source language on top, target language below |
Video Options:
| Option | Description |
|---|---|
--no-embed |
Don't embed subtitles into video, only generate subtitle files |
--hard-sub |
Use hard subtitles (burned into video) |
--font-size |
Hard subtitle font size (default: 24) |
--hw-accel |
Hardware acceleration for hard subtitle encoding (auto/none/videotoolbox/nvenc/qsv/amf, default: auto) |
--video-quality |
Hard subtitle video quality, CRF value (0-51, lower is better, default: 23) |
Summary Options:
| Option | Description |
|---|---|
--no-summary |
Disable video content summary |
--summary-lang |
Summary language code (default: follows target language) |
--max-key-points |
Max number of key points in summary (default: 5) |
--no-timeline |
Exclude timeline from summary |
Advanced Options:
| Option | Description |
|---|---|
--json-progress |
Output JSON-formatted progress (for GUI integration) |
More Examples
# Use a larger model for better accuracy
video-translate video.mp4 --model large
# Only generate subtitle files, don't embed into video
video-translate video.mp4 --no-embed
# Generate hard subtitles (burned into video)
video-translate video.mp4 --hard-sub
# Hard subtitles with NVIDIA hardware acceleration and high quality
video-translate video.mp4 --hard-sub --hw-accel nvenc --video-quality 18
# Output only target language subtitles
video-translate video.mp4 --target-only
# Use OpenAI translation
video-translate video.mp4 --translator openai
# Use a custom API endpoint and model
video-translate video.mp4 --api-base https://your-api.com/v1 --llm-model your-model
# Disable video content summary
video-translate video.mp4 --no-summary
# Generate summary in English with up to 10 key points
video-translate video.mp4 --summary-lang en --max-key-points 10
# Specify output directory
video-translate video.mp4 -o ./output
# JSON progress output for GUI integration
video-translate video.mp4 --json-progress
Use as a Library
from video_translate import (
Config,
TranscriberConfig,
TranslatorConfig,
TranslationPipeline,
WhisperModel,
TranslatorType,
Language,
)
from video_translate.config import SummaryConfig, VideoConfig, HardwareAccel
# Create configuration - Japanese to Chinese translation
config = Config(
transcriber=TranscriberConfig(
model=WhisperModel.BASE,
language="ja" # Source language
),
translator=TranslatorConfig(
type=TranslatorType.DEEPSEEK,
api_key="your-api-key",
source_language=Language.JAPANESE,
target_language=Language.CHINESE,
),
video=VideoConfig(
embed_subtitle=True,
soft_subtitle=False, # Use hard subtitles
hardware_accel=HardwareAccel.AUTO,
),
summary=SummaryConfig(
enabled=True,
max_key_points=5,
include_timeline=True,
),
)
# Create processing pipeline
pipeline = TranslationPipeline(config)
# Process video
result = pipeline.process("video.mp4")
print(f"Subtitle file: {result['subtitle_file']}")
print(f"Output video: {result['output_video']}")
print(f"Summary file: {result['summary_file']}")
# Access summary data
if result['summary']:
summary = result['summary']
print(f"Title: {summary.title}")
print(f"Overview: {summary.overview}")
for point in summary.key_points:
print(f" - {point}")
🤖 Whisper Model Selection
| Model | Size | Memory | Speed | Accuracy |
|---|---|---|---|---|
| tiny | 39M | ~1GB | Fastest | Lower |
| base | 74M | ~1GB | Fast | Medium |
| small | 244M | ~2GB | Medium | Good |
| medium | 769M | ~5GB | Slow | High |
| large | 1550M | ~10GB | Slowest | Highest |
Recommendations:
- Quick preview: Use
tinyorbase - Production use: Use
smallormedium - Highest quality: Use
large
🔌 Extending Translation Engines
The project uses a modular design, making it easy to add new translation engines:
from video_translate.translator import BaseTranslator
class MyTranslator(BaseTranslator):
@property
def name(self) -> str:
return "MyTranslator"
def translate_text(self, text: str, context: str = "") -> str:
# Implement translation logic
pass
def translate_batch(self, texts: list[str]) -> list[str]:
# Implement batch translation logic
pass
📁 Output Files
videoname_{language_code}.srt- Subtitle file (e.g.,video_zh.srt,video_ja.srt)videoname_{language_code}_summary.json- Video content summary in JSON format (title, overview, key points, topics, timeline)videoname_{language_code}.mp4- Video with embedded subtitles (if embedding is selected)
⚠️ Notes
- First run will automatically download the Whisper model, please ensure a stable internet connection
- Hard subtitles will re-encode the video, which takes longer; use
--hw-accelto enable hardware acceleration for faster encoding - Soft subtitles only copy streams, faster but may not be supported by some players
- Ensure FFmpeg is installed on your system
- Apple Silicon Macs will automatically use MPS acceleration for Whisper and VideoToolbox for video encoding
- Video summary is enabled by default and uses the same LLM API as translation; use
--no-summaryto disable
🛠️ Development
# Install development dependencies
uv sync --dev
# Run tests
uv run pytest
# Code formatting
uv run black src/
# Code linting
uv run ruff check src/
# Type checking
uv run mypy src/
📄 License
This project is open-sourced under the MIT License.
Copyright (c) 2026 innovationmech
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file video_translate-1.1.1.tar.gz.
File metadata
- Download URL: video_translate-1.1.1.tar.gz
- Upload date:
- Size: 27.1 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: uv/0.10.10 {"installer":{"name":"uv","version":"0.10.10","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"macOS","version":null,"id":null,"libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
e69951b2006bae00c813e5a9b84513363612b636e508f6d7d4a4b74aaf564af1
|
|
| MD5 |
cf655bbf4bacedec4880a64dfd186ad8
|
|
| BLAKE2b-256 |
0502164a1b5b6f1a2f49a31a0486ba65ea82288756fbe25b038bda2fbeb40d50
|
File details
Details for the file video_translate-1.1.1-py3-none-any.whl.
File metadata
- Download URL: video_translate-1.1.1-py3-none-any.whl
- Upload date:
- Size: 34.1 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: uv/0.10.10 {"installer":{"name":"uv","version":"0.10.10","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"macOS","version":null,"id":null,"libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
56291b5384037c18150c7cdf1548839a6aa504bf9fa338639ae99c58eb5482f0
|
|
| MD5 |
9e7388a0cee3cdbf0d65682905ccf36b
|
|
| BLAKE2b-256 |
2ae1696c98a3491ac76380c006a8f0f47739329b2b5bb9e7c6f1ec7c557b922f
|