A tool for translating subtitles using Google Gemini AI
Project description
🌟 Gemini SRT Translator
Translate SRT subtitle files using the power of Google Gemini AI! 🚀
✨ Overview
Gemini SRT Translator is a powerful python tool to translate subtitle files using the power of Google Gemini AI. Perfect for anyone needing fast, accurate, and customizable translations for videos, movies, and series.
- 🔤 SRT Translation: Translate
.srtsubtitle files to a wide range of languages supported by Google Gemini AI. - 🎙️ Transcription: Transcribe audio or video files directly into SRT subtitles using Gemini's audio capabilities.
- ⏱️ Timing & Format: Ensures that the translated subtitles maintain the exact timestamps and basic SRT formatting of the original file.
- 💾 Quick Resume: Easily resume interrupted translations from where you left off.
- 🧠 Advanced AI: Leverages thinking and reasoning capabilities for more contextually accurate translations (available on Gemini 2.5 and 3 models).
- 🖥️ CLI Support: Full command-line interface for easy automation and scripting.
- ⚙️ Customizable: Tune model parameters, adjust batch size, and access other advanced settings.
- 🎞️ SRT Extraction: Extract and translate SRT subtitles from video files automatically (requires FFmpeg).
- 🎵 Audio Context: Extract audio from a video file or provide your own to improve translation accuracy (requires FFmpeg).
- 📜 Description Support: Add a description to your translation job to guide the AI in using specific terminology or context.
- 📋 List Models: Easily list all currently available Gemini models to choose the best fit for your needs.
- 🔄 Auto-Update: Keep the tool updated with automatic version checking and update prompts.
- 📝 Logging: Optional saving of progress and 'thinking' process logs for review.
📦 Installation
Basic:
pip install --upgrade gemini-srt-translator
Recommended: Use a Virtual Environment
It's best practice to use a virtual environment. This is especially recommended as gemini-srt-translator installs several dependencies that could potentially conflict with your existing packages:
# Create a virtual environment
python -m venv venv
# Activate the virtual environment
# On Windows:
venv\Scripts\activate
# On macOS/Linux:
source venv/bin/activate
# Install inside the virtual environment
pip install --upgrade gemini-srt-translator
🔑 How to Get Your API Key
- Go to Google AI Studio.
- Sign in with your Google account.
- Click on Generate API Key.
- Copy and keep your key safe.
🔐 Setting Your API Key
You can provide your API key in several ways:
- Environment Variable (Recommended): Set the
GEMINI_API_KEYenvironment variable. This is the most secure and recommended method.
-
macOS/Linux:
export GEMINI_API_KEY="your_api_key_here" export GEMINI_API_KEY2="your_second_api_key_here"
-
Windows (Command Prompt):
set GEMINI_API_KEY=your_api_key_here set GEMINI_API_KEY2=your_second_api_key_here
-
Windows (PowerShell):
$env:GEMINI_API_KEY="your_api_key_here" $env:GEMINI_API_KEY2="your_second_api_key_here"
- Command Line Argument: Use the
-kor--api-keyflag
gst translate -i subtitle.srt -l French -k YOUR_API_KEY
- Interactive Prompt: The tool will prompt you if no key is found
gst translate -i subtitle.srt -l French
- Python API: Set the
gemini_api_keyvariable in your script
import gemini_srt_translator as gst
gst.gemini_api_key = "your_api_key_here"
🚀 Quick Start
🖥️ Using the Command Line Interface (CLI)
Basic Translation
# Using environment variable (recommended)
export GEMINI_API_KEY="your_api_key_here"
gst translate -i subtitle.srt -l French
# Using command line argument
gst translate -i subtitle.srt -l French -k YOUR_API_KEY
# Set output file name
gst translate -i subtitle.srt -l French -o translated_subtitle.srt
# Extract subtitles from video and translate (requires FFmpeg)
gst translate -v movie.mp4 -l Spanish
# Extract and use audio from video for context (requires FFmpeg)
gst translate -v movie.mp4 -l Spanish --extract-audio
# Interactive model selection
gst translate -i subtitle.srt -l "Brazilian Portuguese" --interactive
# Resume translation from a specific line
gst translate -i subtitle.srt -l French --start-line 20
# Suppress output
gst translate -i subtitle.srt -l French --quiet
Advanced Options
# Full-featured translation with custom settings
gst translate \
-i input.srt \
-v video.mp4 \
-l French \
-k YOUR_API_KEY \
-k2 YOUR_SECOND_API_KEY \
-o output_french.srt \
--model gemini-3.5-flash \
--batch-size 150 \
--temperature 0.7 \
--description "Medical TV series, use medical terminology" \
--progress-log \
--thoughts-log \
--extract-audio \
--token-stats \
--no-context
Transcribing Audio/Video
# Transcribe a video file to SRT
gst transcribe -v video.mp4 -o transcription.srt
# Transcribe an audio file
gst transcribe -a audio.mp3 -o transcription.srt
# Transcribe with custom settings
gst transcribe \
-v video.mp4 \
-o transcription.srt \
--model gemini-3.5-flash \
--description "Meeting recording about project X" \
--thinking-level high \
--token-stats
Extracting Audio/Subtitles
# Extract SRT from video
gst extract -v video.mp4 --srt
# Extract Audio from video
gst extract -v video.mp4 --audio
# Extract both with voice isolation (default)
gst extract -v video.mp4 --srt --audio
# Extract audio without voice isolation
gst extract -v video.mp4 --audio --no-voice-isolation
CLI Help
# Show all available commands and options
gst --help
# Show specific command help
gst translate --help
gst transcribe --help
gst extract --help
🐍 Using Python API
Translating an SRT file
import gemini_srt_translator as gst
gst.gemini_api_key = "your_api_key_here"
gst.target_language = "French"
gst.input_file = "subtitle.srt"
gst.translate()
Resuming an Interrupted Translation
Just run again with the same parameters, or specify the start line:
import gemini_srt_translator as gst
gst.gemini_api_key = "your_api_key_here"
gst.target_language = "French"
gst.input_file = "subtitle.srt"
gst.start_line = 20
gst.translate()
Transcribing Audio/Video
import gemini_srt_translator as gst
gst.gemini_api_key = "your_api_key"
gst.video_file = "video.mp4" # Or gst.audio_file = "audio.mp3"
gst.output_file = "transcription.srt"
gst.model_name = "gemini-3.5-flash"
gst.token_stats = True
gst.transcribe()
Extracting from Video
import gemini_srt_translator as gst
gst.video_file = "video.mp4"
# Extract SRT
gst.extract("srt")
# Extract Audio (with voice isolation by default)
gst.extract("audio")
# Extract Audio (without voice isolation)
gst.isolate_voice = False
gst.extract("audio")
⚙️ Advanced Configuration
🔧 GST Parameters
gemini_api_key2: Second key for more quota (useful for free Pro models).video_file: Path to a video file to extract subtitles and/or audio for context (requires FFmpeg).audio_file: Path to an audio file to use as context for translation (requires FFmpeg).extract_audio: Whether to extract and use audio context from the video file (default: False).isolate_voice: Whether to isolate voice from audio (default: True).audio_chunk_size: Audio chunk size in seconds for processing (default: 600).output_file: Name of the translated file.start_line: Starting line for translation.description: Description of the translation job.batch_size: Batch size (default: 1000).free_quota: Signal GST that you are using a free quota (default: True).skip_upgrade: Skip version upgrade check (default: False).use_colors: Activate colors in terminal (default: True).progress_log: Enable progress logging to a file (default: False).thoughts_log: Enable logging of the 'thinking' process to a file (default: False).quiet: Suppress all output (default: False).resume: Skip prompt and set automatic resume mode.token_stats: Show token usage information (default: False).preserve_context: Preserve context between batches (default: True).
🔬 Model Tuning Parameters
model_name: Gemini model (default: "gemini-3.5-flash").temperature: Controls randomness in output. Lower for more deterministic, higher for more creative (range: 0.0-2.0).top_p: Nucleus sampling parameter (range: 0.0-1.0).top_k: Top-k sampling parameter (range: >=0).streaming: Enable streamed responses (default: True).- Set to
Falsefor bad internet connections or when using slower models.
- Set to
thinking: Enable thinking capability for potentially more accurate translations (default: True).- Only available for Gemini 2.5 and 3 models.
thinking_budget: Token budget for the thinking process (range: 0-32768, 0 also disables thinking).- Only available for Gemini 2.5 models.
thinking_level: Controls the depth of thinking process (options: minimal, low, medium, high).- Only available for Gemini 3 models.
💡 Full example:
import gemini_srt_translator as gst
gst.gemini_api_key = "your_api_key_here"
gst.gemini_api_key2 = "your_api_key2_here"
gst.target_language = "French"
gst.input_file = "subtitle.srt"
gst.output_file = "subtitle_translated.srt"
gst.video_file = "video.mp4"
gst.audio_file = "audio.mp3"
gst.extract_audio = False
gst.start_line = 20
gst.description = "Medical TV series, use medical terms"
gst.model_name = "gemini-3.5-flash"
gst.batch_size = 150
gst.streaming = True
gst.thinking = True
gst.thinking_budget = 4096
gst.thinking_level = "high"
gst.temperature = 0.7
gst.top_p = 0.95
gst.top_k = 20
gst.free_quota = False
gst.skip_upgrade = True
gst.use_colors = False
gst.progress_log = True
gst.thoughts_log = True
gst.quiet = False
gst.resume = True
gst.token_stats = True
gst.preserve_context = True
gst.translate()
📚 Listing Available Models
CLI
gst list-models -k YOUR_API_KEY
Python API
import gemini_srt_translator as gst
gst.gemini_api_key = "your_api_key_here"
gst.listmodels()
🎨 Unofficial GUI Applications
If you prefer a user-friendly graphical interface over command-line usage, be sure to check out:
- 🔗 Gemini SRT Translator GUI (by @mkaflowski)
- 🔗 Gemini SRT Translator GUI (by @dane-9)
Perfect for users who want the same powerful translation capabilities with an intuitive visual interface!
📝 License
Distributed under the MIT License. See the LICENSE file for details.
👥 Contributors
Thank you to all who have contributed to this project:
- MaKTaiL - Creator and maintainer
- CevreMuhendisi
- angelitto2005
- sjiampojamarn
- mkaflowski
Special thanks to all users who have reported issues, suggested features, and helped improve the project.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file gemini_srt_translator-3.5.1.tar.gz.
File metadata
- Download URL: gemini_srt_translator-3.5.1.tar.gz
- Upload date:
- Size: 42.2 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: uv/0.11.14 {"installer":{"name":"uv","version":"0.11.14","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":null,"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
7439fbecf6963cb4fe7aa81447b3966f0ff7a7e485a031e8a942122e6f816dfe
|
|
| MD5 |
9e06547c0e636b10681b63d3d35fc07c
|
|
| BLAKE2b-256 |
eb07ac91dc3c1cbc0dff703ba281aaef09e8eca2378467831539abcdb8c79351
|
File details
Details for the file gemini_srt_translator-3.5.1-py3-none-any.whl.
File metadata
- Download URL: gemini_srt_translator-3.5.1-py3-none-any.whl
- Upload date:
- Size: 38.9 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: uv/0.11.14 {"installer":{"name":"uv","version":"0.11.14","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":null,"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
9e0de94ac16a9387da9865498f7c064412b90423a0eb18889949d34567d98a63
|
|
| MD5 |
99380d8d4aae791f595c0ecd5f4fb610
|
|
| BLAKE2b-256 |
5a869b0ddbf639f02c73e4acc2d71ef03e68b224238fe60f44d27d26c0786b29
|