Command line tool to transcribe & translate audio from livestreams in real time

These details have not been verified by PyPI

Project links

Project description

stream-translator-gpt

stream-translator-gpt is a command-line tool for real-time transcription and translation of live streams. We have now added an easier-to-use WebUI entry point.

Try it on Colab:

WebUI	Command Line

(Due to frequent scraping and theft of API keys, we are unable to provide a trial API key. You need to fill in your own API key.)

Prerequisites

Python >= 3.10
FFmpeg (skip if already installed):
- Windows: winget install ffmpeg
- Linux (Debian/Ubuntu): sudo apt install ffmpeg
Install CUDA on your system.
Install cuDNN to your CUDA dir if you want to use Faster-Whisper.
Install PyTorch (with CUDA) to your Python.
Create a Google API key if you want to use Gemini API for translation.
Create a OpenAI API key if you want to use OpenAI Transcription API for transcription or GPT API for translation.

Installation

WebUI

pip install stream-translator-gpt[webui] -U

Command Line

pip install stream-translator-gpt -U

Usage

The commands on Colab are the recommended usage, below are some other commonly used options.

Transcribe live streaming (default use Whisper):

stream-translator-gpt {URL} --language {input_language}
Transcribe by Faster-Whisper:

stream-translator-gpt {URL} --language {input_language} --use_faster_whisper
Transcribe by SimulStreaming:

stream-translator-gpt {URL} --language {input_language} --use_simul_streaming
Transcribe by SimulStreaming with Faster-Whisper as the encoder:

stream-translator-gpt {URL} --language {input_language} --use_simul_streaming --use_faster_whisper
Transcribe by OpenAI Transcription API:

stream-translator-gpt {URL} --language {input_language} --use_openai_transcription_api --openai_api_key {your_openai_key}
Transcribe by a HuggingFace ASR model (requires pip install stream-translator-gpt[hf_asr]):

stream-translator-gpt {URL} --model {hf_model_name} --use_hf_asr

Only models with pipeline_tag: automatic-speech-recognition on Hugging Face Hub are supported.
Translate to other language by Gemini:

stream-translator-gpt {URL} --language ja --translation_prompt "Translate from Japanese to Chinese" --google_api_key {your_google_key}
Translate to other language by GPT:

stream-translator-gpt {URL} --language ja --translation_prompt "Translate from Japanese to Chinese" --openai_api_key {your_openai_key}
Using OpenAI Transcription API and Gemini at the same time:

stream-translator-gpt {URL} --language ja --use_openai_transcription_api --openai_api_key {your_openai_key} --translation_prompt "Translate from Japanese to Chinese" --google_api_key {your_google_key}
Local video/audio file as input:

stream-translator-gpt /path/to/file --language {input_language}
Record system audio as input:

stream-translator-gpt device --language {input_language}
Record microphone as input:

stream-translator-gpt device --language {input_language} --mic
Sending result to Discord:

stream-translator-gpt {URL} --language {input_language} --discord_webhook_url {your_discord_webhook_url}
Sending result to Telegram:

stream-translator-gpt {URL} --language {input_language} --telegram_token {your_telegram_token} --telegram_chat_id {your_telegram_chat_id}
Sending result to Cqhttp:

stream-translator-gpt {URL} --language {input_language} --cqhttp_url {your_cqhttp_url} --cqhttp_token {your_cqhttp_token}
Saving result to a .srt subtitle file:

stream-translator-gpt {URL} --language ja --translation_prompt "Translate from Japanese to Chinese" --google_api_key {your_google_key} --hide_transcribe_result --retry_if_translation_fails --output_timestamps --output_file_path ./result.srt

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

This version

2026.5.28

May 28, 2026

2026.5.20

May 20, 2026

2026.5.12

May 12, 2026

2026.3.12

Mar 12, 2026

2026.3.2

Mar 2, 2026

2026.3.1

Feb 28, 2026

2026.2.26

Feb 26, 2026

2026.1.31

Jan 30, 2026

2026.1.29

Jan 28, 2026

2026.1.12

Jan 12, 2026

2025.12.31.1

Dec 30, 2025

2025.12.31

Dec 30, 2025

2025.12.30.2

Dec 30, 2025

2025.12.30.1

Dec 30, 2025

2025.12.30

Dec 30, 2025

2025.12.25

Dec 25, 2025

2025.12.24

Dec 24, 2025

2025.12.19

Dec 19, 2025

2025.12.16.1

Dec 16, 2025

2025.12.16

Dec 16, 2025

2025.12.16.dev2 pre-release

Dec 16, 2025

2025.12.16.dev1 pre-release

Dec 16, 2025

2025.12.16.dev0 pre-release

Dec 16, 2025

2025.12.8

Dec 7, 2025

2025.12.1

Nov 30, 2025

2025.11.9

Nov 9, 2025

2025.10.30

Oct 29, 2025

2025.10.29.2

Oct 29, 2025

2025.8.10

Aug 9, 2025

2025.7.28

Jul 27, 2025

2025.5.14

May 13, 2025

2025.5.13

May 12, 2025

2025.2.8

Feb 7, 2025

2025.1.13

Jan 13, 2025

2024.12.24

Dec 24, 2024

2024.12.17

Dec 17, 2024

2024.12.11

Dec 11, 2024

2024.12.4

Dec 4, 2024

2024.12.4.dev1 pre-release

Dec 4, 2024

2024.12.4.dev0 pre-release

Dec 4, 2024

2024.11.11

Nov 11, 2024

2024.10.11

Oct 11, 2024

2024.9.20

Sep 20, 2024

2024.8.20

Aug 20, 2024

2024.8.19

Aug 18, 2024

2024.8.17

Aug 17, 2024

2024.7.19

Jul 19, 2024

2024.5.28

May 27, 2024

2024.5.25

May 24, 2024

2024.5.4

May 3, 2024

2024.4.24

Apr 24, 2024

2024.3.26

Mar 26, 2024

2024.3.25

Mar 24, 2024

2024.3.22

Mar 22, 2024

2024.3.9

Mar 9, 2024

2024.3.9.dev2 pre-release

Mar 9, 2024

2024.3.9.dev1 pre-release

Mar 9, 2024

2024.3.9.dev0 pre-release

Mar 9, 2024

2024.3.6

Mar 5, 2024

2024.3.3

Mar 3, 2024

2024.3.3.dev3 pre-release

Mar 3, 2024

2024.3.3.dev2 pre-release

Mar 3, 2024

2024.3.3.dev1 pre-release

Mar 3, 2024

2024.3.3.dev0 pre-release

Mar 3, 2024

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

stream_translator_gpt-2026.5.28.tar.gz (2.8 MB view details)

Uploaded May 28, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

stream_translator_gpt-2026.5.28-py3-none-any.whl (2.8 MB view details)

Uploaded May 28, 2026 Python 3

File details

Details for the file stream_translator_gpt-2026.5.28.tar.gz.

File metadata

Download URL: stream_translator_gpt-2026.5.28.tar.gz
Upload date: May 28, 2026
Size: 2.8 MB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.10.20

File hashes

Hashes for stream_translator_gpt-2026.5.28.tar.gz
Algorithm	Hash digest
SHA256	`1bfb695636935e11dd7a74951cb6c856476f3f8949b60947d7c474f4950f5203`
MD5	`671ab6cb0377733a21ee032303559771`
BLAKE2b-256	`127b7edb48bd0e9b739ec99cd9dd4aeedd8830ba070a712a4fabfa01b8725c40`

See more details on using hashes here.

File details

Details for the file stream_translator_gpt-2026.5.28-py3-none-any.whl.

File metadata

Download URL: stream_translator_gpt-2026.5.28-py3-none-any.whl
Upload date: May 28, 2026
Size: 2.8 MB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.10.20

File hashes

Hashes for stream_translator_gpt-2026.5.28-py3-none-any.whl
Algorithm	Hash digest
SHA256	`1cffcf85ce7b558d064ce9a378b13ef2ed2dbd81ba997649cf4f7f33515c4eb3`
MD5	`d4ac991620af680e8ccedd9fce463ae7`
BLAKE2b-256	`f041cdfd59790251896ad481c83acf1c554301bd437a28253d86ebe52a546858`

See more details on using hashes here.

stream-translator-gpt 2026.5.28

Navigation

Verified details

Maintainers

Meta

Unverified details

Project links

Meta

Classifiers

Project description

stream-translator-gpt

Prerequisites

Installation

WebUI

Command Line

Usage

Project details

Verified details

Maintainers

Meta

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes