Faster Whisper transcription with CTranslate2

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

These details have not been verified by PyPI

Project description

Faster Whisper 2 - transcription with CTranslate2

faster-whisper2 is a reimplementation of OpenAI's Whisper model using CTranslate2, which is a fast inference engine for Transformer models.

This implementation is up to 4 times faster than openai/whisper for the same accuracy while using less memory. The efficiency can be further improved with 8-bit quantization on both CPU and GPU.

Requirements

Python 3.9 or greater

Unlike openai-whisper, FFmpeg does not need to be installed on the system. The audio is decoded with the Python library PyAV which bundles the FFmpeg libraries in its package.

GPU

GPU execution requires the following NVIDIA libraries to be installed:

Note: The latest versions of ctranslate2 only support CUDA 12 and cuDNN 9. For CUDA 11 and cuDNN 8, the current workaround is downgrading to the 3.24.0 version of ctranslate2, for CUDA 12 and cuDNN 8, downgrade to the 4.4.0 version of ctranslate2, (This can be done with pip install --force-reinstall ctranslate2==4.4.0 or specifying the version in a requirements.txt).

There are multiple ways to install the NVIDIA libraries mentioned above. The recommended way is described in the official NVIDIA documentation, but we also suggest other installation methods below.

Other installation methods (click to expand)

Note: For all these methods below, keep in mind the above note regarding CUDA versions. Depending on your setup, you may need to install the CUDA 11 versions of libraries that correspond to the CUDA 12 libraries listed in the instructions below.

Use Docker

The libraries (cuBLAS, cuDNN) are installed in this official NVIDIA CUDA Docker images: nvidia/cuda:12.3.2-cudnn9-runtime-ubuntu22.04.

Install with `pip` (Linux only)

On Linux these libraries can be installed with pip. Note that LD_LIBRARY_PATH must be set before launching Python.

pip install nvidia-cublas-cu12 nvidia-cudnn-cu12==9.*

export LD_LIBRARY_PATH=`python3 -c 'import os; import nvidia.cublas.lib; import nvidia.cudnn.lib; print(os.path.dirname(nvidia.cublas.lib.__file__) + ":" + os.path.dirname(nvidia.cudnn.lib.__file__))'`

Installation

The module can be installed from PyPI:

pip install faster-whisper2

Other installation methods (click to expand)

Install the master branch

pip install --force-reinstall "faster-whisper2 @ https://github.com/BBC-Esq/faster-whisper2/archive/refs/heads/master.tar.gz"

Install a specific commit

pip install --force-reinstall "faster-whisper2 @ https://github.com/BBC-Esq/faster-whisper2/archive/a4f1cc8f11433e454c3934442b5e1a4ed5e865c3.tar.gz"

Community integrations (click to expand)

Here is a non exhaustive list of open-source projects using faster-whisper. Feel free to add your project to the list!

speaches is an OpenAI compatible server using faster-whisper. It's easily deployable with Docker, works with OpenAI SDKs/CLI, supports streaming, and live transcription.
WhisperX is an award-winning Python library that offers speaker diarization and accurate word-level timestamps using wav2vec2 alignment
whisper-ctranslate2 is a command line client based on faster-whisper and compatible with the original client from openai/whisper.
whisper-diarize is a speaker diarization tool that is based on faster-whisper and NVIDIA NeMo.
whisper-standalone-win Standalone CLI executables of faster-whisper for Windows, Linux & macOS.
asr-sd-pipeline provides a scalable, modular, end to end multi-speaker speech to text solution implemented using AzureML pipelines.
Open-Lyrics is a Python library that transcribes voice files using faster-whisper, and translates/polishes the resulting text into .lrc files in the desired language using OpenAI-GPT.
wscribe is a flexible transcript generation tool supporting faster-whisper, it can export word level transcript and the exported transcript then can be edited with wscribe-editor
aTrain is a graphical user interface implementation of faster-whisper developed at the BANDAS-Center at the University of Graz for transcription and diarization in Windows (Windows Store App) and Linux.
Whisper-Streaming implements real-time mode for offline Whisper-like speech-to-text models with faster-whisper as the most recommended back-end. It implements a streaming policy with self-adaptive latency based on the actual source complexity, and demonstrates the state of the art.
WhisperLive is a nearly-live implementation of OpenAI's Whisper which uses faster-whisper as the backend to transcribe audio in real-time.
Faster-Whisper-Transcriber is a simple but reliable voice transcriber that provides a user-friendly interface.
Open-dubbing is open dubbing is an AI dubbing system which uses machine learning models to automatically translate and synchronize audio dialogue into different languages.
Whisper-FastAPI whisper-fastapi is a very simple script that provides an API backend compatible with OpenAI, HomeAssistant, and Konele (Android voice typing) formats.

Compute type compatibility reference (click to expand)

When you specify a compute_type, faster-whisper2 validates that it is compatible with your device before downloading anything. If an incompatible combination is detected, a clear error is raised. For quantized types (e.g. int8), the library automatically downloads the best source precision for your hardware and CTranslate2 handles the runtime conversion.

Supported compute types by device

Compute Type	CPU	CUDA >= 6.1	CUDA >= 7.0	CUDA >= 8.0
`float32`	Yes	Yes	Yes	Yes
`float16`	No	No	Yes	Yes
`bfloat16`	No	No	No	Yes
`int8`	Yes	Yes	Yes	Yes
`int8_float16`	No	No	Yes	Yes
`int8_float32`	Yes	Yes	Yes	Yes
`int8_bfloat16`	No	No	No	Yes
`int16`	Yes (Intel MKL only)	No	No	No
`auto`	Yes	Yes	Yes	Yes
`default`	Yes	Yes	Yes	Yes

Which model precision is downloaded

Device	Requested Compute Type	Downloaded Precision
CPU	Any	float32
CUDA (any)	`float32`	float32
CUDA (any)	`float16`	float16
CUDA (any)	`bfloat16`	bfloat16
CUDA >= 8.0 (Ampere+)	`int8`, `int8_float16`, `int8_float32`, `int8_bfloat16`, `auto`, `default`	bfloat16
CUDA < 8.0 (pre-Ampere)	`int8`, `int8_float16`, `int8_float32`, `int8_bfloat16`, `auto`, `default`	float16

CUDA compute capability by GPU generation

Generation	Compute Capability	Example GPUs
Maxwell	5.x	GTX 950, GTX 970, GTX 980
Pascal	6.x	GTX 1060, GTX 1070, GTX 1080
Turing	7.x	RTX 2060, RTX 2070, RTX 2080
Ampere	8.x	RTX 3060, RTX 3070, RTX 3080, A100
Ada Lovelace	8.9	RTX 4060, RTX 4070, RTX 4080, RTX 4090
Hopper	9.0	H100
Blackwell	10.0	RTX 5070, RTX 5080, RTX 5090, B200

Contributing

Pull requests are welcome! Before submitting, make sure your code passes the following checks. You can run them locally to catch issues early:

1. Black (code formatting)

Black enforces a consistent code style. It handles things like indentation, trailing commas, quote style, and line wrapping. The project uses a max line length of 100 characters.

black --check .     # check for issues
black .             # auto-fix formatting

2. isort (import sorting)

isort ensures imports are grouped (stdlib, third-party, local) and alphabetically sorted. It's configured to be compatible with Black.

isort --check-only .   # check for issues
isort .                # auto-fix import order

3. Flake8 (linting)

Flake8 catches common issues like unused imports, undefined names, and style violations. Max line length is 100 characters.

flake8 .

4. Pytest (tests)

The test suite must pass. Some tests require model downloads and may take a while on the first run.

pytest -v tests/

Quick setup

Install the dev dependencies and run all checks:

pip install -e ".[dev]"
black . && isort . && flake8 . && pytest -v tests/

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

AI_for_Lawyers

These details have not been verified by PyPI

Release history Release notifications | RSS feed

This version

2.1.0

Apr 2, 2026

2.0.0

Apr 2, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

faster_whisper2-2.1.0.tar.gz (1.1 MB view details)

Uploaded Apr 2, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

faster_whisper2-2.1.0-py3-none-any.whl (1.1 MB view details)

Uploaded Apr 2, 2026 Python 3

File details

Details for the file faster_whisper2-2.1.0.tar.gz.

File metadata

Download URL: faster_whisper2-2.1.0.tar.gz
Upload date: Apr 2, 2026
Size: 1.1 MB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for faster_whisper2-2.1.0.tar.gz
Algorithm	Hash digest
SHA256	`ea364fc77212ee9b781a71d934df2fc524e8451a0eb011703e7b55de85327879`
MD5	`bbeaed9afbc42ab374163ecd6e1ff7cf`
BLAKE2b-256	`6b242f8b07cd05c841464ee6f8c0895a6935a4e271e84a0aa94be303fea289ef`

See more details on using hashes here.

Provenance

The following attestation bundles were made for faster_whisper2-2.1.0.tar.gz:

Publisher: publish.yml on BBC-Esq/faster-whisper2

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: faster_whisper2-2.1.0.tar.gz
- Subject digest: ea364fc77212ee9b781a71d934df2fc524e8451a0eb011703e7b55de85327879
- Sigstore transparency entry: 1218121206
- Sigstore integration time: Apr 2, 2026
Source repository:
- Permalink: BBC-Esq/faster-whisper2@cbaca3dd444f008c466629f942d6a858f689df0a
- Branch / Tag: refs/tags/v2.1.0
- Owner: https://github.com/BBC-Esq
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@cbaca3dd444f008c466629f942d6a858f689df0a
- Trigger Event: push

File details

Details for the file faster_whisper2-2.1.0-py3-none-any.whl.

File metadata

Download URL: faster_whisper2-2.1.0-py3-none-any.whl
Upload date: Apr 2, 2026
Size: 1.1 MB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for faster_whisper2-2.1.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`9f3fea5d347d78de04e37e2f1e3de15a2cec44edc8e55fbaee815716d1c2c3f3`
MD5	`f97eb1d5a69525f5d4e1779f9177a1fa`
BLAKE2b-256	`3dab5bef126b79d73dbfae1d0e83c0288f1d7c6de3f6db8e5cf67fe8206d036a`

See more details on using hashes here.

Provenance

The following attestation bundles were made for faster_whisper2-2.1.0-py3-none-any.whl:

Publisher: publish.yml on BBC-Esq/faster-whisper2

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: faster_whisper2-2.1.0-py3-none-any.whl
- Subject digest: 9f3fea5d347d78de04e37e2f1e3de15a2cec44edc8e55fbaee815716d1c2c3f3
- Sigstore transparency entry: 1218121324
- Sigstore integration time: Apr 2, 2026
Source repository:
- Permalink: BBC-Esq/faster-whisper2@cbaca3dd444f008c466629f942d6a858f689df0a
- Branch / Tag: refs/tags/v2.1.0
- Owner: https://github.com/BBC-Esq
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@cbaca3dd444f008c466629f942d6a858f689df0a
- Trigger Event: push

faster-whisper2 2.1.0

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Project description

Faster Whisper 2 - transcription with CTranslate2

Requirements

GPU

Use Docker

Install with pip (Linux only)

Installation

Install the master branch

Install a specific commit

Supported compute types by device

Which model precision is downloaded

CUDA compute capability by GPU generation

Contributing

1. Black (code formatting)

2. isort (import sorting)

3. Flake8 (linting)

4. Pytest (tests)

Quick setup

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance

Install with `pip` (Linux only)