A simple tool to transcribe audio files
Project description
Transcriber
A flexible Python package for transcribing audio and video from various sources (SoundCloud, YouTube, or local files) into multiple text formats (txt, docx, srt). The transcriber supports multiple speech recognition models (defaulting to Whisper "large-v3"). It leverages GPU acceleration for faster processing and utilizes concurrency through async and multiprocessing to improve performance.
Features
- Multiple Input Sources: Transcribe from SoundCloud, YouTube, or local audio/video files
- Multiple Models: Support for both
faster-whisperand standardwhispermodels - GPU Acceleration: Utilize GPU processing for faster transcription
- Concurrent Processing: Batch processing and VAD filtering for improved performance
- Various Output Formats: Export transcriptions to multiple formats
- Progress Tracking: Rich progress bars showing transcription status
- Configurable: Extensive configuration options via environment variables or settings
Requirements
- Python 3.11 or higher
- GPU support (optional but recommended for better performance)
Installation
-
Install the package:
uv pip install Transcriber
Configuration
-
Copy the example environment file:
cp .env.example .env
-
Configure the settings in
.env:# Input settings INPUT__URLS_OR_PATHS=["Audio_To_Transcribe"] INPUT__SKIP_IF_OUTPUT_EXIST=false # Output settings OUTPUT__OUTPUT_FORMATS=["all"] OUTPUT__OUTPUT_DIR="Transcripts" # Whisper model settings WHISPER__LANGUAGE="ar"
Usage
Basic Usage
from Transcriber.transcriber import transcribe
# Configure input files in .env, then:
transcribe()
Command Line Usage
uv run --with Transcriber transcribe
Example Configuration
# Example settings in your .env file
INPUT__URLS_OR_PATHS=["path/to/audio.mp3", "https://youtube.com/watch?v=example"]
OUTPUT__OUTPUT_FORMATS=["txt", "srt"]
WHISPER__LANGUAGE="en"
Acknowledgments
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file transcriber-0.1.11.tar.gz.
File metadata
- Download URL: transcriber-0.1.11.tar.gz
- Upload date:
- Size: 13.1 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: uv/0.6.8
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
9577810becb7c8c213cbd83fb7432957568fb69795ffda2991fd220b5b6e2919
|
|
| MD5 |
ea59308034fa7efc394d88e9dd3dd537
|
|
| BLAKE2b-256 |
c591a694706edb7d17e6bf849f50b7c9abe1ca3f825a1662a623db249de627d1
|
File details
Details for the file transcriber-0.1.11-py3-none-any.whl.
File metadata
- Download URL: transcriber-0.1.11-py3-none-any.whl
- Upload date:
- Size: 20.5 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: uv/0.6.8
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
b30c856083cc0d1baa4d2ebb45b232b7afc549380b36c1e68758c4ab8597d323
|
|
| MD5 |
5510f66e5d0c5cdba48e702017423f8b
|
|
| BLAKE2b-256 |
ea1c22f7057406a828a9516267449d4e2afe0f673e14597b3fb8d759c99add8f
|