A multilingual voice recording and transcription tool with German and English support
Project description
DLU Voice Transcriber
A Python tool for recording audio and transcribing it to text in multiple languages (English and German).
Features
- High-quality audio recording (48kHz sample rate, 16-bit depth)
- Automatic input device detection and selection
- Support for multiple audio formats (WAV, MP3)
- Configurable MP3 quality settings
- Transcription to English and German
- Progress monitoring during recording
- Save transcriptions to text files
- Detailed device information and listing
- Intelligent device selection with native sample rate support
Installation
pip install dlu_voice_transcriber
System Requirements
- Python 3.8 or higher
- Working microphone
- Linux:
sudo apt-get install python3-pyaudio portaudio19-dev
- macOS:
brew install portaudio
- Windows: No additional requirements
Usage
Basic Commands
List available audio devices:
dlu_transcribe --list-devices
Basic recording (5 seconds, default settings):
dlu_transcribe
Advanced Usage
Record with specific settings:
dlu_transcribe --device 2 --duration 10 --output my_recording --format mp3 --mp3-quality 320 --language both --save-text
Command Line Options
--device
: Specify input device index (shown by --list-devices)--duration
: Recording duration in seconds (default: 5)--output
: Output filename without extension (default: recording)--format
: Output format: 'wav' or 'mp3' (default: mp3)--mp3-quality
: MP3 quality in kbps (default: 320)--language
: Transcription language: 'de', 'en', or 'both' (default: both)--save-text
: Save transcriptions to text files--list-devices
: Show available audio input devices
Examples
List all available devices:
dlu_transcribe --list-devices
Record high-quality MP3 for 15 seconds:
dlu_transcribe --duration 15 --format mp3 --mp3-quality 320 --output high_quality_recording
Record using specific device with German-only transcription:
dlu_transcribe --device 2 --language de --save-text --output german_speech
Output Files
The tool generates the following files in the recordings
directory:
- Audio file:
<output>.<format>
(e.g., recording.mp3) - Transcription files (if --save-text is used):
- English:
<output>_en.txt
- German:
<output>_de.txt
- English:
License
This project is licensed under the MIT License - see the LICENSE file for details.
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file dlu_voice_transcriber-0.2.3.tar.gz
.
File metadata
- Download URL: dlu_voice_transcriber-0.2.3.tar.gz
- Upload date:
- Size: 8.6 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.1 CPython/3.11.10
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 9dc995a2749fe7f7d46783632602cd3e1764b8e373366e6f2cfc2b463b7f8d13 |
|
MD5 | f1dcb3b5ed9e2bd7adea75a3123dd4ab |
|
BLAKE2b-256 | 2843cd30cdb3e7c49cd79c2d932b7444dcd2f12c760f41246d8ce1b84d21470c |
File details
Details for the file dlu_voice_transcriber-0.2.3-py3-none-any.whl
.
File metadata
- Download URL: dlu_voice_transcriber-0.2.3-py3-none-any.whl
- Upload date:
- Size: 9.0 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.1 CPython/3.11.10
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 17e31a13b50ba8583218a92e5446eff96acae73987747f3e9bf54967469f83a4 |
|
MD5 | e5a2721c1ecb20d8d08c49fb589a49af |
|
BLAKE2b-256 | 57c402e097d052bdf8c87099ada39ba25787351d82736857a313a9f35f86cf25 |