Skip to main content

A multilingual voice recording and transcription tool with German and English support

Project description

DLU Voice Transcriber

A Python tool for recording audio and transcribing it to text in multiple languages (English and German).

Features

  • High-quality audio recording (48kHz sample rate, 16-bit depth)
  • Automatic input device detection and selection
  • Support for multiple audio formats (WAV, MP3)
  • Configurable MP3 quality settings
  • Transcription to English and German
  • Progress monitoring during recording
  • Save transcriptions to text files
  • Detailed device information and listing
  • Intelligent device selection with native sample rate support

Installation

pip install dlu_voice_transcriber

System Requirements

  • Python 3.8 or higher
  • Working microphone
  • Linux: sudo apt-get install python3-pyaudio portaudio19-dev
  • macOS: brew install portaudio
  • Windows: No additional requirements

Usage

Basic Commands

List available audio devices:

dlu_transcribe --list-devices

Basic recording (5 seconds, default settings):

dlu_transcribe

Advanced Usage

Record with specific settings:

dlu_transcribe --device 2 --duration 10 --output my_recording --format mp3 --mp3-quality 320 --language both --save-text

Command Line Options

  • --device: Specify input device index (shown by --list-devices)
  • --duration: Recording duration in seconds (default: 5)
  • --output: Output filename without extension (default: recording)
  • --format: Output format: 'wav' or 'mp3' (default: mp3)
  • --mp3-quality: MP3 quality in kbps (default: 320)
  • --language: Transcription language: 'de', 'en', or 'both' (default: both)
  • --save-text: Save transcriptions to text files
  • --list-devices: Show available audio input devices

Examples

List all available devices:

dlu_transcribe --list-devices

Record high-quality MP3 for 15 seconds:

dlu_transcribe --duration 15 --format mp3 --mp3-quality 320 --output high_quality_recording

Record using specific device with German-only transcription:

dlu_transcribe --device 2 --language de --save-text --output german_speech

Output Files

The tool generates the following files in the recordings directory:

  • Audio file: <output>.<format> (e.g., recording.mp3)
  • Transcription files (if --save-text is used):
    • English: <output>_en.txt
    • German: <output>_de.txt

License

This project is licensed under the MIT License - see the LICENSE file for details.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

dlu_voice_transcriber-0.2.3.tar.gz (8.6 kB view details)

Uploaded Source

Built Distribution

dlu_voice_transcriber-0.2.3-py3-none-any.whl (9.0 kB view details)

Uploaded Python 3

File details

Details for the file dlu_voice_transcriber-0.2.3.tar.gz.

File metadata

  • Download URL: dlu_voice_transcriber-0.2.3.tar.gz
  • Upload date:
  • Size: 8.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.11.10

File hashes

Hashes for dlu_voice_transcriber-0.2.3.tar.gz
Algorithm Hash digest
SHA256 9dc995a2749fe7f7d46783632602cd3e1764b8e373366e6f2cfc2b463b7f8d13
MD5 f1dcb3b5ed9e2bd7adea75a3123dd4ab
BLAKE2b-256 2843cd30cdb3e7c49cd79c2d932b7444dcd2f12c760f41246d8ce1b84d21470c

See more details on using hashes here.

File details

Details for the file dlu_voice_transcriber-0.2.3-py3-none-any.whl.

File metadata

File hashes

Hashes for dlu_voice_transcriber-0.2.3-py3-none-any.whl
Algorithm Hash digest
SHA256 17e31a13b50ba8583218a92e5446eff96acae73987747f3e9bf54967469f83a4
MD5 e5a2721c1ecb20d8d08c49fb589a49af
BLAKE2b-256 57c402e097d052bdf8c87099ada39ba25787351d82736857a313a9f35f86cf25

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page