Skip to main content

A simple tool to transcribe audio files

Project description

Transcriber

A flexible Python package for transcribing audio and video from various sources (SoundCloud, YouTube, or local files) into multiple text formats (txt, docx, srt). The transcriber supports multiple speech recognition models (defaulting to Whisper "large-v3"). It leverages GPU acceleration for faster processing and utilizes concurrency through async and multiprocessing to improve performance.

Features

  • Multiple Input Sources: Transcribe from SoundCloud, YouTube, or local audio/video files.
  • Multiple Models: Easily switch between different speech recognition models.
  • GPU Acceleration: Utilize GPU processing for faster transcription.
  • Concurrent Processing: Implement async and multiprocessing for improved performance.
  • Various Output Formats: Export transcriptions to txt, docx, or srt formats.

Installation

Install the required packages:

uv pip install -r requirements.txt

Usage

Provide examples on how to use the package:

# Example command to transcribe a YouTube video
import transcriber

transcriber.transcrib(
    source="youtube",
    url="https://www.youtube.com/watch?v=6Jv8GKZlX2A",
    model="whisper-large-v3",
    output_format= ["txt", "docx", "srt"],
    output_dir="output"
)

Contributing

Contributions are welcome. Please submit a pull request or open an issue for suggestions.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

transcriber-0.1.3.tar.gz (10.5 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

transcriber-0.1.3-py3-none-any.whl (16.9 kB view details)

Uploaded Python 3

File details

Details for the file transcriber-0.1.3.tar.gz.

File metadata

  • Download URL: transcriber-0.1.3.tar.gz
  • Upload date:
  • Size: 10.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.6.5

File hashes

Hashes for transcriber-0.1.3.tar.gz
Algorithm Hash digest
SHA256 00f60e9036e4443ab70640b064b05d148cf8da5e8dfc8385e4365d505e71a969
MD5 f282de5fd97176ec6e43377bf7bd5754
BLAKE2b-256 6b0fce3d7b82b6128fdf3b8654ea9f7b506a32de54dca27de77cdf56f4d09769

See more details on using hashes here.

File details

Details for the file transcriber-0.1.3-py3-none-any.whl.

File metadata

File hashes

Hashes for transcriber-0.1.3-py3-none-any.whl
Algorithm Hash digest
SHA256 7462be80632c9726698d1b88cdc68e09f2db42b7a5794f89e8f93ec00cc284d8
MD5 6f34231f9ed498a03999188489c33c6d
BLAKE2b-256 67d89b23b34df44bfc83a11eae100e92d77b05887e1b8bc4843655f17cd7727c

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page