Skip to main content

A simple tool to transcribe audio files

Project description

Transcriber

A flexible Python package for transcribing audio and video from various sources (SoundCloud, YouTube, or local files) into multiple text formats (txt, docx, srt). The transcriber supports multiple speech recognition models (defaulting to Whisper "large-v3"). It leverages GPU acceleration for faster processing and utilizes concurrency through async and multiprocessing to improve performance.

Features

  • Multiple Input Sources: Transcribe from SoundCloud, YouTube, or local audio/video files.
  • Multiple Models: Easily switch between different speech recognition models.
  • GPU Acceleration: Utilize GPU processing for faster transcription.
  • Concurrent Processing: Implement async and multiprocessing for improved performance.
  • Various Output Formats: Export transcriptions to txt, docx, or srt formats.

Installation

Install the required packages:

uv pip install -r requirements.txt

Usage

Provide examples on how to use the package:

# Example command to transcribe a YouTube video
import transcriber

transcriber.transcrib(
    source="youtube",
    url="https://www.youtube.com/watch?v=6Jv8GKZlX2A",
    model="whisper-large-v3",
    output_format= ["txt", "docx", "srt"],
    output_dir="output"
)

Contributing

Contributions are welcome. Please submit a pull request or open an issue for suggestions.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

transcriber-0.1.2.tar.gz (55.9 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

transcriber-0.1.2-py3-none-any.whl (14.7 kB view details)

Uploaded Python 3

File details

Details for the file transcriber-0.1.2.tar.gz.

File metadata

  • Download URL: transcriber-0.1.2.tar.gz
  • Upload date:
  • Size: 55.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.6.5

File hashes

Hashes for transcriber-0.1.2.tar.gz
Algorithm Hash digest
SHA256 2ba4b8dda5f5907ae537c8062f04924e87a7a857c33abfbed61e8387d0dc4be9
MD5 c0c012990925bc82af2a7e63a39f792e
BLAKE2b-256 7325eac0d33a741fc3dbad34ae734f301977ef7150fbb8ab9a1ff355df2f1079

See more details on using hashes here.

File details

Details for the file transcriber-0.1.2-py3-none-any.whl.

File metadata

File hashes

Hashes for transcriber-0.1.2-py3-none-any.whl
Algorithm Hash digest
SHA256 c645a5bab7f4523f996b9249b4e924d028ab16ef8ada4040a725041948c2d997
MD5 722f6a99a5ac777cabffaed750af835b
BLAKE2b-256 66934727f7ff2bea0027ae80185878c9824b6c68de40232edd2d008d3cff981e

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page