Transcribe your .wav .mp4 .mp3 .flac files to text or record your own audio!
Project description
Audio-Transcriber
Version: 0.5.14
Transcribe your .wav .mp4 .mp3 .flac files to text or record your own audio!
This repository is actively maintained - Contributions are welcome!
Contribution Opportunities:
- Support new models
Usage:
Short Flag | Long Flag | Description |
---|---|---|
-h | --help | See Usage |
-b | --bitrate | Bitrate to use during recording |
-c | --channels | Number of channels to use during recording |
-d | --directory | Directory to save recording |
-e | --export | Export txt, srt, and vtt files |
-f | --file | File to transcribe |
-l | --language | Language to transcribe |
-m | --model | Model to use: <tiny, base, small, medium, large> |
-n | --name | Name of recording |
-r | --record | Specify number of seconds to record to record from microphone |
Example:
audio-transcriber --file '~/Downloads/Federal_Reserve.mp4' --model 'large'
audio-transcriber --record 60 --directory '~/Downloads/' --name 'my_recording.wav' --model 'tiny'
Model Information:
Courtesy of and Credits to OpenAI: Whisper.ai
Size | Parameters | English-only model | Multilingual model | Required VRAM | Relative speed |
---|---|---|---|---|---|
tiny | 39 M | tiny.en |
tiny |
~1 GB | ~32x |
base | 74 M | base.en |
base |
~1 GB | ~16x |
small | 244 M | small.en |
small |
~2 GB | ~6x |
medium | 769 M | medium.en |
medium |
~5 GB | ~2x |
large | 1550 M | N/A | large |
~10 GB | 1x |
Installation Instructions:
Install Python Package
python -m pip install audio-transcriber
Ubuntu Dependencies
apt install -y libasound-dev portaudio19-dev libportaudio2 libportaudiocpp0 ffmpeg
Repository Owners:
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distributions
No source distribution files available for this release.See tutorial on generating distribution archives.
Built Distribution
Close
Hashes for audio_transcriber-0.5.14-py2.py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 392495e3cc0a915108bb448bea65cb5a5ba5f8fc750a2a7c7d3c28d0fd1edd64 |
|
MD5 | 04633800ccbf1ea2a4002e6713ed3e39 |
|
BLAKE2b-256 | 05265f3eeebc24a93981d3487ab51eaf39a957495004b2f168c76531ad38b3e1 |