A nendo plugin for speech transcription, based on Whisper by OpenAI.
Project description
Nendo Plugin Transcribe Whisper
A nendo plugin for speech transcription, based on Whisper by OpenAI.
Features
- Fast speech transcription with optional word-level timestamps.
Requirements
Since we depend on transformers
, please make sure that you fulfill their requirements.
You also need Pytorch installed on your system, please refer to the pytorch installation instructions.
Installation
- Install Nendo
pip install nendo-plugin-transcribe-whisper
If you have a cuda GPU on your machine you can also install flash-attn
to get an additional speedup:
pip install flash-attn --no-build-isolation
Then set ATTN_IMPLEMENTATION=flash_attention_2
in your environment variables.
Usage
>>> from nendo import Nendo
>>> nd = Nendo(plugins=["nendo_plugin_transcribe_whisper"])
>>> track = nd.library.add_track(file_path="path/to/file.mp3")
>>> nd.plugins.transcribe_whisper(track=track)
>>> track.get_plugin_value("transcription")
Contributing
Visit our docs to learn all about how to contribute to Nendo: Contributing
License
Nendo: MIT License
Pretrained models: The weights are released under the Apache 2.0 license.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for nendo_plugin_transcribe_whisper-0.1.0.tar.gz
Algorithm | Hash digest | |
---|---|---|
SHA256 | e9c5326476588e41495c41838a1f975b88973733f16d9b68d3f3c81a74c92b78 |
|
MD5 | 405cc9921f5acb93872a41749734538a |
|
BLAKE2b-256 | a4f1d3401aedb19b1a2650d10872524f885129a6114ce6dc4a72bea373613256 |
Hashes for nendo_plugin_transcribe_whisper-0.1.0-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | b90d094e63f345a7fc6cdefc6fd8e430921690e899f3a45a4d1fbc5016be0131 |
|
MD5 | 13cd9bf2a13ae4c0e2ebd61aff5b848c |
|
BLAKE2b-256 | 608e1ed3c433c9733ae6ada8d9b7b8e39c1a5a8b074f98feb1666ed933b84b58 |