Transcribe all soundfiles in a folder using Whisper
Project description
swori
LocalWhisper
Introduction
TranscribeTools contains an Python application LocalWhisper which transcribes all sound files in a configurable folder using a local Whisper model. You can choose which Whisper model is to be used
Details
- using Python 3.12.7, openai-whisper https://pypi.org/project/openai-whisper/ (current version 20240930) does not support 3.13 yet.
License
This project is licensed under the Apache 2.0 License - see the LICENSE file for details.
Setup
We use uv for managing virtual environments and package installation. Follow these steps to set up the project:
On macOS:
Install uv
- First install brew if needed from https://github.com/Homebrew/brew/releases/latese
On Windows:
Download the setup script
We need to install UV a tool to install the Python environment and to
install the tool. There are a few possibilities
-
Follow instructions at the UV website
-
Press {Windows button} then type or paste:.
powershell -ExecutionPolicy ByPass -c "irm https://astral.sh/uv/install.ps1 | iex" -
Use
winget
- Open the Windows Powershell: press {Windows} button and type or paste
winget install --id=astral-sh.uv -e
These scripts will:
Installs the uv tool. Check if uv {enter} works. At the moment a reboot is needed on Windows.
Now we can install the tools.
Install tools
uv tool install transcribetools
Install the (commandline) tools in this project. For now
it's only transcribe_folder.
Plans
- Make it a local service, running in the background
- Investigate options to let it run on a central computer, as a service
- Create Docker image
- Add speaker partitioning (see TranscribeWhisperX)
- Adjust models using PyTorch (more control)
Documentation about Whisper on the cloud and local
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distributions
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file transcribetools-0.5.5.tar.gz.
File metadata
- Download URL: transcribetools-0.5.5.tar.gz
- Upload date:
- Size: 4.5 MB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: uv/0.7.15
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
334c045740887c76e64934291fea33ce8e4caec059fe53dc9fe06aefb1a74e4b
|
|
| MD5 |
76257ba3335986c59b21638a16e55843
|
|
| BLAKE2b-256 |
d6237dfa778816c27c466e0e4467fb4868435a4a61407bdc983944ed14ad5f92
|
File details
Details for the file transcribetools-0.5.5-py3-none-any.whl.
File metadata
- Download URL: transcribetools-0.5.5-py3-none-any.whl
- Upload date:
- Size: 12.9 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: uv/0.7.15
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
a92d82bbf6a6c8844133b267c31cf5044671f3d0f429a31d3f82cbe5cf183035
|
|
| MD5 |
431ea95e0b705cac43e758ed9d9e44cb
|
|
| BLAKE2b-256 |
3fb22498d6b9dd17c9031ee598fbae4f3cc66b6a5f2410eb0677bc8488492ff9
|
File details
Details for the file transcribetools-0.5.5-py2.py3-none-any.whl.
File metadata
- Download URL: transcribetools-0.5.5-py2.py3-none-any.whl
- Upload date:
- Size: 20.8 kB
- Tags: Python 2, Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: uv/0.7.15
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
c1e0061e3e4d6bdf582cb2310ef1331afb23969826543aeaac609dccdaf990e3
|
|
| MD5 |
32d22a7bcd9c135ec89f600a0ff7ca5e
|
|
| BLAKE2b-256 |
f140ecbee96e6fc2b9a1546bfea33e237e3d6cd9a783a14cf70b05d0d228f453
|