A python CLI tool to extract voice sentences from audio files with speech recognition
Project description
VoiceXTractor
~ A python CLI tool to extract voice sentences from audio files with speech recognition ~
Ko-fi · Installation · Changelog
Developed by @veeso
Current version: 0.1.2 (23/09/2022)
About VXT 🚜
VXT, which stands for VoiceXTractor is a Python command-line utility to extract voice tracks from audio.
How it works:
- You provide VXT with an audio file
- The audio file is split by silence
- for each "track" chunked by the audio file, it gets the speech for it using a customisable speech-to-text engine
- you can at this point work on tracks (amplify, normalize, split, remove...)
- export the tracks to files with the format you prefer
Get started 🚀
You can install VXT with pip:
pip3 install vxt
then you must install dependencies:
pip3 install SpeechRecognition PyInquirer pydub termcolor click
you also need to install ffmpeg
or libav
.
Finally you can run VXT with the following arguments:
python3 -m vxt -l it_IT -o ./output/ ./assets/hackerino.mp3
this will split the hackerino.mp3
audio file into tracks by voice into output/
, the -l
option specifies the audio language is Italian.
vxt supports these options:
-e, --engine TEXT Specify speech2text engine [bing, google,
google-cloud, houndify, ibm, sphinx] (default:
google)
-l, --language TEXT Specify audio language (e.g. it_IT), system
language will be used otherwise
-f, --output-fmt TEXT Specify output format (See readme)
-o, --output-dir TEXT Specify output directory
-A, --api-key TEXT Specify api key (required for: bing, google
-J, --json-credentials TEXT Specify json credentials (required for: google-
cloud)
-C, --client-id TEXT Specify client id (required for: houndify)
-K, --client-key TEXT Specify client key (required for: houndify)
-U, --username TEXT Specify username (required for: ibm)
-P, --password TEXT Specify user password (required for: ibm)
--keyword-entries TEXT Specify keyword entries (required for: sphinx)
--grammar-file TEXT Specify grammar file (required for: sphinx)
--help Show this message and exit.
by default the google
engine will be used for speech-to-text.
Output format
Track filename fmt.
The syntax use parameters which must be preceeded by %
, everything in between will be kept the same.
The following parameters are supported.
%%
: print percentage symbol%d
: current day%H
: current hours%I
: current timestamp ISO8601 syntax%M
: current minutes%m
: current month%S
: current seconds%s
: track speech%s.NUMBER
track speech cut at length (e.g.%s.24
)%t
: track number in track list (from 1 to n)%y
: current year with 2 digits%Y
: current year with 4 digits
Support the developer ☕
If you like VXT and you're grateful for the work I've done, please consider a little donation 🥳
You can make a donation with one of these platforms:
Contributing and issues 🤝🏻
Contributions, bug reports, new features and questions are welcome! 😉 If you have any question or concern, or you want to suggest a new feature, or you want just want to improve VXT, feel free to open an issue or a PR.
Please follow our contributing guidelines
Changelog ⏳
View VXT's changelog HERE
Powered by 💪
VXT is powered by these awesome projects:
License 📃
VXT is licensed under the MIT license.
You can read the entire license HERE
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file vxt-0.1.2.tar.gz
.
File metadata
- Download URL: vxt-0.1.2.tar.gz
- Upload date:
- Size: 27.5 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.1 CPython/3.9.14
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 36fa36ae3526efd193cd1ef7ef7ed3a727f347f90f3bb9e177500e17d1b65c76 |
|
MD5 | efd09afeb2e93286b2a1295a60678b29 |
|
BLAKE2b-256 | 7c2ea362b77524853b18ff978fae23320da7fe54a78788aaaf16ac05be943837 |
File details
Details for the file vxt-0.1.2-py3-none-any.whl
.
File metadata
- Download URL: vxt-0.1.2-py3-none-any.whl
- Upload date:
- Size: 68.3 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.1 CPython/3.9.14
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 009c710c7a188aa5da579edbc0fff08ebf34a7f9e1c85dc2e1af610c40a452ae |
|
MD5 | 135659a4213501b1ffb1a5a4273e814a |
|
BLAKE2b-256 | 02b02a4832c65d58f5eae683785390595f31ecfc5b1c60c0aab83276450a060b |