A python CLI tool to extract voice sentences from audio files with speech recognition
Project description
VoiceXTractor
~ A python CLI tool to extract voice sentences from audio files with speech recognition ~
Ko-fi · Installation · Changelog
Developed by @veeso
Current version: 0.1.0 (09/01/2022)
About VXT 🚜
VXT, which stands for VoiceXTractor is a Python command-line utility to extract voice tracks from audio.
How it works:
- You provide VXT with an audio file
- The audio file is split by silence
- for each "track" chunked by the audio file, it gets the speech for it using a customisable speech-to-text engine
- you can at this point work on tracks (amplify, normalize, split, remove...)
- export the tracks to files with the format you prefer
Get started 🚀
You can install VXT with pip:
pip3 install vxt
then you can run VXT with the following arguments:
vxt -l it_IT -o ./output/ ./hackerino.mp3
this will split the hackerino.mp3
audio file into tracks by voice into output/
, the -l
option specifies the audio language is Italian.
vxt supports these options:
-e, --engine TEXT Specify speech2text engine [bing, google,
google-cloud, houndify, ibm, sphinx] (default:
google)
-l, --language TEXT Specify audio language (e.g. it_IT), system
language will be used otherwise
-f, --output-fmt TEXT Specify output format (See readme)
-o, --output-dir TEXT Specify output directory
-A, --api-key TEXT Specify api key (required for: bing, google
-J, --json-credentials TEXT Specify json credentials (required for: google-
cloud)
-C, --client-id TEXT Specify client id (required for: houndify)
-K, --client-key TEXT Specify client key (required for: houndify)
-U, --username TEXT Specify username (required for: ibm)
-P, --password TEXT Specify user password (required for: ibm)
--keyword-entries TEXT Specify keyword entries (required for: sphinx)
--grammar-file TEXT Specify grammar file (required for: sphinx)
--help Show this message and exit.
by default the google
engine will be used for speech-to-text.
Output format
Track filename fmt.
The syntax use parameters which must be preceeded by %
, everything in between will be kept the same.
The following parameters are supported.
%%
: print percentage symbol%d
: current day%H
: current hours%I
: current timestamp ISO8601 syntax%M
: current minutes%m
: current month%S
: current seconds%s
: track speech%s.NUMBER
track speech cut at length (e.g.%s.24
)%t
: track number in track list (from 1 to n)%y
: current year with 2 digits%Y
: current year with 4 digits
Support the developer ☕
If you like VXT and you're grateful for the work I've done, please consider a little donation 🥳
You can make a donation with one of these platforms:
Contributing and issues 🤝🏻
Contributions, bug reports, new features and questions are welcome! 😉 If you have any question or concern, or you want to suggest a new feature, or you want just want to improve VXT, feel free to open an issue or a PR.
Please follow our contributing guidelines
Changelog ⏳
View VXT's changelog HERE
Powered by 💪
VXT is powered by these awesome projects:
License 📃
VXT is licensed under the MIT license.
You can read the entire license HERE
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for vxt-0.1.0.post1-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 3163fd2059fdfec6d4b13e4eee7d9ce91ffd78047ba0bde776bbc862396bbe85 |
|
MD5 | 4874ba2ee61e2765e356fed4cdea7376 |
|
BLAKE2b-256 | 2b686121c6b05a9f0681d0eb9327a71f3e0aa60d5f4f0cf791933b397b67d2cd |