No project description provided
Project description
README
介绍
借助whisperx生成卡拉ok歌词所需的k轴ass文件。注意,生成的准确性与输入质量相关,请确保lrc文件的时间戳准确。此外,使用UVR5处理后的人声文件作为输入会得到更准确的时间戳。
使用
可以在windows本地使用或者在 Colab 中运行
需要准备输入:
- 音频
- lrc文件(需要以行为单位的时间戳)
之后在任意位置新建文件夹,放入这两个文件即可。
在 Colab 中运行
选择运行时为 GPU,在默认工作文件夹内新建raw文件夹,之后将音频文件与lrc文件放入其中。依次执行单元格即可。
在本地运行
安装依赖
conda install pytorch==2.0.0 torchaudio==2.0.0 pytorch-cuda=11.8 -c pytorch -c nvidia
pip install git+https://github.com/m-bain/whisperx.git
pip install whisperx-karaoke
运行
例如文件被放置在了 ./raw/song1 文件夹内
python -m whisperx_karaoke ./raw/song1
# usage: __main__.py [-h] [--device DEVICE] [--language LANGUAGE] [--offset OFFSET] dir [dir ...]
# __main__.py: error: the following arguments are required: dir
等待后ass文件会被输出到同一目录内。
其他参数
-
--device DEVICE
选择设备,默认为
cuda -
--language LANGUAGE
选择语言,默认为
ja -
--offset OFFSET
lrc 歌词的偏移量,默认为
0
Q&A
Q: 为什么命令行会长时间卡住?
A:whisperx 会尝试连接 huggingface,如果网络连接状况不好,请尝试为命令行设置代理。如windows powershell $Env:http_proxy="http://127.0.0.1:7890";$Env:https_proxy="http://127.0.0.1:7890"
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file whisperx_karaoke-0.1.1.tar.gz.
File metadata
- Download URL: whisperx_karaoke-0.1.1.tar.gz
- Upload date:
- Size: 5.9 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.0.0 CPython/3.10.14
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
d5d6b37929eac5276a4c476f5d3f2c0b762effa9848ba7547680a98d03f112fc
|
|
| MD5 |
57f639bdfba024a74792931975286806
|
|
| BLAKE2b-256 |
fc6b60941dd2529eb78a27d631d43496b852d25b99f24019e93439f55cee1d21
|
File details
Details for the file whisperx_karaoke-0.1.1-py3-none-any.whl.
File metadata
- Download URL: whisperx_karaoke-0.1.1-py3-none-any.whl
- Upload date:
- Size: 6.7 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.0.0 CPython/3.10.14
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
f7d75b0f5f72d1a248bfba3e132994e26870070d0dda853c84457da9e1af19c8
|
|
| MD5 |
786d312a9f4e066af764c6cc582cf4ee
|
|
| BLAKE2b-256 |
6ce6fc96bd2bc911999e1f57fa88715703983b5d7f701006e318bde4de7b1cee
|