No project description provided

Project description

README

介绍

借助whisperx生成卡拉ok歌词所需的k轴ass文件。注意，生成的准确性与输入质量相关，请确保lrc文件的时间戳准确。此外，使用UVR5处理后的人声文件作为输入会得到更准确的时间戳。

使用

可以在windows本地使用或者在 Colab 中运行

需要准备输入：

音频
lrc文件（需要以行为单位的时间戳）

之后在任意位置新建文件夹，放入这两个文件即可。

在 Colab 中运行

选择运行时为 GPU，在默认工作文件夹内新建raw文件夹，之后将音频文件与lrc文件放入其中。依次执行单元格即可。

在本地运行

安装依赖

conda install pytorch==2.0.0 torchaudio==2.0.0 pytorch-cuda=11.8 -c pytorch -c nvidia
pip install git+https://github.com/m-bain/whisperx.git
pip install whisperx-karaoke

运行

例如文件被放置在了 ./raw/song1 文件夹内

python -m whisperx_karaoke ./raw/song1
# usage: __main__.py [-h] [--device DEVICE] [--language LANGUAGE] [--offset OFFSET] dir [dir ...]
# __main__.py: error: the following arguments are required: dir

等待后ass文件会被输出到同一目录内。

其他参数

--device DEVICE

选择设备，默认为 cuda
--language LANGUAGE

选择语言，默认为 ja
--offset OFFSET

lrc 歌词的偏移量，默认为 0

Q&A

Q: 为什么命令行会长时间卡住？ A：whisperx 会尝试连接 huggingface，如果网络连接状况不好，请尝试为命令行设置代理。如windows powershell $Env:http_proxy="http://127.0.0.1:7890";$Env:https_proxy="http://127.0.0.1:7890"

Project details

Release history Release notifications | RSS feed

This version

0.1.1

May 3, 2024

0.1.0

May 1, 2024

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

whisperx_karaoke-0.1.1.tar.gz (5.9 kB view hashes)

Uploaded May 3, 2024 Source

Built Distribution

whisperx_karaoke-0.1.1-py3-none-any.whl (6.7 kB view hashes)

Uploaded May 3, 2024 Python 3

Hashes for whisperx_karaoke-0.1.1.tar.gz

Hashes for whisperx_karaoke-0.1.1.tar.gz
Algorithm	Hash digest
SHA256	`d5d6b37929eac5276a4c476f5d3f2c0b762effa9848ba7547680a98d03f112fc`
MD5	`57f639bdfba024a74792931975286806`
BLAKE2b-256	`fc6b60941dd2529eb78a27d631d43496b852d25b99f24019e93439f55cee1d21`

Hashes for whisperx_karaoke-0.1.1-py3-none-any.whl

Hashes for whisperx_karaoke-0.1.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`f7d75b0f5f72d1a248bfba3e132994e26870070d0dda853c84457da9e1af19c8`
MD5	`786d312a9f4e066af764c6cc582cf4ee`
BLAKE2b-256	`6ce6fc96bd2bc911999e1f57fa88715703983b5d7f701006e318bde4de7b1cee`