Skip to main content

No project description provided

Project description

README

Open In Colab

介绍

借助whisperx生成卡拉ok歌词所需的k轴ass文件。注意,生成的准确性与输入质量相关,请确保lrc文件的时间戳准确。此外,使用UVR5处理后的人声文件作为输入会得到更准确的时间戳。

使用

可以在windows本地使用或者在 Colab 中运行

需要准备输入:

  1. 音频
  2. lrc文件(需要以行为单位的时间戳)

之后在任意位置新建文件夹,放入这两个文件即可。

在 Colab 中运行

Open In Colab

选择运行时为 GPU,在默认工作文件夹内新建raw文件夹,之后将音频文件与lrc文件放入其中。依次执行单元格即可。

在本地运行

安装依赖

conda install pytorch==2.0.0 torchaudio==2.0.0 pytorch-cuda=11.8 -c pytorch -c nvidia
pip install git+https://github.com/m-bain/whisperx.git
pip install whisperx-karaoke

运行

例如文件被放置在了 ./raw/song1 文件夹内

python -m whisperx_karaoke ./raw/song1
# usage: __main__.py [-h] [--device DEVICE] [--language LANGUAGE] [--offset OFFSET] dir [dir ...]
# __main__.py: error: the following arguments are required: dir

等待后ass文件会被输出到同一目录内。

其他参数

  • --device DEVICE

    选择设备,默认为 cuda

  • --language LANGUAGE

    选择语言,默认为 ja

  • --offset OFFSET

    lrc 歌词的偏移量,默认为 0

Q&A

Q: 为什么命令行会长时间卡住? A:whisperx 会尝试连接 huggingface,如果网络连接状况不好,请尝试为命令行设置代理。如windows powershell $Env:http_proxy="http://127.0.0.1:7890";$Env:https_proxy="http://127.0.0.1:7890"

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

whisperx_karaoke-0.1.1.tar.gz (5.9 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

whisperx_karaoke-0.1.1-py3-none-any.whl (6.7 kB view details)

Uploaded Python 3

File details

Details for the file whisperx_karaoke-0.1.1.tar.gz.

File metadata

  • Download URL: whisperx_karaoke-0.1.1.tar.gz
  • Upload date:
  • Size: 5.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.0.0 CPython/3.10.14

File hashes

Hashes for whisperx_karaoke-0.1.1.tar.gz
Algorithm Hash digest
SHA256 d5d6b37929eac5276a4c476f5d3f2c0b762effa9848ba7547680a98d03f112fc
MD5 57f639bdfba024a74792931975286806
BLAKE2b-256 fc6b60941dd2529eb78a27d631d43496b852d25b99f24019e93439f55cee1d21

See more details on using hashes here.

File details

Details for the file whisperx_karaoke-0.1.1-py3-none-any.whl.

File metadata

File hashes

Hashes for whisperx_karaoke-0.1.1-py3-none-any.whl
Algorithm Hash digest
SHA256 f7d75b0f5f72d1a248bfba3e132994e26870070d0dda853c84457da9e1af19c8
MD5 786d312a9f4e066af764c6cc582cf4ee
BLAKE2b-256 6ce6fc96bd2bc911999e1f57fa88715703983b5d7f701006e318bde4de7b1cee

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page