Skip to main content

Transcribe and translate voice into LRC file.

Project description

Open-Lyrics

Open-Lyrics is a open-source project to transcribe ( using faster-whisper) voice file and translate/polish (OpenAI-GPT) the text.

This new project is rapidly underway, and we welcome any issues or pull requests.

Installation

  1. Please install CUDA and cuDNN first according to https://opennmt.net/CTranslate2/installation.html to enable faster-whisper.

  2. Add your OpenAI API key to environment variable OPENAI_API_KEY.

  3. This project can be installed from PyPI:

    pip install openlrc
    

Usage

from openlrc import LRCer

lrcer = LRCer()
lrcer('./data/test.mp3')  # Generate ./data/test.lrc

Todo

  • Add transcribed examples.
    • Song
    • Podcast
    • Audiobook
  • Make translate prompt more robust.
  • Add local LLM support.
  • Multi-thead support for both whisper model and GPT request.
  • Automatically fix json encoder error using GPT.

Credits

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

openlrc-0.0.3.tar.gz (7.8 kB view hashes)

Uploaded Source

Built Distribution

openlrc-0.0.3-py3-none-any.whl (9.8 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page