Automatically create synchronised lyrics files in ASS and MidiCo LRC formats with word-level timestamps, using Whisper and lyrics from Genius and Spotify

These details have not been verified by PyPI

Project links

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Project description

Lyrics Transcriber 🎶

Automatically create synchronised lyrics files in ASS and MidiCo LRC formats with word-level timestamps, using OpenAI Whisper and lyrics from Genius and Spotify, for convenience in use cases such as karaoke video production.

Features 🌟

Automatically transcribe lyrics with word-level timestamps.
Outputs lyrics in ASS and MidiCo LRC formats.
Can fetch lyrics from with Genius and Spotify.
Command Line Interface (CLI) for easy usage.
Can be included and used in other Python projects.

Installation 🛠️

Prerequisites

Python 3.9 or higher
[Optional] A Genius API token if you want to fetch lyrics from Genius

pip install lyrics-transcriber

Warning The package published to PyPI was created by manually editing poetry.lock to remove triton, as it is technically a sub-dependency from openai-whisper but is currently only supported on Linux (whisper still works fine without it, and I want this package to be usable on any platform)

Usage 🚀

As a standalone CLI

To transcribe lyrics from an audio file:

lyrics-transcriber /path/to/your/audiofile.mp3

To specify Genius API token, song artist, and song title for auto-correction:

lyrics-transcriber /path/to/your/audiofile.mp3 --genius_api_token YOUR_API_TOKEN --artist "Artist Name" --title "Song Title"

As a Python package in your project

Import LyricsTranscriber in your Python script:

from lyrics_transcriber import LyricsTranscriber

Create an instance and use it:

transcriber = LyricsTranscriber(audio_filepath='path_to_audio.mp3')
result_metadata = transcriber.generate()

result_metadata contains values as such:

result_metadata = {
    "whisper_json_filepath": str,
    "genius_lyrics": str,
    "genius_lyrics_filepath": str,
    "midico_lrc_filepath": str,
    "singing_percentage": int,
    "total_singing_duration": int,
    "song_duration": int,
}

Requirements 📋

Python >= 3.9
Python Poetry
Dependencies are listed in pyproject.toml

Local Development 💻

To work on the Lyrics Transcriber project locally, you need Python 3.9 or higher. It's recommended to create a virtual environment using poetry.

Clone the repo and cd into it.
Install poetry if you haven’t already.
Run poetry install to install the dependencies.
Run poetry shell to activate the virtual environment.

Contributing 🤝

Contributions are very much welcome! Please fork the repository and submit a pull request with your changes, and I'll try to review, merge and publish promptly!

This project is 100% open-source and free for anyone to use and modify as they wish.
If the maintenance workload for this repo somehow becomes too much for me I'll ask for volunteers to share maintainership of the repo, though I don't think that is very likely

License 📄

This project is licensed under the MIT License.

Credits 🙏

This project uses OpenAI Whisper for transcription, which inspired the entire tool!
Thanks to @linto-ai for the whisper-timestamped project which solved a big chunk for me.
Thanks to Genius for providing an API which makes fetching lyrics easier!

Contact 💌

For questions or feedback, please raise an issue or reach out to @beveradb (Andrew Beveridge) directly.

Project details

These details have not been verified by PyPI

Project links

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Release history Release notifications | RSS feed

This version

0.13.1

Jan 31, 2024

0.12.9

Nov 22, 2023

0.12.8

Nov 21, 2023

0.12.7

Nov 21, 2023

0.12.6

Nov 19, 2023

0.12.5

Nov 19, 2023

0.12.4

Nov 19, 2023

0.12.3

Nov 19, 2023

0.12.2

Nov 19, 2023

0.12.1

Nov 17, 2023

0.11.0

Nov 17, 2023

0.10.1

Nov 17, 2023

0.9.1

Nov 15, 2023

0.8.0

Oct 10, 2023

0.7.0

Oct 9, 2023

0.6.5

Aug 5, 2023

0.6.4

Jul 9, 2023

0.6.3

Jul 9, 2023

0.6.2

Jul 7, 2023

0.6.1

Jul 7, 2023

0.5.1

Jul 6, 2023

0.5.0

Jul 6, 2023

0.4.1

Jul 3, 2023

0.3.5

Jul 3, 2023

0.3.4

Jul 2, 2023

0.3.2

Jul 1, 2023

0.3.1

Jul 1, 2023

0.3.0

Jul 1, 2023

0.2.0

Jul 1, 2023

0.1.0

Jul 1, 2023

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

lyrics_transcriber-0.13.1.tar.gz (39.3 kB view hashes)

Uploaded Jan 31, 2024 Source

Built Distribution

lyrics_transcriber-0.13.1-py3-none-any.whl (42.1 kB view hashes)

Uploaded Jan 31, 2024 Python 3

Hashes for lyrics_transcriber-0.13.1.tar.gz

Hashes for lyrics_transcriber-0.13.1.tar.gz
Algorithm	Hash digest
SHA256	`bcd95b3e22bcc32ffaf0855e9095a36155cb59b509eb3dd7446f345e9d20e075`
MD5	`a985cfd45df65adb2a7651b811c8d677`
BLAKE2b-256	`b5fea561844462f6c1826745ceca41aa4b408f3db0171390a8c8823e8f6bb044`

Hashes for lyrics_transcriber-0.13.1-py3-none-any.whl

Hashes for lyrics_transcriber-0.13.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`00abd241befdc4ae4550ee4996eaf699b1f3ff5fbd438b4c56a5bc91c46a4f72`
MD5	`e3949298aacbf3a315770a896db613fb`
BLAKE2b-256	`fd9d07e93c5c57984d7d487557d674ccec837dc257fef138e3ea52632dc4a4ed`