15 projects
whisper-timestamped
Multi-lingual Automatic Speech Recognition (ASR) based on Whisper models, with accurate word timestamps, access to language detection confidence, several options for Voice Activity Detection (VAD), and more.
audio-separator
Easy to use audio stem separation, using various models from UVR trained primarily by @Anjok07
karaoke-prep
Prepare for karaoke video creation, by downloading audio and lyrics for a specified song or playlist from youtube and separating audio stems. After syncing, finalise the video with a title screen!
lyrics-transcriber
Automatically create synchronised lyrics files in ASS and MidiCo LRC formats with word-level timestamps, using Whisper and lyrics from Genius and Spotify
karaoke-lyrics-processor
Process song lyrics to prepare them for karaoke video production, e.g. by splitting long lines
youtube-bulk-upload
Upload all videos in a folder to youtube, e.g. to help re-populate an unfairly terminated channel
karaoke-generator
Fully automated creation of _acceptable_ karaoke music videos from any music on YouTube, using open source tools and AI (e.g. Whisper and MDX-Net)
lyrics-converter
Tool to convert between different text-based lyrics formats (e.g. LRC, MidiCo LRC, ASS, TXT)
logo-diagram-generator
Generate SVG diagrams of a (tech) ecosystem, using logos from each tool organised into groups around a central logo
ultimatevocalremover
karaokenerds-requests-prep
Prepare for bulk karaoke video creation, by downloading audio and lyrics for top requests on karaokenerds.
lrc-adjuster
Simple CLI tool to adjust timestamps in LRC files.
fetch-lyrics-from-genius
A package to fetch lyrics from Genius.com
pysonofflan
Interface for Sonoff devices running original Itead firmware, in LAN mode.
migration-runner
Run MySQL migration scripts sequentially from a specified directory, keeping track of current version in the database.