Tools to make music videos
Project description
muvid
Tools to make music videos. Orchestrates the local
ecosystem (falaw, lookbook, lacing, an, mixing) into a
song-to-video pipeline. The user is the director; an agent (Claude in
the terminal, or the local web UI) drives the stages.
Status: v0. The pipeline (init → transcribe → align → cast → environments → script → render → compose) works end to end. Render strategies:
lipsync,image_to_video,text_to_video,animation,still. CLI, Claude skill (.claude/skills/muvid/), and a single-page local UI all dispatch to the same Python functions. Seemisc/docs/design.mdfor the full design rationale andmisc/docs/alignment_references.mdfor the lyric-alignment literature muvid builds on.
Install
pip install -e ./muvid
pip install -e ./muvid[ui] # adds FastAPI + uvicorn for the web UI
This package depends on local sibling packages (falaw, lookbook,
lacing, mixing); install them editable first.
System: ffmpeg and ffprobe on PATH. Env: ELEVENLABS_API_KEY
(for transcription), FAL_KEY (for fal.ai generation).
30-second tour
# Bootstrap a project around a song.
muvid init ~/muvid/park-bench --song ~/Downloads/park_bench.mp3 --title "Park Bench"
# Transcribe to a draft lyrics.md (you'll edit it).
muvid transcribe ~/muvid/park-bench
# … you edit lyrics/lyrics.md to fix mishears and add [section] tags …
# Align lyrics.md against the transcript and write lyrics/alignment.annot.
muvid align ~/muvid/park-bench
# Cast a character: card, then images, then lookbook curation.
muvid character ~/muvid/park-bench maya --description "mid-30s, dark curly hair, wary eyes"
muvid character-generate ~/muvid/park-bench maya --n 6
muvid character-curate ~/muvid/park-bench maya --k 8
# Establish an environment.
muvid environment ~/muvid/park-bench park_bench --description "wooden park bench at dusk"
muvid environment-render ~/muvid/park-bench park_bench
# Write/edit script/script.md (let an agent draft it from the lyrics + cast),
# then sync it back into project.json:
muvid script-apply ~/muvid/park-bench
# Render every shot, then composite.
muvid render ~/muvid/park-bench
muvid compose ~/muvid/park-bench
# → ~/muvid/park-bench/output/final.mp4
# Or open the local UI (FastAPI + single HTML page).
muvid serve ~/muvid/park-bench
How it fits the ecosystem
| Concern | Owner |
|---|---|
| AI media (TTS, image, video, lipsync, voice clone) | falaw |
| Reference image curation (LoRA-style sets) | lookbook |
| Timeline / interval annotations (lyrics, sections) | lacing |
| Structured 2D animation (cutout characters) | an |
| Audio/video editing + ElevenLabs Scribe | mixing |
| Project, pipeline, dispatcher | muvid |
muvid is the orchestrator: a folder layout (project.json + song/,
lyrics/, characters/, environments/, script/, shots/,
output/), a content-addressed cache (re-render only what changed),
and a uniform dispatch layer with three surfaces (CLI, skill, UI)
all calling the same Python functions in muvid.facade.
Render strategies
Each shot picks one. The dispatcher resolves shared inputs (audio slice, lyric lines that fall in the shot interval, character / env anchor images) once and hands them to the strategy:
| strategy | use it for | calls |
|---|---|---|
lipsync |
character singing on screen | falaw.animate_face |
image_to_video |
cinematic shot, env anchor as i2v seed | falaw.image_to_video |
text_to_video |
no anchor, pure prompt | falaw.text_to_video |
animation |
stylized 2D cutout | an.orchestrate |
still |
single image held for the duration | ffmpeg |
The Claude skill
.claude/skills/muvid/SKILL.md walks Claude (or any agent that follows
Claude Code skills) through the eight stages. It will:
- run
muvid statusfirst to see where you are - pick the next stage and offer to run it
- never re-transcribe after you've edited
lyrics.md - never
--forcea render without asking - offer to draft
script/script.mdfrom your lyrics + cast
Layout
muvid/
__init__.py public surface (the facade)
__main__.py CLI (argh)
schema.py ProjectSpec, ShotSpec, SectionSpec, …
project.py MusicVideoProject (folder facade)
song.py (probing via ffprobe lives in project.py)
lyrics.py transcribe + parse/render lyrics.md
align.py greedy token-match → lacing SqliteStore
characters.py cards + ref images + lookbook curation
environments.py cards + establishing-image generation
script.py script.md ↔ ShotSpec list
render/
__init__.py dispatcher + RenderContext + caching
lipsync.py falaw.animate_face
image_to_video.py falaw.image_to_video
text_to_video.py falaw.text_to_video
still.py ffmpeg single-image loop
animation.py handoff to `an.orchestrate`
compose.py ffmpeg concat + overlay song audio
facade.py top-level verbs the CLI/skill/UI call
ui/
app.py FastAPI app
static/index.html single-page UI
.claude/skills/muvid/SKILL.md
misc/docs/design.md full design rationale
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file muvid-0.0.3.tar.gz.
File metadata
- Download URL: muvid-0.0.3.tar.gz
- Upload date:
- Size: 1.6 MB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: uv/0.11.8 {"installer":{"name":"uv","version":"0.11.8","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
0cedc079d14398451efb0f7d6ee28b6a97efa288273e2298a02a4091dcc18156
|
|
| MD5 |
cc084f27c1d498f7d8aa3135261254e5
|
|
| BLAKE2b-256 |
1b1ee341edd991e71dfb4538b8e4558630b5f2807e2191b9692a127c0006fa51
|
File details
Details for the file muvid-0.0.3-py3-none-any.whl.
File metadata
- Download URL: muvid-0.0.3-py3-none-any.whl
- Upload date:
- Size: 44.9 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: uv/0.11.8 {"installer":{"name":"uv","version":"0.11.8","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
6c2976977411bc7d3ebf1ca965260c37702a82222c0f6beec99aaeb5fdf91263
|
|
| MD5 |
43e1433315392563b8e25dd245973f23
|
|
| BLAKE2b-256 |
2df605811b082c72147b3634f212b69069f083f030de09df3effe106a071427c
|