AI-powered media transcription with Claude Code skills
Project description
omni-captions-skills
Captions Made Easy — Claude Code Caption Skills
"I need bilingual captions for this Fireship vibe coding video https://youtube.com/watch?v=Tw18-4U7mts"
One sentence. Claude handles the download, transcription, and translation.
Install
npx skills add https://github.com/lattifai/omni-captions-skills
Other installation methods
Claude Code Plugin System:
/plugin marketplace add lattifai/omni-captions-skills
/plugin install omnicaptions@lattifai-omni-captions-skills
Local Development:
git clone https://github.com/lattifai/omni-captions-skills.git
claude --plugin-dir ./omni-captions-skills
Try It
❯ Make bilingual captions for this Fireship vibe coding video https://youtube.com/watch?v=Tw18-4U7mts
1
00:00:00,000 --> 00:00:03,200
Mass hysteria satisfies a deep human need.
群体性癔症满足了人类某种深层需求。
2
00:00:03,200 --> 00:00:07,440
Vibe coding is programming without actually writing any code yourself.
Vibe coding 就是不用自己写代码的编程方式。
Skills
| Skill | Description |
|---|---|
transcribe |
YouTube/video → Markdown with timestamps |
translate |
Translate captions, bilingual output supported |
convert |
Convert between 30+ caption formats |
download |
Download YouTube video/audio/captions |
LaiCut |
Forced alignment, word-level timing accuracy |
Invoke via
/omnicaptions:transcribeor/omnicaptions-transcribe
LaiCut: Precise Timing
Standard transcription gives "approximate" timestamps. LaiCut uses LattifAI Lattice-1 model to match text precisely to audio waveforms, achieving word-level accuracy.
Install LaiCut:
# Using uv (recommended, auto-configures package index)
uv pip install "omni-captions-skills[laicut]"
# Using pip
pip install "omni-captions-skills[laicut]" --extra-index-url https://lattifai.github.io/pypi/simple/
Supported languages: English, Chinese, German, and mixed
Recommended workflow: Align before translate (translated text doesn't match original audio)
Setup
| Feature | API Key | Note |
|---|---|---|
| Translation | None required | Uses Claude by default, works out of the box |
| Transcription | Gemini API | Optional, only needed for transcription |
| LaiCut alignment | LattifAI API | Optional, only needed for precise alignment |
Gemini is only used for video transcription. When a video has no captions, you'll be prompted whether to transcribe — configure then. Translation uses Claude by default, works out of the box.
API keys are prompted automatically and saved to ~/.config/omnicaptions/config.json
OmniCaptions CLI Usage
# With captions: download → align → translate
omnicaptions download "https://youtube.com/watch?v=xxx"
omnicaptions LaiCut video.mp4 video.en.vtt -o video_LaiCut.srt
omnicaptions translate video_LaiCut.srt -l zh --bilingual
# Without captions: transcribe → align → translate
omnicaptions transcribe video.mp4
omnicaptions LaiCut video.mp4 video_GeminiUnd.md -o video_LaiCut.srt
omnicaptions translate video_LaiCut.srt -l zh --bilingual
Credits: @dotey for the transcription prompt | Built on lattifai-captions
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
File details
Details for the file omni_captions_skills-0.1.1.tar.gz.
File metadata
- Download URL: omni_captions_skills-0.1.1.tar.gz
- Upload date:
- Size: 2.6 MB
- Tags: Source
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
d52909a47314f217d5b363696a08c97d3dd72938ee55009d9887128211a9dda3
|
|
| MD5 |
9047e1e41072d1603807f5e9a38f60b1
|
|
| BLAKE2b-256 |
005810e3875bb8b37f34f70d561b8bfc74d89f5be9be32d1e7d7d7b11baa12d9
|
Provenance
The following attestation bundles were made for omni_captions_skills-0.1.1.tar.gz:
Publisher:
publish.yml on lattifai/omni-captions-skills
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
omni_captions_skills-0.1.1.tar.gz -
Subject digest:
d52909a47314f217d5b363696a08c97d3dd72938ee55009d9887128211a9dda3 - Sigstore transparency entry: 909875479
- Sigstore integration time:
-
Permalink:
lattifai/omni-captions-skills@e4d238900317238206e6048128f08931eb6bf9cd -
Branch / Tag:
refs/tags/v0.1.1 - Owner: https://github.com/lattifai
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
publish.yml@e4d238900317238206e6048128f08931eb6bf9cd -
Trigger Event:
push
-
Statement type: