Skip to main content

AI-powered media transcription with Claude Code skills

Project description

omni-captions-skills

Captions Made Easy — Claude Code Caption Skills

"I need bilingual captions for this Fireship vibe coding video https://youtube.com/watch?v=Tw18-4U7mts"

One sentence. Claude handles the download, transcription, and translation.

中文文档

Install

npx skills add https://github.com/lattifai/omni-captions-skills
Other installation methods

Claude Code Plugin System:

/plugin marketplace add lattifai/omni-captions-skills
/plugin install omnicaptions@lattifai-omni-captions-skills

Local Development:

git clone https://github.com/lattifai/omni-captions-skills.git
claude --plugin-dir ./omni-captions-skills

Try It

❯ Make bilingual captions for this Fireship vibe coding video https://youtube.com/watch?v=Tw18-4U7mts
1
00:00:00,000 --> 00:00:03,200
Mass hysteria satisfies a deep human need.
群体性癔症满足了人类某种深层需求。

2
00:00:03,200 --> 00:00:07,440
Vibe coding is programming without actually writing any code yourself.
Vibe coding 就是不用自己写代码的编程方式。

Skills

Skill Description
transcribe YouTube/video → Markdown with timestamps
translate Translate captions, bilingual output supported
convert Convert between 30+ caption formats
download Download YouTube video/audio/captions
LaiCut Forced alignment, word-level timing accuracy

Invoke via /omnicaptions:transcribe or /omnicaptions-transcribe

LaiCut: Precise Timing

Standard transcription gives "approximate" timestamps. LaiCut uses LattifAI Lattice-1 model to match text precisely to audio waveforms, achieving word-level accuracy.

Install LaiCut:

# Using uv (recommended, auto-configures package index)
uv pip install "omni-captions-skills[laicut]"

# Using pip
pip install "omni-captions-skills[laicut]" --extra-index-url https://lattifai.github.io/pypi/simple/

Supported languages: English, Chinese, German, and mixed

Recommended workflow: Align before translate (translated text doesn't match original audio)

Setup

Feature API Key Note
Translation None required Uses Claude by default, works out of the box
Transcription Gemini API Optional, only needed for transcription
LaiCut alignment LattifAI API Optional, only needed for precise alignment

Gemini is only used for video transcription. When a video has no captions, you'll be prompted whether to transcribe — configure then. Translation uses Claude by default, works out of the box.

API keys are prompted automatically and saved to ~/.config/omnicaptions/config.json

OmniCaptions CLI Usage

# With captions: download → align → translate
omnicaptions download "https://youtube.com/watch?v=xxx"
omnicaptions LaiCut video.mp4 video.en.vtt -o video_LaiCut.srt
omnicaptions translate video_LaiCut.srt -l zh --bilingual

# Without captions: transcribe → align → translate
omnicaptions transcribe video.mp4
omnicaptions LaiCut video.mp4 video_GeminiUnd.md -o video_LaiCut.srt
omnicaptions translate video_LaiCut.srt -l zh --bilingual

Credits: @dotey for the transcription prompt | Built on lattifai-captions

MIT License

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

omni_captions_skills-0.1.1.tar.gz (2.6 MB view details)

Uploaded Source

File details

Details for the file omni_captions_skills-0.1.1.tar.gz.

File metadata

  • Download URL: omni_captions_skills-0.1.1.tar.gz
  • Upload date:
  • Size: 2.6 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for omni_captions_skills-0.1.1.tar.gz
Algorithm Hash digest
SHA256 d52909a47314f217d5b363696a08c97d3dd72938ee55009d9887128211a9dda3
MD5 9047e1e41072d1603807f5e9a38f60b1
BLAKE2b-256 005810e3875bb8b37f34f70d561b8bfc74d89f5be9be32d1e7d7d7b11baa12d9

See more details on using hashes here.

Provenance

The following attestation bundles were made for omni_captions_skills-0.1.1.tar.gz:

Publisher: publish.yml on lattifai/omni-captions-skills

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page