Skip to main content

AI-powered media transcription with Claude Code skills

Project description

omni-captions-skills

Captions Made Easy — Claude Code Caption Skills

"I need bilingual captions for this Fireship vibe coding video https://youtube.com/watch?v=Tw18-4U7mts"

One sentence. Claude handles the download, transcription, and translation.

中文文档

Install

npx skills add https://github.com/lattifai/omni-captions-skills
Other installation methods

Claude Code Plugin System:

/plugin marketplace add lattifai/omni-captions-skills
/plugin install omnicaptions@lattifai-omni-captions-skills

Local Development:

git clone https://github.com/lattifai/omni-captions-skills.git
claude --plugin-dir ./omni-captions-skills

Try It

❯ Make bilingual captions for this Fireship vibe coding video https://youtube.com/watch?v=Tw18-4U7mts
1
00:00:00,000 --> 00:00:03,200
Mass hysteria satisfies a deep human need.
群体性癔症满足了人类某种深层需求。

2
00:00:03,200 --> 00:00:07,440
Vibe coding is programming without actually writing any code yourself.
Vibe coding 就是不用自己写代码的编程方式。

Skills

Skill Description
transcribe YouTube/video → Markdown with timestamps
translate Translate captions, bilingual output supported
convert Convert between 30+ caption formats
download Download YouTube video/audio/captions
LaiCut Forced alignment, word-level timing accuracy

Invoke via /omnicaptions:transcribe or /omnicaptions-transcribe

LaiCut: Precise Timing

Standard transcription gives "approximate" timestamps. LaiCut uses LattifAI Lattice-1 model to match text precisely to audio waveforms, achieving word-level accuracy.

Supported languages: English, Chinese, German, and mixed

Recommended workflow: Align before translate (translated text doesn't match original audio)

Setup

Feature API Key Note
Translation None required Uses Claude by default, works out of the box
Transcription Gemini API Optional, only needed for transcription
LaiCut alignment LattifAI API Optional, only needed for precise alignment

Gemini is only used for video transcription. When a video has no captions, you'll be prompted whether to transcribe — configure then. Translation uses Claude by default, works out of the box.

API keys are prompted automatically and saved to ~/.config/omnicaptions/config.json

OmniCaptions CLI Usage

# With captions: download → align → translate
omnicaptions download "https://youtube.com/watch?v=xxx"
omnicaptions LaiCut video.mp4 video.en.vtt -o video_LaiCut.srt
omnicaptions translate video_LaiCut.srt -l zh --bilingual

# Without captions: transcribe → align → translate
omnicaptions transcribe video.mp4
omnicaptions LaiCut video.mp4 video_GeminiUnd.md -o video_LaiCut.srt
omnicaptions translate video_LaiCut.srt -l zh --bilingual

Credits: @dotey for the transcription prompt | Built on lattifai-captions

MIT License

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

omni_captions_skills-0.1.0.tar.gz (2.6 MB view details)

Uploaded Source

File details

Details for the file omni_captions_skills-0.1.0.tar.gz.

File metadata

  • Download URL: omni_captions_skills-0.1.0.tar.gz
  • Upload date:
  • Size: 2.6 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for omni_captions_skills-0.1.0.tar.gz
Algorithm Hash digest
SHA256 423b28f868c3f10b556e1dfa3f577f580e3874430721e54783c17ead1636fb67
MD5 49c37f767cc2bd6e28fa7da54b998b00
BLAKE2b-256 19d75af692755e0e38e681d06c6dcceec40c5a8f1b749376b3e9c8c6ae90f2ed

See more details on using hashes here.

Provenance

The following attestation bundles were made for omni_captions_skills-0.1.0.tar.gz:

Publisher: publish.yml on lattifai/omni-captions-skills

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page