High-quality transcription, formatting, and analysis of videos and podcasts
Project description
deep-transcribe
Take a video or audio URL (such as YouTube), download and cache it, and perform a "deep transcription" of it, including full transcription, identifying speakers, adding sections, timestamps, and annotations, and inserting frame captures.
By default this needs API keys for Deepgram and Anthropic (Claude).
This is built on kash and its kash-media kit of tools for handling videos.
Usage
See the env.template to set up DEEPGRAM_API_KEY and ANTHROPIC_API_KEY.
uv tool install --upgrade deep_transcribe
# Pick a YouTube video, and do a basic, formatted, or fully annotated transcription:
deep_transcribe transcribe https://www.youtube.com/watch?v=ihaB8AFOhZo
deep_transcribe transcribe_format https://www.youtube.com/watch?v=ihaB8AFOhZo
deep_transcribe transcribe_annotate https://www.youtube.com/watch?v=ihaB8AFOhZo
Results will be in the ./transcriptions directory.
To run as an MCP server:
# In stdio mode:
deep_transcribe mcp
# In SSE mode at 127.0.0.1:4440:
deep_transcribe mcp --sse
Or for Claude Desktop, a config like this should work:
{
"mcpServers": {
"deep_transcribe": {
"command": "/Users/levy/.local/bin/deep_transcribe",
"args": ["mcp"]
}
},
}
To debug MCP logs:
deep_transcribe mcp --logs
Project Docs
For how to install uv and Python, see installation.md.
For development workflows, see development.md.
For instructions on publishing to PyPI, see publishing.md.
This project was built from simple-modern-uv.
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file deep_transcribe-0.1.2.tar.gz.
File metadata
- Download URL: deep_transcribe-0.1.2.tar.gz
- Upload date:
- Size: 169.0 kB
- Tags: Source
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.12.9
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
3b5ef577441edaa6ac229b371f80bc3ec8393fb9c98e391f62caed2a6767dd3e
|
|
| MD5 |
f92efda6bb6ff2c5ea93dfcd07cd03f9
|
|
| BLAKE2b-256 |
c849b77afb742e48bd40b5e0702cfff56616ad8f7d6fbd2b2e1923cfe9f6f7a2
|
Provenance
The following attestation bundles were made for deep_transcribe-0.1.2.tar.gz:
Publisher:
publish.yml on jlevy/deep-transcribe
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
deep_transcribe-0.1.2.tar.gz -
Subject digest:
3b5ef577441edaa6ac229b371f80bc3ec8393fb9c98e391f62caed2a6767dd3e - Sigstore transparency entry: 192923911
- Sigstore integration time:
-
Permalink:
jlevy/deep-transcribe@f941730fb656d5ba0a1e060259007bbbf5ed134f -
Branch / Tag:
refs/tags/v0.1.2 - Owner: https://github.com/jlevy
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
publish.yml@f941730fb656d5ba0a1e060259007bbbf5ed134f -
Trigger Event:
release
-
Statement type:
File details
Details for the file deep_transcribe-0.1.2-py3-none-any.whl.
File metadata
- Download URL: deep_transcribe-0.1.2-py3-none-any.whl
- Upload date:
- Size: 17.7 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.12.9
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
4d02e2b134322125060a64eedff8a36dc6a34b937376acd7962417ef9ff0db3c
|
|
| MD5 |
8d4ca27107c5eb758a81cb2b8ed87ec9
|
|
| BLAKE2b-256 |
e21f46ad9e48e93683ab92a92ec2db59b2a0b8552c8f742c1cc139e9aa00e46b
|
Provenance
The following attestation bundles were made for deep_transcribe-0.1.2-py3-none-any.whl:
Publisher:
publish.yml on jlevy/deep-transcribe
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
deep_transcribe-0.1.2-py3-none-any.whl -
Subject digest:
4d02e2b134322125060a64eedff8a36dc6a34b937376acd7962417ef9ff0db3c - Sigstore transparency entry: 192923913
- Sigstore integration time:
-
Permalink:
jlevy/deep-transcribe@f941730fb656d5ba0a1e060259007bbbf5ed134f -
Branch / Tag:
refs/tags/v0.1.2 - Owner: https://github.com/jlevy
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
publish.yml@f941730fb656d5ba0a1e060259007bbbf5ed134f -
Trigger Event:
release
-
Statement type: