TUI-first tool for transcribing, translating, and analyzing audio/video media

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

spetros

These details have not been verified by PyPI

Project description

mediascribe

TUI-first tool for transcribing, translating, and analyzing audio/video media.

mediascribe takes audio or video files and produces transcriptions, translations, subtitles, and AI-powered analysis. It supports local (faster-whisper) and cloud (OpenAI) transcription, speaker diarization, multi-language translation, and customizable prompt profiles.

Install

From PyPI (recommended)

pipx install mediascribe          # isolated install
pip install mediascribe            # or into current environment

With optional extras

pip install mediascribe[tui]       # Textual TUI interface
pip install mediascribe[diarize]   # speaker diarization (pyannote.audio)
pip install mediascribe[all]       # everything

From Homebrew (after first PyPI release)

brew tap shawnpetros/mediascribe
brew install mediascribe

Note: This requires the tap repo to exist — see Publishing > Homebrew for setup instructions.

From source

git clone https://github.com/shawnpetros/mediascribe.git
cd mediascribe
make install                       # editable install with dev tools

Requirements

Python 3.12+
FFmpeg 6+ — install via brew install ffmpeg or apt install ffmpeg
OpenAI API key — for translation and API transcription mode

Quick Start

# Transcribe a video (auto-detects language)
mediascribe transcribe video.mp4

# Transcribe Japanese audio → English subtitles
mediascribe transcribe podcast.mp3 --lang ja --translate en

# Use the anime profile with multiple output formats
mediascribe transcribe anime.mkv --translate en --profile anime --formats srt,vtt

# Translate an existing SRT file
mediascribe translate subtitles.srt --target en --profile anime

# Batch process a folder
mediascribe batch ./recordings/ --translate en --formats srt,txt,json

# Enable speaker diarization and AI analysis
mediascribe transcribe meeting.mp4 --diarize --analyze --formats srt,txt,json

# Launch the interactive TUI
mediascribe tui

Configuration

# Show current settings
mediascribe config show

# Set your API key
mediascribe config set openai_api_key sk-...

# Initialize config directory with profile templates
mediascribe config init

# List available profiles
mediascribe config profiles

Configuration is loaded from (highest priority first):

CLI flags
Environment variables (MEDIASCRIBE_*)
.env file in working directory
~/.config/mediascribe/config.toml
Built-in defaults

Profiles

Profiles are named config presets. Built-in profiles: general, anime, podcast, meeting.

Create custom profiles as TOML files in ~/.config/mediascribe/profiles/:

# ~/.config/mediascribe/profiles/lectures.toml
description = "University lecture transcription"

[transcription]
mode = "local"
model = "large-v3"

[translation]
target_language = "en"
enable_review = true
custom_instructions = """
Preserve technical terminology accurately.
Format mathematical expressions clearly.
"""

[output]
formats = ["srt", "txt", "json"]

Development

git clone https://github.com/shawnpetros/mediascribe.git
cd mediascribe
make install          # install editable + dev deps
make test             # run test suite (184 tests)
make lint             # run ruff linter
make format           # auto-format code
make typecheck        # run mypy
make check            # all of the above
make build            # build sdist + wheel
make help             # show all targets

Make Targets

Target	Description
`make install`	Install package in editable mode with dev extras
`make install-all`	Install with all optional extras (tui, diarize, dev)
`make test`	Run test suite
`make test-cov`	Run tests with coverage report
`make lint`	Run ruff linter
`make format`	Auto-format code with ruff
`make typecheck`	Run mypy type checker
`make check`	Run all checks (lint + format + types + tests)
`make build`	Build sdist and wheel
`make build-check`	Build and validate distribution with twine
`make publish-test`	Publish to TestPyPI
`make publish`	Publish to PyPI
`make clean`	Remove all build/cache artifacts
`make version`	Show current package version

Publishing

Fully automated (recommended)

The entire release pipeline is automated. To ship a new version:

# 1. Bump version in both files
#    - pyproject.toml:  version = "0.2.0"
#    - src/mediascribe/__init__.py:  __version__ = "0.2.0"
# 2. Commit and merge to main
git add -A && git commit -m "release: v0.2.0"
git push

What happens automatically on merge to main:

release.yml detects the version change in pyproject.toml, creates a v0.2.0 tag
publish.yml triggers on the new tag:
- Runs full CI (tests, lint, typecheck)
- Builds sdist + wheel
- Smoke tests the built wheel (CLI loads, commands respond)
- Publishes to PyPI via trusted publisher (OIDC)
- Creates a GitHub Release with auto-generated notes
- Updates the Homebrew tap formula with the new version + SHA256

One-time setup

PyPI: Configure trusted publisher in your PyPI project settings to trust the publish.yml workflow from your GitHub repo.

Homebrew tap:

Create a GitHub repo named shawnpetros/homebrew-mediascribe with a Formula/ directory
Add a repo secret HOMEBREW_TAP_TOKEN in the mediascribe repo — a personal access token with repo scope on the tap repo
Optionally set a repo variable HOMEBREW_TAP_REPO if the tap is at a different path (defaults to shawnpetros/homebrew-mediascribe)

After setup, users install via:

brew tap shawnpetros/mediascribe
brew install mediascribe

Manual publishing

make build-check     # build + validate
make publish-test    # upload to TestPyPI first
make publish         # upload to PyPI

Architecture

Input File(s)
    │
    ▼
[Detect] → file type, duration, codec
    │
    ▼
[Normalize] → 16kHz mono WAV
    │
    ▼
[Transcribe] → segments (overlap-chunked + validated + deduped)
    │
    ├──▶ [Diarize] → speaker labels (optional)
    │
    ▼
[Timing] → subtitle timing optimization
    │
    ▼
[Translate] → target language (optional, batched + context overlap)
    │
    ▼
[Review] → AI quality check (optional)
    │
    ▼
[Analyze] → summary, topics, action items (optional)
    │
    ▼
[Export] → SRT, VTT, TXT, JSON

See docs/SPEC.md for the full specification and docs/PROJECT.md for implementation status.

License

MIT

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

spetros

These details have not been verified by PyPI

Release history Release notifications | RSS feed

0.10.1

Mar 4, 2026

0.9.0

Mar 2, 2026

This version

0.2.1

Feb 27, 2026

0.2.0

Feb 26, 2026

0.1.0

Feb 26, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

mediascribe-0.2.1.tar.gz (76.5 kB view details)

Uploaded Feb 27, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

mediascribe-0.2.1-py3-none-any.whl (63.4 kB view details)

Uploaded Feb 27, 2026 Python 3

File details

Details for the file mediascribe-0.2.1.tar.gz.

File metadata

Download URL: mediascribe-0.2.1.tar.gz
Upload date: Feb 27, 2026
Size: 76.5 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for mediascribe-0.2.1.tar.gz
Algorithm	Hash digest
SHA256	`d77cedb36984fcb98da2adfbcee7e59cd94d1d2dbfe1908ff82fb5aadf919d2b`
MD5	`b5c689f1da3deef01155e1ce63e5b7ec`
BLAKE2b-256	`08dd1389d7c37499a3ec7100defffcde87a3850855e5120660fcbaa037c0bd26`

See more details on using hashes here.

Provenance

The following attestation bundles were made for mediascribe-0.2.1.tar.gz:

Publisher: publish.yml on shawnpetros/mediascribe

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: mediascribe-0.2.1.tar.gz
- Subject digest: d77cedb36984fcb98da2adfbcee7e59cd94d1d2dbfe1908ff82fb5aadf919d2b
- Sigstore transparency entry: 1002680679
- Sigstore integration time: Feb 27, 2026
Source repository:
- Permalink: shawnpetros/mediascribe@cc5e92dd2d3bb7841c164827437d95bb658f0758
- Branch / Tag: refs/heads/main
- Owner: https://github.com/shawnpetros
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@cc5e92dd2d3bb7841c164827437d95bb658f0758
- Trigger Event: workflow_dispatch

File details

Details for the file mediascribe-0.2.1-py3-none-any.whl.

File metadata

Download URL: mediascribe-0.2.1-py3-none-any.whl
Upload date: Feb 27, 2026
Size: 63.4 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for mediascribe-0.2.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`e0bf634f895e8a07aab1b1bdf2ca84b82e6e934b24277ecf725ec0f8cae9b7e6`
MD5	`a4c2375802ba76ed4b10710e1092087d`
BLAKE2b-256	`ff42bfab61438e1255ce45a8ed50ba29211fe8e8f2cc80d6c217cbd4f845741e`

See more details on using hashes here.

Provenance

The following attestation bundles were made for mediascribe-0.2.1-py3-none-any.whl:

Publisher: publish.yml on shawnpetros/mediascribe

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: mediascribe-0.2.1-py3-none-any.whl
- Subject digest: e0bf634f895e8a07aab1b1bdf2ca84b82e6e934b24277ecf725ec0f8cae9b7e6
- Sigstore transparency entry: 1002680735
- Sigstore integration time: Feb 27, 2026
Source repository:
- Permalink: shawnpetros/mediascribe@cc5e92dd2d3bb7841c164827437d95bb658f0758
- Branch / Tag: refs/heads/main
- Owner: https://github.com/shawnpetros
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@cc5e92dd2d3bb7841c164827437d95bb658f0758
- Trigger Event: workflow_dispatch

mediascribe 0.2.1

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Project description

mediascribe

Install

From PyPI (recommended)

With optional extras

From Homebrew (after first PyPI release)

From source

Requirements

Quick Start

Configuration

Profiles

Development

Make Targets

Publishing

Fully automated (recommended)

One-time setup

Manual publishing

Architecture

License

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance