Skip to main content

A modular transcription CLI with pluggable providers. Agent-friendly.

Project description

transcribe-cli

A modular audio transcription CLI with pluggable providers. Agent-friendly.

Currently supports AssemblyAI with speaker diarization, sentiment analysis, entity detection, auto chapters, summarization, topic detection, and more.

Installation

# With uv (recommended)
uv tool install transcriber-cli

# With pip
pip install transcriber-cli

# Run without installing
uvx --from transcriber-cli transcribe audio.mp3

Or install from source:

git clone https://github.com/miguelarios/transcribe-cli.git
cd transcribe-cli
uv tool install .

Setup

Get an API key from the AssemblyAI Dashboard and set it:

export ASSEMBLYAI_API_KEY='your_key_here'

Or pass it directly: transcribe --api-key <key> audio.mp3

Usage

# Basic transcription (text output to stdout)
transcribe interview.mp3

# JSON output
transcribe interview.mp3 -f json

# With speaker diarization
transcribe meeting.wav --speaker-labels

# SRT subtitles to file
transcribe lecture.mp3 -f srt -o lecture.srt

# Full-featured transcription
transcribe call.m4a --speaker-labels --entities --sentiment -f json

# Pipe JSON to jq
transcribe interview.mp3 -f json | jq '.segments[] | .speaker, .text'

# Agent-friendly summary (saves full transcript to temp file)
transcribe meeting.mp3 --summary

# Transcribe from URL
transcribe https://example.com/audio.mp3

Output Formats

Format Flag Description
text -f text (default) Timestamped text with optional speaker labels
json -f json Full structured data (segments, metadata, entities)
srt -f srt SRT subtitle format
vtt -f vtt WebVTT subtitle format

All Options

Run transcribe -h for the full list. Key options:

Speaker/Diarization

  • --speaker-labels — Enable speaker diarization
  • --speakers-expected N — Exact speaker count
  • --min-speakers N --max-speakers M — Speaker range
  • --speaker-id-type [role|name] — Speaker identification mode
  • --speaker-names NAME — Known speaker names (repeatable)

Analysis

  • --entities — Detect names, locations, dates, etc.
  • --sentiment — Sentiment analysis per utterance
  • --topics — IAB topic detection
  • --auto-chapters — Generate chapters with headlines
  • --summarize — Generate transcript summary
  • --content-safety — Content moderation

Language

  • --language CODE — Language code (e.g., en_us)
  • --language-detection — Auto-detect language

Advanced

  • --prompt TEXT — Context prompt for transcription
  • --keyterms TERM — Domain-specific terms to boost (repeatable)
  • --multichannel — Multichannel transcription
  • --redact-pii — Redact personally identifiable information
  • --filter-profanity — Filter profanity
  • --disfluencies — Include filler words (um, uh)

Output

  • -f, --format [text|json|srt|vtt] — Output format
  • -o, --output PATH — Write to file instead of stdout
  • --summary — Output metadata summary, save full transcript to temp file
  • -v, --verbose — Show progress and timing on stderr

Meta

  • --list-providers — List available transcription providers
  • --api-key KEY — AssemblyAI API key
  • -V, --version — Show version
  • -h, --help — Show help

Development

git clone https://github.com/miguelarios/transcribe-cli.git
cd transcribe-cli
uv run --extra dev pytest tests/ -v

License

MIT

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

transcriber_cli-0.1.2.tar.gz (53.3 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

transcriber_cli-0.1.2-py3-none-any.whl (12.1 kB view details)

Uploaded Python 3

File details

Details for the file transcriber_cli-0.1.2.tar.gz.

File metadata

  • Download URL: transcriber_cli-0.1.2.tar.gz
  • Upload date:
  • Size: 53.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for transcriber_cli-0.1.2.tar.gz
Algorithm Hash digest
SHA256 037239fde15475778b4bb4fe89a7a9b70592becf689f93d799f3bb839ef49093
MD5 7fec73603c76c2ec657610ddb1fe39bd
BLAKE2b-256 77ff84d478b5665dc7bd414e95b87dee2719e1f3eabb3ad098a05f2ed741b0ae

See more details on using hashes here.

Provenance

The following attestation bundles were made for transcriber_cli-0.1.2.tar.gz:

Publisher: publish.yml on miguelarios/transcribe-cli

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file transcriber_cli-0.1.2-py3-none-any.whl.

File metadata

File hashes

Hashes for transcriber_cli-0.1.2-py3-none-any.whl
Algorithm Hash digest
SHA256 8b0b09cde0ba78180339f47afd8c7239c186c52e511468a7e067bf4e0f5a52d1
MD5 85807bd131ee015690aa75180502713f
BLAKE2b-256 1d157bc0113424ecd58449fda69ce0156257c5ca6e57111a613e40060812d22a

See more details on using hashes here.

Provenance

The following attestation bundles were made for transcriber_cli-0.1.2-py3-none-any.whl:

Publisher: publish.yml on miguelarios/transcribe-cli

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page