Skip to main content

Internal scribe service that wraps meetscribe for team-scale meeting capture, transcription, summarization, and speaker labeling

Project description

vezir

Self-hosted team intelligence. Record a meeting on any device; Vezir gives you back a diarized transcript, an AI summary, and a PDF — processed on your own GPU server and synced into a private team archive you control. Sign in with Nostr or Google.

Vezir wraps millet (the transcription/diarization/summarization pipeline) and turns it into a multi-user, multi-team service: a scribe records on their laptop or phone, the audio uploads to a central GPU box, and the team gets back labeled transcripts and summaries — with speakers resolved to GitHub handles.

Status

Alpha (0.8.3). Built for small teams that want meeting audio to stay inside their own infrastructure: one GPU server (Linux/CUDA or Apple Silicon) reachable over ordinary HTTPS. Full history in CHANGELOG.md.

What's new in 0.7–0.8:

  • Identity sign-in (0.8.x). Members sign in with Nostr (a remote signer like Amber via NIP-46, or the NIP-55 Android intent flow) or with Google (@workspace-domain accounts via the OAuth device grant). The server mints a short-lived session JWT (~24h); no key or password touches the client. vzr_ bearer tokens are retained for machine/CI use.
  • Public-access front (0.8.x). A small VPS terminates nothing — it WireGuard-forwards TLS to the server, which keeps the cert. Clients reach the server over plain outbound HTTPS, so it works from CGNAT / IPv6-only links (e.g. Starlink) with no per-client VPN.
  • Multi-team by membership (0.7.0). A token/identity is a person, not a team; team scope is supplied per-request via X-Team-Id and validated against a memberships table. One identity covers every team you're in; the TUI/Android auto-discover them. Team keys are stable UUIDs with mutable slugs (vezir team rename).
  • Hardening (0.8.2). NIP-98 replay protection, header-injection-resistant login-URL pinning (VEZIR_PUBLIC_URL), exact Google-domain matching.
  • Resumable uploads (tus.io subset), vezir relabel, vezir pull, per-team voiceprints + sync, vezir doctor.

The JSON-only API (no web dashboard since 0.7.0) is consumed by the TUI, the Android app, and the CLI. Speaker labeling happens in the TUI/Android or via a one-time /login?code=… page link printed after upload.

Linux and macOS (Apple Silicon) laptop clients and an Android client are supported.

Requires millet-pipeline >= 0.12.5 (pinned via the [server] extra).

Sign-in & access

Members authenticate with their own identity; an admin authorizes it once.

# Admin, on the server — authorize an identity (one of):
vezir npub add   --npub npub1…             --github <handle> --label "<who>"
vezir google add --email them@blinkbtc.com --github <handle> --label "<who>"
# …and grant team scope (one handle covers every team they join):
vezir team add-member --team <slug> --role scribe --github <handle>
# Member, on their laptop:
export VEZIR_URL=https://your-vezir-host
vezir login --team <slug>                  # Nostr (remote signer / Amber)
vezir login --method google --team <slug>  # Google (@workspace-domain)

vezir login stores a ~24h session; the client uses it as Authorization: Bearer on every request. Re-run to refresh.

Production note: set VEZIR_PUBLIC_URL=https://your-host on the server so NIP-98 login-URL verification is pinned to a fixed base (not reconstructed from request headers).

Architecture

[Scribe laptop / phone]                 [GPU server]
  vezir tui / scribe   ──HTTPS──▶   vezir serve (FastAPI, 127.0.0.1)
   (record, list,                     │  fronted by Caddy (TLS terminates here)
    label, view,                      │
    pull artifacts)                   │   identity sign-in:
                                      │     NIP-46 / NIP-55 (nostr)  ─┐
  vezir-android        ──HTTPS──▶     │     Google device grant      ─┴─▶ session JWT
   (record + sign in)                 │     vzr_ tokens (machine/CI)
                                      │
                            ┌─ public VPS front (optional) ─┐
   any network ────────────┤  WireGuard + nftables TLS-     │
   (incl. CGNAT/IPv6)       │  passthrough → server :443    │
                            └───────────────────────────────┘
                                      │
                                      ├── sqlite job queue (per-team)
                                      ▼
                                    worker  ── HOME-shim ──▶ millet
                                      │   (per-team voiceprints + sync)
                                      ▼
                                    millet transcribe / label --auto / sync
                                      └──▶ private git repo (per-team)

Millet runs as an unmodified subprocess via a per-job HOME shim that exposes per-team voiceprints + sync config. Vezir owns the job queue, per-team voiceprint DBs, team roster/memberships, and auth.

Clients

Client Best for Install
vezir tui Day-to-day desktop use — record, browse sessions, read transcripts/summaries, label speakers, all in one terminal UI. ctrl+e Teams tab, ctrl+t cycles teams. pip install 'vezir[tui]'
vezir scribe Headless / ssh / scripted recording. Pause-resume with p. pip install vezir
vezir upload <file> An existing WAV/OGG (phone, OBS, etc.); resumable. pip install vezir
vezir pull Download artifacts for meetings others recorded (team sharing without git). pip install vezir
vezir-android Recording from a phone; signs in with Nostr (Amber) or Google. Sideload the release APK

All desktop clients resolve credentials from a vezir login session (stored per-team in ~/.config/vezir/teams.json), or from VEZIR_URL+VEZIR_TOKEN for machine/CI. The TUI/Android auto-discover every team you belong to from /api/me.

Summarization presets

The client sends a preset id as the summary_preset form field; the worker passes it to millet transcribe --summary-preset <id>.

Preset Backend Model Use case
high-quality claudemax Sonnet Default on desktop; highest quality (Claude Max on the server).
confidential tinfoil DeepSeek (TEE) Hardware-attested enclave — prompts not visible to the provider. Default on Android; PDF gets a CONFIDENTIAL watermark.
alternative openrouter Kimi Cheapest cloud option.

When a preset is explicitly chosen the server does not silently fall back to another backend on failure — a silent tinfoil→cloud fallback would defeat the Confidential preset. Set via vezir scribe --preset … or the TUI/Android dropdown.

Privacy toggles (per upload)

Toggle Default When set Sticky?
auto_label ON OFF skips voiceprint matching; routes to manual labeling. Yes
sync ON OFF keeps the session on the server (local-only), not pushed to the team git repo. Retroactively syncable. Yes
personal OFF ON marks it private to you and forces sync off for this recording. No (per-recording)

CLI: --auto-label/--no-auto-label, --sync/--no-sync, --personal on scribe and upload. Server-side kill switches: VEZIR_SKIP_SYNC=1 (global sync off), VEZIR_DELETE_AUDIO=1 (drop audio after artifacts).

Install profiles

Role Install Footprint
Scribe client (CLI) pip install --user vezir ~30 MB
Scribe client + TUI (recommended desktop) pip install --user 'vezir[tui]' ~35 MB
Server (FastAPI + worker + pipeline) pip install --user 'vezir[server]' ~3 GB (Linux/CUDA: whisperx+torch+pyannote); +mlx-whisper on Apple Silicon

The base install uses millet-record (capture only); [server] adds millet-pipeline for transcription/diarization/summarization (plus mlx-whisper on Apple Silicon via a PEP 508 marker for the MLX ASR backend).

Quick start — server

git clone https://github.com/pretyflaco/vezir.git && cd vezir
pip install --user -e '.[server]'

mkdir -p ~/vezir-data
vezir voiceprints seed --from ~/.config/meet/speaker_profiles.json   # optional

vezir team create --id myteam --name "My Team"
vezir team set-sync --id myteam --remote https://github.com/yourorg/meetings.git  # optional

# Authorize yourself + grant scope (identity sign-in):
vezir npub add --npub npub1… --github you --admin --label "laptop"
vezir team add-member --team myteam --role admin --github you

export VEZIR_PUBLIC_URL=https://your-vezir-host   # recommended in prod
vezir serve                                       # binds 127.0.0.1:8000; front with Caddy

TLS via Caddy

cd infra/caddy && ./install-caddy.sh
# edit the Caddyfile for your hostnames, then:
sudo systemctl enable --now caddy

For a public-access deployment (clients on any network, incl. CGNAT), see infra/vps/ — a VPS WireGuard-forwards :443 to the server, which terminates TLS (the VPS sees only ciphertext).

Quick start — scribe client

pip install --user 'vezir[tui]'
export VEZIR_URL=https://your-vezir-host

vezir login --team myteam                 # Nostr / Amber
# or: vezir login --method google --team myteam

vezir tui                                 # record + browse + label
vezir scribe --title "team sync"          # CLI record (p = pause; Ctrl+C = stop)
vezir upload ./recording.ogg --title "…"  # existing file (resumable)
vezir pull                                # artifacts for meetings others recorded
vezir doctor                              # diagnose creds / connectivity / certs

After upload, artifacts (summary, transcript, PDF) auto-download into ~/vezir-meetings/<team>/meeting-…/. Standalone uploads accept .wav/.ogg.

macOS (Apple Silicon) scribe

pip install vezir pulls millet-record, whose macOS wheel ships a Swift sidecar that captures mic + system audio via native APIs (no virtual drivers). Grant both Microphone and System Audio Recording to your terminal app; verify with millet check. The server does the heavy lifting.

Environment variables

Variable Default Effect
VEZIR_DATA ~/vezir-data All server runtime state.
VEZIR_HOST / VEZIR_PORT 127.0.0.1 / 8000 Bind for vezir serve (front with Caddy).
VEZIR_PUBLIC_URL unset Canonical public base URL; pins NIP-98 login-URL verification (recommended in prod).
VEZIR_URL http://localhost:8000 Server URL for clients.
VEZIR_TOKEN vzr_ bearer for machine/CI clients (interactive members use vezir login).
VEZIR_GOOGLE_CLIENT_ID / …_SECRET[_FILE] / …_ALLOWED_DOMAIN unset Enable Google sign-in (server holds the secret).
SSL_CERT_FILE / VEZIR_CADDY_ROOT_CERT_PATH unset Extra internal CA to trust; the client appends it to the public store (0.8.0+), so public + internal hosts both validate.
VEZIR_COOKIE_SECURE unset 1 adds Secure to the session cookie (HTTPS).
VEZIR_SUMMARY_PRESET unset Default preset (high-quality|confidential|alternative).
VEZIR_RECORD_DIR ~/vezir-meetings Local recordings root.
VEZIR_MILLET_* auto Pass-throughs to millet transcribe (device, compute type, ASR backend, MLX model).
VEZIR_SKIP_SYNC / VEZIR_DELETE_AUDIO unset Server-side sync kill switch / audio retention.
VEZIR_MAX_UPLOAD_BYTES 2147483648 Max upload (2 GiB → 413).
VEZIR_DISABLE_RATELIMIT unset Disable the in-process rate limiter. Test/CI only (logs a loud warning if set).

Performance (rough, 1h audio)

Runtime Path Time
NVIDIA CUDA CUDA float16 ~5–20 min
Apple Silicon (MLX) MLX Whisper + MPS ~10–30 min
Apple Silicon (split) CPU CTranslate2 + MPS ~20–45 min
CPU only CPU int8 ~1.5–10 h

Runtime dirs are created 0700, sensitive files 0600; the systemd unit sets UMask=0077.

License

MIT — see LICENSE.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

vezir-0.8.7.tar.gz (312.9 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

vezir-0.8.7-py3-none-any.whl (242.0 kB view details)

Uploaded Python 3

File details

Details for the file vezir-0.8.7.tar.gz.

File metadata

  • Download URL: vezir-0.8.7.tar.gz
  • Upload date:
  • Size: 312.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for vezir-0.8.7.tar.gz
Algorithm Hash digest
SHA256 070da8b4a5e00556c9792156292fed7780fa9bf6ecc61c1a9d635506811d0702
MD5 6309442337d60a8bfa6a1f3cd6bf17d1
BLAKE2b-256 8ef1f79469236dbe503433e119e002f635be65aeed0744d3e4323543f97f6d41

See more details on using hashes here.

Provenance

The following attestation bundles were made for vezir-0.8.7.tar.gz:

Publisher: release.yml on pretyflaco/vezir

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file vezir-0.8.7-py3-none-any.whl.

File metadata

  • Download URL: vezir-0.8.7-py3-none-any.whl
  • Upload date:
  • Size: 242.0 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for vezir-0.8.7-py3-none-any.whl
Algorithm Hash digest
SHA256 b03d876730aa08499de720e01603755aa0dff861606c58b6e5172cb718dd340b
MD5 a54f874ec32de8b1fb5582bbbd6fa8d4
BLAKE2b-256 cff666d466d13251544d76dd534a74b536447b59b5522d80d8740774946f25f1

See more details on using hashes here.

Provenance

The following attestation bundles were made for vezir-0.8.7-py3-none-any.whl:

Publisher: release.yml on pretyflaco/vezir

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page