Skip to main content

Convert voice notes, videos, and audio files into AI-ready text and images

Project description

ingestible

Convert voice notes, videos, and audio files into AI-ready text and images.

Consultants, researchers, and anyone who works with AI tools faces the same problem: clients and colleagues send voice notes, screen recordings, and video walkthroughs — but your AI workflow needs text and images. Ingestible bridges that gap with a single command.

Quick start

pip install ingestible
export OPENAI_API_KEY=sk-...
ingest client-feedback.opus

That's it. You get a folder with transcript.txt and you're ready to feed it into whatever AI tool you're using.

Install

pip install ingestible

You also need ffmpeg on your PATH:

OS Command
macOS brew install ffmpeg
Ubuntu/Debian sudo apt install ffmpeg
Windows choco install ffmpeg or download from ffmpeg.org

Usage

# Transcribe a voice note
ingest recording.opus

# Process a video (transcript + frame grabs every 15 seconds)
ingest feedback.mp4

# Batch process everything in a folder
ingest *.opus *.mp4

# Grab frames more frequently
ingest --frame-interval 5 demo.mp4

# Use local Whisper instead of OpenAI API
pip install ingestible[local]
ingest --backend faster-whisper recording.opus

# Preview what would be processed
ingest --dry-run *.opus

Output

Each file produces a folder alongside it:

recording/
├── transcript.txt    # Plain text transcription
├── frames/           # Video frame grabs (video only)
│   ├── frame_001.jpg
│   ├── frame_002.jpg
│   └── ...
└── process.log       # Timestamped processing log

Re-running the same command skips already-completed files (idempotent).

Privacy notice

By default, ingestible sends your audio to OpenAI's Whisper API for transcription. Your audio is transmitted to OpenAI's servers. Review OpenAI's data usage policy to understand how your data is handled.

If you need fully local, private transcription — no data leaves your machine:

pip install ingestible[local]
ingest --backend faster-whisper recording.opus

This uses faster-whisper running entirely on your CPU. It's slower but nothing leaves your computer.

Configuration

Set OPENAI_API_KEY in your environment or a .env file in the current directory.

Variable Default Description
OPENAI_API_KEY Required for OpenAI backend
INGESTIBLE_BACKEND openai Default backend (openai or faster-whisper)
OPENAI_WHISPER_MODEL whisper-1 OpenAI model to use
INGESTIBLE_WHISPER_MODEL base Local Whisper model size (base, small, medium, large)

Supported formats

Audio: .opus, .m4a, .ogg, .mp3, .wav

Video: .mp4, .mkv, .mov, .webm

All formats are normalised to MP3 before transcription — this ensures consistent behaviour regardless of input format.

Requirements

  • Python 3.10+
  • ffmpeg on your PATH

Contributing

Found a bug or want to add a format? See CONTRIBUTING.md.

License

MIT

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

slurpai-0.1.0.tar.gz (9.9 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

slurpai-0.1.0-py3-none-any.whl (9.4 kB view details)

Uploaded Python 3

File details

Details for the file slurpai-0.1.0.tar.gz.

File metadata

  • Download URL: slurpai-0.1.0.tar.gz
  • Upload date:
  • Size: 9.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.14.3

File hashes

Hashes for slurpai-0.1.0.tar.gz
Algorithm Hash digest
SHA256 f94b3153a1662066a35cdb28da1366d8ab9970e1276eac10232654c2d2dd62e3
MD5 1b33a53f355c31121ec915ef9f644408
BLAKE2b-256 9cc6094a96859eb80d419aa077aad767868e4a7ffea5d26153097616679e8e7c

See more details on using hashes here.

File details

Details for the file slurpai-0.1.0-py3-none-any.whl.

File metadata

  • Download URL: slurpai-0.1.0-py3-none-any.whl
  • Upload date:
  • Size: 9.4 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.14.3

File hashes

Hashes for slurpai-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 88a92ab389f632535d407fb48e3d4cf7ba8e573348740b4a2a700d34fb65be62
MD5 851d0544aef22385f4de00220c7080aa
BLAKE2b-256 e112306382778951a872391017ed4e73ee0c26d97d773e1b70460343b5064c19

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page