5 projects
detstream
Modular object detection for live video feeds
prompt2dataset
Build labeled image datasets from a plain-English prompt.
doctape
Chop large PDFs into page windows, convert each with docling, and reassemble to Markdown
dinscribe
Audio denoising and transcription pipeline using Demucs, Silero VAD, and Whisper
dinnote
Audio denoising, VAD, speaker diarization, and transcription pipeline using Demucs, Silero VAD, pyannote, and Whisper