9 projects
docframe-ai
A Python framework for normalizing PDFs, Word files, CSV, Excel, JPG, and PNG into AI-ready document chunks.
docflow-sager
Document-native DAG runner for preprocessing PDFs, Office files, and email messages into structured evidence artifacts.
clipr-video
CLIPR: Clip Improvement, Processing, and Reframing for Python.
axiomdoc
Open-source document intelligence for extraction, structure preservation, XML export, and RAG indexing.
doctr-index
Open-source document indexing library for building hierarchical trees from PDF and Markdown.
st-autorefresh
A Streamlit component for automatically refreshing the page at a user-defined interval.
fixparser
A Python library for parsing FIX protocol messages and exporting to text and CSV formats.
vokal
A Python library for separating vocals and instruments from audio using Demucs.
hashtagger
A hashtag generator using TensorFlow and NLTK