Last released Jan 29, 2026
Extract text from PDFs using pypdfium2 with OCR fallback via pytesseract
Last released Oct 15, 2025
A lightweight tool for removing personal data from text before uploading to LLMs
Supported by