Last released Mar 2, 2026
Reliable PDF text extraction with PyMuPDF and configurable OCR engines (Tesseract/PaddleOCR).
Supported by