Last released Aug 28, 2025
A Python library for extracting plain text from various document formats for LLM and NLP purposes
Supported by