Last released Jan 11, 2025
A Python script that can parse a Chinese patent of invention type to extract fields, sections, and subsections in it.
Last released Dec 18, 2024
A Python script that runs Paddle OCR on a possibly unsearchable PDF to make it searchable.
Last released Dec 6, 2020
A set of extended tools to process pdf files
Supported by