6 projects
pix2text
An Open-Source Python3 tool for recognizing layouts, tables, math formulas, and text in images, converting them into Markdown format. A free alternative to Mathpix, empowering seamless conversion of visual content into text-based representations.
cnstd
Python3 package for Chinese/English Scene Text Detection (STD), Mathematical Formula Detection (MFD), and Layout Analysis, with free pretrained models
cnocr
Python3 package for Chinese/English OCR, with small pretrained models
coin-clip
Enhancing Coin Image Retrieval with CLIP
ragflow
Efficient Document-Based QA with Retrieval-Augmented Generation (RAG) and Large Language Models (LLM).
antiocr
Python3 package for generating text images which can't be recognized by AI