12 projects
unstructured-ingest
Local ETL data pipeline to get data RAG ready
unstructured
A library that prepares raw documents for downstream ML tasks.
unstructured-client
Python Client SDK for Unstructured API
unstructured-inference
A library for performing inference using trained models.
uns-mcp
MCP server implementation providing structured tools for interacting with the Unstructured API, managing sources, destinations, workflows, and jobs
utic-dev-tools
Dev tools for Unstructured.io ecosystem.
utic-public-types
Public/Open types shared among different projects in the Unstructured.io ecosystem.
unstructured-platform-plugins
Wrapper to convert arbitrary code into a uvicorn/fastapi implementation for Unstructured Platform
unstructured-paddleocr
Awesome OCR toolkits based on PaddlePaddle(8.6M ultra-lightweight pre-trained model, support training and deployment among server, mobile, embedded and IoT devices)
unstructured.pytesseract
Python-tesseract is a python wrapper for Google's Tesseract-OCR
unstructured.paddlepaddle
Parallel Distributed Deep Learning
unstructured-api-tools
A library that prepares raw documents for downstream ML tasks.