Skip to main content

Universal Document Agent for extracting and analyzing various documents with Ollama support.

Project description

UniversalDocAgent (unidoc_agent)

UniversalDocAgent is a Python package designed to intelligently detect document types and automatically extract or summarize their contents using a set of specialized tools. It supports a wide range of file types such as PDFs, Word documents, emails, source code, Excel files, XML, OCR-recognizable images, and plain text.

You can optionally integrate with an LLM backend like Ollama to generate summaries and maintain conversation history across sessions.


🚧 Installation

pip install .

Run this from the root directory of your cloned project or package.


📂 Supported Document Types

File Type Handled By
.pdf PDFTool
.docx WordTool
.txt TextTool
.py, .js, etc. CodeTool
.eml EmailTool
.xlsx ExcelTool
.jpg, .png, etc. OCRTool
.xml XMLTool

🚀 Usage Examples

1. Extracting Content

from unidoc_agent.agent import read_document

file_path = "sample.pdf"
content = read_document(file_path)
print(content)

2. Summarizing Content

summary = read_document("example.docx", summarize=True)
print(summary)

🧠 Advanced: Use Ollama LLM Backend

Custom LLM model or session

from unidoc_agent.agent import UniversalDocAgent
from unidoc_agent.agent import tools

agent = UniversalDocAgent(tools=tools, llm_backend="ollama")
summary = agent.summarize_content("report.txt")
print(summary)

💬 Conversation History with OllamaClient

The OllamaClient class is used internally to manage conversation context for summarization.

  • Caching: Stores conversation history locally in ~/.unidoc_ollama_cache/{model}_{session_id}.json
  • Session Management: Custom session IDs let you manage multiple user contexts

Clearing History

from unidoc_agent.ollama_client import OllamaClient

client = OllamaClient(session_id="user123")
client.clear_history()

🔧 API Reference

read_document(file_path, summarize=False)

  • file_path: Path to the input document
  • summarize: If True, returns a summary via LLM; else returns extracted content

UniversalDocAgent

  • extract_content(file_path): Extracts raw content
  • summarize_content(file_path): Summarizes content using the selected tool + LLM

🔮 Tests

Make sure you have pytest or unittest installed:

pytest

Or:

python -m unittest discover tests/

📄 License

This project is licensed under the MIT License. See the LICENSE file for details.


📊 Use Cases

  • ✉️ Email Parsing – Automatically extract the body of .eml files and summarize them.
  • 📄 Document Summary – Get concise summaries of long reports, manuals, or meeting notes.
  • 📈 Spreadsheet Reader – Read .xlsx Excel files and extract tables or data grids.
  • 🔧 OCR Scanning – Use OCRTool to read text from images (e.g., scanned receipts).
  • 📁 Source Code Insight – Extract and analyze comments or logic from .py or .js files.
  • 📖 Multi-format Aggregation – Use the same interface (read_document) for any supported format.

🚀 Contributing

Pull requests are welcome. Please open issues for bugs or feature requests.


✨ Acknowledgements

Thanks to OpenAI, Ollama, and the open-source contributors whose tools helped build this module.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

unidoc_agent-0.2.4.tar.gz (9.7 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

unidoc_agent-0.2.4-py3-none-any.whl (10.7 kB view details)

Uploaded Python 3

File details

Details for the file unidoc_agent-0.2.4.tar.gz.

File metadata

  • Download URL: unidoc_agent-0.2.4.tar.gz
  • Upload date:
  • Size: 9.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.8.12

File hashes

Hashes for unidoc_agent-0.2.4.tar.gz
Algorithm Hash digest
SHA256 91c947a3a34c7d184aa3672386bd6e853f241cc3bca63d17d5c5e944091a17e1
MD5 229a55a50b880e688af3bc6c6120d3c4
BLAKE2b-256 90939b540b4dfa1034054c438e29b4b9cbcc666c34344c12d43a95a1144c7cb4

See more details on using hashes here.

File details

Details for the file unidoc_agent-0.2.4-py3-none-any.whl.

File metadata

  • Download URL: unidoc_agent-0.2.4-py3-none-any.whl
  • Upload date:
  • Size: 10.7 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.8.12

File hashes

Hashes for unidoc_agent-0.2.4-py3-none-any.whl
Algorithm Hash digest
SHA256 78bc404727a2fa2d82c7e6f634b8b0c81e660a833605237f4445718eeb9e90cf
MD5 a7fd71f341fdad7fb490ef5ad12198af
BLAKE2b-256 6e0faba2e035fb8d30dbe75e5c3c98772aa83a5e9417c05b7a525a5fcdb2b635

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page