Documents and large language models.
Project description
Docprompt
Docprompt is a lightweight library for working with text-rich multimodal inputs to support Large Language Model Workloads
This library has several goals
- Provide abstractions for working with and processing PDF's and images
- Abstractions for document operations with third party providers
Documents and large language models
- Documentation: https://psu3d0.github.io/docprompt
- GitHub: https://github.com/Page-Leaf/docprompt
- PyPI: https://pypi.org/project/docprompt/
- Free software: Apache-2.0
Features
- Representations for common document layout types -
TextBlock
,BoundingBox
, etc - Generic implementations of OCR providers
Installation
Use the package manager pip to install Docprompt.
pip install docprompt
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
docprompt-0.1.2.tar.gz
(18.4 kB
view hashes)
Built Distribution
docprompt-0.1.2-py3-none-any.whl
(22.0 kB
view hashes)
Close
Hashes for docprompt-0.1.2-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | b0876214d5fec27a2d69ff179bdc7270425261d85afd1d3ad370c800c126a6d0 |
|
MD5 | bc48931b654655ace564340e35170125 |
|
BLAKE2b-256 | 7cdcd762d56eec7d5e643e9d94f1d2edf82598656f76789fe4e5ad4fc6449030 |