Documents and large language models.
Project description
Docprompt
Docprompt is a library for working with text-rich multimodal inputs to support Large Language Model Workloads
This library has several goals
-
Provide abstractions for working with and processing PDF's and images
-
Abstractions for document operations with third party providers
-
Functionality for robust conversion of OCR'd documents into inputs suitable for large language models
-
State of the art document masking with text synthesis
Documents and large language models
- Documentation: https://psu3d0.github.io/docprompt
- GitHub: https://github.com/psu3d0/docprompt
- PyPI: https://pypi.org/project/docprompt/
- Free software: Apache-2.0
Features
- TODO
Installation
Use the package manager pip to install Docprompt.
pip install docprompt
Credits
This package was created with Cookiecutter and the waynerv/cookiecutter-pypackage project template.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for docprompt-0.1.0-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | c2b33c967e262cb228e27c6d303a2ac1aefd3065aa35c35942c2f8acf658e91f |
|
MD5 | cc737d8ed8a68649c54476109ce1012d |
|
BLAKE2b-256 | bac02ee3fdc8a0acf4b7803c5cd83ccc780d098a741d8f337ce9e76b762640a6 |