A family of LLM-enhanced PDF utilities
Project description
pdf-llm-tools
pdf-llm-tools
is a family of AI pdf utilities:
pdfllm-titler
renames a pdf with metadata parsed from the filename and contents. In particular it renames it asYEAR-AUTHOR-TITLE.pdf
.- (todo)
pdfllm-toccer
adds a bookmark structure parsed from the detected contents table of the pdf.
Currently OpenAI's gpt-3.5-turbo-1106
is hardcoded as the LLM backend. The
program requires an OpenAI API key via option, envvar, or manual input.
Installation
pip install pdf-llm-tools
Usage
These utilities require all PDFs to have a correct OCR layer. Run something like OCRmyPDF if needed.
pdfllm-titler
pdfllm-titler a.pdf b.pdf c.pdf
pdfllm-titler --last-page 8 d.pdf
See --help
for full details.
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
pdf_llm_tools-0.0.2.tar.gz
(3.3 kB
view hashes)
Built Distribution
Close
Hashes for pdf_llm_tools-0.0.2-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | a817ac62824f6f50abf098dfa9b5ca4add6a6310b39e7df0500a1e18cfd2edd1 |
|
MD5 | ac55d5f8777d678689eace9cf732c4c6 |
|
BLAKE2b-256 | c36a1cc8e687ae594368e5d164c40758e1300d075146de1f1b3be2ad0b0f3703 |