A Python package that utilizes GPT-4V and other tools to extract and process information from PDF files
Project description
GPT PDF Reader
GPT PDF Reader is a Python package that utilizes GPT-4V and other tools to extract and process information from PDF files.
Features
- Extracts figures from PDF files using the
pdffigures2
Scala library. - Converts PDF pages to images and uploads them to Google Cloud Bucket.
- Utilizes GPT-4V Vision to generate Markdown content from pdf an than inserts image urls into markdown.
Installation
The installation process requires Java and Scala. The following instructions are for macOS users:
brew tap AdoptOpenJDK/openjdk
brew install --cask adoptopenjdk11
brew install jenv
echo 'export PATH="$HOME/.jenv/bin:$PATH"' >> ~/.zshrc
echo 'eval "$(jenv init -)"' >> ~/.zshrc
After updating your shell configuration, close and reopen your terminal, then set Java 11 as the global version using jenv:
jenv add /Library/Java/JavaVirtualMachines/adoptopenjdk-11.jdk/Contents/Home/
jenv global 11.0.11
Install GPT PDF Reader via pip:
pip install gptpdfreader
Configure the required environment variables in your .env file without spaces or unnecessary quotes:
OPENAI_API_KEY=open_ai_key
GOOGLE_ID=google_project_id
GOOGLE_BUCKET=google_bucket_name
Usage
To process a PDF and generate Markdown content:
from gptpdfreader.reader import main
main('path_to_your_pdf.pdf')
This will process the specified PDF and output a Markdown file with the extracted information in the same directory.
Limitations
some limitations
Contributing
We welcome contributions! Please open an issue or submit a pull request on our GitHub repository.
Support
For questions and support, please open an issue in the GitHub issue tracker.
License
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file gpt_pdf_reader-1.5.tar.gz
.
File metadata
- Download URL: gpt_pdf_reader-1.5.tar.gz
- Upload date:
- Size: 4.9 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.11.5
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | f9c87b0a27fbd033f4c304b91fbf1671d285b5a8be68ae0ad46af2f8f015dc56 |
|
MD5 | 6be5368b77b3b3472e44969e619c6793 |
|
BLAKE2b-256 | f3e4e4f469dbb48b8189b2796846bc5a533692be8c1b9d6f4c0e9371f5513c57 |
File details
Details for the file gpt_pdf_reader-1.5-py3-none-any.whl
.
File metadata
- Download URL: gpt_pdf_reader-1.5-py3-none-any.whl
- Upload date:
- Size: 5.2 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.11.5
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | d7fa44debe137f3b5853ae949d193e8cd1b7452ed8771885db074484da524b43 |
|
MD5 | 8e4be1c2f0d354acf5e7260e56858aad |
|
BLAKE2b-256 | e13eaa0bce4131ad564f898b39774bb2a3116b5e0dd4c4a4b309b2fcd02ac3ab |