A Python package that utilizes GPT-4V and other tools to extract and process information from PDF files
Project description
GPT PDF Reader
GPT PDF Reader is a Python package that utilizes GPT-4V and other tools to extract and process information from PDF files.
Features
- Extracts figures from PDF files using the
pdffigures2
Scala library. - Converts PDF pages to images and uploads them to Google Cloud Bucket.
- Utilizes GPT-4V Vision to generate Markdown content from pdf an than inserts image urls into markdown.
Installation
The installation process requires Java and Scala. The following instructions are for macOS users:
brew tap AdoptOpenJDK/openjdk
brew install --cask adoptopenjdk11
brew install jenv
echo 'export PATH="$HOME/.jenv/bin:$PATH"' >> ~/.zshrc
echo 'eval "$(jenv init -)"' >> ~/.zshrc
After updating your shell configuration, close and reopen your terminal, then set Java 11 as the global version using jenv:
jenv add /Library/Java/JavaVirtualMachines/adoptopenjdk-11.jdk/Contents/Home/
jenv global 11.0.11
Install GPT PDF Reader via pip:
pip install gptpdfreader
Configure the required environment variables in your .env file without spaces or unnecessary quotes:
OPENAI_API_KEY=open_ai_key
GOOGLE_ID=google_project_id
GOOGLE_BUCKET=google_bucket_name
Usage
To process a PDF and generate Markdown content:
from gptpdfreader.reader import main
main('path_to_your_pdf.pdf')
This will process the specified PDF and output a Markdown file with the extracted information in the same directory.
Limitations
some limitations
Contributing
We welcome contributions! Please open an issue or submit a pull request on our GitHub repository.
Support
For questions and support, please open an issue in the GitHub issue tracker.
License
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for gpt_pdf_reader-1.5-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | d7fa44debe137f3b5853ae949d193e8cd1b7452ed8771885db074484da524b43 |
|
MD5 | 8e4be1c2f0d354acf5e7260e56858aad |
|
BLAKE2b-256 | e13eaa0bce4131ad564f898b39774bb2a3116b5e0dd4c4a4b309b2fcd02ac3ab |