Skip to main content

A Python package that utilizes GPT-4V and other tools to extract and process information from PDF files

Project description

GPT PDF Reader

GPT PDF Reader is a Python package that utilizes GPT-4V and other tools to extract and process information from PDF files.

Features

  • Extracts figures from PDF files using the pdffigures2 Scala library.
  • Converts PDF pages to images and uploads them to Google Cloud Bucket.
  • Utilizes GPT-4V Vision to generate Markdown content from pdf an than inserts image urls into markdown.

Installation

The installation process requires Java and Scala. The following instructions are for macOS users:

brew tap AdoptOpenJDK/openjdk
brew install --cask adoptopenjdk11
brew install jenv
echo 'export PATH="$HOME/.jenv/bin:$PATH"' >> ~/.zshrc
echo 'eval "$(jenv init -)"' >> ~/.zshrc

After updating your shell configuration, close and reopen your terminal, then set Java 11 as the global version using jenv:

jenv add /Library/Java/JavaVirtualMachines/adoptopenjdk-11.jdk/Contents/Home/
jenv global 11.0.11

Install GPT PDF Reader via pip:

pip install gptpdfreader

Configure the required environment variables in your .env file without spaces or unnecessary quotes:

OPENAI_API_KEY=open_ai_key
GOOGLE_ID=google_project_id
GOOGLE_BUCKET=google_bucket_name

Usage

To process a PDF and generate Markdown content:

from gptpdfreader.reader import main

main('path_to_your_pdf.pdf')

This will process the specified PDF and output a Markdown file with the extracted information in the same directory.

Limitations

some limitations

Contributing

We welcome contributions! Please open an issue or submit a pull request on our GitHub repository.

Support

For questions and support, please open an issue in the GitHub issue tracker.

License

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

gpt_pdf_reader-1.5.tar.gz (4.9 kB view details)

Uploaded Source

Built Distribution

gpt_pdf_reader-1.5-py3-none-any.whl (5.2 kB view details)

Uploaded Python 3

File details

Details for the file gpt_pdf_reader-1.5.tar.gz.

File metadata

  • Download URL: gpt_pdf_reader-1.5.tar.gz
  • Upload date:
  • Size: 4.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.11.5

File hashes

Hashes for gpt_pdf_reader-1.5.tar.gz
Algorithm Hash digest
SHA256 f9c87b0a27fbd033f4c304b91fbf1671d285b5a8be68ae0ad46af2f8f015dc56
MD5 6be5368b77b3b3472e44969e619c6793
BLAKE2b-256 f3e4e4f469dbb48b8189b2796846bc5a533692be8c1b9d6f4c0e9371f5513c57

See more details on using hashes here.

File details

Details for the file gpt_pdf_reader-1.5-py3-none-any.whl.

File metadata

File hashes

Hashes for gpt_pdf_reader-1.5-py3-none-any.whl
Algorithm Hash digest
SHA256 d7fa44debe137f3b5853ae949d193e8cd1b7452ed8771885db074484da524b43
MD5 8e4be1c2f0d354acf5e7260e56858aad
BLAKE2b-256 e13eaa0bce4131ad564f898b39774bb2a3116b5e0dd4c4a4b309b2fcd02ac3ab

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page