Skip to main content

PDF form-filling ecosystem: chatbot, doc-upload, mapper and RAG — install any combination

Project description

pdf-autofillr

PDF form-filling ecosystem — chatbot, doc-upload, mapper, and RAG — install any combination.

Install

# Full stack (everything)
pip install pdf-autofillr[all]

# Chatbot + mapper (conversational form filling)
pip install pdf-autofillr[chatbot]

# Doc upload + mapper (extract from document → fill PDF)
pip install pdf-autofillr[doc-upload]

# Chatbot + mapper + RAG (self-learning predictions)
pip install pdf-autofillr[chatbot,rag]

# Doc upload + mapper + RAG
pip install pdf-autofillr[doc-upload,rag]

# Chatbot + doc_upload + mapper (both input methods)
pip install pdf-autofillr[chatbot,doc-upload]

# Individual modules standalone
pip install pdf-autofillr-chatbot
pip install pdf-autofillr-doc-upload
pip install pdf-autofillr-mapper
pip install pdf-autofillr-rag

After install

# Write .env.example, configs/, data/ for your installed combination:
pdf-autofillr setup

# Check that everything is configured correctly:
pdf-autofillr status

Configure

cp .env.example .env
# Edit .env:
#   Set your API key  → OPENAI_API_KEY=sk-...
#   Set your PDF path → chatbot_PDF_PATH=./data/input/blank_form.pdf

Drop your blank (empty) PDF form into data/input/blank_form.pdf.

Start

pdf-autofillr chatbot       # start chatbot server (port 8001)
pdf-autofillr doc-upload    # start doc_upload server (port 8001)
pdf-autofillr mapper        # start mapper server (port 8000)
pdf-autofillr rag           # start RAG server (port 8000)

How the modules connect

User types → CHATBOT ──→ collects fields ──→ MAPPER ──→ fills blank_form.pdf
                                                ↕
User uploads doc → DOC_UPLOAD → extracts fields → MAPPER → fills blank_form.pdf
                                                ↕
                                             RAG ← learns from each run, predicts next time
  • chatbot → mapper: MAPPER_API_URL empty = inprocess (default). Set URL = HTTP server.
  • doc_upload → mapper: same pattern, MAPPER_API_URL.
  • mapper → rag: set RAG_ENABLED=true in .env + [rag] enabled=true in mapper_config.ini.

Cloud storage

Add cloud extras when needed:

pip install "pdf-autofillr[chatbot,s3]"    # chatbot with S3 storage
pip install "pdf-autofillr[all,gcp]"       # full stack with GCP
pip install "pdf-autofillr[all,azure]"     # full stack with Azure

RAG vector store

pip install "pdf-autofillr[chatbot,rag,rag-pinecone]"  # Pinecone
pip install "pdf-autofillr[chatbot,rag,rag-chroma]"    # ChromaDB

Module docs

  • chatbot/README.md
  • doc_upload/README.md
  • mapper/README.md
  • rag/README.md

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pdf_autofillr-1.0.8.tar.gz (14.0 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

pdf_autofillr-1.0.8-py3-none-any.whl (14.8 kB view details)

Uploaded Python 3

File details

Details for the file pdf_autofillr-1.0.8.tar.gz.

File metadata

  • Download URL: pdf_autofillr-1.0.8.tar.gz
  • Upload date:
  • Size: 14.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.13.5

File hashes

Hashes for pdf_autofillr-1.0.8.tar.gz
Algorithm Hash digest
SHA256 461c24de7019ec61d5b0d48df2e15de2d61810aa42f7dd65aec633cfae6e78de
MD5 012ee0d94ce52001e0163753ed744a2c
BLAKE2b-256 b739f02c8808d1cbe24d8f3580ff5de6a4cf9dfb5563909c8d6ba54cdbe793ba

See more details on using hashes here.

File details

Details for the file pdf_autofillr-1.0.8-py3-none-any.whl.

File metadata

  • Download URL: pdf_autofillr-1.0.8-py3-none-any.whl
  • Upload date:
  • Size: 14.8 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.13.5

File hashes

Hashes for pdf_autofillr-1.0.8-py3-none-any.whl
Algorithm Hash digest
SHA256 6886573297a2698503c8af59eaeb7c4f9e523f2fd75ea706b8c3d37f95863564
MD5 e025ea0944eeeac94e211292eb9c2cc6
BLAKE2b-256 06c7b5a31dbc22a869ec1479be4d415f8bddd876cc999f3ef4405932c08b5fbc

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page