Skip to main content

PDF form-filling ecosystem: chatbot, doc-upload, mapper and RAG — install any combination

Project description

pdf-autofillr

PDF form-filling ecosystem — chatbot, doc-upload, mapper, and RAG — install any combination.

Install

# Full stack (everything)
pip install pdf-autofillr[all]

# Chatbot + mapper (conversational form filling)
pip install pdf-autofillr[chatbot]

# Doc upload + mapper (extract from document → fill PDF)
pip install pdf-autofillr[doc-upload]

# Chatbot + mapper + RAG (self-learning predictions)
pip install pdf-autofillr[chatbot,rag]

# Doc upload + mapper + RAG
pip install pdf-autofillr[doc-upload,rag]

# Chatbot + doc_upload + mapper (both input methods)
pip install pdf-autofillr[chatbot,doc-upload]

# Individual modules standalone
pip install pdf-autofillr-chatbot
pip install pdf-autofillr-doc-upload
pip install pdf-autofillr-mapper
pip install pdf-autofillr-rag

After install

# Write .env.example, configs/, data/ for your installed combination:
pdf-autofillr setup

# Check that everything is configured correctly:
pdf-autofillr status

Configure

cp .env.example .env
# Edit .env:
#   Set your API key  → OPENAI_API_KEY=sk-...
#   Set your PDF path → chatbot_PDF_PATH=./data/input/blank_form.pdf

Drop your blank (empty) PDF form into data/input/blank_form.pdf.

Start

pdf-autofillr chatbot       # start chatbot server (port 8001)
pdf-autofillr doc-upload    # start doc_upload server (port 8001)
pdf-autofillr mapper        # start mapper server (port 8000)
pdf-autofillr rag           # start RAG server (port 8000)

How the modules connect

User types → CHATBOT ──→ collects fields ──→ MAPPER ──→ fills blank_form.pdf
                                                ↕
User uploads doc → DOC_UPLOAD → extracts fields → MAPPER → fills blank_form.pdf
                                                ↕
                                             RAG ← learns from each run, predicts next time
  • chatbot → mapper: MAPPER_API_URL empty = inprocess (default). Set URL = HTTP server.
  • doc_upload → mapper: same pattern, MAPPER_API_URL.
  • mapper → rag: set RAG_ENABLED=true in .env + [rag] enabled=true in mapper_config.ini.

Cloud storage

Add cloud extras when needed:

pip install "pdf-autofillr[chatbot,s3]"    # chatbot with S3 storage
pip install "pdf-autofillr[all,gcp]"       # full stack with GCP
pip install "pdf-autofillr[all,azure]"     # full stack with Azure

RAG vector store

pip install "pdf-autofillr[chatbot,rag,rag-pinecone]"  # Pinecone
pip install "pdf-autofillr[chatbot,rag,rag-chroma]"    # ChromaDB

Module docs

  • chatbot/README.md
  • doc_upload/README.md
  • mapper/README.md
  • rag/README.md

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pdf_autofillr-1.1.0.tar.gz (14.0 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

pdf_autofillr-1.1.0-py3-none-any.whl (14.8 kB view details)

Uploaded Python 3

File details

Details for the file pdf_autofillr-1.1.0.tar.gz.

File metadata

  • Download URL: pdf_autofillr-1.1.0.tar.gz
  • Upload date:
  • Size: 14.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.13.5

File hashes

Hashes for pdf_autofillr-1.1.0.tar.gz
Algorithm Hash digest
SHA256 f0d1d55c9314d0f4988b0c298df20d45fa660b22329511b254ecc348e2b83308
MD5 54e0628ac68d554cc9743363ea1f1bd2
BLAKE2b-256 aff53e332694fa2741a21117e0ff376450be27ddc6486bbd219b5aff1d8dcc14

See more details on using hashes here.

File details

Details for the file pdf_autofillr-1.1.0-py3-none-any.whl.

File metadata

  • Download URL: pdf_autofillr-1.1.0-py3-none-any.whl
  • Upload date:
  • Size: 14.8 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.13.5

File hashes

Hashes for pdf_autofillr-1.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 cfce8fccfc2129d4296502e60b0def5b2a648ce88d75e966ca30a9177511d01e
MD5 d7d9f39978243605d76b5d82a51d444e
BLAKE2b-256 4c2f930ac0870318ca31e532720780a408c5714bebe901c4dd35dc25a438f3e5

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page