Skip to main content

PDF form-filling ecosystem: chatbot, doc-upload, mapper and RAG — install any combination

Project description

pdf-autofillr

PDF form-filling ecosystem — chatbot, doc-upload, mapper, and RAG — install any combination.

Install

# Full stack (everything)
pip install pdf-autofillr[all]

# Chatbot + mapper (conversational form filling)
pip install pdf-autofillr[chatbot]

# Doc upload + mapper (extract from document → fill PDF)
pip install pdf-autofillr[doc-upload]

# Chatbot + mapper + RAG (self-learning predictions)
pip install pdf-autofillr[chatbot,rag]

# Doc upload + mapper + RAG
pip install pdf-autofillr[doc-upload,rag]

# Chatbot + doc_upload + mapper (both input methods)
pip install pdf-autofillr[chatbot,doc-upload]

# Individual modules standalone
pip install pdf-autofillr-chatbot
pip install pdf-autofillr-doc-upload
pip install pdf-autofillr-mapper
pip install pdf-autofillr-rag

After install

# Write .env.example, configs/, data/ for your installed combination:
pdf-autofillr setup

# Check that everything is configured correctly:
pdf-autofillr status

Configure

cp .env.example .env
# Edit .env:
#   Set your API key  → OPENAI_API_KEY=sk-...
#   Set your PDF path → chatbot_PDF_PATH=./data/input/blank_form.pdf

Drop your blank (empty) PDF form into data/input/blank_form.pdf.

Start

pdf-autofillr chatbot       # start chatbot server (port 8001)
pdf-autofillr doc-upload    # start doc_upload server (port 8001)
pdf-autofillr mapper        # start mapper server (port 8000)
pdf-autofillr rag           # start RAG server (port 8000)

How the modules connect

User types → CHATBOT ──→ collects fields ──→ MAPPER ──→ fills blank_form.pdf
                                                ↕
User uploads doc → DOC_UPLOAD → extracts fields → MAPPER → fills blank_form.pdf
                                                ↕
                                             RAG ← learns from each run, predicts next time
  • chatbot → mapper: MAPPER_API_URL empty = inprocess (default). Set URL = HTTP server.
  • doc_upload → mapper: same pattern, MAPPER_API_URL.
  • mapper → rag: set RAG_ENABLED=true in .env + [rag] enabled=true in mapper_config.ini.

Cloud storage

Add cloud extras when needed:

pip install "pdf-autofillr[chatbot,s3]"    # chatbot with S3 storage
pip install "pdf-autofillr[all,gcp]"       # full stack with GCP
pip install "pdf-autofillr[all,azure]"     # full stack with Azure

RAG vector store

pip install "pdf-autofillr[chatbot,rag,rag-pinecone]"  # Pinecone
pip install "pdf-autofillr[chatbot,rag,rag-chroma]"    # ChromaDB

Module docs

  • chatbot/README.md
  • doc_upload/README.md
  • mapper/README.md
  • rag/README.md

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pdf_autofillr-1.1.4.tar.gz (16.6 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

pdf_autofillr-1.1.4-py3-none-any.whl (14.8 kB view details)

Uploaded Python 3

File details

Details for the file pdf_autofillr-1.1.4.tar.gz.

File metadata

  • Download URL: pdf_autofillr-1.1.4.tar.gz
  • Upload date:
  • Size: 16.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.13.5

File hashes

Hashes for pdf_autofillr-1.1.4.tar.gz
Algorithm Hash digest
SHA256 53fb0be25094bf17bfe89bbd832aabe5ff96a26805d63be1b975c2b673408419
MD5 27024435f865c296da76295ebd3413b7
BLAKE2b-256 8dcd61a9d34daf7986752af92d68fc993c71e864502827772529a6fb1429e511

See more details on using hashes here.

File details

Details for the file pdf_autofillr-1.1.4-py3-none-any.whl.

File metadata

  • Download URL: pdf_autofillr-1.1.4-py3-none-any.whl
  • Upload date:
  • Size: 14.8 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.13.5

File hashes

Hashes for pdf_autofillr-1.1.4-py3-none-any.whl
Algorithm Hash digest
SHA256 42678d75381de8e81193b4d190934e12c41af0d60c47ed10699fa76d93797a6b
MD5 104aa12ac8491475b46f18ea7cc8e01b
BLAKE2b-256 7ce9852895b10d7070cd0b0fca58a7309bf10e948a2741649b1a6fe36d7ca222

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page