Skip to main content

PDF form-filling ecosystem: chatbot, doc-upload, mapper and RAG — install any combination

Project description

pdf-autofillr

PDF form-filling ecosystem — chatbot, doc-upload, mapper, and RAG — install any combination.

Install

# Full stack (everything)
pip install pdf-autofillr[all]

# Chatbot + mapper (conversational form filling)
pip install pdf-autofillr[chatbot]

# Doc upload + mapper (extract from document → fill PDF)
pip install pdf-autofillr[doc-upload]

# Chatbot + mapper + RAG (self-learning predictions)
pip install pdf-autofillr[chatbot,rag]

# Doc upload + mapper + RAG
pip install pdf-autofillr[doc-upload,rag]

# Chatbot + doc_upload + mapper (both input methods)
pip install pdf-autofillr[chatbot,doc-upload]

# Individual modules standalone
pip install pdf-autofillr-chatbot
pip install pdf-autofillr-doc-upload
pip install pdf-autofillr-mapper
pip install pdf-autofillr-rag

After install

# Write .env.example, configs/, data/ for your installed combination:
pdf-autofillr setup

# Check that everything is configured correctly:
pdf-autofillr status

Configure

cp .env.example .env
# Edit .env:
#   Set your API key  → OPENAI_API_KEY=sk-...
#   Set your PDF path → chatbot_PDF_PATH=./data/input/blank_form.pdf

Drop your blank (empty) PDF form into data/input/blank_form.pdf.

Start

pdf-autofillr chatbot       # start chatbot server (port 8001)
pdf-autofillr doc-upload    # start doc_upload server (port 8001)
pdf-autofillr mapper        # start mapper server (port 8000)
pdf-autofillr rag           # start RAG server (port 8000)

How the modules connect

User types → CHATBOT ──→ collects fields ──→ MAPPER ──→ fills blank_form.pdf
                                                ↕
User uploads doc → DOC_UPLOAD → extracts fields → MAPPER → fills blank_form.pdf
                                                ↕
                                             RAG ← learns from each run, predicts next time
  • chatbot → mapper: MAPPER_API_URL empty = inprocess (default). Set URL = HTTP server.
  • doc_upload → mapper: same pattern, MAPPER_API_URL.
  • mapper → rag: set RAG_ENABLED=true in .env + [rag] enabled=true in mapper_config.ini.

Cloud storage

Add cloud extras when needed:

pip install "pdf-autofillr[chatbot,s3]"    # chatbot with S3 storage
pip install "pdf-autofillr[all,gcp]"       # full stack with GCP
pip install "pdf-autofillr[all,azure]"     # full stack with Azure

RAG vector store

pip install "pdf-autofillr[chatbot,rag,rag-pinecone]"  # Pinecone
pip install "pdf-autofillr[chatbot,rag,rag-chroma]"    # ChromaDB

Module docs

  • chatbot/README.md
  • doc_upload/README.md
  • mapper/README.md
  • rag/README.md

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pdf_autofillr-1.1.1.tar.gz (14.0 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

pdf_autofillr-1.1.1-py3-none-any.whl (14.8 kB view details)

Uploaded Python 3

File details

Details for the file pdf_autofillr-1.1.1.tar.gz.

File metadata

  • Download URL: pdf_autofillr-1.1.1.tar.gz
  • Upload date:
  • Size: 14.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.13.5

File hashes

Hashes for pdf_autofillr-1.1.1.tar.gz
Algorithm Hash digest
SHA256 c540c3312c2966c340de0b59a8d3563aa69c5527ad5050b00c5aeec73153cd68
MD5 1775a568c92b398856be873bd7949184
BLAKE2b-256 dc52f4759cd85ca6cb33fa4d6487030ca2bb4884d88df0202477c8ec857d3248

See more details on using hashes here.

File details

Details for the file pdf_autofillr-1.1.1-py3-none-any.whl.

File metadata

  • Download URL: pdf_autofillr-1.1.1-py3-none-any.whl
  • Upload date:
  • Size: 14.8 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.13.5

File hashes

Hashes for pdf_autofillr-1.1.1-py3-none-any.whl
Algorithm Hash digest
SHA256 39606f1d01aaa1ffa5de855fc0148a7b97d0fe6a43e23fb5222e736c390cdb21
MD5 62b81098a2b0939be7726525f5496462
BLAKE2b-256 1571b2044577cbd51714b22debf3b3af677c0e95deb73bce5fe5c8e33e7f0643

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page