Skip to main content

PDF form-filling ecosystem: chatbot, doc-upload, mapper and RAG — install any combination

Project description

pdf-autofillr

PDF form-filling ecosystem — chatbot, doc-upload, mapper, and RAG — install any combination.

Install

# Full stack (everything)
pip install pdf-autofillr[all]

# Chatbot + mapper (conversational form filling)
pip install pdf-autofillr[chatbot]

# Doc upload + mapper (extract from document → fill PDF)
pip install pdf-autofillr[doc-upload]

# Chatbot + mapper + RAG (self-learning predictions)
pip install pdf-autofillr[chatbot,rag]

# Doc upload + mapper + RAG
pip install pdf-autofillr[doc-upload,rag]

# Chatbot + doc_upload + mapper (both input methods)
pip install pdf-autofillr[chatbot,doc-upload]

# Individual modules standalone
pip install pdf-autofillr-chatbot
pip install pdf-autofillr-doc-upload
pip install pdf-autofillr-mapper
pip install pdf-autofillr-rag

After install

# Write .env.example, configs/, data/ for your installed combination:
pdf-autofillr setup

# Check that everything is configured correctly:
pdf-autofillr status

Configure

cp .env.example .env
# Edit .env:
#   Set your API key  → OPENAI_API_KEY=sk-...
#   Set your PDF path → chatbot_PDF_PATH=./data/input/blank_form.pdf

Drop your blank (empty) PDF form into data/input/blank_form.pdf.

Start

pdf-autofillr chatbot       # start chatbot server (port 8001)
pdf-autofillr doc-upload    # start doc_upload server (port 8001)
pdf-autofillr mapper        # start mapper server (port 8000)
pdf-autofillr rag           # start RAG server (port 8000)

How the modules connect

User types → CHATBOT ──→ collects fields ──→ MAPPER ──→ fills blank_form.pdf
                                                ↕
User uploads doc → DOC_UPLOAD → extracts fields → MAPPER → fills blank_form.pdf
                                                ↕
                                             RAG ← learns from each run, predicts next time
  • chatbot → mapper: MAPPER_API_URL empty = inprocess (default). Set URL = HTTP server.
  • doc_upload → mapper: same pattern, MAPPER_API_URL.
  • mapper → rag: set RAG_ENABLED=true in .env + [rag] enabled=true in mapper_config.ini.

Cloud storage

Add cloud extras when needed:

pip install "pdf-autofillr[chatbot,s3]"    # chatbot with S3 storage
pip install "pdf-autofillr[all,gcp]"       # full stack with GCP
pip install "pdf-autofillr[all,azure]"     # full stack with Azure

RAG vector store

pip install "pdf-autofillr[chatbot,rag,rag-pinecone]"  # Pinecone
pip install "pdf-autofillr[chatbot,rag,rag-chroma]"    # ChromaDB

Module docs

  • chatbot/README.md
  • doc_upload/README.md
  • mapper/README.md
  • rag/README.md

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pdf_autofillr-1.0.5.tar.gz (14.0 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

pdf_autofillr-1.0.5-py3-none-any.whl (14.8 kB view details)

Uploaded Python 3

File details

Details for the file pdf_autofillr-1.0.5.tar.gz.

File metadata

  • Download URL: pdf_autofillr-1.0.5.tar.gz
  • Upload date:
  • Size: 14.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.13.5

File hashes

Hashes for pdf_autofillr-1.0.5.tar.gz
Algorithm Hash digest
SHA256 8bf7f50b2c8984231191c2959e0f82ddf4d500bbee3d9c31f86810117719a792
MD5 974bd984017269344720449ec9c174b2
BLAKE2b-256 0db33639ed900657c1eef2d2cc19f2671029d9e193b2dac9a2eda7f957601b03

See more details on using hashes here.

File details

Details for the file pdf_autofillr-1.0.5-py3-none-any.whl.

File metadata

  • Download URL: pdf_autofillr-1.0.5-py3-none-any.whl
  • Upload date:
  • Size: 14.8 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.13.5

File hashes

Hashes for pdf_autofillr-1.0.5-py3-none-any.whl
Algorithm Hash digest
SHA256 79aef46dfa74c4731b62f9bf057681962b03c8d0ce9055cffe4a4d3a5a0a2509
MD5 4cd17d68584e7d747b41f23f1dd78bed
BLAKE2b-256 a10ebc1c4f8e537ec62e21266fce5802ca5b08180e1b26059dabacfd59839685

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page