Synthetic logistics document & photo dataset generator

Project description

penquify

OCR in reverse. Make your documents worse.

A Python toolkit that generates photorealistic smartphone photos of logistics documents
with coffee stains, folds, blur, skew — and verified ground truth for every field.

penquify.com · Docs · GitHub

From Chilean slang "penca" (lousy, worse) — because your document photos should look realistically bad, not studio-perfect.

How it works

You start with data. Penquify builds everything else.

ERP purchase order          penquify generates           penquify generates
(or any JSON/PDF)    ──►    dispatch guide PDF     ──►   realistic photos
                            with supplier jargon,        with verified
                            unit mismatches,             ground truth +
                            realistic discrepancies      occlusion manifest

You don't build the PDF. You don't design the document. You give penquify an OC number, a JSON payload, or upload an existing PDF — and it:

Generates a realistic document with supplier-style names (not your ERP master data names), realistic unit mismatches (CJ vs KG, UN vs L), and configurable quantity discrepancies
Renders a clean PDF from Jinja2 templates (dispatch guides, invoices, POs, BOLs)
Produces N photorealistic photos — each a different failure mode (blur, fold, stain, crop, angle)
Verifies every field by blind-extracting from the photo and comparing programmatically against source data
Generates an occlusion manifest explaining which fields are hidden in each variation and why

Before → After

Clean PDF (auto-generated)		Realistic Photo (verified)
	→

Every variation from the same document

`full_picture`	`folded_skewed`	`strong_oblique`
`coffee_stain`	`stain + angle`	`galaxy_s7`

8 built-in presets + infinite custom via JSON or natural language.

# From scratch — penquify generates the document AND the photos
penquify demo

# From an existing PDF — penquify detects the schema and generates variations
penquify upload --image existing_invoice.pdf

# From a description — no JSON needed
penquify config --text "folded paper with grease, shot on old Motorola"

Getting Started

Install

pip install penquify

# or from source
git clone https://github.com/MAXMARDONES/penquify.git
cd penquify && pip install -e ".[all]"

# browser engine for HTML → PDF rendering
playwright install chromium

Environment

export GEMINI_API_KEY="your-key"   # required for photo generation
export PENQUIFY_OUTPUT="./output"  # where files go (default: ~/penquify-output)

Run

# Full demo: PDF + 8 photo variations
penquify demo

# PDF only from JSON
penquify pdf --doc-json invoice.json

# Photos from any document image
penquify photos --image scan.png --presets full_picture blurry coffee_stain

# Full dataset: 10 documents x 3 variations each
penquify dataset --doc-json docs.json --presets full_picture folded_skewed blurry

Python

from penquify.models import Document, DocHeader, DocItem, PhotoVariation, Stain
from penquify.generators.pdf import generate_document_files
from penquify.generators.photo import generate_dataset

doc = Document(
    header=DocHeader(doc_type="guia_despacho", doc_number="00847291", date="16/04/2026",
                     emitter_name="ACME FOODS LTDA.", oc_number="4500000316"),
    items=[
        DocItem(pos=1, code="AF-001", description="FROZEN POTATO WEDGES",
                qty=12, unit="CJ", unit_price=15000, total=180000),
    ],
)

files = await generate_document_files(doc, "output/")
photos = await generate_dataset(files["png"], preset_names=["full_picture", "blurry"])

Document Templates

Template	Description	Status
`guia_despacho`	Chilean dispatch guide (guia de despacho electronica)	Done
`factura_sii`	Chilean tax invoice (DTE tipo 33, SII XML)	Planned
`purchase_order`	Standard purchase order	Planned
`bill_of_lading`	Transport bill of lading (BOL)	Planned
`nota_credito`	Credit note (DTE tipo 61)	Planned
`remito`	Argentine dispatch note	Planned

Templates are Jinja2 HTML — add your own:

penquify pdf --template my_template.html --doc-json data.json

Photo Variations

A fixed system instruction handles base realism (paper physics, camera behavior, operational context). The variation config controls specifics. Every field is optional — override only what you need.

8 Built-in Presets

Preset	What it tests
`full_picture`	Baseline: clean handheld shot, 90% frame coverage
`folded_skewed`	Geometric distortion: dog-ear, crease, 6deg tilt
`zoomed_detail`	Close-up OCR: tight crop, oblique 25-30deg
`blurry`	Motion blur: rushed capture, partial legibility
`cropped_header`	Missing data: top 10-15% cut off
`strong_oblique`	Extreme angle: 45deg, strong curvature
`coffee_stain`	Contamination: stain over text
`stapled_stack`	Multi-page: stapled with sheets behind

Full Variation Schema

{
  "name": "my_variation",
  "camera": "Samsung Galaxy S8",
  "year_device_style": "2017 Android",
  "aspect_ratio": "4:3",
  "document_coverage": "90% of frame",
  "background": "blurred warehouse at edges",
  "curvature": "slight",
  "folds": "dog_ear",
  "wrinkles": "medium",
  "angle": "45 degree oblique",
  "skew": "strong",
  "rotation_degrees": 8,
  "motion_blur": true,
  "glare": "strong",
  "shadow_from_hand": true,
  "jpeg_compression": "heavy",
  "hand_visible": true,
  "grip_type": "both hands",
  "glove": "warehouse glove",
  "stain": {"type": "coffee", "location": "upper_right", "opacity": "heavy", "text_obstruction": "partial"},
  "cropped_header": true,
  "stapled": true,
  "stacked_sheets_behind": 2
}

Every string field is free text — cameras, angles, backgrounds, grip types. Use presets or write whatever describes your scenario.

22 Camera Presets (+ free text)

galaxy_s7 galaxy_s8 galaxy_a5_2017 moto_g5 iphone_7 iphone_8 pixel_2 huawei_p10 xiaomi_note4 galaxy_s9 iphone_xr galaxy_a10 galaxy_a50 iphone_11 galaxy_a21s iphone_12 pixel_4a galaxy_a13 iphone_14 pixel_7 warehouse_generic field_worker

Or any free text: PhotoVariation(camera="Nokia 3310 with cracked screen")

Natural Language Config

Don't know the schema? Just describe it:

from penquify.generators.config import text_to_variation

config = await text_to_variation(
    "blurry photo with coffee stain, strong angle, old Samsung, paper folded in half"
)
# → returns valid PhotoVariation JSON

REST API

uvicorn penquify.api.server:app --port 8080

Method	Path	Description
`POST`	`/generate/document`	Document JSON → PDF + PNG
`POST`	`/generate/photos`	Image → realistic photos
`POST`	`/generate/dataset`	Document → PDF → photos (full pipeline)
`POST`	`/generate/config`	Natural language → variation JSON
`GET`	`/documents`	List generated runs
`GET`	`/documents/{id}/{file}`	Download file
`GET`	`/presets`	Photo presets
`GET`	`/templates`	Document templates

MCP Server

5 tools for Claude Desktop, Cursor, Windsurf, or any MCP client:

{
  "mcpServers": {
    "penquify": {
      "command": "python3",
      "args": ["-m", "penquify.mcp"],
      "env": {"GEMINI_API_KEY": "your-key"}
    }
  }
}

Tools: penquify_generate_document penquify_generate_photos penquify_generate_dataset penquify_text_to_config penquify_list_presets

Claude Code Skills

/penquify          # Full reference: presets, cameras, variation schema
/generate          # Generate a document from description or JSON
/dataset           # Generate large synthetic datasets
/add-template      # Add a new document template

Agent SDK Plugin

from penquify.agent_plugin import penquify_tools

agent = Agent(model="claude-sonnet-4-6", tools=penquify_tools)

Deployment

Docker

docker build -t penquify .
docker run -p 8080:8080 -e GEMINI_API_KEY=xxx penquify

docker-compose (with PostgreSQL)

GEMINI_API_KEY=xxx docker-compose up

Kubernetes

kubectl apply -f k8s/secret.yaml   # set GEMINI_API_KEY first
kubectl apply -f k8s/deployment.yaml

Architecture

penquify/
  templates/         Jinja2 HTML per doc type
  generators/
    pdf.py           HTML → PDF/PNG (Playwright)
    photo.py         PNG → realistic photo (Gemini image gen)
    config.py        text → variation JSON (Gemini text)
  models/
    document.py      DocHeader + DocItem + Document
    variation.py     PhotoVariation + Stain + 8 presets
    cameras.py       22 camera presets + free text
  api/server.py      FastAPI REST
  mcp.py             MCP server (5 tools)
  agent_plugin.py    Agent SDK plugin
  storage/s3.py      AWS S3 upload
  cli.py             CLI entry point

Roadmap

Jinja2 templates + Playwright PDF/PNG
Gemini photo gen with system instruction + variation config
8 photo presets + 22 camera presets
CLI (penquify demo/pdf/photos/dataset)
FastAPI REST server (8 endpoints)
MCP server (5 tools)
Agent SDK plugin
Claude Code skills (4 commands)
Natural language → variation JSON (Gemini)
S3 upload support
Dockerfile + docker-compose + K8s manifests
GitHub Actions CI
CODE_OF_CONDUCT + CONTRIBUTING + LICENSE
PostgreSQL persistent storage
PostgREST auto-API
More templates: factura SII, PO, BOL
SII DTE XML generation
Batch dataset generation with progress bar
PyPI publish
Demo images in README

License

MIT

penquify.com | Docs | GitHub

_{Built by Max Mardones}

Project details

Release history Release notifications | RSS feed

This version

0.1.0

Apr 16, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

penquify-0.1.0.tar.gz (35.5 kB view details)

Uploaded Apr 16, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

penquify-0.1.0-py3-none-any.whl (35.9 kB view details)

Uploaded Apr 16, 2026 Python 3

File details

Details for the file penquify-0.1.0.tar.gz.

File metadata

Download URL: penquify-0.1.0.tar.gz
Upload date: Apr 16, 2026
Size: 35.5 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.14.2

File hashes

Hashes for penquify-0.1.0.tar.gz
Algorithm	Hash digest
SHA256	`59d010859e75813e5588a595dbd423b9575defda40897bc437abb2d8ddc6c655`
MD5	`7d84f53bedaf554e547a4fb89495772e`
BLAKE2b-256	`2cd12e872cbcb44791e16648e4d4dd94d4eb44c58767c32e125628b46ef62d45`

See more details on using hashes here.

File details

Details for the file penquify-0.1.0-py3-none-any.whl.

File metadata

Download URL: penquify-0.1.0-py3-none-any.whl
Upload date: Apr 16, 2026
Size: 35.9 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.14.2

File hashes

Hashes for penquify-0.1.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`b15c1fc3d5386295d0aaffebf033e7f3acb951b5e8f32e892fe4693ac412c701`
MD5	`ccfd1b63ad1167f32497e8ad53677caa`
BLAKE2b-256	`deb8c651bf20f6d93901848d88e19fd54293b03cf1ad0d808fa81f5e611f8531`

See more details on using hashes here.

penquify 0.1.0

Navigation

Verified details

Maintainers

Unverified details

Meta

Project description

penquify

How it works

You start with data. Penquify builds everything else.

Before → After

Every variation from the same document

Getting Started

Install

Environment

Run

Python

Document Templates

Photo Variations

8 Built-in Presets

Full Variation Schema

22 Camera Presets (+ free text)

Natural Language Config

REST API

MCP Server

Claude Code Skills

Agent SDK Plugin

Deployment

Docker

docker-compose (with PostgreSQL)

Kubernetes

Architecture

Roadmap

License

Project details

Verified details

Maintainers

Unverified details

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes