Python client for PDFDancer API

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

mlahr

These details have not been verified by PyPI

Project links

Project description

PDFDancer Python Client

PDFDancer logo

PDF used to be read-only. We fixed that.

Edit text in real-world PDFs—even ones you didn't create. Move images, reposition headers, and change fonts with pixel-perfect control from Python. The same API is also available for TypeScript and Java.

Need the raw API schema? The latest OpenAPI description lives in docs/openapi.yml and is published at https://bucket.pdfdancer.com/api-doc/development-0.0.yml.

Highlights

Locate paragraphs, text lines, images, vector paths, form fields, and pages by page number, coordinates, or text patterns.
Edit existing content in place with fluent editors and context managers that apply changes safely.
Programmatically control third-party PDFs—modify invoices, contracts, and reports you did not author.
Add content with precise XY positioning using paragraph, image, and vector path builders with custom fonts and colors.
Draw lines, rectangles, and Bezier curves with configurable stroke width, dash patterns, and fill colors.
Redact sensitive content—replace text, images, or form fields with customizable placeholders.
Export results as bytes for downstream processing or save directly to disk with one call.

What Makes PDFDancer Different

Edit text in real-world PDFs: Work with documents from customers, governments, or vendors—even ones you didn't create.
Pixel-perfect positioning: Move or add elements at exact coordinates and keep the original layout intact.
Surgical text replacement: Swap or rewrite paragraphs without reflowing the rest of the page.
Form manipulation: Inspect, fill, and update AcroForm fields programmatically.
Coordinate-based selection: Select objects by position, bounding box, or text patterns.
Vector graphics: Draw lines, rectangles, and Bezier curves with full control over stroke and fill properties.
Secure redaction: Permanently remove sensitive content and replace with customizable markers.
Real PDF editing: Modify the underlying PDF structure instead of merely stamping overlays.

Installation

pip install pdfdancer-client-python

# Editable install for local development
pip install -e .

Requires Python 3.10+ and a PDFDancer API token.

Quick Start — Edit an Existing PDF

from pathlib import Path
from pdfdancer import Color, PDFDancer, StandardFonts

with PDFDancer.open(
    pdf_data=Path("input.pdf"),
    token="your-api-token",             # optional when PDFDANCER_API_TOKEN is set
    base_url="https://api.pdfdancer.com",
) as pdf:
    # Locate and update an existing paragraph
    heading = pdf.page(0).select_paragraphs_starting_with("Executive Summary")[0]
    heading.move_to(72, 680)
    with heading.edit() as editor:
        editor.replace("Overview")

    # Add a new paragraph with precise placement
    pdf.new_paragraph() \
        .text("Generated with PDFDancer") \
        .font(StandardFonts.HELVETICA, 12) \
        .color(Color(70, 70, 70)) \
        .line_spacing(1.4) \
        .at(page_number=1, x=72, y=520) \
        .add()

    # Persist the modified document
    pdf.save("output.pdf")
    # or keep it in memory
    pdf_bytes = pdf.get_bytes()

Create a Blank PDF

from pathlib import Path
from pdfdancer import Color, PDFDancer, StandardFonts

with PDFDancer.new(token="your-api-token") as pdf:
    pdf.new_paragraph() \
        .text("Quarterly Summary") \
        .font(StandardFonts.TIMES_BOLD, 18) \
        .color(Color(10, 10, 80)) \
        .line_spacing(1.2) \
        .at(page_number=1, x=72, y=730) \
        .add()

    pdf.new_image() \
        .from_file(Path("logo.png")) \
        .at(page=0, x=420, y=710) \
        .add()

    pdf.save("summary.pdf")

Work with Forms and Layout

from pdfdancer import PDFDancer

with PDFDancer.open("contract.pdf") as pdf:
    # Inspect global document structure
    pages = pdf.pages()
    print("Total pages:", len(pages))

    # Update form fields
    signature = pdf.select_form_fields_by_name("signature")[0]
    signature.edit().value("Signed by Jane Doe").apply()

    # Trim or move content at specific coordinates
    images = pdf.page(1).select_images()
    for image in images:
        x = image.position.x()
        if x is not None and x < 100:
            image.delete()

Selectors return typed objects (ParagraphObject, TextLineObject, ImageObject, FormFieldObject, PageClient, …) with helpers such as delete(), move_to(x, y), clear_clipping(), redact(), or edit() depending on the object type.

Singular selection methods return the first match (or None) for convenience:

# Instead of: paragraphs = page.select_paragraphs_starting_with("Invoice")[0]
paragraph = page.select_paragraph_starting_with("Invoice")  # Returns first match or None
image = page.select_image_at(100, 200)                      # Returns first match or None
field = pdf.select_form_field_by_name("email")              # Returns first match or None

Draw Vector Paths

Add lines, curves, and shapes to your PDFs with fluent builders:

from pdfdancer import PDFDancer, Color, Point

with PDFDancer.open("document.pdf") as pdf:
    page = pdf.page(0)

    # Draw a simple line
    page.new_line() \
        .from_point(100, 700) \
        .to_point(500, 700) \
        .stroke_color(Color(0, 0, 255)) \
        .stroke_width(2.0) \
        .add()

    # Draw a rectangle
    page.new_rectangle() \
        .at_coordinates(100, 500) \
        .with_size(200, 100) \
        .stroke_color(Color(0, 0, 0)) \
        .fill_color(Color(255, 255, 200)) \
        .add()

    # Draw a bezier curve
    page.new_bezier() \
        .from_point(100, 400) \
        .control_point_1(150, 450) \
        .control_point_2(250, 350) \
        .to_point(300, 400) \
        .stroke_width(1.5) \
        .add()

    # Build complex paths with multiple segments
    page.new_path() \
        .stroke_color(Color(255, 0, 0)) \
        .add_line(Point(50, 200), Point(150, 200)) \
        .add_line(Point(150, 200), Point(100, 280)) \
        .add_line(Point(100, 280), Point(50, 200)) \
        .add()

    pdf.save("annotated.pdf")

Redact Sensitive Content

Remove text, images, or form fields and replace them with redaction markers:

from pdfdancer import PDFDancer, Color

with PDFDancer.open("confidential.pdf") as pdf:
    # Redact paragraphs containing sensitive patterns
    for para in pdf.select_paragraphs():
        if "SSN:" in para.text or "Password:" in para.text:
            para.redact("[REDACTED]")

    # Redact all images on a specific page
    for image in pdf.page(0).select_images():
        image.redact()

    # Bulk redact multiple objects with custom placeholder color
    form_fields = pdf.select_form_fields_by_name("credit_card")
    result = pdf.redact(form_fields, replacement="[REMOVED]", placeholder_color=Color(0, 0, 0))
    print(f"Redacted {result.count} items")

    pdf.save("redacted.pdf")

Configuration

Set PDFDANCER_API_TOKEN for authentication (preferred). PDFDANCER_TOKEN is also supported for backwards compatibility.
Override the API host with PDFDANCER_BASE_URL (e.g., sandbox or local environments). Defaults to https://api.pdfdancer.com.
Tune HTTP read timeouts via the timeout argument on PDFDancer.open() and PDFDancer.new() (default: 30 seconds).
For testing against self-signed certificates, call pdfdancer.set_ssl_verify(False) to temporarily disable TLS verification.

Error Handling

Operations raise subclasses of PdfDancerException:

ValidationException: input validation problems (missing token, invalid coordinates, etc.).
FontNotFoundException: requested font unavailable on the service.
HttpClientException: transport or server errors with detailed context.
SessionException: session creation and lifecycle failures.
RateLimitException: API rate limit exceeded; includes retry-after timing.

Wrap automated workflows in try/except blocks to surface actionable errors to your users.

Development Setup

Prerequisites

Python 3.10 or higher (Python 3.9 has SSL issues with large file uploads)
Git for cloning the repository
PDFDancer API token for running end-to-end tests

Step-by-Step Setup

1. Clone the Repository

git clone https://github.com/MenschMachine/pdfdancer-client-python.git
cd pdfdancer-client-python

2. Create a Virtual Environment

# Create virtual environment
python -m venv venv

# Activate the virtual environment
# On macOS/Linux:
source venv/bin/activate

# On Windows:
venv\Scripts\activate

You should see (venv) in your terminal prompt indicating the virtual environment is active.

3. Install Dependencies

# Install the package in editable mode with development dependencies
pip install -e ".[dev]"

# Alternatively, install runtime dependencies only:
# pip install -e .

This installs:

The pdfdancer package in editable mode (changes reflect immediately)
Development tooling including pytest, pytest-cov, pytest-mock, black, isort, flake8, mypy, build, and twine.

4. Configure API Token

Set your PDFDancer API token as an environment variable:

# On macOS/Linux:
export PDFDANCER_API_TOKEN="your-api-token-here"

# On Windows (Command Prompt):
set PDFDANCER_API_TOKEN=your-api-token-here

# On Windows (PowerShell):
$env:PDFDANCER_API_TOKEN="your-api-token-here"

For permanent configuration, add this to your shell profile (~/.bashrc, ~/.zshrc, etc.).

5. Verify Installation

# Run the test suite
pytest tests/ -v

# Run only unit tests (faster)
pytest tests/test_models.py -v

# Run end-to-end tests (requires API token)
pytest tests/e2e/ -v

All tests should pass if everything is set up correctly.

Common Development Tasks

Running Tests

# Run all tests with verbose output
pytest tests/ -v

# Run specific test file
pytest tests/test_models.py -v

# Run end-to-end tests only
pytest tests/e2e/ -v

# Run with coverage report
pytest tests/ --cov=pdfdancer --cov-report=term-missing

Building Distribution Packages

# Build wheel and source distribution
python -m build

# Verify the built packages
python -m twine check dist/*

Artifacts will be created in the dist/ directory. Package versions are derived from Git tags via setuptools-scm.

Publishing to PyPI

Releases are published automatically to PyPI when a v* tag is pushed to GitHub (via GitHub Actions with Trusted Publishers).

# Create and push a release tag — GitHub Actions handles the rest
git tag v1.1.0
git push origin v1.1.0

Code Quality

# Format code
black src tests
isort src tests

# Lint
flake8 src tests

# Type checking
mypy src/pdfdancer/

Project Structure

pdfdancer-client-python/
├── src/pdfdancer/           # Main package source
│   ├── __init__.py          # Package exports
│   ├── pdfdancer_v1.py      # Core PDFDancer and PageClient classes
│   ├── paragraph_builder.py # Fluent paragraph builders
│   ├── text_line_builder.py # Fluent text line builders
│   ├── image_builder.py     # Fluent image builders
│   ├── path_builder.py      # Vector path builders (lines, beziers, rectangles)
│   ├── page_builder.py      # Page creation builder
│   ├── models.py            # Data models (Position, Font, Color, etc.)
│   ├── types.py             # Object wrappers (ParagraphObject, etc.)
│   └── exceptions.py        # Exception hierarchy
├── tests/                   # Test suite
│   ├── test_models.py       # Model unit tests
│   ├── e2e/                 # End-to-end integration tests
│   └── fixtures/            # Test fixtures and sample PDFs
├── docs/                    # Documentation
├── dist/                    # Build artifacts (created after packaging)
├── pyproject.toml           # Project metadata and dependencies
└── README.md                # This file

Troubleshooting

Virtual Environment Issues

If python -m venv venv fails, ensure you have the venv module:

# On Ubuntu/Debian
sudo apt-get install python3-venv

# On macOS (using Homebrew)
brew install python@3.10

SSL Errors with Large Files

Upgrade to Python 3.10+ if you encounter SSL errors during large file uploads.

Import Errors

Ensure the virtual environment is activated and the package is installed in editable mode:

source venv/bin/activate  # or venv\Scripts\activate on Windows
pip install -e .

Test Failures

Ensure PDFDANCER_API_TOKEN is set for e2e tests
Check network connectivity to the PDFDancer API
Verify you're using Python 3.10 or higher

Contributing

Contributions are welcome via pull request. Please:

Create a feature branch from main
Add tests for new functionality
Ensure all tests pass: pytest tests/ -v
Follow existing code style and patterns
Update documentation as needed

Helpful links

Related SDKs

TypeScript client: https://github.com/MenschMachine/pdfdancer-client-typescript
Java client: https://github.com/MenschMachine/pdfdancer-client-java

License

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

mlahr

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

This version

0.3.14

Apr 3, 2026

0.3.13

Mar 24, 2026

0.3.12

Mar 16, 2026

0.3.11

Mar 4, 2026

0.3.10

Feb 26, 2026

0.3.9

Feb 20, 2026

0.3.8

Feb 20, 2026

0.3.7

Jan 12, 2026

0.3.6

Jan 7, 2026

0.3.5

Dec 28, 2025

0.3.4

Dec 18, 2025

0.3.3

Dec 8, 2025

0.3.2

Dec 2, 2025

0.3.1

Nov 25, 2025

0.2.29

Nov 19, 2025

0.2.28

Nov 19, 2025

0.2.27

Nov 19, 2025

0.2.26

Nov 19, 2025

0.2.25

Nov 19, 2025

0.2.24

Nov 13, 2025

0.2.23

Nov 11, 2025

0.2.22

Nov 6, 2025

0.2.21

Oct 30, 2025

0.2.20

Oct 28, 2025

0.2.19

Oct 27, 2025

0.2.18

Oct 27, 2025

0.2.17

Oct 23, 2025

0.2.16

Oct 21, 2025

0.2.15

Oct 21, 2025

0.2.14

Oct 20, 2025

0.2.13

Oct 20, 2025

0.2.12

Oct 18, 2025

0.2.11

Oct 18, 2025

0.2.10

Oct 16, 2025

0.2.9

Oct 16, 2025

0.2.8

Oct 16, 2025

0.2.7

Oct 16, 2025

0.2.6

Oct 15, 2025

0.2.5

Oct 15, 2025

0.2.4

Oct 15, 2025

0.2.3

Oct 15, 2025

0.2.2

Oct 15, 2025

0.1.2

Sep 19, 2025

0.1.1

Sep 18, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pdfdancer_client_python-0.3.14.tar.gz (597.7 kB view details)

Uploaded Apr 3, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

pdfdancer_client_python-0.3.14-py3-none-any.whl (71.8 kB view details)

Uploaded Apr 3, 2026 Python 3

File details

Details for the file pdfdancer_client_python-0.3.14.tar.gz.

File metadata

Download URL: pdfdancer_client_python-0.3.14.tar.gz
Upload date: Apr 3, 2026
Size: 597.7 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for pdfdancer_client_python-0.3.14.tar.gz
Algorithm	Hash digest
SHA256	`ed8bca71abe07b1e500a0454387f8ed75273a7169047f304b58dc7c4342ac226`
MD5	`1ee0861ae8cf34b1551bc3470f7eb095`
BLAKE2b-256	`199e8e3a2a61983f21274654c10c8b21ddfcc424953bda67531cae5a14ab5f33`

See more details on using hashes here.

Provenance

The following attestation bundles were made for pdfdancer_client_python-0.3.14.tar.gz:

Publisher: release.yml on MenschMachine/pdfdancer-client-python

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: pdfdancer_client_python-0.3.14.tar.gz
- Subject digest: ed8bca71abe07b1e500a0454387f8ed75273a7169047f304b58dc7c4342ac226
- Sigstore transparency entry: 1223117634
- Sigstore integration time: Apr 3, 2026
Source repository:
- Permalink: MenschMachine/pdfdancer-client-python@8f51df18ac16bf5d64e5185ac0aa39b7bfcca382
- Branch / Tag: refs/tags/v0.3.14
- Owner: https://github.com/MenschMachine
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yml@8f51df18ac16bf5d64e5185ac0aa39b7bfcca382
- Trigger Event: push

File details

Details for the file pdfdancer_client_python-0.3.14-py3-none-any.whl.

File metadata

Download URL: pdfdancer_client_python-0.3.14-py3-none-any.whl
Upload date: Apr 3, 2026
Size: 71.8 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for pdfdancer_client_python-0.3.14-py3-none-any.whl
Algorithm	Hash digest
SHA256	`b9e0063dec424a4d406f60bbe99e214411bd0e94c8f36e13dc4cd57e0ac1173f`
MD5	`e1ee6a33589c8b66da36e96422e38c49`
BLAKE2b-256	`afe1c7e22850f5f5474cc0c073571fb7100d9f0852b0ad997039814b3dc781d6`

See more details on using hashes here.

Provenance

The following attestation bundles were made for pdfdancer_client_python-0.3.14-py3-none-any.whl:

Publisher: release.yml on MenschMachine/pdfdancer-client-python

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: pdfdancer_client_python-0.3.14-py3-none-any.whl
- Subject digest: b9e0063dec424a4d406f60bbe99e214411bd0e94c8f36e13dc4cd57e0ac1173f
- Sigstore transparency entry: 1223117706
- Sigstore integration time: Apr 3, 2026
Source repository:
- Permalink: MenschMachine/pdfdancer-client-python@8f51df18ac16bf5d64e5185ac0aa39b7bfcca382
- Branch / Tag: refs/tags/v0.3.14
- Owner: https://github.com/MenschMachine
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yml@8f51df18ac16bf5d64e5185ac0aa39b7bfcca382
- Trigger Event: push

pdfdancer-client-python 0.3.14

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

PDFDancer Python Client

PDF used to be read-only. We fixed that.

Highlights

What Makes PDFDancer Different

Installation

Quick Start — Edit an Existing PDF

Create a Blank PDF

Work with Forms and Layout

Draw Vector Paths

Redact Sensitive Content

Configuration

Error Handling

Development Setup

Prerequisites

Step-by-Step Setup

1. Clone the Repository

2. Create a Virtual Environment

3. Install Dependencies

4. Configure API Token

5. Verify Installation

Common Development Tasks

Running Tests

Building Distribution Packages

Publishing to PyPI

Code Quality

Project Structure

Troubleshooting

Virtual Environment Issues

SSL Errors with Large Files

Import Errors

Test Failures

Contributing

Helpful links

Related SDKs

License

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance