A CLI tool to edit PDF slides using natural language prompts, powered by Gemini 3 Pro Image

These details have not been verified by PyPI

Project links

Project description

Nano PDF Logo

Nano PDF Editor

A CLI tool to edit PDF slides using natural language prompts, powered by Google's Gemini 3 Pro Image ("Nano Banana") model.

Features

Natural Language Editing: "Update the graph to include data from 2025", "Change the chart to a bar graph".
Add New Slides: Generate entirely new slides that match your deck's visual style.
Non-Destructive: Preserves the searchable text layer of your PDF using OCR re-hydration.
Multi-page & Parallel: Edit multiple pages in a single command with concurrent processing.

How It Works

Nano PDF uses Gemini 3 Pro Image (aka Nano Banana) and PDF manipulation to enable quick edits of PDFs with natural language editing:

Page Rendering: Converts target PDF pages to images using Poppler
Style References: Optionally includes style reference pages with generation request to understand visual style (fonts, colors, layout)
AI Generation: Sends images + prompts to Gemini 3 Pro Image, which generates edited versions
OCR Re-hydration: Uses Tesseract to restore searchable text layer to generated images
PDF Stitching: Replaces original pages with AI-edited versions while preserving document structure

The tool processes multiple pages in parallel for speed, with configurable resolution (4K/2K/1K) to balance quality vs. cost.

Installation

pip install nano-pdf

Configuration

You need a paid Google Gemini API key with billing enabled. Free tier keys do not support image generation.

Get an API key from Google AI Studio
Enable billing on your Google Cloud project
Set it as an environment variable:

export GEMINI_API_KEY="your_api_key_here"

Note: This tool uses Gemini 3 Pro Image which requires a paid API tier. See pricing for details.

Usage

Basic Edit

Edit a single page (e.g., Page 2):

nano-pdf edit my_deck.pdf 2 "Change the title to 'Q3 Results'"

Multi-page Edit

Edit multiple pages in one go:

nano-pdf edit my_deck.pdf \
  1 "Update date to Oct 2025" \
  5 "Add company logo" \
  10 "Fix typo in footer"

Add New Slides

Insert a new AI-generated slide into your deck:

# Add a title slide at the beginning
nano-pdf add my_deck.pdf 0 "Title slide with 'Q3 2025 Review'"

# Add a slide after page 5
nano-pdf add my_deck.pdf 5 "Summary slide with key takeaways as bullet points"

The new slide will automatically match the visual style of your existing slides and uses document context by default for better relevance.

Options

--use-context / --no-use-context: Include the full text of the PDF as context for the model. Disabled by default for edit, enabled by default for add. Use --no-use-context to disable.
--style-refs "1,5": Manually specify which pages to use as style references.
--output "new.pdf": Specify the output filename.
--resolution "4K": Image resolution - "4K" (default), "2K", or "1K". Higher quality = slower processing.
--disable-google-search: Prevents the model from using Google Search to find information before generating (enabled by default).

Examples

Fixing Presentation Errors

# Fix typos across multiple slides
nano-pdf edit pitch_deck.pdf \
  3 "Fix the typo 'recieve' to 'receive'" \
  7 "Change 'Q4 2024' to 'Q1 2025'"

Visual Design Changes

# Update branding and colors
nano-pdf edit slides.pdf 1 "Make the header background blue and text white" \
  --style-refs "2,3" --output branded_slides.pdf

Content Updates

# Update financial data
nano-pdf edit report.pdf 12 "Update the revenue chart to show Q3 at $2.5M instead of $2.1M"

Batch Processing with Context

# Use full document context for consistency
nano-pdf edit presentation.pdf \
  5 "Update the chart colors to match the theme" \
  8 "Add the company logo in the bottom right" \
  --use-context

Adding New Slides

# Add a new agenda slide at the beginning
nano-pdf add quarterly_report.pdf 0 "Agenda slide with: Overview, Financial Results, Q4 Outlook"

Using Google Search

# Google Search is enabled by default - the model can look up current information
nano-pdf edit deck.pdf 5 "Update the market share data to latest figures"

# Disable Google Search if you want the model to only use provided context
nano-pdf add deck.pdf 3 "Add a summary slide" --disable-google-search

Requirements

Python 3.10+
poppler (for PDF rendering)
tesseract (for OCR)

System Dependencies

macOS

brew install poppler tesseract

Windows

choco install poppler tesseract

Note: After installation, you may need to restart your terminal or add the installation directory to your PATH.

Linux (Ubuntu/Debian)

sudo apt-get install poppler-utils tesseract-ocr

Troubleshooting

"Missing system dependencies" error

Make sure you've installed poppler and tesseract for your platform. After installation, restart your terminal to refresh PATH. Run which pdftotext and which tesseract to verify they're accessible.

"GEMINI_API_KEY not found" error

Set your API key as an environment variable:

export GEMINI_API_KEY="your_key_here"

"Gemini API Error: PAID API key required" error

Gemini 3 Pro Image requires a paid API tier. Visit Google AI Studio to enable billing on your project.

Generated images don't match the style

Try using --style-refs to specify reference pages that have the desired visual style. The model will analyze these pages to better match fonts, colors, and layout.

Text layer is missing or incorrect after editing

The tool uses Tesseract OCR to restore searchable text. For best results, ensure your generated images are high resolution (--resolution "4K"). Note that OCR may not be perfect for stylized fonts or small text.

Pages are processing slowly

Use --resolution "2K" or --resolution "1K" for faster processing

Running from Source

If you want to run the latest development version:

# Clone the repository
git clone https://github.com/gavrielc/Nano-PDF.git
cd Nano-PDF

# Install dependencies
pip install -e .

# Run the tool
nano-pdf edit my_deck.pdf 2 "Your edit here"

License

MIT

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

0.2.1

Nov 29, 2025

This version

0.2.0

Nov 29, 2025

0.1.0

Nov 28, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

nano_pdf-0.2.0.tar.gz (13.6 kB view details)

Uploaded Nov 29, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

nano_pdf-0.2.0-py3-none-any.whl (12.0 kB view details)

Uploaded Nov 29, 2025 Python 3

File details

Details for the file nano_pdf-0.2.0.tar.gz.

File metadata

Download URL: nano_pdf-0.2.0.tar.gz
Upload date: Nov 29, 2025
Size: 13.6 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.10.12

File hashes

Hashes for nano_pdf-0.2.0.tar.gz
Algorithm	Hash digest
SHA256	`6448340dfc1d85e826dd072d935eefde23de626ea626c8218648ba966504d594`
MD5	`68d3e24cbcebf6d7be431031b6d6886f`
BLAKE2b-256	`61520e68a29b1b3aaa5b20276e56bc888109226789ee9b5ff1e3cd4d2133737c`

See more details on using hashes here.

File details

Details for the file nano_pdf-0.2.0-py3-none-any.whl.

File metadata

Download URL: nano_pdf-0.2.0-py3-none-any.whl
Upload date: Nov 29, 2025
Size: 12.0 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.10.12

File hashes

Hashes for nano_pdf-0.2.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`1ce09057728191726e236e0853d56e2a4012df5e7f7450136c45faa81f4fe24a`
MD5	`96cecacdad9d8e790bfa46c2d693ef52`
BLAKE2b-256	`889da733f81efe271a7b34bfb6f45da8c6e8da8a99c68c66a15bc45c0e563568`

See more details on using hashes here.

nano-pdf 0.2.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Nano PDF Editor

Features

How It Works

Installation

Configuration

Usage

Basic Edit

Multi-page Edit

Add New Slides

Options

Examples

Fixing Presentation Errors

Visual Design Changes

Content Updates

Batch Processing with Context

Adding New Slides

Using Google Search

Requirements

System Dependencies

macOS

Windows

Linux (Ubuntu/Debian)

Troubleshooting

"Missing system dependencies" error

"GEMINI_API_KEY not found" error

"Gemini API Error: PAID API key required" error

Generated images don't match the style

Text layer is missing or incorrect after editing

Pages are processing slowly

Running from Source

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes