Skip to main content

Automatically detect, split, and rotate multiple photos from scanned images

Project description

ScanSplitter Logo

PyPI Python versions License: GPLv3

ScanSplitter

Automatically detect, split, and rotate multiple photos from scanned images.

Drop a scan containing multiple photos and get individual, correctly-oriented images back.

Quick Start

One-time setup - Install uv:

curl -LsSf https://astral.sh/uv/install.sh | sh

Run ScanSplitter (no clone needed):

uvx scansplitter api

Opens at http://localhost:8000 - drag & drop your scans and export cropped photos. If port 8000 is already in use, pick another:

uvx scansplitter api --port 8001

Features

  • Auto-detection - Finds multiple photos in a single scan using contour detection
  • Interactive editing - Adjust, rotate, and resize bounding boxes before cropping
  • Auto-rotation - Detects and corrects 90°/180°/270° rotations
  • PDF support - Extract and process pages from PDF files
  • Web UI - Modern React interface with Fabric.js canvas editor
  • CLI - Batch process files from the command line

Installation Options

Option 1: Run directly with uvx (recommended)

No installation needed - just run:

uvx scansplitter api

Option 2: Install with pipx

pipx install scansplitter
scansplitter api

Option 3: Install from source

git clone https://github.com/janklan/scansplitter
cd scansplitter
uv sync
uv run scansplitter api

Usage

Web Interface

scansplitter api
# or: uvx scansplitter api

Opens at http://localhost:8000 with:

  • Drag & drop file upload (images and PDFs)
  • Interactive bounding box editor (drag, resize, rotate)
  • Multi-file support with tabs
  • PDF page navigation
  • ZIP export

Command Line

# Process a scanned image
uv run scansplitter process scan.jpg -o ./output/

# Process a PDF
uv run scansplitter process document.pdf -o ./output/

# Multiple files
uv run scansplitter process scan1.jpg scan2.png -o ./output/

# Options
uv run scansplitter process scan.jpg \
  --no-rotate \
  --min-area 5 \
  --max-area 70 \
  --format jpg \
  -o ./output/

CLI Options:

Option Description
-o, --output Output directory (default: ./output)
--no-rotate Disable auto-rotation
--min-area Minimum photo size as % of scan (default: 2)
--max-area Maximum photo size as % of scan (default: 80)
--format Output format: png or jpg (default: png)

How It Works

  1. Preprocessing - Convert to grayscale, apply Gaussian blur
  2. Thresholding - Adaptive binary threshold to separate photos from background
  3. Contour Detection - Find distinct regions using OpenCV
  4. Filtering - Keep regions between min/max area thresholds
  5. Interactive Adjustment - User can modify detected boxes in the web UI
  6. Rotation Detection - Score each 90° rotation using Hough line detection
  7. Cropping - Extract photos using adjusted bounding boxes

Development

Frontend Development

# Start API server
uv run scansplitter api --reload

# In another terminal, start frontend dev server
cd frontend
npm install
npm run dev

Frontend runs on http://localhost:5173 with hot reload, proxying API requests to :8000.

Build Frontend

cd frontend
npm run build

Builds to src/scansplitter/static/, which FastAPI serves automatically.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

scansplitter-0.2.0.tar.gz (1.2 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

scansplitter-0.2.0-py3-none-any.whl (1.2 MB view details)

Uploaded Python 3

File details

Details for the file scansplitter-0.2.0.tar.gz.

File metadata

  • Download URL: scansplitter-0.2.0.tar.gz
  • Upload date:
  • Size: 1.2 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.9.21 {"installer":{"name":"uv","version":"0.9.21","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for scansplitter-0.2.0.tar.gz
Algorithm Hash digest
SHA256 c961fb8e865b95d52b7c4e3c317f636d9c2fe53a306aa632a9af9adaed96af2d
MD5 df6beb475dfd84b515ceee781a1236ba
BLAKE2b-256 006d796c1f61ec2f3e558473c42bb1e0a302f60c797ed0390b91004bc980f958

See more details on using hashes here.

File details

Details for the file scansplitter-0.2.0-py3-none-any.whl.

File metadata

  • Download URL: scansplitter-0.2.0-py3-none-any.whl
  • Upload date:
  • Size: 1.2 MB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.9.21 {"installer":{"name":"uv","version":"0.9.21","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for scansplitter-0.2.0-py3-none-any.whl
Algorithm Hash digest
SHA256 ce7913a4f8814105f929c7f22d485d03d69986d41814ee8f28d154a5c6cfd890
MD5 c358aaf97fa091dfe3b09ae55655e737
BLAKE2b-256 00cdc8b34add4653ec7023df0b9cbe70c707ec31afe8e7801f5d87973c943bf8

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page