Skip to main content

Wine Cellar Management Application with OCR label scanning

Project description

WineBox

A wine cellar management application with OCR label scanning.

Features

  • Label Scanning: Upload wine label images for automatic text extraction via OCR
  • Wine Autocomplete: Search 100K+ wines from the X-Wines dataset with community ratings
  • Inventory Tracking: Check-in and check-out bottles with full history
  • Smart Parsing: Automatically identifies vintage, grape variety, region, and more
  • Search: Find wines by any criteria
  • Web Interface: Simple, mobile-friendly interface

Quick Start

Prerequisites

Installation

From PyPI:

pip install winebox

From source:

# Clone the repository
git clone https://github.com/jdrumgoole/winebox.git
cd winebox

# Install dependencies
uv sync --all-extras

# Install Tesseract OCR
# macOS:
brew install tesseract

# Ubuntu/Debian:
sudo apt-get install tesseract-ocr

Running the Server

# Development mode with auto-reload
invoke start --reload

# Background mode
invoke start-background

# Check status
invoke status

# Stop server
invoke stop

Access the Application

Usage

Check In Wine

  1. Navigate to the Check In page
  2. Upload front label image (required)
  3. Optionally upload back label image
  4. Review/edit auto-detected wine details
  5. Set quantity and add notes
  6. Click "Check In Wine"

Check Out Wine

  1. Go to the Cellar view
  2. Click "Check Out" on a wine card
  3. Enter quantity to remove
  4. Add optional notes (tasting notes, occasion)
  5. Confirm checkout

Search

Use the Search page to find wines by:

  • Text search (name, winery, region)
  • Vintage year
  • Grape variety
  • Region or country
  • Stock status

API

Full REST API available at /api:

Endpoint Method Description
/api/wines/checkin POST Add wine to cellar
/api/wines/{id}/checkout POST Remove wine from cellar
/api/wines GET List all wines
/api/wines/{id} GET Get wine details
/api/cellar GET Current inventory
/api/cellar/summary GET Cellar statistics
/api/transactions GET Transaction history
/api/search GET Search wines
/api/xwines/search GET Autocomplete wine search
/api/xwines/wines/{id} GET X-Wines wine details
/api/xwines/stats GET Dataset statistics

See /docs for interactive API documentation.

Data Storage

Database

The SQLite database is stored at data/winebox.db by default. This can be configured via the WINEBOX_DATABASE_URL environment variable.

Images

Wine label images are stored in the data/images/ directory by default. Each image is saved with a UUID filename to avoid conflicts.

Item Default Location Environment Variable
Database data/winebox.db WINEBOX_DATABASE_URL
Images data/images/ WINEBOX_IMAGE_STORAGE_PATH

Images are served via the API at /api/images/{filename}.

Note: The data/ directory is excluded from git (see .gitignore). Make sure to back up this directory to preserve your wine collection data.

X-Wines Dataset

WineBox integrates the X-Wines dataset for wine autocomplete, providing suggestions from 100,646 wines with 21 million community ratings.

Installing the Dataset

# Run database migration (if not already done)
uv run python -m scripts.migrations.runner up

# Option 1: Test dataset (100 wines, for development)
uv run python -m scripts.import_xwines --version test

# Option 2: Full dataset (100K+ wines, for production)
# First, install gdown and download from Google Drive
uv pip install gdown
mkdir -p data/xwines
uv run gdown --folder "https://drive.google.com/drive/folders/1LqguJNV-aKh1PuWMVx5ELA61LPfGfuu_?usp=sharing" -O data/xwines/
cp data/xwines/X-Wines_Official_Repository/last/XWines_Full_*.csv data/xwines/

# Then import
uv run python -m scripts.import_xwines --version full

The autocomplete appears when typing in the Wine Name field during check-in.

Label Scanning

WineBox uses AI-powered label scanning to extract wine information from photos.

Claude Vision (Recommended)

For best results, configure Claude Vision by setting your Anthropic API key:

export ANTHROPIC_API_KEY=your-api-key
# or
export WINEBOX_ANTHROPIC_API_KEY=your-api-key

Claude Vision provides intelligent label analysis that:

  • Handles decorative and artistic fonts
  • Understands wine-specific terminology
  • Extracts structured data (winery, vintage, grape variety, region, etc.)
  • Works with curved or angled text

Tesseract OCR (Fallback)

If no Anthropic API key is configured, WineBox falls back to Tesseract OCR. This requires Tesseract to be installed on your system:

# macOS
brew install tesseract

# Ubuntu/Debian
sudo apt-get install tesseract-ocr

Authentication

WineBox requires authentication for all API endpoints (except /health).

Creating Users

Use the winebox-admin command to manage users:

# Create an admin user
uv run winebox-admin add admin --email admin@example.com --admin --password yourpassword

# Create a regular user
uv run winebox-admin add username --email user@example.com --password yourpassword

# List all users
uv run winebox-admin list

# Disable/enable a user
uv run winebox-admin disable username
uv run winebox-admin enable username

# Change password
uv run winebox-admin passwd username --password newpassword

# Remove a user
uv run winebox-admin remove username

Server Management

Use the winebox-server command to manage the server:

# Start server (foreground)
uv run winebox-server start --foreground

# Start server (background)
uv run winebox-server start

# Stop server
uv run winebox-server stop

# Restart server
uv run winebox-server restart

# Check status
uv run winebox-server status

API Authentication

The API uses JWT bearer tokens. To authenticate:

  1. POST to /api/auth/token with username and password (form-urlencoded)
  2. Include the returned token in subsequent requests: Authorization: Bearer <token>

Tokens expire after 24 hours.

Development

Running Tests

# Run all tests
invoke test

# With verbose output
invoke test --verbose

# With coverage
invoke test --coverage

Project Structure

winebox/
├── winebox/          # Application package
│   ├── main.py       # FastAPI app
│   ├── models/       # Database models
│   ├── schemas/      # API schemas
│   ├── routers/      # API endpoints
│   ├── services/     # Business logic
│   └── static/       # Web interface
├── tests/            # Test suite
├── docs/             # Documentation
└── tasks.py          # Build tasks

Building Documentation

invoke docs-build
invoke docs-serve

Tech Stack

  • FastAPI: Web framework
  • SQLAlchemy: ORM
  • SQLite: Database
  • Tesseract: OCR engine
  • Vanilla JS: Frontend (no frameworks)

License

MIT License

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

winebox-0.3.1.tar.gz (5.6 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

winebox-0.3.1-py3-none-any.whl (66.3 kB view details)

Uploaded Python 3

File details

Details for the file winebox-0.3.1.tar.gz.

File metadata

  • Download URL: winebox-0.3.1.tar.gz
  • Upload date:
  • Size: 5.6 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for winebox-0.3.1.tar.gz
Algorithm Hash digest
SHA256 148864bd8455aae09ae9e6b9a2cbe22f2f2d2a783a06e4f031d178f8b617cc4f
MD5 fbb5e5ee7caa02e27765086a6cf6ad55
BLAKE2b-256 552b9d3ef92e00e0788d94f7edbe82093280fd9148cabac21b144003cc401ae7

See more details on using hashes here.

Provenance

The following attestation bundles were made for winebox-0.3.1.tar.gz:

Publisher: publish.yml on jdrumgoole/winebox

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file winebox-0.3.1-py3-none-any.whl.

File metadata

  • Download URL: winebox-0.3.1-py3-none-any.whl
  • Upload date:
  • Size: 66.3 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for winebox-0.3.1-py3-none-any.whl
Algorithm Hash digest
SHA256 8d9aa35b81ef5ce2ac7bb851eb56631d64906618dcf54a336ce77667ee44de04
MD5 09ecd4a7c86d567223a305b96f245c6e
BLAKE2b-256 9726a0e33bd89b1c89b91218741ed42afd45a089e809c827b6e580161cd303c4

See more details on using hashes here.

Provenance

The following attestation bundles were made for winebox-0.3.1-py3-none-any.whl:

Publisher: publish.yml on jdrumgoole/winebox

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page