Skip to main content

Wine Cellar Management Application with OCR label scanning

Project description

WineBox

A wine cellar management application with OCR label scanning.

Features

  • Label Scanning: Upload wine label images for automatic text extraction via OCR
  • Wine Autocomplete: Search 100K+ wines from the X-Wines dataset with community ratings
  • Inventory Tracking: Check-in and check-out bottles with full history
  • Smart Parsing: Automatically identifies vintage, grape variety, region, and more
  • Search: Find wines by any criteria
  • Web Interface: Simple, mobile-friendly interface

Quick Start

Prerequisites

Installation

From PyPI:

pip install winebox

From source:

# Clone the repository
git clone https://github.com/jdrumgoole/winebox.git
cd winebox

# Install dependencies
uv sync --all-extras

# Install Tesseract OCR
# macOS:
brew install tesseract

# Ubuntu/Debian:
sudo apt-get install tesseract-ocr

Running the Server

# Development mode with auto-reload
invoke start --reload

# Background mode
invoke start-background

# Check status
invoke status

# Stop server
invoke stop

Access the Application

Usage

Check In Wine

  1. Navigate to the Check In page
  2. Upload front label image (required)
  3. Optionally upload back label image
  4. Review/edit auto-detected wine details
  5. Set quantity and add notes
  6. Click "Check In Wine"

Check Out Wine

  1. Go to the Cellar view
  2. Click "Check Out" on a wine card
  3. Enter quantity to remove
  4. Add optional notes (tasting notes, occasion)
  5. Confirm checkout

Search

Use the Search page to find wines by:

  • Text search (name, winery, region)
  • Vintage year
  • Grape variety
  • Region or country
  • Stock status

API

Full REST API available at /api:

Endpoint Method Description
/api/wines/checkin POST Add wine to cellar
/api/wines/{id}/checkout POST Remove wine from cellar
/api/wines GET List all wines
/api/wines/{id} GET Get wine details
/api/cellar GET Current inventory
/api/cellar/summary GET Cellar statistics
/api/transactions GET Transaction history
/api/search GET Search wines
/api/xwines/search GET Autocomplete wine search
/api/xwines/wines/{id} GET X-Wines wine details
/api/xwines/stats GET Dataset statistics

See /docs for interactive API documentation.

Data Storage

Database

The SQLite database is stored at data/winebox.db by default. This can be configured via the WINEBOX_DATABASE_URL environment variable.

Images

Wine label images are stored in the data/images/ directory by default. Each image is saved with a UUID filename to avoid conflicts.

Item Default Location Environment Variable
Database data/winebox.db WINEBOX_DATABASE_URL
Images data/images/ WINEBOX_IMAGE_STORAGE_PATH

Images are served via the API at /api/images/{filename}.

Note: The data/ directory is excluded from git (see .gitignore). Make sure to back up this directory to preserve your wine collection data.

X-Wines Dataset

WineBox integrates the X-Wines dataset for wine autocomplete, providing suggestions from 100,646 wines with 21 million community ratings.

Installing the Dataset

# Run database migration (if not already done)
uv run python -m scripts.migrations.runner up

# Option 1: Test dataset (100 wines, for development)
uv run python -m scripts.import_xwines --version test

# Option 2: Full dataset (100K+ wines, for production)
# First, install gdown and download from Google Drive
uv pip install gdown
mkdir -p data/xwines
uv run gdown --folder "https://drive.google.com/drive/folders/1LqguJNV-aKh1PuWMVx5ELA61LPfGfuu_?usp=sharing" -O data/xwines/
cp data/xwines/X-Wines_Official_Repository/last/XWines_Full_*.csv data/xwines/

# Then import
uv run python -m scripts.import_xwines --version full

The autocomplete appears when typing in the Wine Name field during check-in.

Label Scanning

WineBox uses AI-powered label scanning to extract wine information from photos.

Claude Vision (Recommended)

For best results, configure Claude Vision by setting your Anthropic API key:

export ANTHROPIC_API_KEY=your-api-key
# or
export WINEBOX_ANTHROPIC_API_KEY=your-api-key

Claude Vision provides intelligent label analysis that:

  • Handles decorative and artistic fonts
  • Understands wine-specific terminology
  • Extracts structured data (winery, vintage, grape variety, region, etc.)
  • Works with curved or angled text

Tesseract OCR (Fallback)

If no Anthropic API key is configured, WineBox falls back to Tesseract OCR. This requires Tesseract to be installed on your system:

# macOS
brew install tesseract

# Ubuntu/Debian
sudo apt-get install tesseract-ocr

Authentication

WineBox requires authentication for all API endpoints (except /health).

Creating Users

Use the winebox-admin command to manage users:

# Create an admin user
uv run winebox-admin add admin --email admin@example.com --admin --password yourpassword

# Create a regular user
uv run winebox-admin add username --email user@example.com --password yourpassword

# List all users
uv run winebox-admin list

# Disable/enable a user
uv run winebox-admin disable username
uv run winebox-admin enable username

# Change password
uv run winebox-admin passwd username --password newpassword

# Remove a user
uv run winebox-admin remove username

Server Management

Use the winebox-server command to manage the server:

# Start server (foreground)
uv run winebox-server start --foreground

# Start server (background)
uv run winebox-server start

# Stop server
uv run winebox-server stop

# Restart server
uv run winebox-server restart

# Check status
uv run winebox-server status

API Authentication

The API uses JWT bearer tokens. To authenticate:

  1. POST to /api/auth/token with username and password (form-urlencoded)
  2. Include the returned token in subsequent requests: Authorization: Bearer <token>

Tokens expire after 24 hours.

Development

Running Tests

# Run all tests
invoke test

# With verbose output
invoke test --verbose

# With coverage
invoke test --coverage

Project Structure

winebox/
├── winebox/          # Application package
│   ├── main.py       # FastAPI app
│   ├── models/       # Database models
│   ├── schemas/      # API schemas
│   ├── routers/      # API endpoints
│   ├── services/     # Business logic
│   └── static/       # Web interface
├── tests/            # Test suite
├── docs/             # Documentation
└── tasks.py          # Build tasks

Building Documentation

invoke docs-build
invoke docs-serve

Tech Stack

  • FastAPI: Web framework
  • SQLAlchemy: ORM
  • SQLite: Database
  • Tesseract: OCR engine
  • Vanilla JS: Frontend (no frameworks)

License

MIT License

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

winebox-0.3.0.tar.gz (3.1 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

winebox-0.3.0-py3-none-any.whl (66.3 kB view details)

Uploaded Python 3

File details

Details for the file winebox-0.3.0.tar.gz.

File metadata

  • Download URL: winebox-0.3.0.tar.gz
  • Upload date:
  • Size: 3.1 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for winebox-0.3.0.tar.gz
Algorithm Hash digest
SHA256 f7ae1765d1a6658429d1ce12ffb053d424bd053401bf442e51b036e26bcd8507
MD5 2321cc60a1c33246bc4eb251d34aa48c
BLAKE2b-256 4fd1d6a84225a60e734c22d471fb9dbee7776e7955833792cabd5801de1b0357

See more details on using hashes here.

Provenance

The following attestation bundles were made for winebox-0.3.0.tar.gz:

Publisher: publish.yml on jdrumgoole/winebox

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file winebox-0.3.0-py3-none-any.whl.

File metadata

  • Download URL: winebox-0.3.0-py3-none-any.whl
  • Upload date:
  • Size: 66.3 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for winebox-0.3.0-py3-none-any.whl
Algorithm Hash digest
SHA256 d10428d749a7c2984923788abb8597772aa92dd735fa9b6340a50f1c483e93be
MD5 f89b452584e64466c65b81e7a7c8412a
BLAKE2b-256 2a0003e67a8f066c45e2e50a9baa8c7b61e59ed6fa1030a03c4d094f0ca7efce

See more details on using hashes here.

Provenance

The following attestation bundles were made for winebox-0.3.0-py3-none-any.whl:

Publisher: publish.yml on jdrumgoole/winebox

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page