Skip to main content

Lightweight image gallery server - no database, no files left behind

Project description

Lenslet

A lightweight image gallery server for fast visual triage. Point it at a directory or a Parquet table and browse instantly in your browser. Lenslet keeps the source images read-only and stores workspace state separately.

Introduction

image image

Lenslet is a self-contained image gallery server designed for simplicity and speed. It indexes directories on-the-fly, generates thumbnails on demand, and serves everything through a clean web interface. Perfect for quickly browsing local image collections or large Parquet-backed datasets without modifying the source images.

Features

  • Workspace-aware: Persists UI state (Smart Folders/views) and optional thumbnail cache under .lenslet/ (or <parquet>.lenslet.json)
  • Read-only sources: Never writes into your image directories or S3 buckets
  • Local + S3 + HTTP: Mix local files, s3:// URIs, and URLs with smart source parsing
  • Metrics & filtering: Sort/filter by numeric metrics from Parquet (histograms + range brushing)
  • Embedding similarity: Find similar images from fixed-size list embeddings (cosine; optional FAISS acceleration)
  • Labels & export: Tag, rate, and annotate items, then export metadata as JSON or CSV
  • Single command: Just point to a directory or Parquet file and go

Installation

pip install lenslet

Optional extras for embedding search:

# NumPy-only similarity search
pip install "lenslet[embeddings]"

# FAISS-accelerated similarity search (CPU)
pip install "lenslet[embeddings-faiss]"

Usage

Command Line Interface

lenslet /path/to/images

Then open the URL printed in the terminal (default http://127.0.0.1:7070, or the next available port).

Options:

lenslet <directory|table.parquet|org/dataset|s3://.../table.parquet> [options]

Options:
  -p, --port PORT              Port to listen on (default: 7070; auto-increment if in use)
  -H, --host HOST              Host to bind to (default: 127.0.0.1)
  --thumb-size SIZE            Thumbnail short edge in pixels (default: 256)
  --thumb-quality QUALITY      Thumbnail WebP quality 1-100 (default: 70)
  --source-column NAME         Column to load image paths from in table mode
  --base-dir PATH              Base directory for resolving relative paths in table mode
  --no-cache-wh                Disable caching width/height back into parquet
  --no-skip-indexing           Probe image dimensions during table load
  --no-thumb-cache             Disable thumbnail cache when a workspace is available
  --no-og-preview              Disable dataset-based social preview image
  --no-write                   Disable workspace writes (.lenslet/) for one-off sessions
  --presence-lifecycle-v2      Enable lifecycle-v2 presence join/move/leave semantics (default)
  --no-presence-lifecycle-v2   Rollback presence transitions to legacy heartbeat semantics
  --embedding-column NAME      Embedding column name (repeatable, comma-separated allowed)
  --embedding-metric NAME:METRIC
                               Embedding metric override (repeatable)
  --embedding-preload          Preload embedding indexes on startup
  --embedding-cache            Enable embedding cache (default)
  --no-embedding-cache         Disable embedding cache
  --embedding-cache-dir PATH   Override embedding cache directory
  --embed                      Run CPU embedding inference on a parquet file before launch
  --batch-size SIZE            Embedding inference batch size (used with --embed)
  --parquet-batch-size SIZE    Rows per parquet batch (used with --embed)
  --num-workers N              Parallel image loading workers (used with --embed)
  --reload                     Enable auto-reload for development
  --share                      Create a public share URL via cloudflared
  --verbose                    Show detailed server logs
  -v, --version                Show version and exit

Examples:

# Serve images from your Pictures folder
lenslet ~/Pictures

# Use a custom port
lenslet ~/Photos --port 8080

# Make accessible on local network
lenslet ~/Images --host 0.0.0.0 --port 7070

# Create a public share URL (prints a trycloudflare.com link)
lenslet ~/Images --share

# Start from a Parquet workspace (paths can be local, s3://, or https://)
lenslet /data/items.parquet --source-column image_path --base-dir /data

# Start from a folder containing items.parquet
lenslet /data/dataset --source-column image_path

# Start from a Hugging Face dataset repo (org/dataset)
lenslet incantor/dit03-twitter-niji7-5k-filtering-metrics --share

# Start from a remote Parquet file
lenslet s3://my-bucket/items.parquet --source-column image_path

# Add embeddings to a local Parquet file before launching
lenslet /data/items.parquet --source-column image_path --embed

Embedding Similarity Search

Lenslet auto-detects fixed-size list embedding columns in items.parquet (or you can force them with --embedding-column). The UI exposes a "Find similar" action, and the API supports path-based or base64 vector queries.

# Search by selected image path
curl -X POST http://127.0.0.1:7070/embeddings/search \
  -H "Content-Type: application/json" \
  -d '{"embedding":"clip","query_path":"/images/cat.jpg","top_k":50,"min_score":0.2}'
# Encode a float32 vector (little-endian) for query_vector_b64
import base64
import numpy as np

vec = np.asarray([0.1, 0.2, 0.3], dtype="<f4")
payload = base64.b64encode(vec.tobytes()).decode("ascii")

Embedding caches live under .lenslet/embeddings_cache/ (or <parquet>.cache/embeddings_cache/) unless you override with --embedding-cache-dir.

For a one-shot embedding write without launching Lenslet, run:

python scripts/embed_parquet_embeddings.py /data/items.parquet --image-column image_path

Programmatic API (Python/Jupyter)

Launch lenslet directly from Python code or notebooks:

import lenslet

datasets = {
    "my_images": ["/path/to/img1.jpg", "/path/to/img2.jpg"],
    "more_images": [
        "s3://bucket/img3.jpg",           # S3 URIs
        "https://example.com/img4.jpg",   # HTTP/HTTPS URLs
    ],
}

# Launch in non-blocking mode (returns immediately)
lenslet.launch(datasets, blocking=False, port=7070)

Key Features:

  • 🚀 Jupyter-friendly: Non-blocking mode for notebooks
  • ☁️ S3 support: Automatically handles S3 URIs via presigned URLs
  • 📁 Multiple datasets: Organize images into named collections
  • 🔗 Mixed sources: Combine local files, S3 URIs, and HTTP URLs

See Programmatic API Documentation for details and examples.

Hotpath API Notes (2026-02)

  • GET /folders recursive mode is paged by default with recursive=1&page=<n>&page_size=<n>. Defaults: page=1, page_size=200; page_size is clamped to 500.
  • Use legacy_recursive=1 with recursive=1 to return the full legacy recursive payload shape while rolling out paged callers.
  • GET /file now streams local file-backed sources and falls back to byte responses for non-local/remote sources.
  • Full-file prefetch is restricted to viewer/compare contexts and sends x-lenslet-prefetch: viewer|compare.
  • GET /health exposes hotpath runtime counters/timers under hotpath.counters and hotpath.timers_ms.

Deferred Performance Backlog

The following items are intentionally deferred from the hotpath sprint:

  • Indexed search path to replace O(N) in-memory scans.
  • Folder tree virtualization/flattening for very large trees.
  • Configurable or batched label persistence writes.

Notes

  • Workspace files: .lenslet/views.json stores Smart Folders; optional thumbnail cache lives under .lenslet/thumbs/
    • For Parquet, views live at <table>.lenslet.json and thumbs at <table>.cache/thumbs/
  • Embedding cache: .lenslet/embeddings_cache/ (or <table>.cache/embeddings_cache/) stores cached embedding indexes
  • Read-only sources: The server never writes into your image directories or S3 buckets
  • Labels: Tags/notes/ratings are editable in the UI (session-only) and exportable as JSON/CSV
  • No-write mode: Pass --no-write to keep the session fully ephemeral (no .lenslet/ or .lenslet.json)
  • Formats: Supports JPEG, PNG, and WebP
  • Hidden files: Files/folders starting with . are ignored

License

MIT License

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

lenslet-0.2.13.tar.gz (545.4 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

lenslet-0.2.13-py3-none-any.whl (211.1 kB view details)

Uploaded Python 3

File details

Details for the file lenslet-0.2.13.tar.gz.

File metadata

  • Download URL: lenslet-0.2.13.tar.gz
  • Upload date:
  • Size: 545.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.10.16

File hashes

Hashes for lenslet-0.2.13.tar.gz
Algorithm Hash digest
SHA256 7cf8d73b1a19090460624bcd360716cb3866f72d004e6a3196deacf133289a4f
MD5 71e2555e96cdb7d4a64da64640463dc7
BLAKE2b-256 d84d105be23e116eca67ee6cd498badb3796d45f66c22de90cdecefdc098b236

See more details on using hashes here.

File details

Details for the file lenslet-0.2.13-py3-none-any.whl.

File metadata

  • Download URL: lenslet-0.2.13-py3-none-any.whl
  • Upload date:
  • Size: 211.1 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.10.16

File hashes

Hashes for lenslet-0.2.13-py3-none-any.whl
Algorithm Hash digest
SHA256 2af2701b90c5b844e523ef02db877972c44faff524d6a14533443e908038d9e9
MD5 130fa185c3ad7fc3c356584f60341f63
BLAKE2b-256 4b475baa518a3310962f5f5dccae51eb89e097de118ed67ab2b173ebe9ae5b6b

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page