A local visual search and clustering tool for image archives.
Project description
Nazariya
Nazariya is a local visual search and clustering tool for photo archives.
The current goal is to help find photos that are visually close enough to support a realistic style transfer with human input, but not so identical that simple copy-paste settings would be enough.
In practice, Nazariya helps answer questions like:
- Which candidate sets are visually close to each other?
- Which photos share similar lighting, palette, environment, or editability?
- Which sets should I inspect in Lightroom when building stronger delivery groups?
Nazariya currently works as a Python CLI, with a small Lightroom Classic helper plugin for exporting catalog metadata to CSV.
Current workflow
1. Export candidate metadata from Lightroom Classic
Use the included Lightroom plugin to export a CSV of RAW/DNG photos that are in your candidate keyword sets.
The plugin exports metadata only. It does not export or copy the RAW files.
Example output:
$MATRIX/packages/nazariya/data/inputs/candidates.csv
This CSV includes source paths, candidate keys, keywords, ratings, labels, capture time, camera/lens metadata, and location fields when Lightroom exposes them.
For this project, the current candidate set contains roughly 39,397 RAW photos across candidate keys such as c001 through c325.
2. Create a per-candidate normalization override CSV
Create a CSV where each candidate set can define white balance and exposure normalization settings.
./scripts/nazariya make-overrides-template \
--input data/inputs/candidates.csv \
--output data/config/candidate_overrides.csv
Most rows can use the same default settings. Override only sets that need adjustment.
3. Build a smaller review sample
For fast iteration, sample a few images from each candidate set.
./scripts/nazariya sample \
--input data/inputs/candidates.csv \
--output data/inputs/candidates_sample_003_seed_42.csv \
--per-candidate 3 \
--seed 42
This produces a manageable visual review set. Three images per candidate is usually a good starting point because one image can be misleading.
4. Build normalized previews and inspect them
Nazariya reads the original RAW files, applies analysis-only white balance and exposure normalization, then writes JPEG previews.
The purpose is not to make the previews look beautiful. The purpose is to make visually comparable previews so similar images land close together in feature space.
This step is iterative:
- Build previews.
- Generate contact sheets.
- Inspect exposure and white balance consistency.
- Adjust
data/config/candidate_overrides.csv. - Swap bad random samples when a picked frame does not represent the set.
- Rebuild previews.
See the detailed workflow:
5. Extract features and generate neighbor sheets
Once the previews look usable, generate embeddings and color/light features.
Nazariya currently combines:
- CLIP image embeddings
- LAB/HSV color and tonal histograms
- simple color/light statistics
For edit-family search, color and lighting usually matter more than semantic similarity, so a color-heavy weighting can be useful.
Example:
./scripts/nazariya extract-features \
--preview-map "$WHISK_ML_DATASETS/nazariya/previews/sample_003_seed_42_review/preview_map.csv" \
--output "$WHISK_ML_DATASETS/nazariya/features/sample_003_seed_42_review/features_clip035_color065.npz" \
--metadata "$WHISK_ML_DATASETS/nazariya/features/sample_003_seed_42_review/features_clip035_color065.csv" \
--clip-weight 0.35 \
--color-weight 0.65 \
--batch-size 32
Then generate one neighbor contact sheet per candidate set:
./scripts/nazariya neighbor-sheets \
--features "$WHISK_ML_DATASETS/nazariya/features/sample_003_seed_42_review/features_clip035_color065.npz" \
--preview-map "$WHISK_ML_DATASETS/nazariya/previews/sample_003_seed_42_review/preview_map.csv" \
--output "$WHISK_ML_DATASETS/nazariya/debug_neighbors/by_candidate_clip035_color065" \
--top-k 10 \
--exclude-same-candidate \
--thumb-size 260
See the detailed workflow:
Lightroom plugin
The Lightroom helper plugin lives here:
lightroom/nazariya.lrplugin
Current menu item:
Library > Plug-in Extras > Export Candidate CSV
It exports selected Lightroom photos to the configured CSV path in lightroom/nazariya.lrplugin/Config.lua.
Data layout
Recommended local repo layout:
data/
config/
candidate_overrides.csv
inputs/
candidates.csv
candidates_backup.csv
candidates_sample_003_seed_42.csv
candidates_sample_003_seed_42_swapped.csv
previews/
Generated previews, features, and contact sheets can be written to a larger dataset volume such as:
$WHISK_ML_DATASETS/nazariya/
For long jobs, local scratch is often more reliable than a network volume. You can render locally, then copy results to the dataset volume afterward.
CLI commands currently used
./scripts/nazariya --version
./scripts/nazariya hello
./scripts/nazariya init
./scripts/nazariya sample
./scripts/nazariya swap-sample
./scripts/nazariya make-overrides-template
./scripts/nazariya build-previews
./scripts/nazariya contact-sheets
./scripts/nazariya extract-features
./scripts/nazariya neighbor-sheets
Development
This project uses uv.
uv sync
./scripts/nazariya --help
Build:
uv build
Publish:
uv publish
Notes
Nazariya is still experimental. The current approach is intentionally practical:
- Use Lightroom to identify candidate pools.
- Use Python to normalize RAW previews and compute features.
- Use contact sheets for human visual review.
- Use nearest-neighbor suggestions to rebuild better photo groups manually.
The human review step is part of the design. The tool suggests promising neighborhoods; the final grouping still depends on taste, context, and the editing goal.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file nazariya-0.1.2.tar.gz.
File metadata
- Download URL: nazariya-0.1.2.tar.gz
- Upload date:
- Size: 17.6 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: uv/0.11.12 {"installer":{"name":"uv","version":"0.11.12","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"macOS","version":null,"id":null,"libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
8296b891c4187e017c179dce2139fd315213a20f19733f09f58f985b9ab8b720
|
|
| MD5 |
d73056e9aeeef47993a934b4eed875e3
|
|
| BLAKE2b-256 |
e7ccd0350f87ea962638c02eca4e8629cd96c1ca6caaa130566dbe3d59fb68d8
|
File details
Details for the file nazariya-0.1.2-py3-none-any.whl.
File metadata
- Download URL: nazariya-0.1.2-py3-none-any.whl
- Upload date:
- Size: 22.8 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: uv/0.11.12 {"installer":{"name":"uv","version":"0.11.12","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"macOS","version":null,"id":null,"libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
e9516bcd28126e9b06ded0a253de9a8b4d179254d585158a5710b25b4b12a88a
|
|
| MD5 |
52919566aa072ecdc533758eb98b3472
|
|
| BLAKE2b-256 |
0e7083bb34a92b7e7d749531f3bf252c88ee96aa1890f231a80877dc049bdc27
|