Skip to main content

CLI tool and library for exporting metadata from Eagle app JSON into parquet or huggingface dataset.

Project description

Eagle Exporter

Eagle Exporter is a command-line utility (and Python library) to parse image metadata JSON files from an Eagle library directory and export them to either a local Parquet file or a Hugging Face Dataset repository.

Features

  • Recursively scans an Eagle library's images/ folder for JSON files.
  • Extracts tags, star ratings, palette info, etc.
  • Optionally merges with s5cmd logs to provide S3 URIs for each file.
  • NEW: Can include the actual image data when exporting to Hugging Face datasets.
  • Exports to either:
    • A local .parquet file
    • A Hugging Face dataset (can be public or private).

Installation

pip install eagle-exporter

Usage

Example command to export an Eagle library to Hugging Face:

eagle-exporter path/to/my_eagle.library --to myuser/my_hf_dataset --hf-public

Export to Hugging Face with image data included:

eagle-exporter path/to/my_eagle.library --to myuser/my_hf_dataset --hf-public --include-images

Or export to a local .parquet file:

eagle-exporter path/to/my_eagle.library --to /tmp/output.parquet

Optionally include an s5cmd file to attach S3 URIs:

eagle-exporter path/to/my_eagle.library --s5cmd /path/to/s5cmd.txt --to /tmp/output.parquet

Command-Line Arguments

  • EAGLE_DIR (positional): The path to your Eagle library directory (the folder that has images/ inside).

  • --s5cmd <FILE> (optional): Path to the s5cmd log file (with lines like cp localfile s3://bucket/...)

  • --to <DEST> (required):

    • If <DEST> ends with .parquet, will export to Parquet.
    • Otherwise, treats <DEST> as a Hugging Face dataset name (e.g. username/datasetname).
  • --hf-public (optional): If exporting to Hugging Face, mark it as public.

  • --include-images (optional): Include the actual image files in the export. This is particularly useful for Hugging Face exports. Note that binary image data will be excluded from Parquet exports.

  • --help: Show the help message.

Developer Notes

  • The core functionality resides in src/eagle_exporter/core.py.
  • The CLI is in src/eagle_exporter/cli.py.
  • The library uses click for the command line, pandas for data manipulation, and datasets for pushing to Hugging Face.
  • Image processing is handled using Pillow (PIL).

Building and publishing

To build and publish the repo, run the following commands:

python -m pip install build twine
python -m build
twine check dist/*
twine upload dist/*

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

eagle_exporter-0.1.4.tar.gz (7.1 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

eagle_exporter-0.1.4-py3-none-any.whl (8.4 kB view details)

Uploaded Python 3

File details

Details for the file eagle_exporter-0.1.4.tar.gz.

File metadata

  • Download URL: eagle_exporter-0.1.4.tar.gz
  • Upload date:
  • Size: 7.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.10.0

File hashes

Hashes for eagle_exporter-0.1.4.tar.gz
Algorithm Hash digest
SHA256 4c91f6aaa5b74bdcf068609047472849c1ac013b7acb48de98c0436e3fa45f92
MD5 192a04e9e3b82904a1bb5aa75439bed1
BLAKE2b-256 46ce7b135db8664116f5be810980bf946311e6fb91ace4960041238caaf0c1f4

See more details on using hashes here.

File details

Details for the file eagle_exporter-0.1.4-py3-none-any.whl.

File metadata

  • Download URL: eagle_exporter-0.1.4-py3-none-any.whl
  • Upload date:
  • Size: 8.4 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.10.0

File hashes

Hashes for eagle_exporter-0.1.4-py3-none-any.whl
Algorithm Hash digest
SHA256 c666776d1e4e91659b03d06af529d2634d2fe7f68bfd841f65f0b86e175effae
MD5 c6fa41d05236d2a6f6b4b69572bbf167
BLAKE2b-256 e6b922e7384b635afec66fcb5a31127418f5b44dcb2655f3ed09d25609f489ac

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page