Backfill footprints and images for HiTIDE collections
Project description
hitide-backfill-tool
Tool to backfill thumbnail images and footprints for POCLOUD datasets
Some granules have been ingested without creating footprints/thumbnail images. The purpose of this tool is to trigger part of Cumulus workflow to generate footprints and images for granules that need it.
What it does in a nutshell
- You specify search parameters at command line (collection, start_date, end_date, footprint, image, etc)
- Backfill-Tool searches CMR for matching granules
- Backfill-Tool figures out if the granule needs a footprint or image
- If footprint or image generation is needed, Backfill-Tool creates a Cumulus message and sends it to an AWS SNS topic.
- From there, another service will run trigger Forge/TIG and update CMR with new images/footprints as needed
Prerequisites
- Python >= 3.12
- poetry
failed_workflow.py
- Script used to scan failed workflows and get unique errors
- Takes in three arguments
- workflow_arn: arn of aws workflow
- profile_name: aws profile name credential to use
- limit: how many of latest execution to scan if not specified will go through all failed executions
- ex: python failed_workflow.py --workflow_arn arn:aws:states:us-west-2:123456:stateMachine:podaac-services-ops-hitide-backfill-forge --profile_name service_ops --limit 1000
replay.py
- Script used to get messages off dead letter queue and back into regular queue
- Takes 1 argument
- config: configuration that has the aws_profile, dlq_url, and sqs_url
- ex: replay --config config.cfg
regression.py
- Script to run backfill tool command on all collection that has a forge-tig configuartion file
- Script can be modify to exclude or test specific collections
memory_profiler.py
- Script to run profile the memory use of lambdas, currently only tig is being profiled
- Lambdas need to be modified to include lambda request id in cloudwatch logs
- Modify script with cloudwatch lambda to profile
- Modify script to include start time and end time range where cloudwatch events were logged
ECS facility
- ECS template to start docker : ecs_cluster_instance_autoscaling_cf_template.yml.tmpl
- ECS script to execute task : task-reaper.sh
- All ECS related resources are specified in ecs_cluster.tf
- ECS is a cluster of EC2 instances. While creating the EC2 instances, a key is given to create each EC2 and the key name is specified as key_name variable within variables.tf. At this moment, the following keys are specified for each environment
- backfill-tool-sit-cluster-keypair (SIT)
- backfill-tool-uat-cluster-keypair (UAT)
- backfill-tool-ops-cluster-keypair (OPS)
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file hitide_backfill_tool-0.13.0.tar.gz.
File metadata
- Download URL: hitide_backfill_tool-0.13.0.tar.gz
- Upload date:
- Size: 27.4 kB
- Tags: Source
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
93770e7c9c778f751ec61d3098b291bd2bc45d08ba9bba02f14cc073cd829e09
|
|
| MD5 |
eda91d7266bcc15374f814a332d2832c
|
|
| BLAKE2b-256 |
b85b5763245862124dde828aabbea28da5a93335e8323a3f1519d6979410c61e
|
Provenance
The following attestation bundles were made for hitide_backfill_tool-0.13.0.tar.gz:
Publisher:
build.yml on podaac/hitide-backfill-tool
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
hitide_backfill_tool-0.13.0.tar.gz -
Subject digest:
93770e7c9c778f751ec61d3098b291bd2bc45d08ba9bba02f14cc073cd829e09 - Sigstore transparency entry: 1168331359
- Sigstore integration time:
-
Permalink:
podaac/hitide-backfill-tool@c54de90a7f573c9fe88b61cff8824a9b05a69b80 -
Branch / Tag:
refs/heads/main - Owner: https://github.com/podaac
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
build.yml@c54de90a7f573c9fe88b61cff8824a9b05a69b80 -
Trigger Event:
push
-
Statement type:
File details
Details for the file hitide_backfill_tool-0.13.0-py3-none-any.whl.
File metadata
- Download URL: hitide_backfill_tool-0.13.0-py3-none-any.whl
- Upload date:
- Size: 34.6 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
7b94000ca4df3985c8d79de6c63054b76da970cf9185372e8f8d4ba540b508fe
|
|
| MD5 |
de1c199fd7dad4046604fb48a602fb26
|
|
| BLAKE2b-256 |
8313c85cb416dad1f46252f5356217c69c195dbeb6a5fc42c1446af7869564f9
|
Provenance
The following attestation bundles were made for hitide_backfill_tool-0.13.0-py3-none-any.whl:
Publisher:
build.yml on podaac/hitide-backfill-tool
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
hitide_backfill_tool-0.13.0-py3-none-any.whl -
Subject digest:
7b94000ca4df3985c8d79de6c63054b76da970cf9185372e8f8d4ba540b508fe - Sigstore transparency entry: 1168331397
- Sigstore integration time:
-
Permalink:
podaac/hitide-backfill-tool@c54de90a7f573c9fe88b61cff8824a9b05a69b80 -
Branch / Tag:
refs/heads/main - Owner: https://github.com/podaac
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
build.yml@c54de90a7f573c9fe88b61cff8824a9b05a69b80 -
Trigger Event:
push
-
Statement type: