Skip to main content

Tools for creating and validating Daylily omics analysis reference buckets

Project description

daylily-omics-references

Bash utilities for creating and validating the reference S3 buckets used by daylily-ephemeral-cluster. The project now ships a single shell script that wraps AWS CLI commands to clone public reference data, validate an existing bucket, or ensure a bucket exists and matches the expected structure.

The commands shell out to the AWS CLI for recursive S3 copy operations. Ensure that the AWS CLI and jq are installed and that your environment is authenticated before running the script.

Usage

The entry point lives at scripts/daylily-omics-references.sh. Global options may be placed before the subcommand.

Usage: daylily-omics-references.sh [GLOBAL OPTIONS] <command> [ARGS]

Global options:
  --profile PROFILE   AWS profile to use
  --region REGION     AWS region to target (required for clone/ensure)

Commands:
  clone   Create a new reference bucket from the public source
  verify  Validate that a bucket matches the expected structure
  ensure  Verify a bucket or create it if missing

Clone a new reference bucket

scripts/daylily-omics-references.sh \
  --profile daylily-service \
  clone \
  --bucket-prefix myorg \
  --region us-west-2 \
  --execute

This creates myorg-omics-analysis-us-west-2, enables transfer acceleration and copies the default reference version (0.7.131c). Use --exclude-hg38, --exclude-b37, or --exclude-giab to omit large subsets. Pass --use-acceleration to copy via the S3 accelerate endpoint.

Verify an existing bucket

scripts/daylily-omics-references.sh \
  --profile daylily-service \
  verify \
  --bucket myorg-omics-analysis-us-west-2

This validates that the bucket exists, contains the expected folder structure and that its s3_reference_data_version.info marker matches the default version.

Ensure a bucket is ready for daylily-ephemeral-cluster

scripts/daylily-omics-references.sh \
  --profile daylily-service \
  ensure \
  --bucket-prefix myorg \
  --region us-west-2 \
  --execute

The command verifies the bucket when it already exists; otherwise it creates the bucket using the same cloning logic.

Development

The repository now consists solely of the bash script above. No Python dependencies are required. If you modify the script, ensure it remains POSIX shell compatible and keep the usage examples up to date.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

daylily_omics_references-0.3.3.tar.gz (21.1 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

daylily_omics_references-0.3.3-py3-none-any.whl (21.2 kB view details)

Uploaded Python 3

File details

Details for the file daylily_omics_references-0.3.3.tar.gz.

File metadata

File hashes

Hashes for daylily_omics_references-0.3.3.tar.gz
Algorithm Hash digest
SHA256 2a3ef41678940780729c3ffd080bd7116ee20f63012d4b1d7ec8d780f6786ec3
MD5 f8d1c962cb194f6e520c20ffe7b5a2e5
BLAKE2b-256 eec13a2932acf5d9b1d711c0e60fea54151204dd8b577fd492c5d12e8c3caccb

See more details on using hashes here.

File details

Details for the file daylily_omics_references-0.3.3-py3-none-any.whl.

File metadata

File hashes

Hashes for daylily_omics_references-0.3.3-py3-none-any.whl
Algorithm Hash digest
SHA256 2092e39330450ec9a137265b5c301e91259dcdb9de8aeebd4dd7dac88c8d522f
MD5 f6ee3d86c1199e80941cdd6f87c3fc33
BLAKE2b-256 af53f0aaf17c54543bbdbe465fccf8381704d1823c818cdfb0f1338f2a9757a7

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page