Skip to main content

No project description provided

Project description

surf-archiver

The surf-archiver tool consists of two components:

  1. a CLI tool
  2. a remote client which can execute CLI installed on a remote machine

The CLI tool copies daily data from S3 supported storage, bundling it into a per experiment per day tar archives. Once completed it emits a message via RabbitMQ. The intended use case is for the CLI tool to installed on the Surf Data Archive and remotely triggered daily.

The tool copies data based on the S3 key. It is assumed that the keys have the following structure:

<SRC BUCKET>/<images|videos>/<EXPERIMENT-ID>/YYYYMMDD/*.tar

The resulting archive will be created at:

<DATA DIR>/<images|videos>/<EXPERIMENT-ID>/YYYY-MM-DD.tar

Installation of the CLI tool

The CLI tool can be installed via pipx.

In order to run the tool needs some configuration. It can be configured by via a yaml file:

bucket:             # S3 bucket data is pulled from
target_dir:         # Target directory data is stored in       
connction_url:      # RabbitMQ connection url
exchange_name:      # RabbitMQ Exchange name
log_file:           # log file path

By default the tool will look for this configuration in the ${HOME}/.surf-archiver. The tool will additionally look for these parameters in the shell environment. If they are defined here, they need to be prefixed by surf_archiver_.

In addition to the above, the following environment variables need to be set in order to connect to S3.

AWS_SECRET_KEY_ID=
AWS_ACCESS_KEY_ID=
AWS_ENDPOINT_URL=

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

surf_archiver-0.1.1a3.tar.gz (9.7 kB view details)

Uploaded Source

Built Distribution

surf_archiver-0.1.1a3-py3-none-any.whl (14.0 kB view details)

Uploaded Python 3

File details

Details for the file surf_archiver-0.1.1a3.tar.gz.

File metadata

  • Download URL: surf_archiver-0.1.1a3.tar.gz
  • Upload date:
  • Size: 9.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.8.4 CPython/3.10.12 Linux/6.5.0-1025-azure

File hashes

Hashes for surf_archiver-0.1.1a3.tar.gz
Algorithm Hash digest
SHA256 83e242b71a69696fb61b9122c4dff63eae33945e4289f74acb06e50929e6430a
MD5 a3c71682dd9fd093e88b04ca74e4cc1f
BLAKE2b-256 ecdc3575bccbd82d20adebeff6846d80207fb173a52165bd9c90520a34ed4eb8

See more details on using hashes here.

File details

Details for the file surf_archiver-0.1.1a3-py3-none-any.whl.

File metadata

  • Download URL: surf_archiver-0.1.1a3-py3-none-any.whl
  • Upload date:
  • Size: 14.0 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.8.4 CPython/3.10.12 Linux/6.5.0-1025-azure

File hashes

Hashes for surf_archiver-0.1.1a3-py3-none-any.whl
Algorithm Hash digest
SHA256 1eedc45f31d7cf6f6010954d7b5b68284ec3134e4be266fdb6d0c535d113f957
MD5 bd0777af4d14a405b593bace4a93a3de
BLAKE2b-256 bcf3acbe6f329dbd6cf205213f0d3d52ba9058921d4d33a298ef0442391c1502

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page