Skip to main content

git annex support for Ebrains Data Proxy

Project description

git-annex-remote-dataproxy

This git annex external remote extension uses AnnexRemote to talk to EBRAINS Data Proxy so that you can use EBRAINS Collaboratory buckets as Datalad siblings.

Install

pip install requests annexremote and put the git-annex-remote-dataproxy script somewhere on your $PATH.

Usage

create a dataset

$ datalad create pdfdata
[INFO   ] Creating a new annex repo at /home/duke/repo/pdfdata 
[INFO   ] scanning for unlocked files (this may take some time) 
create(ok): /home/duke/repo/pdfdata (dataset)
$ cd pdfdata/
$ rsync -ra /home/duke/Nextcloud/Work/PDFs/ ./
$ datalad save
add(ok): 1107.0903 Montbrio Pazo - Stuart Landau.pdf (file)                                          
  [199 similar messages have been suppressed; disable with datalad.ui.suppress-similar-results=off]  
save(ok): . (dataset)                                                                                
action summary:                                                                                      
  add (ok: 209)
  save (ok: 1)

tell the dataproxy remote our token and what bucket & prefix to use

$ export EBRAINS_TOKEN=$EBRAINS_TOKEN
$ export BUCKET_NAME=insference
$ export BUCKET_PATH=pdfs-annex

create the annex remote and datalad push --to it

$ git annex initremote insference type=external externaltype=dataproxy encryption=none
initremote insference ok
(recording state in git...)
$ datalad push --to insference
copy(ok): 15009.full.pdf (file) [to insference...]                                                   
  [193 similar messages have been suppressed; disable with datalad.ui.suppress-similar-results=off]  
action summary:                                                                                      
  copy (notneeded: 6, ok: 203)
$ which git-annex-remote-dataproxy
/home/duke/repo/datalad-dataproxy/git-annex-remote-dataproxy

Status

  • sloppy proof of concept
  • git annex testremote passes
  • PyPI package for easier install
  • better mechanism for specifying bucket & prefix

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

annex_dataproxy-1.0.0rc1.tar.gz (3.4 kB view details)

Uploaded Source

Built Distribution

annex_dataproxy-1.0.0rc1-py2.py3-none-any.whl (1.9 kB view details)

Uploaded Python 2 Python 3

File details

Details for the file annex_dataproxy-1.0.0rc1.tar.gz.

File metadata

  • Download URL: annex_dataproxy-1.0.0rc1.tar.gz
  • Upload date:
  • Size: 3.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.10.6

File hashes

Hashes for annex_dataproxy-1.0.0rc1.tar.gz
Algorithm Hash digest
SHA256 91619381dca12d276d7d63bef8020b521cd1e79639dd642c567dd9f13428ed42
MD5 fff3f69075692488bbe6347206fd941f
BLAKE2b-256 0ae7b6411c986ffa324a60ece2a54f197dfdf4156b68765120733a946755089a

See more details on using hashes here.

File details

Details for the file annex_dataproxy-1.0.0rc1-py2.py3-none-any.whl.

File metadata

File hashes

Hashes for annex_dataproxy-1.0.0rc1-py2.py3-none-any.whl
Algorithm Hash digest
SHA256 34ed42a009947aea056d5746c4e1d41100620791c13a3779c85d443d0818b19b
MD5 a8e1e104d908d182eae78101c26fcb36
BLAKE2b-256 9f9940ea2712815c51d0647538d8bdd2c5205b10935a724330d75159e2c57f93

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page