Skip to main content

git annex support for Ebrains Data Proxy

Project description

git-annex-remote-dataproxy

This git annex external remote extension uses AnnexRemote to talk to EBRAINS Data Proxy so that you can use EBRAINS Collaboratory buckets as Datalad siblings.

Install

pip install requests annexremote and put the git-annex-remote-dataproxy script somewhere on your $PATH.

Usage

create a dataset

$ datalad create pdfdata
[INFO   ] Creating a new annex repo at /home/duke/repo/pdfdata 
[INFO   ] scanning for unlocked files (this may take some time) 
create(ok): /home/duke/repo/pdfdata (dataset)
$ cd pdfdata/
$ rsync -ra /home/duke/Nextcloud/Work/PDFs/ ./
$ datalad save
add(ok): 1107.0903 Montbrio Pazo - Stuart Landau.pdf (file)                                          
  [199 similar messages have been suppressed; disable with datalad.ui.suppress-similar-results=off]  
save(ok): . (dataset)                                                                                
action summary:                                                                                      
  add (ok: 209)
  save (ok: 1)

tell the dataproxy remote our token and what bucket & prefix to use

$ export EBRAINS_TOKEN=$EBRAINS_TOKEN
$ export BUCKET_NAME=insference
$ export BUCKET_PATH=pdfs-annex

create the annex remote and datalad push --to it

$ git annex initremote insference type=external externaltype=dataproxy encryption=none
initremote insference ok
(recording state in git...)
$ datalad push --to insference
copy(ok): 15009.full.pdf (file) [to insference...]                                                   
  [193 similar messages have been suppressed; disable with datalad.ui.suppress-similar-results=off]  
action summary:                                                                                      
  copy (notneeded: 6, ok: 203)
$ which git-annex-remote-dataproxy
/home/duke/repo/datalad-dataproxy/git-annex-remote-dataproxy

Status

  • sloppy proof of concept
  • git annex testremote passes
  • PyPI package for easier install
  • better mechanism for specifying bucket & prefix

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

annex_dataproxy-1.0.0rc2.tar.gz (3.0 kB view details)

Uploaded Source

Built Distribution

annex_dataproxy-1.0.0rc2-py2.py3-none-any.whl (4.5 kB view details)

Uploaded Python 2 Python 3

File details

Details for the file annex_dataproxy-1.0.0rc2.tar.gz.

File metadata

  • Download URL: annex_dataproxy-1.0.0rc2.tar.gz
  • Upload date:
  • Size: 3.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.10.6

File hashes

Hashes for annex_dataproxy-1.0.0rc2.tar.gz
Algorithm Hash digest
SHA256 7ef341d9348005c68f7486024ad6164411941500dd9e66ee42317d53daac7518
MD5 fd2a07857ada2177cc731d57730f171b
BLAKE2b-256 f93051e0096036f02f2e5699b56ad5fd4fe01a30f7cb4e64e6e11297a828bd2b

See more details on using hashes here.

File details

Details for the file annex_dataproxy-1.0.0rc2-py2.py3-none-any.whl.

File metadata

File hashes

Hashes for annex_dataproxy-1.0.0rc2-py2.py3-none-any.whl
Algorithm Hash digest
SHA256 144383efab9e21993c699763567f982d8634ac8d2d2300adbb552835249112ef
MD5 770ca86258053b6ca63c45d06afe2d59
BLAKE2b-256 a6e295570e118dd0ec01c4eba262cc768b391856a3e1635061ead1f909d3dd1a

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page