Download datasets from the RE-MAT Clowder
Project description
RE-MAT Data
This package provides a simple interface to download and upload datasets used in the ReMat project.
Installation
This is a python package and can be installed using pip:
pip install remat-data-downloader
Authentication
You need to obtain a Clowder API key to use this package. You can obtain this
key by logging into the
RE-MAT Clowder instance and going
to your user settings page (grey silhouette in the upper right corner of the
page). Click on the API Keys tab and create a new key. Save this key in a file
nameed clowder_key.txt in the directory where you will be running the
remat-download-data command.
Usage
The remat-data command can be run from your command line. It has commands to interact with Clowder spaces and with datasets.
List Spaces
remat-data spaces list
Clowder Spaces
┏━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━┓
┃ Name ┃ ID ┃ datasets ┃
┡━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━┩
│ Front velocities │ 6674972be4b0a2d1b9ba0228 │ 303 │
│ DSC Post Cures │ 6669d4d0e4b0a2d1b9b9a797 │ 228 │
│ DSC Cure Kinetics │ 64343b6be4b01a23c58bad90 │ 307 │
└───────────────────┴──────────────────────────┴──────────┘
Download Datasets
Now that you know the unique ID for the space you want to download, you can
download the datasets using the download command. This command will create
subdirectories for each dataset and download the metadata as a json file and the
DSC Curve as a csv file.
You can re-run this command, and it will skip downloads of datasets that already exist in the directory.
remat-data spaces download 6669d4d0e4b0a2d1b9b9a797
List Collections
Collections are another way to organize datasets in Clowder. You can list all available collections using the following command:
remat-data collections list
Clowder Collections
┏━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━┓
┃ Name ┃ ID ┃ datasets ┃
┡━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━┩
│ Example Collection│ 507f1f77bcf86cd799439011 │ 150 │
└───────────────────┴──────────────────────────┴──────────┘
Download from Collections
Similar to downloading from spaces, you can download all datasets from a specific collection using its unique ID. This command will download metadata, DSC_Curve.csv files, and any MP4 video files for each dataset in the collection.
remat-data collections download 507f1f77bcf86cd799439011
Like the space download command, this will skip datasets that have already been downloaded to avoid redundant transfers.
Upload Files to Space
Using the upload command, you can upload files to a space. The command takes
the space name as a flag, optional --name flag for the dataset name and a list
of file name to upload. If dataset name is not mentioned, A default dataset is
created and the files are uploaded under the newly created dataset
Valid values of spaces are: --Cure, --PostCure, --FrontVelocity │
For help run
remat-data spaces upload --help
Command to upload:
remat-data spaces upload --Cure --name TEST-dataset-2 test3.csv DSC_Curve.csv
Configuration
Application configuration and space name to UUID mapping is stored in config.py Add new spaces to this config.py file
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file remat_data-1.1.3.tar.gz.
File metadata
- Download URL: remat_data-1.1.3.tar.gz
- Upload date:
- Size: 3.6 MB
- Tags: Source
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
fe5bd022b975a88e6add7f2dc215a5ce85af1c5d3ed9a3fa7cfdd31ab88ff68f
|
|
| MD5 |
f800e868149c957eb7f6798c0ccc7794
|
|
| BLAKE2b-256 |
4fcab8a14fbfed24ff11035412aaebb205886214c0634c29dea088b2c57672db
|
Provenance
The following attestation bundles were made for remat_data-1.1.3.tar.gz:
Publisher:
cd.yml on re-mat/remat-data
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
remat_data-1.1.3.tar.gz -
Subject digest:
fe5bd022b975a88e6add7f2dc215a5ce85af1c5d3ed9a3fa7cfdd31ab88ff68f - Sigstore transparency entry: 911120924
- Sigstore integration time:
-
Permalink:
re-mat/remat-data@a197965bd42901f66f7e7b89094075e952e4bcdb -
Branch / Tag:
refs/tags/1.1.3 - Owner: https://github.com/re-mat
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
cd.yml@a197965bd42901f66f7e7b89094075e952e4bcdb -
Trigger Event:
release
-
Statement type:
File details
Details for the file remat_data-1.1.3-py3-none-any.whl.
File metadata
- Download URL: remat_data-1.1.3-py3-none-any.whl
- Upload date:
- Size: 10.3 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
36a57e705b83cdc63f9445dce4a37608eeea042663c827d2b4d950db5aa49899
|
|
| MD5 |
f229ba011826bff5db0db08e62eed9da
|
|
| BLAKE2b-256 |
f880e0509426ea9fc28879eaed4e5b5ec388231260317cb2b6dc258ab5e9316a
|
Provenance
The following attestation bundles were made for remat_data-1.1.3-py3-none-any.whl:
Publisher:
cd.yml on re-mat/remat-data
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
remat_data-1.1.3-py3-none-any.whl -
Subject digest:
36a57e705b83cdc63f9445dce4a37608eeea042663c827d2b4d950db5aa49899 - Sigstore transparency entry: 911120995
- Sigstore integration time:
-
Permalink:
re-mat/remat-data@a197965bd42901f66f7e7b89094075e952e4bcdb -
Branch / Tag:
refs/tags/1.1.3 - Owner: https://github.com/re-mat
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
cd.yml@a197965bd42901f66f7e7b89094075e952e4bcdb -
Trigger Event:
release
-
Statement type: