accession

Tool to submit genomics pipeline outputs to the ENCODE Portal

These details have not been verified by PyPI

Project links

License
- OSI Approved :: MIT License
Natural Language
- English
Operating System
- OS Independent
Programming Language

Project description

accession is a Python module and command line tool for submitting genomics pipeline analysis output files and metadata to the ENCODE Portal.

Installation

Note: installation requires Python >= 3.8

$ pip install accession

Next, provide your API keys from the ENCODE portal:

$ export DCC_API_KEY=XXXXXXXX
$ export DCC_SECRET_KEY=yyyyyyyyyyy

It is highly recommended to set the DCC_LAB and DCC_AWARD environment variables for ease of use. These correspond to the lab and award identifiers given by the ENCODE portal, e.g. /labs/foo/ and U00HG123456, respectively.

$ export DCC_LAB=XXXXXXXX
$ export DCC_AWARD=yyyyyyyyyyy

If you are accessioning workflows produced using the Caper local backend, then installation is complete. However, if using WDL metadata from pipeline runs on Google Cloud, you will also need to authenticate with Google Cloud. Run the following two commands and follow the prompts:

$ gcloud auth login --no-launch-browser
$ gcloud auth application-default login --no-launch-browser

If you would like to be able to pass Caper workflow IDs or labels you will need to configure access to the Caper server. If you are invoking accession from a machine where you already have a Caper set up, and you have the Caper configuration file available at ~/.caper/default.conf, then there is no extra setup required. If the Caper server is on another machine, you will need so configure HTTP access to it by setting the hostname and port values in the Caper conf file.

(Optional) Finally, to enable using Cloud Tasks to upload files from Google Cloud Storage to AWS S3, set the following two environment variables. If one or more of them is not set, then files will be uploaded using the same machine that the accessioning code is run from. For more information on how to set up Cloud Tasks and the upload service, see the docs for the gcs-s3-transfer-service

$ export ACCESSION_CLOUD_TASKS_QUEUE_NAME=my-queue
$ export ACCESSION_CLOUD_TASKS_QUEUE_REGION=us-west1

To accession workflows produced on AWS backend you will need to set up AWS credentials. The easiest way to do this is to install the AWS CLI and run aws configure

Usage

$ accession -m metadata.json \
            -p mirna \
            -s dev

Please see the docs for greater detail on these input parameters.

Deploying on Google Cloud

First authenticate with Google Cloud via gcloud auth login if needed. Then install the API client with pip install google-api-python-client, it is recommended to do this inside of a venv. Finally, create the firewall rule and deploy the instance by running python deploy.py –project $PROJECT. This will also install the accession package. Finally, SSH onto the new instance and run gcloud auth login to authenticate on the instance.

For Caper integration, once the instance is up, SSH onto it and create the Caper conf file at ~/.caper/default.conf, use the private IP of the Caper VM instance as the hostname and use 8000 for the port. For the connection to work the Caper VM will need to have the tag caper-server. Also note that the deployment assumes the Cromwell server port is set to 8000.

AWS Notes

To enable S3 to S3 copy from the pipeline buckets to the ENCODE buckets, ensure that the pipeline bucket policy grants read access to the ENCODE account. Here is an example policy:

{
    "Version": "2012-10-17",
    "Statement": [
        {
            "Sid": "DelegateS3AccessGet",
            "Effect": "Allow",
            "Principal": {
                "AWS": [
                    "arn:aws:iam::618537831167:root",
                    "arn:aws:iam::159877419961:root"
                ]
            },
            "Action": "s3:GetObject",
            "Resource": "arn:aws:s3:::PIPELINE-BUCKET/*"
        },
        {
            "Sid": "DelegateS3AccessList",
            "Effect": "Allow",
            "Principal": {
                "AWS": [
                    "arn:aws:iam::618537831167:root",
                    "arn:aws:iam::159877419961:root"
                ]
            },
            "Action": "s3:ListBucket",
            "Resource": "arn:aws:s3:::PIPELINE-BUCKET"
        }
    ]
}

Project Information

accession is released under the MIT license, documentation lives in readthedocs, code is hosted on github and the releases on PyPI.

Project details

These details have not been verified by PyPI

Project links

License
- OSI Approved :: MIT License
Natural Language
- English
Operating System
- OS Independent
Programming Language

Release history Release notifications | RSS feed

This version

4.8.4

Nov 7, 2022

4.8.3

Feb 28, 2022

4.8.2

Feb 14, 2022

4.8.1

Feb 11, 2022

4.8.0

Feb 10, 2022

4.7.1

Feb 1, 2022

4.7.0

Feb 1, 2022

4.6.0

Jan 19, 2022

4.5.0

Jan 18, 2022

4.4.0

Dec 15, 2021

4.3.0

Dec 10, 2021

4.2.0

Oct 15, 2021

4.1.0

Sep 24, 2021

4.0.3

Sep 16, 2021

4.0.2

Aug 20, 2021

4.0.1

Aug 20, 2021

4.0.0

Aug 20, 2021

3.11.0

Jul 9, 2021

3.10.0

Jun 18, 2021

3.9.1

Jun 4, 2021

3.9.0

Jun 3, 2021

3.8.2

May 7, 2021

3.8.1

Apr 22, 2021

3.8.0

Apr 8, 2021

3.7.0

Apr 7, 2021

3.6.0

Apr 7, 2021

3.5.0

Apr 2, 2021

3.4.0

Mar 10, 2021

3.3.0

Mar 1, 2021

3.2.3

Feb 10, 2021

3.2.2

Feb 8, 2021

3.2.1

Feb 8, 2021

3.2.0

Feb 4, 2021

3.1.0

Jan 28, 2021

3.0.6

Jan 27, 2021

3.0.5

Jan 26, 2021

3.0.4

Jan 23, 2021

3.0.3

Jan 7, 2021

3.0.2

Nov 25, 2020

3.0.1

Nov 25, 2020

3.0.0

Nov 24, 2020

2.8.0

Nov 13, 2020

2.7.0

Nov 10, 2020

2.6.0

Nov 4, 2020

2.5.0

Oct 9, 2020

2.4.3

Sep 25, 2020

2.4.2

Sep 14, 2020

2.4.1

Sep 11, 2020

2.4.0

Sep 9, 2020

2.3.1

Sep 4, 2020

2.3.0

Sep 4, 2020

2.2.2

Sep 2, 2020

2.2.1

Aug 6, 2020

2.2.0

Aug 6, 2020

2.1.0

Aug 3, 2020

2.0.0

Jul 28, 2020

1.9.0

Jul 22, 2020

1.8.0

Jul 21, 2020

1.7.1

Jul 14, 2020

1.7.0

Jul 14, 2020

1.6.0

Jun 24, 2020

1.5.1

Jun 9, 2020

1.5.0

Jun 4, 2020

1.4.0

May 29, 2020

1.3.1

May 8, 2020

1.3.0

Apr 28, 2020

1.2.2

Apr 21, 2020

1.2.1

Apr 17, 2020

1.2.0

Apr 15, 2020

1.1.0

Apr 6, 2020

1.0.1

Apr 3, 2020

1.0.0

Mar 31, 2020

0.1.0

Feb 14, 2020

0.0.37

Nov 25, 2019

0.0.36

Sep 10, 2019

0.0.35

Aug 23, 2019

0.0.34

Aug 14, 2019

0.0.33

Aug 12, 2019

0.0.32

Jul 3, 2019

0.0.31

Jun 26, 2019

0.0.30

Jun 26, 2019

0.0.25

Jun 24, 2019

0.0.24

Jun 24, 2019

0.0.23

Jun 24, 2019

0.0.22

Jun 24, 2019

0.0.21

Jun 21, 2019

0.0.20

Jun 21, 2019

0.0.19

Jun 21, 2019

0.0.18

Jun 14, 2019

0.0.17

Jun 14, 2019

0.0.16

Jun 14, 2019

0.0.15

Jun 14, 2019

0.0.14

Apr 3, 2019

0.0.13

Apr 3, 2019

0.0.12

Apr 2, 2019

0.0.11

Mar 28, 2019

0.0.10

Mar 22, 2019

0.0.9

Mar 22, 2019

0.0.8

Mar 22, 2019

0.0.7

Mar 22, 2019

0.0.6

Mar 22, 2019

0.0.5

Mar 22, 2019

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

accession-4.8.4.tar.gz (102.8 kB view details)

Uploaded Nov 7, 2022 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

accession-4.8.4-py3-none-any.whl (134.9 kB view details)

Uploaded Nov 7, 2022 Python 3

File details

Details for the file accession-4.8.4.tar.gz.

File metadata

Download URL: accession-4.8.4.tar.gz
Upload date: Nov 7, 2022
Size: 102.8 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/4.0.1 CPython/3.9.13

File hashes

Hashes for accession-4.8.4.tar.gz
Algorithm	Hash digest
SHA256	`19eb62abc083a2370d62e7171c0f9bf1347b96ecbea98ebc57580cbebe2032b3`
MD5	`63ffd145adb8cd1a45db8ce53c11bf82`
BLAKE2b-256	`d05f66992f4c1298ce17e12c3b7fe7edc3c470c3c6d4aa9b8f52834884b041a4`

See more details on using hashes here.

File details

Details for the file accession-4.8.4-py3-none-any.whl.

File metadata

Download URL: accession-4.8.4-py3-none-any.whl
Upload date: Nov 7, 2022
Size: 134.9 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/4.0.1 CPython/3.9.13

File hashes

Hashes for accession-4.8.4-py3-none-any.whl
Algorithm	Hash digest
SHA256	`34a0f2280a46abc39a8c765da8dc1142f2d4eadacbbf0a2a73369aea5bb9438e`
MD5	`630513d2dbb8b9543a8fefc0ddf2e5f7`
BLAKE2b-256	`b5ceb862b70742d228c2a7065aafc4289a3d777b8b9af5b0341b9d0c35f5b87f`

See more details on using hashes here.

accession 4.8.4

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Installation

Usage

Deploying on Google Cloud

AWS Notes

Project Information

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes