Skip to main content

This script facilitates the submission of projects, samples, runs, assemblies, and other analyses to the public repository ENA (European Nucleotide Archive). It also assists in validating AMR (Antimicrobial Resistance) antibiograms before submission.

Project description

enaCLI

Introduction

ENA-CLI is a command-line tool designed to facilitate the validation and submission of data to the European Nucleotide Archive (ENA). This tool streamlines the process of preparing and uploading data files, ensuring they meet ENA's submission requirements.

Installation

pip install enaCLI

templates or templates_all

Test Data

File Upload Reminder

Before using the enaCLI for other type of analysis submission (5), ensure you have uploaded your files to ENA using the Webin file uploader. Detailed instructions on how to upload files can be found here.

Doker image

Usage

1. Project Submission

Usage

enaCLI project -h

Example

enaCLI project -u Webin-XXXX -p 'XXXXXX' -m templates/templates.xlsx -t

Options

  • -u: Webin submission account
  • -p: Password for the submission account
  • -m: Manifest file (template: templates/templates.xlsx)
  • -t: Use Webin test service (optional)

2. Sample Submission

Usage

enaCLI sample -h

Example

enaCLI sample -u Webin-XXXX -p 'XXXXXX' -m templates/templates.xlsx -t

Options

  • -u: Webin submission account
  • -p: Password for the submission account
  • -m: Manifest file (template: templates/templates.xlsx)
  • -t: Use Webin test service (optional)

3. Run Submission

Usage

enaCLI run -h

Example

enaCLI run -u Webin-XXXX -p 'XXXXXX' -m templates/templates.xlsx -i test_data/run -t

Options

  • -u: Webin submission account
  • -p: Password for the submission account
  • -m: Manifest file (template: templates/templates.xlsx)
  • -i: Input directory for files declared in the manifest file
  • -C: The center name of the submitter (mandatory for broker accounts)
  • -t: Use Webin test service (optional)

4. Genome Assembly Submissions

Usage

enaCLI genome -h

Example

enaCLI genome -u Webin-XXXX -p 'XXXXXX' -m templates/templates.xlsx -i test_data/genome -c genome -t

Options

  • -u: Webin submission account
  • -p: Password for the submission account
  • -m: Manifest file (template: templates/templates.xlsx)
  • -i: Input directory for files declared in the manifest file
  • -c: Assembly submission type (choices: genome, transcriptome)
  • -C: The center name of the submitter (mandatory for broker accounts)
  • -t: Use Webin test service (optional)

5. Targeted Command

The targeted command facilitates the submission of targeted sequences to the public repository ENA (European Nucleotide Archive). It requires the following mandatory arguments:

  • -u, --username: Webin submission account (e.g., Webin-XXX).
  • -p, --password: Password for the submission account.
  • -m, --manifestFile: Path to the manifest file specifying the details of the submission. The manifest file should follow the template provided in templates/templates.xlsx.
  • -i, --inputDir: Path to the input directory containing the files declared in the manifest file.

Additionally, the following optional arguments can be provided:

  • -C, --centerName: The center name of the submitter (mandatory for broker accounts).
  • -t, --test: Use Webin test service instead of the production service. Please note that the Webin upload area is shared between test and production services, and that test submission files will not be archived.

Usage Example:

enaCLI targeted -u Webin-XXXX -p 'XXXXXX' -m templates/templates.xlsx -i test_data/targeted -t

6. Other Submission

Usage

enaCLI other -h

Example

enaCLI other -u Webin-XXXX -p 'XXXXXX' -m templates/templates.xlsx -i test_data/other -a AMR_ANTIBIOGRAM -t

Options

  • -u: Webin submission account
  • -p: Password for the submission account
  • -m: Manifest file (template: templates/templates.xlsx)
  • -i: Input directory for files declared in the manifest file
  • -a: Analysis type (choices: GENOME_MAP, REFERENCE_ALIGNMENT, SEQUENCE_ANNOTATION, ASSEMBLY_GRAPH, PROCESSED_READ, PATHOGEN_ANALYSIS, AMR_ANTIBIOGRAM, COVID-19_FILTERED_VCF, COVID-19_CONSENSUS, PHYLOGENY_ANALYSIS)
  • -C: The center name of the submitter (mandatory for broker accounts)
  • -t: Use Webin test service (optional)

7. enaCLI all (the 🪄 option)

Description

The all command combines all submissions (projects, samples, runs, genome assemblies, targets, and other analysis objects) into a single command line. This aims to streamline submission to ENA. However, please note the following before running the package:

  1. Submit all associated data for other analysis types (different to genome or transcriptome) via Webin file uploader or other tools. More information is available here.
  2. Remove all lines or prepend '#' to each line if they are already submitted or you do not wish to submit them.
  3. You can reference projects, samples, or runs using their aliases or accession numbers. Please add the prefix 'webin-reads-' if you want to use the alias with a run. For example, if you set alias=run_alias_1 during your run submission, then to refer to this submission, you would use 'webin-reads-run_alias_1'.
  4. Ensure all data is placed in a single folder for the 'inputDir' option.
  5. Complete the template while adhering to its structure - an example template is available here.

Help

enaCLI all -h

Example

enaCLI all -u webin-XXXX -p 'XXXXXX' -m templates/templates_all.xlsx -i test_data/all/ -c genome -a AMR_ANTIBIOGRAM -t

Options

  • -u: Webin submission account: Indicates the Webin submission account.
  • -p: Password: Indicates the password for the Webin submission account.
  • -m: Manifest file: Specifies the path to the manifest file.
  • -i: Input directory for all files declared in the manifest file
  • -a: Analysis type: Specifies the type of analysis provided in the XML.
  • -c: Assembly submission type: Specifies the type of assembly submission.
  • -C: Center name: Specifies the center name of the submitter.
  • -t: Test submission: Submits the data as a test (optional).

Contact Information

For any errors or assistance, please contact the ENA helpdesk.

License

This project is licensed under the MIT License.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

enaCLI-1.0.6.tar.gz (69.1 MB view details)

Uploaded Source

Built Distribution

enaCLI-1.0.6-py3-none-any.whl (69.1 MB view details)

Uploaded Python 3

File details

Details for the file enaCLI-1.0.6.tar.gz.

File metadata

  • Download URL: enaCLI-1.0.6.tar.gz
  • Upload date:
  • Size: 69.1 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.0.0 CPython/3.9.16

File hashes

Hashes for enaCLI-1.0.6.tar.gz
Algorithm Hash digest
SHA256 d3a6ecbb5869b2a2522ca546cb19156ed72f0cfde596e3210e0150d71a60d828
MD5 676066c79359e3a94b35293cc47608bc
BLAKE2b-256 87391aa02ffb68f28b9b1d29b1fd99a51c7f8dabd3fe0c663501145051e41fad

See more details on using hashes here.

File details

Details for the file enaCLI-1.0.6-py3-none-any.whl.

File metadata

  • Download URL: enaCLI-1.0.6-py3-none-any.whl
  • Upload date:
  • Size: 69.1 MB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.0.0 CPython/3.9.16

File hashes

Hashes for enaCLI-1.0.6-py3-none-any.whl
Algorithm Hash digest
SHA256 2040ebf70d3f16e31a7668ea91d7bf4cdf5283e0bbac33bc28aca8409351e723
MD5 1fbc7091ffc605fc45bacbe5a999ebe9
BLAKE2b-256 358e67e45a6ca60d38177872dcb835f1b7c99aa3ab4ec12c25f9efc28ac7ba4b

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page