Skip to main content

Converts an annotated DNA multi-sequence alignment (in NEXUS format) to an EMBL flatfile for submission to ENA via the Webin-CLI submission tool

Project description

annonex2embl

Build Status PyPI status PyPI pyversions PyPI version shields.io PyPI license

Converts an annotated DNA multi-sequence alignment (in NEXUS format) to an EMBL flatfile for submission to ENA via the Webin-CLI submission tool.

INSTALLATION

To get the most recent stable version of annonex2embl, run:

pip install annonex2embl

Or, alternatively, if you want to get the latest development version of annonex2embl, run:

pip install git+https://github.com/michaelgruenstaeudl/annonex2embl.git

INPUT, OUTPUT AND PREREQUISITES

  • Input: an annotated DNA multiple sequence alignment in NEXUS format; and a comma-delimited (CSV) metadata table
  • Output: a submission-ready, multi-record EMBL flatfile

Requirements / Input preparation

The annotations of a NEXUS file are specified via SETS-block, which is located beneath a DATA-block and defines sets of characters in the DNA alignment. In such a SETS-block, every gene and every exon charset must be accompanied by one CDS charset. Other charsets can be defined unaccompanied.

Example of a complete SETS-BLOCK

BEGIN SETS;
CHARSET matK_gene_forward = 929-2530;
CHARSET matK_CDS_forward = 929-2530;
CHARSET trnK_intron_forward = 1-928 2531-2813;
END;

Examples of corresponding DESCR variable

DESCR="tRNA-Lys (trnK) intron, partial sequence; maturase K (matK) gene, complete sequence"

EXAMPLE USAGE

cd into the annonex2embl package, then ...

On Linux / MacOS

SCRPT=$PWD/scripts/annonex2embl_launcher_CLI.py
INPUT=$PWD/examples/input/TestData1.nex
METAD=$PWD/examples/input/Metadata.csv
mkdir -p $PWD/examples/temp/
OTPUT=$PWD/examples/temp/TestData1.embl
DESCR='description of alignment here'  # Do not use double-quotes
EMAIL=your_email_here@yourmailserver.com
AUTHR='your name here'  # Do not use double-quotes
MNFTS=PRJEB00000
MNFTD=${DESCR//[^[:alnum:]]/_}

python3 $SCRPT -n $INPUT -c $METAD -d "$DESCR" -e $EMAIL -a "$AUTHR" -o $OTPUT --qualifiername "note" --productlookup --manifeststudy $MNFTS --manifestdescr $MNFTD --compress

On Windows

SET SCRPT=$PWD\scripts\annonex2embl_launcher_CLI.py
SET INPUT=$PWD\examples\input\TestData1.nex
SET METAD=$PWD\examples\input\Metadata.csv
mkdir $PWD\examples\temp\
SET OTPUT=$PWD\examples\temp\TestData1.embl
SET DESCR='description of alignment here'
SET EMAIL=your_email_here@yourmailserver.com
SET AUTHR='your name here'
SET MNFTS=PRJEB00000
SET MNFTD=a_unique_description_here

python %SCRPT% -n %INPUT% -c %METAD% -d %DESCR% -e %EMAIL% -a %AUTHR% -o %OTPUT% --productlookup --manifeststudy %MNFTS% --manifestdescr %MNFTD% --compress

CHANGELOG

See CHANGELOG.md for a list of recent changes to the software.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

annonex2embl-1.0.3.tar.gz (29.0 kB view details)

Uploaded Source

Built Distribution

annonex2embl-1.0.3-py3-none-any.whl (33.3 kB view details)

Uploaded Python 3

File details

Details for the file annonex2embl-1.0.3.tar.gz.

File metadata

  • Download URL: annonex2embl-1.0.3.tar.gz
  • Upload date:
  • Size: 29.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.3.0 pkginfo/1.7.0 requests/2.25.1 setuptools/51.1.1 requests-toolbelt/0.9.1 tqdm/4.55.0 CPython/3.9.1

File hashes

Hashes for annonex2embl-1.0.3.tar.gz
Algorithm Hash digest
SHA256 bfcb87a7b014b5b46735b2d36d8276ce623ef617a1c849d07030e70e511ac77f
MD5 cbd654e4209d3781f0e5478928d67103
BLAKE2b-256 3854c3435c3d192127cc38104d0cc13b4c9f305afa8bd2fdd7fefea14de64021

See more details on using hashes here.

File details

Details for the file annonex2embl-1.0.3-py3-none-any.whl.

File metadata

  • Download URL: annonex2embl-1.0.3-py3-none-any.whl
  • Upload date:
  • Size: 33.3 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.3.0 pkginfo/1.7.0 requests/2.25.1 setuptools/51.1.1 requests-toolbelt/0.9.1 tqdm/4.55.0 CPython/3.9.1

File hashes

Hashes for annonex2embl-1.0.3-py3-none-any.whl
Algorithm Hash digest
SHA256 57c611e6269569a2e0366e4fd2656f2e1d2a502af19c2b46e659fc2b480ba4aa
MD5 8ddf27f3fcac4498b42ba6b6fdb2a07a
BLAKE2b-256 e6e5d550da86efb1cce67f00ec52f713e355e81f993629352cf0037093de9642

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page