Converts an annotated DNA multi-sequence alignment (in NEXUS format) to an EMBL flatfile for submission to ENA via the Webin-CLI submission tool
Project description
annonex2embl
Converts an annotated DNA multi-sequence alignment (in NEXUS format) to an EMBL flatfile for submission to ENA via the Webin-CLI submission tool.
INSTALLATION
To get the most recent stable version of annonex2embl, run:
pip install annonex2embl
Or, alternatively, if you want to get the latest development version of annonex2embl, run:
pip install git+https://github.com/michaelgruenstaeudl/annonex2embl.git
INPUT, OUTPUT AND PREREQUISITES
- Input: an annotated DNA multiple sequence alignment in NEXUS format; and a comma-delimited (CSV) metadata table
- Output: a submission-ready, multi-record EMBL flatfile
Requirements / Input preparation
The annotations of a NEXUS file are specified via SETS-block, which is located beneath a DATA-block and defines sets of characters in the DNA alignment. In such a SETS-block, every gene and every exon charset must be accompanied by one CDS charset. Other charsets can be defined unaccompanied.
Example of a complete SETS-BLOCK
BEGIN SETS;
CHARSET matK_gene_forward = 929-2530;
CHARSET matK_CDS_forward = 929-2530;
CHARSET trnK_intron_forward = 1-928 2531-2813;
END;
Examples of corresponding DESCR variable
DESCR="tRNA-Lys (trnK) intron, partial sequence; maturase K (matK) gene, complete sequence"
EXAMPLE USAGE
cd
into the annonex2embl package, then ...
On Linux / MacOS
SCRPT=$PWD/scripts/annonex2embl_launcher_CLI.py
INPUT=$PWD/examples/input/TestData1.nex
METAD=$PWD/examples/input/Metadata.csv
mkdir -p $PWD/examples/temp/
OTPUT=$PWD/examples/temp/TestData1.embl
DESCR='description of alignment here' # Do not use double-quotes
EMAIL=your_email_here@yourmailserver.com
AUTHR='your name here' # Do not use double-quotes
MNFTS=PRJEB00000
MNFTD=${DESCR//[^[:alnum:]]/_}
python3 $SCRPT -n $INPUT -c $METAD -d "$DESCR" -e $EMAIL -a "$AUTHR" -o $OTPUT --qualifiername "note" --productlookup --manifeststudy $MNFTS --manifestdescr $MNFTD --compress
On Windows
SET SCRPT=$PWD\scripts\annonex2embl_launcher_CLI.py
SET INPUT=$PWD\examples\input\TestData1.nex
SET METAD=$PWD\examples\input\Metadata.csv
mkdir $PWD\examples\temp\
SET OTPUT=$PWD\examples\temp\TestData1.embl
SET DESCR='description of alignment here'
SET EMAIL=your_email_here@yourmailserver.com
SET AUTHR='your name here'
SET MNFTS=PRJEB00000
SET MNFTD=a_unique_description_here
python %SCRPT% -n %INPUT% -c %METAD% -d %DESCR% -e %EMAIL% -a %AUTHR% -o %OTPUT% --productlookup --manifeststudy %MNFTS% --manifestdescr %MNFTD% --compress
CHANGELOG
See CHANGELOG.md
for a list of recent changes to the software.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file annonex2embl-1.0.3.tar.gz
.
File metadata
- Download URL: annonex2embl-1.0.3.tar.gz
- Upload date:
- Size: 29.0 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.3.0 pkginfo/1.7.0 requests/2.25.1 setuptools/51.1.1 requests-toolbelt/0.9.1 tqdm/4.55.0 CPython/3.9.1
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | bfcb87a7b014b5b46735b2d36d8276ce623ef617a1c849d07030e70e511ac77f |
|
MD5 | cbd654e4209d3781f0e5478928d67103 |
|
BLAKE2b-256 | 3854c3435c3d192127cc38104d0cc13b4c9f305afa8bd2fdd7fefea14de64021 |
File details
Details for the file annonex2embl-1.0.3-py3-none-any.whl
.
File metadata
- Download URL: annonex2embl-1.0.3-py3-none-any.whl
- Upload date:
- Size: 33.3 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.3.0 pkginfo/1.7.0 requests/2.25.1 setuptools/51.1.1 requests-toolbelt/0.9.1 tqdm/4.55.0 CPython/3.9.1
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 57c611e6269569a2e0366e4fd2656f2e1d2a502af19c2b46e659fc2b480ba4aa |
|
MD5 | 8ddf27f3fcac4498b42ba6b6fdb2a07a |
|
BLAKE2b-256 | e6e5d550da86efb1cce67f00ec52f713e355e81f993629352cf0037093de9642 |