No project description provided
Project description
synthaser
Process
synthaser
parses the results of a batch NCBI conserved domain search and determines
the domain architecture of secondary metabolite synthases.
Installation
Install from PyPI using pip:
$ pip install --user synthaser
or clone the repo and install locally:
$ git clone https://www.github.com/gamcil/synthaser
$ cd synthaser
$ pip install -e .
Dependencies
synthaser
is written for Python 3.6+ and its only external Python dependency is
requests
, which is used for querying the NCBI's APIs. To perform local
searches, the programs RPS-BLAST
as well as the CD-Search post-processing
utility, rpsbproc
, should be installed (available from ...)
Usage
A full synthaser
search can be performed as simply as:
$ synthaser -qf sequences.fasta
Where sequences.fasta
is a FASTA format file containing the protein sequences
that you would like to search.
For a full listing of available arguments, enter:
$ synthaser -h
Visualising your results
synthaser
is capable of generating fully-interactive, annotated visualisations
so you can easily explore your results. All that is required is one
extra argument:
$ synthaser -qf sequences.fasta -p
This will generate a figure like so:
Click here to play around with the full version of this example.
Saving your search session
synthaser
allows you to save your search results such that they can be easily
reloaded for further visualisation or exploration without having to fully re-do
the search.
To do this, use the --json_file
command:
$ synthaser -qf sequences.fasta --json_file sequences.json
This will save all of your results, in JSON format, to the file
sequences.json
. Then, loading this session back into synthaser
, is as easy
as:
$ synthaser --json_file sequences.json ...
Using your own rules
Though synthaser
was originally designed to analyse secondary metabolite synthases,
it can easily be repurposed to analyse the domain architectures of
any type of protein sequence.
Under the hood, synthaser
uses two files to determine 1) which domains to save
from a CD-Search run (domains.json
) and 2) the rules used to classify sequences based on their
domain architectures (rules.json
). By default, synthaser
will use the copies
of these files distributed with the program. However, providing your own rules
is as simple as:
$ synthaser -qf sequences.fasta -df my_domains.json -cf my_rules.json
For a detailed explanation of how to create your own synthaser
rule files,
as well as API
please refer to the documentation
.
Citations
If you found synthaser
helpful, please cite:
1. <pending>
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file synthaser-1.1.4.tar.gz
.
File metadata
- Download URL: synthaser-1.1.4.tar.gz
- Upload date:
- Size: 130.3 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.2.0 pkginfo/1.6.1 requests/2.25.0 setuptools/49.2.1 requests-toolbelt/0.9.1 tqdm/4.54.0 CPython/3.9.0
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 3e90757f99de7b9fa1df37c7c0172cf9cf6f0e025332526f8a64dfdd63076e7e |
|
MD5 | f8022f750aec60a6c67b0cf4d21bc314 |
|
BLAKE2b-256 | f16a2e46acbc1a617a757235147fa4082afd52bf432cd78ac91cd5023c5722c2 |
File details
Details for the file synthaser-1.1.4-py3-none-any.whl
.
File metadata
- Download URL: synthaser-1.1.4-py3-none-any.whl
- Upload date:
- Size: 135.7 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.2.0 pkginfo/1.6.1 requests/2.25.0 setuptools/49.2.1 requests-toolbelt/0.9.1 tqdm/4.54.0 CPython/3.9.0
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | e391d16420d3aab8811d97e01176b5fdbd7451e8f98c4bcef98c9e26c469b0d5 |
|
MD5 | 337d6fcbcd61ebab93dee8ec107ec797 |
|
BLAKE2b-256 | e0ccd92db943ab71fe5f1fc7e5073ccfa9f12af99199d5de23537fc0e889639f |