A small tool to download, match and save sequences from MIBiG.
Project description
pyMIBiG
A small tool to download, match and save sequences from MIBiG.
pyMIBiG can search by "organism name", "compound / product",
"biosynthetic class" and "entry quality" as intersections of every argument added.
Which means that the more arguments you add more restrictive your search becomes.
It uses the available MIBiG download files which have less information then
those returned when using their web search. So, for very specific queries,
that yield fewer results, you will be better using the web interface.
Usage
Download the available package of pyMIBiG and execute pymibig -<target>
where target is the term you wanto to search in MIBiG database.
You can also install it using pip. In a virtual environment execute:
pip install pymibig
By default pyMIBiG will fetch all entry data and information of a given target.
You may change that using optional aguments passed along with the <target>:
usage: pyMIBiG [-h] [-o ORGANISM] [-p PRODUCT] [-b BIOSYNT] [-c {complete,incomplete,unknown,all}] [-q {low,medium,high,questionable,all}]
A small tool to download, match and save targeted sequences from MIBiG.
options:
-h, --help show this help message and exit
-o ORGANISM, --organism ORGANISM
Organism name to query in database.
-p PRODUCT, --product PRODUCT
Compound to query in database.
-b BIOSYNT, --biosynt BIOSYNT
Biosynthetic class to query in database.
-c {complete,incomplete,unknown,all}, --completeness {complete,incomplete,unknown,all}
Loci completeness.
-q {low,medium,high,questionable,all}, --quality {low,medium,high,questionable,all}
Entry quality level.
You have to use at least one of the following arguments: organism, product or biosynt. The others are optional.
On first execution pyMIBiG will download the database files from
MIBiG and save locally,
so an internet connection will be needed, after that it can be used offline.
Latest release of pyMIBiG will download from MIBiG
Version 4.0 (November 15, 2024) the:
- Metadata in compressed format, including several JSON files;
- Nucleotide sequences of the biosynthetic gene clusters in compressed format, including several GBK files;
- Amino acid sequence translations of all genes from MIBiG entries are also available in a single compressed FASTa file.
Version 1.2.7 uses MIBiG Version 3.1 (October 7, 2022).
Output
pyMIBiG will create three files:
- a FASTa containing nucleotide sequences
- a FASTa containing aminoacid sequences
- a tab-separated value table with information on the selected sequences
The filenames will reflect the parameters used when searching the database.
Ps.: Retired entries will be presented in the table, but there will be no sequences for them.
Reference
MIBiG 4.0: Advancing Biosynthetic Gene Cluster Curation through Global Collaboration.
License
pyMiBiG is distributed under the terms of the LGPL 3.0 license.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file pymibig-1.4.0.tar.gz.
File metadata
- Download URL: pymibig-1.4.0.tar.gz
- Upload date:
- Size: 10.2 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: python-httpx/0.27.0
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
489b3fca57558b20d52a6e3c8e8729f3961f21cea928d31c054e286cda8b5720
|
|
| MD5 |
e6c808733e9e13750c84373018a1538e
|
|
| BLAKE2b-256 |
c5d61f0a15c8cb458dcf2e6e3bb60399ca0cbee8f58654328809f90a60ea5649
|
File details
Details for the file pymibig-1.4.0-py3-none-any.whl.
File metadata
- Download URL: pymibig-1.4.0-py3-none-any.whl
- Upload date:
- Size: 14.5 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: python-httpx/0.27.0
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
5d45098d7a1eeff988f76564f2ccd0610ec1d7fee5ea91a6f7b6e441bda64b05
|
|
| MD5 |
2f3d43018ec2b166f7d25b240b2cc4bf
|
|
| BLAKE2b-256 |
602c90ec3781adc7f3742e0cdcb702d3922e0b629ef2ef859a295666f47df2a3
|