No project description provided
Project description
MGE Masker
This package finds MGEs based on annotations in a rich sequence file (Genbank or EMBL). There are 3 subcommands
- find_mges Search a rich sequence file for features annotated with text that suggests a MGE-associated element
- mask_mges Mask regions from a pseudogenome alignment with the regions in a GFF file produced using the find_mges command
- default_matches Show the default regex patterns used when searching for MGEs. This can be overridden by providing a similarly formatted file using the -m parameter with the find_mges command
Deafult patterns matched
.*\b[tT]ranspos
.*\b[pP]hage
.*\b[rR]epeat
.*\b[rR]eptitive
.*\b[iI]nsertion sequence
.*\bIS
.*\b[tT]n
.*\b[iI]ntegr
.*\b[Cc]onjug
.*\b[Pp]lasmid
Installation
Python3 only
pip install MGEmasker
or
pip3 install MGEmasker
Usage
usage: mgemasker [-h] {find_mges,mask_mges,default_matches} ...
A module to find MGEs in a rich sequence file and mask regions corresponding to the MGEs in a pseudogenome alignment.
The find_mges command searches a gbk or embl file for features that have MGE-associated annotations.
It writes a GFF file containing the positions of the matched features.
The mask_mges command takes a GFF file produced using the find_mges command and masks those regions in all sequences of a pseudogenome alignment based on the reference sequence used to find MGEs.
positional arguments:
{find_mges,mask_mges,default_matches}
The following commands are available. Type mge_masker
<COMMAND> -h for more help on a specific commands
find_mges Search a rich sequence file for features annotated
with text that suggests a MGE-associated element
mask_mges Mask regions from a pseudogenome alignment with the
regions in a GFF file produced using the find_mges
command
default_matches Show the default regex patterns used when searching
for MGEs
optional arguments:
-h, --help show this help message and exit
find_mges usage
usage: mgemasker find_mges [-h] -g GENOME_FILE_PATH [-f {genbank,embl}]
[-i MERGE_INTERVAL] [-m MGE_FILE_PATH]
optional arguments:
-h, --help show this help message and exit
-g GENOME_FILE_PATH, --genome_file_path GENOME_FILE_PATH
path to a genome file
-f {genbank,embl}, --file_format {genbank,embl}
genome file format
-i MERGE_INTERVAL, --merge_interval MERGE_INTERVAL
The maximum distance between MGEs when performing the
merging step (Default 1000bp)
-m MGE_FILE_PATH, --mge_file_path MGE_FILE_PATH
path to a file containing regex MGE annotations
mask_mges usage
usage: mgemasker mask_mges [-h] -f FASTA_PATH -g GFF_FILE_PATH
[-m MASKING_CHARACTER]
optional arguments:
-h, --help show this help message and exit
-f FASTA_PATH, --fasta_path FASTA_PATH
path to either a single fasta reference or a
pseudogenome alignment file
-g GFF_FILE_PATH, --gff_file_path GFF_FILE_PATH
path to a gff file containing MGE regions to be masked
-m MASKING_CHARACTER, --masking_character MASKING_CHARACTER
character used to mask (default: N)
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
MGEmasker-0.1.13.tar.gz
(8.9 kB
view details)
Built Distribution
File details
Details for the file MGEmasker-0.1.13.tar.gz
.
File metadata
- Download URL: MGEmasker-0.1.13.tar.gz
- Upload date:
- Size: 8.9 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/1.13.0 pkginfo/1.4.2 requests/2.25.1 setuptools/57.0.0 requests-toolbelt/0.8.0 tqdm/4.24.0 CPython/3.7.0
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 45a1dd58e23061d7e81cd7a2f9f65cd06d0b38e580e65dc032ef0f7fb0228fd2 |
|
MD5 | 1541553af239d281828322963f904a40 |
|
BLAKE2b-256 | 3f0ba6f655a5427e08ff6bf8401c4a6835e54ec11c892d4b1f2fea888068d892 |
File details
Details for the file MGEmasker-0.1.13-py3-none-any.whl
.
File metadata
- Download URL: MGEmasker-0.1.13-py3-none-any.whl
- Upload date:
- Size: 13.2 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/1.13.0 pkginfo/1.4.2 requests/2.25.1 setuptools/57.0.0 requests-toolbelt/0.8.0 tqdm/4.24.0 CPython/3.7.0
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | d87ad7cfb4553fa7537375f7c89ff8abb10f2cbf1fa8ba7991d1c960db24cff1 |
|
MD5 | acc8393bc94d10175bb55aeebe219f8f |
|
BLAKE2b-256 | e498fe51004d82d17dd60f7464c44ed655ed90dca0760fb50c6ed7304c73981c |