Skip to main content

No project description provided

Project description

MGE Masker

This package finds MGEs based on annotations in a rich sequence file (Genbank or EMBL). There are 3 subcommands

  1. find_mges Search a rich sequence file for features annotated with text that suggests a MGE-associated element
  2. mask_mges Mask regions from a pseudogenome alignment with the regions in a GFF file produced using the find_mges command
  3. default_matches Show the default regex patterns used when searching for MGEs. This can be overridden by providing a similarly formatted file using the -m parameter with the find_mges command

Deafult patterns matched

.*\b[tT]ranspos
.*\b[pP]hage
.*\b[rR]epeat
.*\b[rR]eptitive
.*\b[iI]nsertion sequence
.*\bIS
.*\b[tT]n
.*\b[iI]ntegr
.*\b[Cc]onjug
.*\b[Pp]lasmid

Installation

Python3 only

pip install MGEmasker

or

pip3 install MGEmasker

Usage

usage: mgemasker [-h] {find_mges,mask_mges,default_matches} ...

A module to find MGEs in a rich sequence file and mask regions corresponding to the MGEs in a pseudogenome alignment.

The find_mges command searches a gbk or embl file for features that have MGE-associated annotations.
It writes a GFF file containing the positions of the matched features.

The mask_mges command takes a GFF file produced using the find_mges command and masks those regions in all sequences of a pseudogenome alignment based on the reference sequence used to find MGEs.

positional arguments:
  {find_mges,mask_mges,default_matches}
                        The following commands are available. Type mge_masker
                        <COMMAND> -h for more help on a specific commands
    find_mges           Search a rich sequence file for features annotated
                        with text that suggests a MGE-associated element
    mask_mges           Mask regions from a pseudogenome alignment with the
                        regions in a GFF file produced using the find_mges
                        command
    default_matches     Show the default regex patterns used when searching
                        for MGEs

optional arguments:
  -h, --help            show this help message and exit

find_mges usage

usage: mgemasker find_mges [-h] -g GENOME_FILE_PATH [-f {genbank,embl}]
                            [-i MERGE_INTERVAL] [-m MGE_FILE_PATH]

optional arguments:
  -h, --help            show this help message and exit
  -g GENOME_FILE_PATH, --genome_file_path GENOME_FILE_PATH
                        path to a genome file
  -f {genbank,embl}, --file_format {genbank,embl}
                        genome file format
  -i MERGE_INTERVAL, --merge_interval MERGE_INTERVAL
                        The maximum distance between MGEs when performing the
                        merging step (Default 1000bp)
  -m MGE_FILE_PATH, --mge_file_path MGE_FILE_PATH
                        path to a file containing regex MGE annotations

mask_mges usage

usage: mgemasker mask_mges [-h] -f FASTA_PATH -g GFF_FILE_PATH
                            [-m MASKING_CHARACTER]

optional arguments:
  -h, --help            show this help message and exit
  -f FASTA_PATH, --fasta_path FASTA_PATH
                        path to either a single fasta reference or a
                        pseudogenome alignment file
  -g GFF_FILE_PATH, --gff_file_path GFF_FILE_PATH
                        path to a gff file containing MGE regions to be masked
  -m MASKING_CHARACTER, --masking_character MASKING_CHARACTER
                        character used to mask (default: N)

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

MGEmasker-0.1.13.tar.gz (8.9 kB view details)

Uploaded Source

Built Distribution

MGEmasker-0.1.13-py3-none-any.whl (13.2 kB view details)

Uploaded Python 3

File details

Details for the file MGEmasker-0.1.13.tar.gz.

File metadata

  • Download URL: MGEmasker-0.1.13.tar.gz
  • Upload date:
  • Size: 8.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.13.0 pkginfo/1.4.2 requests/2.25.1 setuptools/57.0.0 requests-toolbelt/0.8.0 tqdm/4.24.0 CPython/3.7.0

File hashes

Hashes for MGEmasker-0.1.13.tar.gz
Algorithm Hash digest
SHA256 45a1dd58e23061d7e81cd7a2f9f65cd06d0b38e580e65dc032ef0f7fb0228fd2
MD5 1541553af239d281828322963f904a40
BLAKE2b-256 3f0ba6f655a5427e08ff6bf8401c4a6835e54ec11c892d4b1f2fea888068d892

See more details on using hashes here.

File details

Details for the file MGEmasker-0.1.13-py3-none-any.whl.

File metadata

  • Download URL: MGEmasker-0.1.13-py3-none-any.whl
  • Upload date:
  • Size: 13.2 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.13.0 pkginfo/1.4.2 requests/2.25.1 setuptools/57.0.0 requests-toolbelt/0.8.0 tqdm/4.24.0 CPython/3.7.0

File hashes

Hashes for MGEmasker-0.1.13-py3-none-any.whl
Algorithm Hash digest
SHA256 d87ad7cfb4553fa7537375f7c89ff8abb10f2cbf1fa8ba7991d1c960db24cff1
MD5 acc8393bc94d10175bb55aeebe219f8f
BLAKE2b-256 e498fe51004d82d17dd60f7464c44ed655ed90dca0760fb50c6ed7304c73981c

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page