Skip to main content

screens for presence of genes of interest (GOI) in bacterial assemblies

Project description

Screen assemblies

Pipeline that screens for presence of genes of interest (GOI) in bacterial assemblies. Generates multiple CSVs and plots that describe which genes are present and how variable their sequence is. Can use DNA or protein query sequences (GOIs) and DNA contigs/fastas or protein fastas as database (db) to search in.

If you use this script in scientific publications, then please reference Davies et al., 2019., Nature Genetics., http://dx.doi.org/10.1038/s41588-019-0417-8

Getting Started

You need one fasta file with all GOIs as the query and a folder with db contigs/fastas. Db files can only have one '.' in the name (i.e., sample_1.fa NOT sample.1.fa)

Prerequisites

Required

Python 3

Command line blast

ftp://ftp.ncbi.nlm.nih.gov/blast/executables/blast+/LATEST/

Optional

Clustal Omega

http://www.clustal.org/omega/

IQtree

http://www.iqtree.org/doc/Quickstart

Installing

pip3 install --user screen_assembly

Make sure screen_assembly3.py is in you PATH

If tkinter is missing, do 'sudo apt-get install python3-tk' (on Ubuntu)

Check for updates

pip3 install --user screen_assembly

Running the tests

Once screen_assembly3.py is in your PATH type screen_assembly3.py -h . If you have all dependencies then the help menu will display. Otherwise read the erorr and install whichever dependency is missing.

Running the program

Please see the WIKI

Authors

License

This project is licensed under the MIT License - see the LICENSE https://github.com/shimbalama/screen_assembly/blob/master/LICENSE file for details

Acknowledgments

  • Mark Davies lab and Jake for testing

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

screen_assembly-1.2.8.tar.gz (7.7 MB view details)

Uploaded Source

Built Distribution

screen_assembly-1.2.8-py3-none-any.whl (17.6 kB view details)

Uploaded Python 3

File details

Details for the file screen_assembly-1.2.8.tar.gz.

File metadata

  • Download URL: screen_assembly-1.2.8.tar.gz
  • Upload date:
  • Size: 7.7 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.4.2 importlib_metadata/4.8.1 pkginfo/1.6.1 requests/2.24.0 requests-toolbelt/0.9.1 tqdm/4.50.2 CPython/3.8.5

File hashes

Hashes for screen_assembly-1.2.8.tar.gz
Algorithm Hash digest
SHA256 e43f403010c53f3e2cf50cacb749699de89aeafc98f88f07ae19dcd9930b7fbb
MD5 9f37f7b4a20d8ad23d04d342bb235d11
BLAKE2b-256 8112c24265be4450979582fda08a5568180b03c7a1491212730e9842fe39c047

See more details on using hashes here.

File details

Details for the file screen_assembly-1.2.8-py3-none-any.whl.

File metadata

  • Download URL: screen_assembly-1.2.8-py3-none-any.whl
  • Upload date:
  • Size: 17.6 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.4.2 importlib_metadata/4.8.1 pkginfo/1.6.1 requests/2.24.0 requests-toolbelt/0.9.1 tqdm/4.50.2 CPython/3.8.5

File hashes

Hashes for screen_assembly-1.2.8-py3-none-any.whl
Algorithm Hash digest
SHA256 32cbfab592b1d36d0c46013f253b35936c5f75f9584b7e009594f5f8224677d7
MD5 40b13d804f45ba1241c37315a364d92d
BLAKE2b-256 0d8ed22ff7573ba3fba2c237c15f0d39e9f6f46adde4d87dc3a3f202cfafe71a

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page