Skip to main content

screens for presence of genes of interest (GOI) in bacterial assemblies

Project description

Screen assemblies

Pipeline that screens for presence of genes of interest (GOI) in bacterial assemblies. Generates multiple CSVs and plots that describe which genes are present and how variable their sequence is. Can use DNA or protein query sequences (GOIs) and DNA contigs/fastas or protein fastas as database (db) to search in.

Getting Started

You need one fasta file with all GOIs as the query and a folder with db contigs/fastas. Db files can only have one '.' in the name (i.e., sample_1.fa NOT sample.1.fa)

Prerequisites

Required

Python 3 and scypi/Biopython Command line blast

Optional

Clustal Omega, RAXML and or IQtree

Installing

  • Download the screen_assembly3.py script and place it in your PATH:
    • git clone https://github.com/shimbalama/screen_assembly.git
    • Make sure its executable (chmod +x screen_assembly/screen_assembly3.py)
    • Export PATH="your_path:$PATH" (the command pwd will give you your PATH)
    • Best to permanently add it to you path by adding it to .bash_profile (mac) or .profile (unix)
  • Download lab_modules.py:
    • git clone https://github.com/shimbalama/common_modules.git
    • Make sure its executable (chmod +x common_modules/lab_modules.py)
    • Export PYTHONPATH="your_path:$PYTHONPATH"
    • Best to permanently add it to you path by adding it to .bash_profile (mac) or .profile (unix)
  • Place the common_modules folder next to the screen_assembly folder (as thats where it looks by default). OR use a text editor to set this line in screen_assembly3.py to point at the dir you put lab_modules.py in: sys.path.append('../common_modules') becomes sys.path.append('your_path/common_modules')

Check for updates

  • git pull

Running the tests

Once screen_assembly3.py is in your PATH type screen_assembly3.py -h . If you have all dependencies then the help menu will display. Otherwise read the erorr and install whichever dependency is missing.

Running the program

Please see the WIKI

Authors

License

This project is licensed under the MIT License - see the LICENSE https://github.com/shimbalama/screen_assembly/blob/master/LICENSE file for details

Acknowledgments

  • Mark Davies lab and Jake for testing

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

read_overlap-1.0.0.tar.gz (18.0 kB view details)

Uploaded Source

Built Distribution

read_overlap-1.0.0-py3-none-any.whl (22.8 kB view details)

Uploaded Python 3

File details

Details for the file read_overlap-1.0.0.tar.gz.

File metadata

  • Download URL: read_overlap-1.0.0.tar.gz
  • Upload date:
  • Size: 18.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.21.0 setuptools/40.8.0 requests-toolbelt/0.9.1 tqdm/4.31.1 CPython/3.6.5

File hashes

Hashes for read_overlap-1.0.0.tar.gz
Algorithm Hash digest
SHA256 73da5dd68b87b3185b966f7432d028bf816890da891eaa0766e246ab752888c7
MD5 48917b91b0102f61667bb70115f6f3c7
BLAKE2b-256 9e8dc9d79f5c924dddd51b2c63c86ed844e48316b98a157e51af70e6fa49b978

See more details on using hashes here.

File details

Details for the file read_overlap-1.0.0-py3-none-any.whl.

File metadata

  • Download URL: read_overlap-1.0.0-py3-none-any.whl
  • Upload date:
  • Size: 22.8 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.21.0 setuptools/40.8.0 requests-toolbelt/0.9.1 tqdm/4.31.1 CPython/3.6.5

File hashes

Hashes for read_overlap-1.0.0-py3-none-any.whl
Algorithm Hash digest
SHA256 9f2cf4df0777bca57dfd33887b6084722727ba14487b8c910f6a09c6476e386e
MD5 5ecac75d1024f5ab73aa9d63110043a8
BLAKE2b-256 fa2f86cb2e9b29455c0724b57d3318b96ffda84cda626226da13e9d58d7ebe5c

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page