screens for presence of genes of interest (GOI) in bacterial assemblies
Project description
Screen assemblies
Pipeline that screens for presence of genes of interest (GOI) in bacterial assemblies. Generates multiple CSVs and plots that describe which genes are present and how variable their sequence is. Can use DNA or protein query sequences (GOIs) and DNA contigs/fastas or protein fastas as database (db) to search in.
Getting Started
You need one fasta file with all GOIs as the query and a folder with db contigs/fastas. Db files can only have one '.' in the name (i.e., sample_1.fa NOT sample.1.fa)
Prerequisites
Required
Python 3 and scypi/Biopython Command line blast
Optional
Clustal Omega, RAXML and or IQtree
Installing
- Download the screen_assembly3.py script and place it in your PATH:
- git clone https://github.com/shimbalama/screen_assembly.git
- Make sure its executable (chmod +x screen_assembly/screen_assembly3.py)
- Export PATH="your_path:$PATH" (the command pwd will give you your PATH)
- Best to permanently add it to you path by adding it to .bash_profile (mac) or .profile (unix)
- Download lab_modules.py:
- git clone https://github.com/shimbalama/common_modules.git
- Make sure its executable (chmod +x common_modules/lab_modules.py)
- Export PYTHONPATH="your_path:$PYTHONPATH"
- Best to permanently add it to you path by adding it to .bash_profile (mac) or .profile (unix)
- Place the common_modules folder next to the screen_assembly folder (as thats where it looks by default). OR use a text editor to set this line in screen_assembly3.py to point at the dir you put lab_modules.py in: sys.path.append('../common_modules') becomes sys.path.append('your_path/common_modules')
Check for updates
- git pull
Running the tests
Once screen_assembly3.py is in your PATH type screen_assembly3.py -h . If you have all dependencies then the help menu will display. Otherwise read the erorr and install whichever dependency is missing.
Running the program
Please see the WIKI
Authors
- Liam McIntyre - Initial work - https://github.com/shimbalama/
License
This project is licensed under the MIT License - see the LICENSE https://github.com/shimbalama/screen_assembly/blob/master/LICENSE file for details
Acknowledgments
- Mark Davies lab and Jake for testing
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file read_overlap-1.0.0.tar.gz
.
File metadata
- Download URL: read_overlap-1.0.0.tar.gz
- Upload date:
- Size: 18.0 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.21.0 setuptools/40.8.0 requests-toolbelt/0.9.1 tqdm/4.31.1 CPython/3.6.5
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 73da5dd68b87b3185b966f7432d028bf816890da891eaa0766e246ab752888c7 |
|
MD5 | 48917b91b0102f61667bb70115f6f3c7 |
|
BLAKE2b-256 | 9e8dc9d79f5c924dddd51b2c63c86ed844e48316b98a157e51af70e6fa49b978 |
File details
Details for the file read_overlap-1.0.0-py3-none-any.whl
.
File metadata
- Download URL: read_overlap-1.0.0-py3-none-any.whl
- Upload date:
- Size: 22.8 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.21.0 setuptools/40.8.0 requests-toolbelt/0.9.1 tqdm/4.31.1 CPython/3.6.5
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 9f2cf4df0777bca57dfd33887b6084722727ba14487b8c910f6a09c6476e386e |
|
MD5 | 5ecac75d1024f5ab73aa9d63110043a8 |
|
BLAKE2b-256 | fa2f86cb2e9b29455c0724b57d3318b96ffda84cda626226da13e9d58d7ebe5c |