Skip to main content

Welcome to Open Biopipeline, an open source gene exploration tool!

Project description



Open Biopipeline is an open source bioinformatics tool for general and broad purposes, specifically the goal of this project is to develop a tool for researchers or students to use in order to investigate genes. This tool combines several popular existing bioinformatic tools, such as BLAST, KEGG, GO, and Protein Atlas, by consolidating them into one singular location. By using this tool you are able to retrieve: function gene annotation, protein function, clinical relevance, specific patient case information, and much more.

This project was inspired by my intro to biomedical engineering lab course. This fully automates and streamlines the process of analyzing unknown sequence data. You can view the python notebook below to see a working version.

Packages

  • Biopython
  • NCBI BLAST 2.10.0+
  • KEGG
  • UniProt
  • Protein Atlas

Installation

Windows and MacOS

Go to NCBI website here and download the installer. Install as you would any .exe program

Linux dependencies

uname -i
sudo apt-get install lftp
lftp -e "cd blast/executables/LATEST; dir; quit" ftp.ncbi.nlm.nih.gov | awk '{print $NF}'
wget ftp://ftp.ncbi.nlm.nih.gov/blast/executables/blast+/LATEST/ncbi-blast-2.10.0+-x64-linux.tar.gz

tar -xzvf ncbi-blast-2.10.0+-x64-linux.tar.gz

# remove tar file
rm ncbi-blast-2.10.0+-x64-linux.tar.gz

cd ncbi-blast-2.10.0+
# so you can run the bin commands without specifying directory
export PATH = $PATH:$PWD

# or

export PATH = $PATH:$HOME/content/ncbi-blast-2.10.0+/bin

BLAST commands should work now, the following command should return an output other than "Unknown command".

blastn -version

Python Setup

pip install virtualenv

virtualenv bio_pipeline

bio_pipeline\Scripts\activate

pip install open-biopipeline

Example Pipeline Flow Chart

Figure 2. Flow chart diagram of bioinformatic pipeline, displaying flow of input/outputs

Contributing

Pull requests are welcome. For major changes, please open an issue first to discuss what you would like to change.

Please make sure to update tests as appropriate.

License

MIT

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distribution

open_biopipeline-0.0.4-py3-none-any.whl (8.1 kB view details)

Uploaded Python 3

File details

Details for the file open_biopipeline-0.0.4-py3-none-any.whl.

File metadata

  • Download URL: open_biopipeline-0.0.4-py3-none-any.whl
  • Upload date:
  • Size: 8.1 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.3.0 pkginfo/1.6.1 requests/2.24.0 setuptools/51.1.1 requests-toolbelt/0.9.1 tqdm/4.55.0 CPython/3.7.9

File hashes

Hashes for open_biopipeline-0.0.4-py3-none-any.whl
Algorithm Hash digest
SHA256 a49b0cfc5ff921a56acd0fac6452bb2253049b95e6fe8cc0a2212f81e3de89b8
MD5 5b103846ea7396ef0df5c2e6619c99e6
BLAKE2b-256 3d281903e65b1094f68f38e6b60fa2e411570b1e8314477cc7fb897247a9582a

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page