Skip to main content

Biosynthetic Gene Cluster finder with Graph Neural Network

Project description

# BGCfinder : Biosynthetic Gene Cluster detection with Graph Neural Network

BGCfinder detects biosynthetic gene clusters in bacterial genomes using deep learning. BGCfinder takes a fasta file containing bacterial protein coding sequences and embed each protein sequence into a graph. Graph Neural Network takes the graphs to detect biosynthetic gene cluster.

Author : Jihun Jeung, jihun@gm.gist.ac.kr, jeung4705@gmail.com, https://github.com/jihunni/BGCfinder

To construct the conda environment,

`bash $ conda create --name BGCfinder --clone base $ conda init bash $ conda activate BGCfinder $ conda install pytorch cudatoolkit=11.3 -c pytorch $ conda install pyg -c pyg $ pip install BGCfinder `

To download the BGCfinder model and test files, `bash $ bgc-download `

To run BGCfinder with a fasta file containing amino acid sequence with CPU, `bash bgcfinder bacterial_genome.fasta -o output_filename.tsv -l log_record.log -d False `

To run BGCfinder with a fasta file containing amino acid sequence with GPU, `bash bgcfinder bacterial_genome.fasta -o output_filename.tsv -l log_record.log -d True `

The development environment of BGCfinder : `bash 'torch==1.10.0', 'torch-geometric==2.0.2', 'torch-scatter==2.0.9', 'torch-sparse==0.6.12' `

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distribution

BGCfinder-0.0.23-py3-none-any.whl (9.7 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page