Skip to main content

No project description provided

Project description

presented by Institute of Food Safety and Health, Illinois Institute of Technology

PlasmidHunter: Accurate and Fast Plasmid Prediction Based on Gene Content Using Machine Learning

Plasmids are extrachromosomal DNA found in microorganisms. They often carry beneficial genes that help bacteria adapt to harsh conditions. Plasmids are also important tools in genetic engineering, gene therapy, and drug production. However, it can be difficult to identify plasmid sequences from chromosomal sequences in genomic and metagenomic data. Here, we have developed a new tool called PlasmidHunter, which uses machine learning to predict plasmid sequences based on gene content profile. PlasmidHunter achieved high accuracies (up to 97.6%) and high speeds in benchmark tests including both simulated contigs and real metagenomic plasmidome data, outperforming other existing tools.

Keywords: artificial intelligence (AI), machine learning (ML), plasmid prediction, genomic sequencing

Installation and run

conda create -n plasmidhunter -c bioconda -y python=3.10 diamond=2.1.8 prodigal

conda activate plasmidhunter

pip install plasmidhunter

plasmidhunter -h

Result Interpretation

The result is a tab-delimited table showing the prediction of each sequence. The columns include Prediction (0: chromosome, 1: plasmid), Probability of 0 (chromosome), and Probability of 1 (plasmid).

Citation

PlasmidHunter: Accurate and fast prediction of plasmid sequences using gene content profile and machine learning

Renmao Tian, Jizhong Zhou, Behzad Imanian

bioRxiv 2023.02.01.526640; doi: https://doi.org/10.1101/2023.02.01.526640

Contact

If you have any questions, please contact Renmao Tian (tianrenmao[at]gmail.com) or Behzad Imanian (bimanian[at]iit.edu).

License

Educational Community License, Version 2.0

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

plasmidhunter-1.3.tar.gz (4.1 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

plasmidhunter-1.3-py3-none-any.whl (4.2 MB view details)

Uploaded Python 3

File details

Details for the file plasmidhunter-1.3.tar.gz.

File metadata

  • Download URL: plasmidhunter-1.3.tar.gz
  • Upload date:
  • Size: 4.1 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.10.13

File hashes

Hashes for plasmidhunter-1.3.tar.gz
Algorithm Hash digest
SHA256 93f24af03645d947487dc4f950deba02c40562634e53e31482ef87cbe743ee98
MD5 41e3f97a96734672ec178828ca4b8179
BLAKE2b-256 b06c29a5db1d3f4ac0ed8a7781862c076937d7936dd323cb34e95bac058c8bd6

See more details on using hashes here.

File details

Details for the file plasmidhunter-1.3-py3-none-any.whl.

File metadata

  • Download URL: plasmidhunter-1.3-py3-none-any.whl
  • Upload date:
  • Size: 4.2 MB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.10.13

File hashes

Hashes for plasmidhunter-1.3-py3-none-any.whl
Algorithm Hash digest
SHA256 96af6a6220d1be62eae71df73e97b049afade7a44b87fdf25e7f5d1984e614e1
MD5 497042ca9bd9c78c90f571ec07b0cb51
BLAKE2b-256 492771b690c296a9e77e19cd640ae50f9f78ca0ce82ffda6f3acce95d7aaca24

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page