Skip to main content

Feature-based prediction of genome quality indices

Project description

CoCoPyE

CoCoPyE is a fast tool for quality assessment of microbial genomes. It is able to reliably predict completeness and contamination of bacterial and archaeal genomes. Additionally, it can provide a taxonomic classification of the input.

Background: The classical approach for estimation of quality indices solely relies on a relatively small number of universal single copy genes. Because these classical markers only cover a small fraction of the whole genome the quality assessment can be rather unreliable. Our method is based on a novel two-stage feature extraction and transformation scheme. It first performs a flexible extraction of genomic markers and then refines the marker-based estimates with a machine learning approach based on count-ratio histograms. In our simulation studies CoCoPyE showed a more accurate prediction of quality indices than existing tools.

Getting started

CoCoPyE is available via pip and conda (conda-forge channel). See the project wiki for installation and usage instructions.

Online Demo

You can test CoCoPyE without installation on our project homepage. Please note that the online demo can process only one query genome per request and is less performant than a local installation. Therefore it is highly recommended to use the online version only for evaluation purposes and install CoCoPyE on your own machine for productive use.

Additional notes

Contact

For bug reports, suggestions or questions, please open an issue on GitHub or send an email to pmeinic@gwdg.de.

API documentation

You can find the API documentation of the CoCoPyE package on https://gobics.github.io/cocopye.

License

CoCoPyE is available under the terms of the GNU General Public License, version 3 or later. See COPYING for the full license text.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

cocopye-0.5.0.tar.gz (16.1 MB view details)

Uploaded Source

Built Distribution

cocopye-0.5.0-py3-none-any.whl (16.1 MB view details)

Uploaded Python 3

File details

Details for the file cocopye-0.5.0.tar.gz.

File metadata

  • Download URL: cocopye-0.5.0.tar.gz
  • Upload date:
  • Size: 16.1 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/5.1.0 CPython/3.12.4

File hashes

Hashes for cocopye-0.5.0.tar.gz
Algorithm Hash digest
SHA256 aef29373a987bb5492f1ffba83cf22e25c4008a0ded57864f00ced45b23af67a
MD5 f8864c06b7b0cba9d09db807682bac16
BLAKE2b-256 a8880020528d3957271b9a5c1f4434558553720afa1eb1586b4727dd42955f57

See more details on using hashes here.

File details

Details for the file cocopye-0.5.0-py3-none-any.whl.

File metadata

  • Download URL: cocopye-0.5.0-py3-none-any.whl
  • Upload date:
  • Size: 16.1 MB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/5.1.0 CPython/3.12.4

File hashes

Hashes for cocopye-0.5.0-py3-none-any.whl
Algorithm Hash digest
SHA256 6219ae7d49d4a8fd49075931b75f5340f4c384955830c2f0ab1a0cc15adf26fb
MD5 bfa9555fea040c3f8b748b927472c5bf
BLAKE2b-256 a404a8857b3aec96005a2c5db5b5fa84939af3dc14d605db168a3e2700ed21c8

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page