Skip to main content

Assess the quality of putative genome bins.

Project description

CheckM

Unmaintained
version status Bioconda Downloads BioConda Install

CheckM v2 has been released

CheckM v2 has been released and can be found here. CheckM v1 (this repository) estimates genome completeness and contamination based on the presence or absence of marker genes, i.e., genes that are typically ubiquitous and single copy. In contrast, CheckM v2 uses machine learning models to estimate completeness and contamination. Our assessment indicates CheckM v2 will be more accurate in general. However, you may consider running both CheckM v1 and v2 in some cases since these are independent methodologies for estimating genome quality. It can also be insightful to explore the marker genes that are missing in your genomes, especially if you have multiple related genomes and are interested in systematic loss of an otherwise ubiquitious gene.

Outside of minor updates to fix compatibility issues with external dependencies no maintanence is intended for CheckM v1.

Installing and using CheckM

Please see the project home page for usage details and installation instructions: https://github.com/Ecogenomics/CheckM/wiki

We do not recommend installing CheckM from the master branch. This may be unstable. Please install an official release of CheckM or use pip.

Estimating quality of CPR genomes

Information about obtaining improved quality estimates for CPR (Patescibacteria) genomes can be found here: https://github.com/Ecogenomics/CheckM/wiki/Workflows#using-cpr-marker-set

Migration to Python 3

CheckM has been ported to Python 3 to accomodate Python 2 reaching end of life on January 1, 2020. CheckM >=1.1.0 requires Python 3. Python 2 will no longer be actively supported. Apologies for any issues this may cause.

Massive thanks to baudrly, Vini Salazar, and Asaf Peer for initial Python 2 to 3 porting.

Python 2 to 3 Validation

Porting of CheckM to Python 3 was validation on a set of 1,000 genomes randomly select from the GTDB R89 representative genomes. Results were compared to those generated with CheckM v1.0.18, the last Python 2 version of CheckM. Identical results were obtained for the 'lineage_wf', 'taxonomy_wf', and 'ssu_finder' methods across this set of test genomes. Other CheckM methods have been executed on a small set of 3 genomes to verify they run to completion under Python 3.

Removed Functionality

The following features have been removed from CheckM v1.1.x in order to simplify the code base and focus CheckM and support requests on critical functionality:

  • bin_qa_plot: non-critical, rarely used plot which does not scale to the large numbers of MAGs now being recovered
  • par_plot: non-critical plot and the same information is better presented in the reference distribution plots
  • cov_pca, tetra_pca: alternatives to these static plots exist in tools such as Anvi'o
  • len_plot: rarely used plot which is largely redundant with the len_hist and nx_plot plots
  • bin_union, bin_compare: feature rich alternative now exist such as DAS Tool and UniteM

Bug Reports

Please report bugs through the GitHub issues system.

Copyright © 2014 Donovan Parks, Connor Skennerton, Michael Imelfort. See LICENSE for further details.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

checkm_genome-1.2.4.tar.gz (923.1 kB view details)

Uploaded Source

Built Distribution

checkm_genome-1.2.4-py3-none-any.whl (164.2 kB view details)

Uploaded Python 3

File details

Details for the file checkm_genome-1.2.4.tar.gz.

File metadata

  • Download URL: checkm_genome-1.2.4.tar.gz
  • Upload date:
  • Size: 923.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.10.6

File hashes

Hashes for checkm_genome-1.2.4.tar.gz
Algorithm Hash digest
SHA256 7da9c42b05d144c0237aa949894277cefb04fad8a634968d7266b1b2d3cfb271
MD5 145e09d9d82f3aa152a811a05b14bc87
BLAKE2b-256 a0c4c8eaba0f2a7154cc6039d53b7c11331b39cec486ff2bec354a52c6657ce5

See more details on using hashes here.

File details

Details for the file checkm_genome-1.2.4-py3-none-any.whl.

File metadata

  • Download URL: checkm_genome-1.2.4-py3-none-any.whl
  • Upload date:
  • Size: 164.2 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.10.6

File hashes

Hashes for checkm_genome-1.2.4-py3-none-any.whl
Algorithm Hash digest
SHA256 80c8d174ea50491132ac711ab50eec6353a1411d0e2e319e4192deba27332038
MD5 1480444043f79e83ea7abbe56d7a0ba1
BLAKE2b-256 dd7087284a911130516afc4381e29a479299c3c262d76632ab1e3aafd4d4d677

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page