Skip to main content

Assess the quality of putative genome bins.

Project description

CheckM

Unmaintained
version status Bioconda Downloads BioConda Install

CheckM v2 has been released

CheckM v2 has been released and can be found here. CheckM v1 (this repository) estimates genome completeness and contamination based on the presence or absence of marker genes, i.e., genes that are typically ubiquitous and single copy. In contrast, CheckM v2 uses machine learning models to estimate completeness and contamination. Our assessment indicates CheckM v2 will be more accurate in general. However, you may consider running both CheckM v1 and v2 in some cases since these are independent methodologies for estimating genome quality. It can also be insightful to explore the marker genes that are missing in your genomes, especially if you have multiple related genomes and are interested in systematic loss of an otherwise ubiquitious gene.

Outside of minor updates to fix compatibility issues with external dependencies no maintanence is intended for CheckM v1.

Installing and using CheckM

Please see the project home page for usage details and installation instructions: https://github.com/Ecogenomics/CheckM/wiki

We do not recommend installing CheckM from the master branch. This may be unstable. Please install an official release of CheckM or use pip.

Estimating quality of CPR genomes

Information about obtaining improved quality estimates for CPR (Patescibacteria) genomes can be found here: https://github.com/Ecogenomics/CheckM/wiki/Workflows#using-cpr-marker-set

Migration to Python 3

CheckM has been ported to Python 3 to accomodate Python 2 reaching end of life on January 1, 2020. CheckM >=1.1.0 requires Python 3. Python 2 will no longer be actively supported. Apologies for any issues this may cause.

Massive thanks to baudrly, Vini Salazar, and Asaf Peer for initial Python 2 to 3 porting.

Python 2 to 3 Validation

Porting of CheckM to Python 3 was validation on a set of 1,000 genomes randomly select from the GTDB R89 representative genomes. Results were compared to those generated with CheckM v1.0.18, the last Python 2 version of CheckM. Identical results were obtained for the 'lineage_wf', 'taxonomy_wf', and 'ssu_finder' methods across this set of test genomes. Other CheckM methods have been executed on a small set of 3 genomes to verify they run to completion under Python 3.

Removed Functionality

The following features have been removed from CheckM v1.1.x in order to simplify the code base and focus CheckM and support requests on critical functionality:

  • bin_qa_plot: non-critical, rarely used plot which does not scale to the large numbers of MAGs now being recovered
  • par_plot: non-critical plot and the same information is better presented in the reference distribution plots
  • cov_pca, tetra_pca: alternatives to these static plots exist in tools such as Anvi'o
  • len_plot: rarely used plot which is largely redundant with the len_hist and nx_plot plots
  • bin_union, bin_compare: feature rich alternative now exist such as DAS Tool and UniteM

Bug Reports

Please report bugs through the GitHub issues system.

Copyright © 2014 Donovan Parks, Connor Skennerton, Michael Imelfort. See LICENSE for further details.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

checkm_genome-1.2.5.tar.gz (922.8 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

checkm_genome-1.2.5-py3-none-any.whl (164.4 kB view details)

Uploaded Python 3

File details

Details for the file checkm_genome-1.2.5.tar.gz.

File metadata

  • Download URL: checkm_genome-1.2.5.tar.gz
  • Upload date:
  • Size: 922.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.12.12

File hashes

Hashes for checkm_genome-1.2.5.tar.gz
Algorithm Hash digest
SHA256 b91333fc6e1d86244895def447cdcffa918177e6cc839501ba2586052edaa5cf
MD5 1a7148fcb45736d0e8d6d2b8cf30967d
BLAKE2b-256 03ff07ee79823f040ae51176f605ab70218ff778b0709bb9a484946690ffd969

See more details on using hashes here.

File details

Details for the file checkm_genome-1.2.5-py3-none-any.whl.

File metadata

  • Download URL: checkm_genome-1.2.5-py3-none-any.whl
  • Upload date:
  • Size: 164.4 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.12.12

File hashes

Hashes for checkm_genome-1.2.5-py3-none-any.whl
Algorithm Hash digest
SHA256 12e160ed41086eef8c3456efcd87cb54665c6952d12f529e335497725678da76
MD5 e53c11f069078a7fbaff4b84699ffa3b
BLAKE2b-256 8dde0088ececb41baeaad59e1c20380ad5981f02d68e6d2e17220841f442bff9

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page