Novelty-inclusive microbial community profiling of shotgun metagenomes
Project description
Welcome.
At heart, SingleM is a tool for profiling shotgun (both short and long-read) metagenomes. It shows good accuracy in estimating the relative abundances of community members, and has a particular strength in dealing with novel lineages.
It was originally designed to determine the relative abundance of bacterial and archaeal taxa in a sample. Microbial SingleM has been applied to ~700,000 public metagenomes. The resulting data are available at the Sandpiper companion website.
Recent versions have added features:
- Long-read input support (v0.20.0). Either Nanopore >= R10.4.1 or PacBio HiFi reads are recommended to ensure reliable taxonomic profiling.
- Profiling of dsDNA phages (v0.19.0, updated DB in v0.20.0). See Lyrebird.
The method it uses also it suitable for some related tasks, such as assessing eukaryotic contamination, finding bias in genome recovery, and lineage-targeted MAG recovery. It can also be used as the basis for choosing metagenomes which, when coassembled, maximise the recovery of novel MAGs (see Bin Chicken).
Documentation can be found at https://wwood.github.io/singlem/.
Citations
Profiling microbial communities with SingleM / Sandpiper
Ben J. Woodcroft, Samuel T. N. Aroney, Rossen Zhao, Mitchell Cunningham, Joshua A. M. Mitchell, Rizky Nurdiansyah, Linda Blackall & Gene W. Tyson. Comprehensive taxonomic identification of microbial species in metagenomic data using SingleM and Sandpiper. Nat Biotechnol (2025). https://doi.org/10.1038/s41587-025-02738-1.
SingleM prokaryotic_fraction
Raphael Eisenhofer, Antton Alberdi, Ben J. Woodcroft, 2024. Large-scale estimation of bacterial and archaeal DNA prevalence in metagenomes reveals biome-specific patterns. bioRxiv, pp.2024-05; https://doi.org/10.1101/2024.05.16.594470.
SingleM-powered coassembly with Bin Chicken
Samuel T. N. Aroney, Rhys J. Newell, Gene W. Tyson and Ben J. Woodcroft, 2024. Bin Chicken: targeted metagenomic coassembly for the efficient recovery of novel genomes. bioRxiv, pp.2024-11. https://doi.org/10.1101/2024.11.24.625082.
Lyrebird
Rossen Zhao, Gene W. Tyson, Ben J. Woodcroft. Lyrebird: a tool for profiling dsDNA phage communities in metagenomic data. (in preparation).
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file singlem-0.20.3.tar.gz.
File metadata
- Download URL: singlem-0.20.3.tar.gz
- Upload date:
- Size: 1.5 MB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.14.0
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
4492c2065f3466db6d4f1924bff7a6af108e717eb63e8777bb0ad490c8c8f150
|
|
| MD5 |
dbaa708fb2398e8767fd5b730f32818c
|
|
| BLAKE2b-256 |
6b58d939d5b83b6acd12892b08703c4db09afa507a1341050ca8d55f8a78a908
|
File details
Details for the file singlem-0.20.3-py3-none-any.whl.
File metadata
- Download URL: singlem-0.20.3-py3-none-any.whl
- Upload date:
- Size: 217.4 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.14.0
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
50a7aa9abc60e7eacc81d7af204f43b40c77b5752483212d752bba57b5472cbb
|
|
| MD5 |
6d1b75e5dcdbba0d0665c714a881e2dd
|
|
| BLAKE2b-256 |
91066fe37fdd8614b8e7a5b8b45bbc66f5731aa275cce08e15dfbdbc775906f8
|