A pipeline for binning metagenomic datasets from 3C data.
Project description
# metaTOR
[![PyPI version](https://badge.fury.io/py/metator.svg)](https://badge.fury.io/py/metator)
[![Read the docs](https://readthedocs.org/projects/metator/badge)](https://metator.readthedocs.io)
[![License: Artistic-2.0](https://img.shields.io/badge/License-GPL%203-0298c3.svg)](https://opensource.org/licenses/GPL-3.0)
[![Code style: black](https://img.shields.io/badge/code%20style-black-000000.svg)](https://github.com/ambv/black)
Metagenomic Tridimensional Organisation-based Reassembly - A set of scripts that streamline the processing and binning of metagenomic 3C datasets.
## Installation
```sh
pip3 install metator
```
or, to use the latest version:
```sh
pip3 install -e git+https://github.com/koszullab/metator.git@master#egg=metator
```
Python 3.4 or later is required. A [standalone
version](https://github.com/koszullab/metaTOR/tree/python3-standalone) (no
installation, just download/unzip/run) is also available, as well as a [Python
2 version](https://github.com/koszullab/metaTOR/tree/python2), but keep
in mind that development will focus on the current branch.
## Usage
metator {align|partition|annotation|binning} [parameters]
A metaTOR command takes the form ```metator action --param1 arg1 --param2
arg2 #etc.```
There are four actions/steps in the metaTOR pipeline, which must be run in the following order:
* ```align``` : map paired-end reads on a preliminary assembly, then generate a network from
detected contacts between DNA chunks.
* ```partition``` : perform the Louvain community detection algorithm many times to isolate
chunks that consistently cluster together for binning purposes.
* ```annotation``` : run standard annotation software on the assembly (namely gene prediction
and database comparison) to match with the bins.
* ```binning``` : match annotations to bins, extract bin genomes and subnetworks, build bin-local
and global contact maps.
After the last step is completed there should be a set of bins, their relative
enrichments in various gene categories, and the contact map of each bin.
There are a number of other, optional, miscellaneous actions:
* ```pipeline``` : check the environment is right, then run all four of the above actioins sequentially.
This can take a while.
* ```dependencies``` : download third party dependencies that are not
available in most package managers.
* ```deploy``` : set up the environment and all dependencies for Ubuntu 14.04
and higher (run as root).
* ```version``` : display current version number.
* ```help``` : display this help message.
Please refer to the [metaTOR manual](https://github.com/koszullab/metaTOR/meta3c_manual.pdf) for detailed explanations on the parameters.
## Requirements
* Python 3 with ```numpy```, ```scipy```, ```matplotlib```, ```biopython``` and ```pysam``` libraries.
* [bowtie2](http://bowtie-bio.sourceforge.net/bowtie2/index.shtml)
* [samtools](http://www.htslib.org/)
* [hmmer](http://hmmer.org/) and some HMM databases (such as [these](http://dl.pasteur.fr/fop/LItxiFe9/hmm_databases.tgz))
* [prodigal](https://github.com/hyattpd/Prodigal)
* [louvain](https://sourceforge.net/projects/louvain/) (original
implementation)
Requirements can usually be installed with an OS's package manager. The requirements
that can not (namely ```prodigal```, ```louvain``` and HMM databases) can be
fetched with the following (The package may need to be run as a root
depending on where the package is installed):
```sh
metator dependencies
```
A dockerfile is also available if that is of interest.
## References
* [Metagenomic chromosome conformation capture (meta3C) unveils the diversity of chromosome organization in microorganisms](https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4381813/), Martial Marbouty, Axel Cournac, Jean-François Flot, Hervé Marie-Nelly, Julien Mozziconacci, and Romain Koszul, eLife, 2014
* [Scaffolding bacterial genomes and probing host-virus interactions in gut microbiome by proximity ligation (chromosome capture) assay](https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5315449/), Martial Marbouty, Lyam Baudry, Axel Cournac, and Romain Koszul, Science Advances, 2017
## Contact
### Authors
* lyam.baudry@pasteur.fr
* thfoutel@pasteur.fr
* martial.marbouty@pasteur.fr
* romain.koszul@pasteur.fr
### Research lab
[Spatial Regulation of Genomes](https://research.pasteur.fr/en/team/spatial-regulation-of-genomes/) (Institut Pasteur, Paris)
[![PyPI version](https://badge.fury.io/py/metator.svg)](https://badge.fury.io/py/metator)
[![Read the docs](https://readthedocs.org/projects/metator/badge)](https://metator.readthedocs.io)
[![License: Artistic-2.0](https://img.shields.io/badge/License-GPL%203-0298c3.svg)](https://opensource.org/licenses/GPL-3.0)
[![Code style: black](https://img.shields.io/badge/code%20style-black-000000.svg)](https://github.com/ambv/black)
Metagenomic Tridimensional Organisation-based Reassembly - A set of scripts that streamline the processing and binning of metagenomic 3C datasets.
## Installation
```sh
pip3 install metator
```
or, to use the latest version:
```sh
pip3 install -e git+https://github.com/koszullab/metator.git@master#egg=metator
```
Python 3.4 or later is required. A [standalone
version](https://github.com/koszullab/metaTOR/tree/python3-standalone) (no
installation, just download/unzip/run) is also available, as well as a [Python
2 version](https://github.com/koszullab/metaTOR/tree/python2), but keep
in mind that development will focus on the current branch.
## Usage
metator {align|partition|annotation|binning} [parameters]
A metaTOR command takes the form ```metator action --param1 arg1 --param2
arg2 #etc.```
There are four actions/steps in the metaTOR pipeline, which must be run in the following order:
* ```align``` : map paired-end reads on a preliminary assembly, then generate a network from
detected contacts between DNA chunks.
* ```partition``` : perform the Louvain community detection algorithm many times to isolate
chunks that consistently cluster together for binning purposes.
* ```annotation``` : run standard annotation software on the assembly (namely gene prediction
and database comparison) to match with the bins.
* ```binning``` : match annotations to bins, extract bin genomes and subnetworks, build bin-local
and global contact maps.
After the last step is completed there should be a set of bins, their relative
enrichments in various gene categories, and the contact map of each bin.
There are a number of other, optional, miscellaneous actions:
* ```pipeline``` : check the environment is right, then run all four of the above actioins sequentially.
This can take a while.
* ```dependencies``` : download third party dependencies that are not
available in most package managers.
* ```deploy``` : set up the environment and all dependencies for Ubuntu 14.04
and higher (run as root).
* ```version``` : display current version number.
* ```help``` : display this help message.
Please refer to the [metaTOR manual](https://github.com/koszullab/metaTOR/meta3c_manual.pdf) for detailed explanations on the parameters.
## Requirements
* Python 3 with ```numpy```, ```scipy```, ```matplotlib```, ```biopython``` and ```pysam``` libraries.
* [bowtie2](http://bowtie-bio.sourceforge.net/bowtie2/index.shtml)
* [samtools](http://www.htslib.org/)
* [hmmer](http://hmmer.org/) and some HMM databases (such as [these](http://dl.pasteur.fr/fop/LItxiFe9/hmm_databases.tgz))
* [prodigal](https://github.com/hyattpd/Prodigal)
* [louvain](https://sourceforge.net/projects/louvain/) (original
implementation)
Requirements can usually be installed with an OS's package manager. The requirements
that can not (namely ```prodigal```, ```louvain``` and HMM databases) can be
fetched with the following (The package may need to be run as a root
depending on where the package is installed):
```sh
metator dependencies
```
A dockerfile is also available if that is of interest.
## References
* [Metagenomic chromosome conformation capture (meta3C) unveils the diversity of chromosome organization in microorganisms](https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4381813/), Martial Marbouty, Axel Cournac, Jean-François Flot, Hervé Marie-Nelly, Julien Mozziconacci, and Romain Koszul, eLife, 2014
* [Scaffolding bacterial genomes and probing host-virus interactions in gut microbiome by proximity ligation (chromosome capture) assay](https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5315449/), Martial Marbouty, Lyam Baudry, Axel Cournac, and Romain Koszul, Science Advances, 2017
## Contact
### Authors
* lyam.baudry@pasteur.fr
* thfoutel@pasteur.fr
* martial.marbouty@pasteur.fr
* romain.koszul@pasteur.fr
### Research lab
[Spatial Regulation of Genomes](https://research.pasteur.fr/en/team/spatial-regulation-of-genomes/) (Institut Pasteur, Paris)
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
metator-0.1.3.tar.gz
(3.0 MB
view hashes)