A pipeline for binning metagenomic datasets from 3C data.
Project description
# metaTOR
[](https://badge.fury.io/py/metator)
[](https://metator.readthedocs.io)
[](https://opensource.org/licenses/GPL-3.0)
[](https://github.com/ambv/black)
Metagenomic Tridimensional Organisation-based Reassembly - A set of scripts that streamline the processing and binning of metagenomic 3C datasets.
## Installation
```sh
pip3 install metator
```
or, to use the latest version:
```sh
pip3 install -e git+https://github.com/koszullab/metator.git@master#egg=metator
```
Python 3.4 or later is required. A [standalone
version](https://github.com/koszullab/metaTOR/tree/python3-standalone) (no
installation, just download/unzip/run) is also available, as well as a [Python
2 version](https://github.com/koszullab/metaTOR/tree/python2), but keep
in mind that development will focus on the current branch.
## Usage
metator {align|partition|annotation|binning} [parameters]
A metaTOR command takes the form ```metator action --param1 arg1 --param2
arg2 #etc.```
There are four actions/steps in the metaTOR pipeline, which must be run in the following order:
* ```align``` : map paired-end reads on a preliminary assembly, then generate a network from
detected contacts between DNA chunks.
* ```partition``` : perform the Louvain community detection algorithm many times to isolate
chunks that consistently cluster together for binning purposes.
* ```annotation``` : run standard annotation software on the assembly (namely gene prediction
and database comparison) to match with the bins.
* ```binning``` : match annotations to bins, extract bin genomes and subnetworks, build bin-local
and global contact maps.
After the last step is completed there should be a set of bins, their relative
enrichments in various gene categories, and the contact map of each bin.
There are a number of other, optional, miscellaneous actions:
* ```pipeline``` : check the environment is right, then run all four of the above actioins sequentially.
This can take a while.
* ```dependencies``` : download third party dependencies that are not
available in most package managers.
* ```deploy``` : set up the environment and all dependencies for Ubuntu 14.04
and higher (run as root).
* ```version``` : display current version number.
* ```help``` : display this help message.
Please refer to the [metaTOR manual](https://github.com/koszullab/metaTOR/meta3c_manual.pdf) for detailed explanations on the parameters.
## Requirements
* Python 3 with ```numpy```, ```scipy```, ```matplotlib```, ```biopython``` and ```pysam``` libraries.
* [bowtie2](http://bowtie-bio.sourceforge.net/bowtie2/index.shtml)
* [samtools](http://www.htslib.org/)
* [hmmer](http://hmmer.org/) and some HMM databases (such as [these](http://dl.pasteur.fr/fop/LItxiFe9/hmm_databases.tgz))
* [prodigal](https://github.com/hyattpd/Prodigal)
* [louvain](https://sourceforge.net/projects/louvain/) (original
implementation)
Requirements can usually be installed with an OS's package manager. The requirements
that can not (namely ```prodigal```, ```louvain``` and HMM databases) can be
fetched with the following (The package may need to be run as a root
depending on where the package is installed):
```sh
metator dependencies
```
A dockerfile is also available if that is of interest.
## References
* [Metagenomic chromosome conformation capture (meta3C) unveils the diversity of chromosome organization in microorganisms](https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4381813/), Martial Marbouty, Axel Cournac, Jean-François Flot, Hervé Marie-Nelly, Julien Mozziconacci, and Romain Koszul, eLife, 2014
* [Scaffolding bacterial genomes and probing host-virus interactions in gut microbiome by proximity ligation (chromosome capture) assay](https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5315449/), Martial Marbouty, Lyam Baudry, Axel Cournac, and Romain Koszul, Science Advances, 2017
## Contact
### Authors
* lyam.baudry@pasteur.fr
* thfoutel@pasteur.fr
* martial.marbouty@pasteur.fr
* romain.koszul@pasteur.fr
### Research lab
[Spatial Regulation of Genomes](https://research.pasteur.fr/en/team/spatial-regulation-of-genomes/) (Institut Pasteur, Paris)
[](https://badge.fury.io/py/metator)
[](https://metator.readthedocs.io)
[](https://opensource.org/licenses/GPL-3.0)
[](https://github.com/ambv/black)
Metagenomic Tridimensional Organisation-based Reassembly - A set of scripts that streamline the processing and binning of metagenomic 3C datasets.
## Installation
```sh
pip3 install metator
```
or, to use the latest version:
```sh
pip3 install -e git+https://github.com/koszullab/metator.git@master#egg=metator
```
Python 3.4 or later is required. A [standalone
version](https://github.com/koszullab/metaTOR/tree/python3-standalone) (no
installation, just download/unzip/run) is also available, as well as a [Python
2 version](https://github.com/koszullab/metaTOR/tree/python2), but keep
in mind that development will focus on the current branch.
## Usage
metator {align|partition|annotation|binning} [parameters]
A metaTOR command takes the form ```metator action --param1 arg1 --param2
arg2 #etc.```
There are four actions/steps in the metaTOR pipeline, which must be run in the following order:
* ```align``` : map paired-end reads on a preliminary assembly, then generate a network from
detected contacts between DNA chunks.
* ```partition``` : perform the Louvain community detection algorithm many times to isolate
chunks that consistently cluster together for binning purposes.
* ```annotation``` : run standard annotation software on the assembly (namely gene prediction
and database comparison) to match with the bins.
* ```binning``` : match annotations to bins, extract bin genomes and subnetworks, build bin-local
and global contact maps.
After the last step is completed there should be a set of bins, their relative
enrichments in various gene categories, and the contact map of each bin.
There are a number of other, optional, miscellaneous actions:
* ```pipeline``` : check the environment is right, then run all four of the above actioins sequentially.
This can take a while.
* ```dependencies``` : download third party dependencies that are not
available in most package managers.
* ```deploy``` : set up the environment and all dependencies for Ubuntu 14.04
and higher (run as root).
* ```version``` : display current version number.
* ```help``` : display this help message.
Please refer to the [metaTOR manual](https://github.com/koszullab/metaTOR/meta3c_manual.pdf) for detailed explanations on the parameters.
## Requirements
* Python 3 with ```numpy```, ```scipy```, ```matplotlib```, ```biopython``` and ```pysam``` libraries.
* [bowtie2](http://bowtie-bio.sourceforge.net/bowtie2/index.shtml)
* [samtools](http://www.htslib.org/)
* [hmmer](http://hmmer.org/) and some HMM databases (such as [these](http://dl.pasteur.fr/fop/LItxiFe9/hmm_databases.tgz))
* [prodigal](https://github.com/hyattpd/Prodigal)
* [louvain](https://sourceforge.net/projects/louvain/) (original
implementation)
Requirements can usually be installed with an OS's package manager. The requirements
that can not (namely ```prodigal```, ```louvain``` and HMM databases) can be
fetched with the following (The package may need to be run as a root
depending on where the package is installed):
```sh
metator dependencies
```
A dockerfile is also available if that is of interest.
## References
* [Metagenomic chromosome conformation capture (meta3C) unveils the diversity of chromosome organization in microorganisms](https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4381813/), Martial Marbouty, Axel Cournac, Jean-François Flot, Hervé Marie-Nelly, Julien Mozziconacci, and Romain Koszul, eLife, 2014
* [Scaffolding bacterial genomes and probing host-virus interactions in gut microbiome by proximity ligation (chromosome capture) assay](https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5315449/), Martial Marbouty, Lyam Baudry, Axel Cournac, and Romain Koszul, Science Advances, 2017
## Contact
### Authors
* lyam.baudry@pasteur.fr
* thfoutel@pasteur.fr
* martial.marbouty@pasteur.fr
* romain.koszul@pasteur.fr
### Research lab
[Spatial Regulation of Genomes](https://research.pasteur.fr/en/team/spatial-regulation-of-genomes/) (Institut Pasteur, Paris)
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
metator-0.1.3.tar.gz
(3.0 MB
view details)
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file metator-0.1.3.tar.gz.
File metadata
- Download URL: metator-0.1.3.tar.gz
- Upload date:
- Size: 3.0 MB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/1.12.1 pkginfo/1.4.2 requests/2.20.1 setuptools/40.5.0 requests-toolbelt/0.8.0 tqdm/4.28.1 CPython/3.6.6
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
e807b7d2797660db7863d3fcbed0fc9adc558f4c5f5387a0b1dd8e32ad3e0ca4
|
|
| MD5 |
55f36802e386459f94d14df081e8105b
|
|
| BLAKE2b-256 |
dddddf8fe6145a79688c102242e41ad61e0cd5a123b011e4bc62cf353c6a4be1
|
File details
Details for the file metator-0.1.3-py2-none-any.whl.
File metadata
- Download URL: metator-0.1.3-py2-none-any.whl
- Upload date:
- Size: 3.0 MB
- Tags: Python 2
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/1.12.1 pkginfo/1.4.2 requests/2.20.1 setuptools/40.5.0 requests-toolbelt/0.8.0 tqdm/4.28.1 CPython/3.6.6
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
f016d632aa9b8fa65dec5b43b895aec0b48952603119d7e911150836ca9e61f0
|
|
| MD5 |
28f2b04bea19ba4f4d14820b23d4f08f
|
|
| BLAKE2b-256 |
95dfa01e3ae29761dfea9f13c16e17ec14eb39cb91c3831f46523c43ef8256f6
|