Skip to main content

A pipeline for binning metagenomic datasets from 3C data.

Project description

# metaTOR

[![PyPI version](https://badge.fury.io/py/metator.svg)](https://badge.fury.io/py/metator)
[![Read the docs](https://readthedocs.org/projects/metator/badge)](https://metator.readthedocs.io)
[![License: Artistic-2.0](https://img.shields.io/badge/License-GPL%203-0298c3.svg)](https://opensource.org/licenses/GPL-3.0)
[![Code style: black](https://img.shields.io/badge/code%20style-black-000000.svg)](https://github.com/ambv/black)

Metagenomic Tridimensional Organisation-based Reassembly - A set of scripts that streamline the processing and binning of metagenomic 3C datasets.

## Installation

```sh
pip3 install metator
```

or, to use the latest version:

```sh
pip3 install -e git+https://github.com/koszullab/metator.git@master#egg=metator
```

Python 3.4 or later is required. A [standalone
version](https://github.com/koszullab/metaTOR/tree/python3-standalone) (no
installation, just download/unzip/run) is also available, as well as a [Python
2 version](https://github.com/koszullab/metaTOR/tree/python2), but keep
in mind that development will focus on the current branch.

## Usage

metator {align|partition|annotation|binning} [parameters]

A metaTOR command takes the form ```metator action --param1 arg1 --param2
arg2 #etc.```

There are four actions/steps in the metaTOR pipeline, which must be run in the following order:

* ```align``` : map paired-end reads on a preliminary assembly, then generate a network from
detected contacts between DNA chunks.
* ```partition``` : perform the Louvain community detection algorithm many times to isolate
chunks that consistently cluster together for binning purposes.
* ```annotation``` : run standard annotation software on the assembly (namely gene prediction
and database comparison) to match with the bins.
* ```binning``` : match annotations to bins, extract bin genomes and subnetworks, build bin-local
and global contact maps.

After the last step is completed there should be a set of bins, their relative
enrichments in various gene categories, and the contact map of each bin.

There are a number of other, optional, miscellaneous actions:

* ```pipeline``` : check the environment is right, then run all four of the above actioins sequentially.
This can take a while.
* ```dependencies``` : download third party dependencies that are not
available in most package managers.
* ```deploy``` : set up the environment and all dependencies for Ubuntu 14.04
and higher (run as root).
* ```version``` : display current version number.
* ```help``` : display this help message.

Please refer to the [metaTOR manual](https://github.com/koszullab/metaTOR/meta3c_manual.pdf) for detailed explanations on the parameters.

## Requirements

* Python 3 with ```numpy```, ```scipy```, ```matplotlib```, ```biopython``` and ```pysam``` libraries.
* [bowtie2](http://bowtie-bio.sourceforge.net/bowtie2/index.shtml)
* [samtools](http://www.htslib.org/)
* [hmmer](http://hmmer.org/) and some HMM databases (such as [these](http://dl.pasteur.fr/fop/LItxiFe9/hmm_databases.tgz))
* [prodigal](https://github.com/hyattpd/Prodigal)
* [louvain](https://sourceforge.net/projects/louvain/) (original
implementation)

Requirements can usually be installed with an OS's package manager. The requirements
that can not (namely ```prodigal```, ```louvain``` and HMM databases) can be
fetched with the following (The package may need to be run as a root
depending on where the package is installed):

```sh
metator dependencies
```

A dockerfile is also available if that is of interest.

## References

* [Metagenomic chromosome conformation capture (meta3C) unveils the diversity of chromosome organization in microorganisms](https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4381813/), Martial Marbouty, Axel Cournac, Jean-François Flot, Hervé Marie-Nelly, Julien Mozziconacci, and Romain Koszul, eLife, 2014
* [Scaffolding bacterial genomes and probing host-virus interactions in gut microbiome by proximity ligation (chromosome capture) assay](https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5315449/), Martial Marbouty, Lyam Baudry, Axel Cournac, and Romain Koszul, Science Advances, 2017

## Contact

### Authors

* lyam.baudry@pasteur.fr
* thfoutel@pasteur.fr
* martial.marbouty@pasteur.fr
* romain.koszul@pasteur.fr

### Research lab

[Spatial Regulation of Genomes](https://research.pasteur.fr/en/team/spatial-regulation-of-genomes/) (Institut Pasteur, Paris)


Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

metator-0.1.3.tar.gz (3.0 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

metator-0.1.3-py2-none-any.whl (3.0 MB view details)

Uploaded Python 2

File details

Details for the file metator-0.1.3.tar.gz.

File metadata

  • Download URL: metator-0.1.3.tar.gz
  • Upload date:
  • Size: 3.0 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.12.1 pkginfo/1.4.2 requests/2.20.1 setuptools/40.5.0 requests-toolbelt/0.8.0 tqdm/4.28.1 CPython/3.6.6

File hashes

Hashes for metator-0.1.3.tar.gz
Algorithm Hash digest
SHA256 e807b7d2797660db7863d3fcbed0fc9adc558f4c5f5387a0b1dd8e32ad3e0ca4
MD5 55f36802e386459f94d14df081e8105b
BLAKE2b-256 dddddf8fe6145a79688c102242e41ad61e0cd5a123b011e4bc62cf353c6a4be1

See more details on using hashes here.

File details

Details for the file metator-0.1.3-py2-none-any.whl.

File metadata

  • Download URL: metator-0.1.3-py2-none-any.whl
  • Upload date:
  • Size: 3.0 MB
  • Tags: Python 2
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.12.1 pkginfo/1.4.2 requests/2.20.1 setuptools/40.5.0 requests-toolbelt/0.8.0 tqdm/4.28.1 CPython/3.6.6

File hashes

Hashes for metator-0.1.3-py2-none-any.whl
Algorithm Hash digest
SHA256 f016d632aa9b8fa65dec5b43b895aec0b48952603119d7e911150836ca9e61f0
MD5 28f2b04bea19ba4f4d14820b23d4f08f
BLAKE2b-256 95dfa01e3ae29761dfea9f13c16e17ec14eb39cb91c3831f46523c43ef8256f6

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page