Skip to main content

Estimate abundances of genomic features from read densities

Project description

Welcome!

Rnacounter estimates abundances of genes and their different transcripts from read densities. Exons and introns can also be quantified. It requires a genome-level BAM file and a GTF/GFF file describing the exon structure, such as those provided by Ensembl or GenRep. The method used is described in [<ref>].

It is not meant to be used as a library, but through its command-line tool “rnacounter”.

The code is under GPL-2 license.

Usage:

See “rnacounter –help” and the tutorial at http://bbcf.epfl.ch/bbcflib/tutorial_rnacounter.html, also available in the doc/ folder.

Minimal example:

rnacounter test.bam test.gtf        # Python 2.7 version
rnacounter3 test.bam test.gtf       # Python 3 version

Installation:

First ensure that you have setuptools>=7.0 and numpy installed.

With easy_install:

sudo easy_install rnacounter

or better yet, with pip:

sudo pip install rnacounter

Use “easy_install3”, “pip3” respectively to install the python3 version and run it in command-line with “rnacounter3”. The code is fully compatible with Python 2.7 and Python 3.

It installs as a standard Python library but includes the executable and puts it somewhere in your $PATH. Dependencies should be added automatically.

To uninstall with pip:

sudo pip uninstall rnacounter

Building from source:

This allows to modify the Cython source code (rnacounter.pyx) before rebuilding.

Clone or download the repository from https://github.com/delafont/rnacounter .

You need cython installed (pip install cython).

From where rnacounter.pyx lies (rnacounter/rnacounter/), run:

sudo python setup.py build_ext

It will recompile to create rnacounter.c, and build it. Then add the executable (rnacounter/bin/rnacounter[3]) to your $PATH, or install from the package root (rnacounter/) with:

sudo python setup.py install        # Python 2.7 version
sudo python3 setup.py install       # Python 3 version

Dependencies:

Tests run with the library versions below, but may work with earlier versions.

  • setuptools 7.0+

  • pysam 0.7.5+

  • numpy 1.6.2+

  • scipy 0.9.0+

  • docopt 0.6.1+

Testing:

Testing files in the testfiles/ folder: - gapdhKO.bam: alignment on mm9 with only Gapdh covered. - mm9_3genes_renamed.gtf: extract of the Ensembl GTF with Gapdh, the gene before and the gene after it. - mm9_Gapdh_renamed.gtf: extract of the Ensembl GTF with Gapdh only.

Example:

rnacounter testfiles/gapdhKO.bam testfiles/mm9_3genes_renamed.gtf

The BAM contains 4041 reads all aligning perfectly on Gapdh (ENSMUSG00000057666) exons, mostly on ENSMUSE00000487077 but also ENSMUSE00000751942 and ENSMUSE00000886744. Nothing on other exons, which makes it a good example of badly conditioned input data…

The least squares method returns counts on the following transcripts: ENSMUST00000117757, ENSMUST00000118875, ENSMUST00000147954 and nothing on ENSMUST00000073605, ENSMUST00000144205, ENSMUST00000144588 .

Returns a count of 2459.62 (1091.71 RPK) for the gene.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

rnacounter-1.1.3.1.tar.gz (289.7 kB view hashes)

Uploaded Source

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page