Skip to main content

A utility to annotate genomic intervals.

Project description

***************
region_analysis
***************

Region_analysis is a package derived and extended from region_analysis.pl in diffReps package. It is a utility to annotate the genomic intervals like the peak list of ChIP-seq or other interval lists from the genomic research. Now it supports human (hg19), mouse (mm9, mm10), and rat (rn4). New genomes will be added. Any question or suggestion is welcome!

Dependency:
###########

bedtools: https://code.google.com/p/bedtools/

pybedtools: https://github.com/daler/pybedtools

::

If easy_install or pip is available, then:

easy_install pybedtools

or:

pip isntall pybedtools

Usage:
######

region\_analysis.py [options]

Options:
########

-h, --help show this help message and exit

-i INPUT\_FILE\_NAME, --input=INPUT\_FILE\_NAME

::

Input region file must assume the first 3 columns contain (chr, start, end)

-d ANNO\_DB, --database=ANNO\_DB

::

Choose database: refseq(default) or ensembl

-r, --rhead Whether the input file contains column header

-g GENOME, --genome=GENOME

::

Choose genome: mm10(default), mm9, hg19, rn4

-v, --version

::

Version of Region_Analysis package
Output:
#######

*-.annotated: the one-to-one output list, only the annotation entry whose TSS is nearest to the inquiry interval kept.

*-.full.annotated: all hit entries are kept.

*-.full.annotated.json: the json format output of -.full.annotated.

Features:
#########

*ProximalPromoter: +/- 250bp of TSS

*Promoter1k: +/- 1kbp of TSS

*Promoter3k: +/- 3kbp of TSS

*Genebody: Anywhere between a gene's promoter and up to 1kbp downstream of the TES.

*Genedeserts: Genomic regions that are depleted with genes and are at least 1Mbp long.

*Pericentromere: Between the boundary of a centromere and the closest gene minus 10kbp of that gene's regulatory region.

*Subtelomere: Similary defined as pericentromere.

*OtherIntergenic: Any region that does not belong to the above categories.

Testing with examples:
######################

region\_analysis.py -i example/test\_without\_header.bed -g mm10 -d ensembl

region\_analysis.py -i example/test\_with\_header.bed -g mm10 -d ensembl -r

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

regionanalysis-0.1.1.tar.gz (15.9 MB view hashes)

Uploaded Source

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page