Skip to main content

A utility to annotate genomic intervals.

Project description

***************
region_analysis
***************

Region_analysis is a package derived and extended from region_analysis.pl in diffReps package. It is a utility to annotate the genomic intervals like the peak list of ChIP-seq or other interval lists from the genomic research. Now it supports human (hg19), mouse (mm9, mm10), and rat (rn4). New genomes will be added. Any question or suggestion is welcome!

Dependency:
###########

bedtools: https://code.google.com/p/bedtools/

pybedtools: https://github.com/daler/pybedtools

::

If easy_install or pip is available, then:

easy_install pybedtools

or:

pip isntall pybedtools

Usage:
######

region\_analysis.py [options]

Options:
########

-h, --help show this help message and exit

-i INPUT\_FILE\_NAME, --input=INPUT\_FILE\_NAME

::

Input region file must assume the first 3 columns contain (chr, start, end)

-d ANNO\_DB, --database=ANNO\_DB

::

Choose database: refseq(default) or ensembl

-r, --rhead Whether the input file contains column header

-g GENOME, --genome=GENOME

::

Choose genome: mm10(default), mm9, hg19, rn4

-v, --version

::

Version of Region_Analysis package
Output:
#######

*-.annotated: the one-to-one output list, only the annotation entry whose TSS is nearest to the inquiry interval kept.

*-.full.annotated: all hit entries are kept.

*-.full.annotated.json: the json format output of -.full.annotated.

Features:
#########

*ProximalPromoter: +/- 250bp of TSS

*Promoter1k: +/- 1kbp of TSS

*Promoter3k: +/- 3kbp of TSS

*Genebody: Anywhere between a gene's promoter and up to 1kbp downstream of the TES.

*Genedeserts: Genomic regions that are depleted with genes and are at least 1Mbp long.

*Pericentromere: Between the boundary of a centromere and the closest gene minus 10kbp of that gene's regulatory region.

*Subtelomere: Similary defined as pericentromere.

*OtherIntergenic: Any region that does not belong to the above categories.

Testing with examples:
######################

region\_analysis.py -i example/test\_without\_header.bed -g mm10 -d ensembl

region\_analysis.py -i example/test\_with\_header.bed -g mm10 -d ensembl -r

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

regionanalysis-0.1.1.tar.gz (15.9 MB view details)

Uploaded Source

File details

Details for the file regionanalysis-0.1.1.tar.gz.

File metadata

File hashes

Hashes for regionanalysis-0.1.1.tar.gz
Algorithm Hash digest
SHA256 3938f7a8fc575e7c1a13c3c57fdb067bc3f1c610a1cca56ed19712e6e460b455
MD5 296d027d073a885a9da6412523f190e3
BLAKE2b-256 904f816d41b75df94504390942cf3271a4eca631f0cc13d1abe2ebcf03638544

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page