Skip to main content

UROPA is a command line based tool, intended for genomic region annotation

Project description

Universal RObust Peak Annotator
=======================================

[![Anaconda-Server Badge](https://img.shields.io/badge/Install%20with-conda-green.svg?style=plastic&logoWidth=40)](https://conda.anaconda.org/bioconda) [![Docker Badge](https://img.shields.io/badge/Container-ready-green.svg?style=plastic&logoWidth=40)](http://biocontainers.pro/registry/#/) [![PyPI version](https://badge.fury.io/py/uropa.png)](https://badge.fury.io/py/uropa)

The **Universal RObust Peak Annotator** (UROPA) is a command line based tool, intended for genomic region
annotation. Based on a configuration file, different target features can be prioritized with multiple integrated queries.
These can be sensitive for feature type, distance, strand specificity, feature attributes (eg. protein_coding) or the anchor position relative to the feature.
UROPA can incorporate reference annotation files (GTF) from different sources, like Gencode, Ensembl, or RefSeq,
as well as custom reference files produced by the user.

Features
--------

- Detect the most appropriate annotation with flexible parameter keys that allow
robustness and simple customization, such as

- feature type
- feature anchor
- feature direction relative to peak location
- filter for attribute values, e.g. “protein\_coding”
- strand specificity

- Utilization of all available GTF files as annotation database
- One run with variable sets of parameters by multiple queries
- Graduated annotation due to priorization
- Different easily-readable output tables (allhits, finalfits, besthits).
- Visual summary for annotation evaluation
- Preparation of custom annotation files

Documentation
--------------
A detailed description of how to apply UROPA to your data can be found [here](http://uropa-manual.readthedocs.io/).

Installation and Command-line usage
------------------------------------

### Conda package manager

We recommend to install UROPA using the conda package manager. Make sure to have `conda` installed, e.g. via

- [Miniconda](https://conda.io/miniconda.html)
- download the Miniconda installer for **Python 3**
- run ```bash Miniconda3-latest-Linux-x86_64.sh``` to install Miniconda
- Answer the question "Do you wish the installer to prepend the Miniconda install location to PATH in your /home/.../.bashrc ?" with yes
OR do ```PATH=dir/to/miniconda3:$PATH``` after installation process

The UROPA installation is now as easy as

```conda create --name uropa```

```conda activate uropa```

```conda install python uropa```.

### Biocontainers / Docker

If you have a running [Docker](docker.com) environment, you can pull a biocontainer with UROPA and all dependencies via

- `docker pull quay.io/biocontainers/uropa:latest_tag` using the latest tag from the [taglist](https://quay.io/repository/biocontainers/uropa?tab=tags), e.g. `1.2.1--py27r3.3.2_0`
- `docker pull loosolab/uropa`

### Installation from source

You can also install UROPA from the source PyPI package. Note that this comes without the R dependencies for auxillary scripts:

`pip install uropa`

To fulfill all other dependencies, follow the instructions below:

- [R/Rscript](http://www.r-project.org/) (v3.3.0 or higher; follow instructions on url)
- install required packages step by step:
```bash
install.packages(c("ggplot2", "devtools", "gplots", "gridExtra", "jsonlite", "VennDiagram", "getopt", "tidyr", "UpSetR"))
source("https://bioconductor.org/biocLite.R")
biocLite(c("RBGL", "graph"))
```

In order to plot the Chow-Ruskey plot with uropa_summary.R, install the modified Vennerable package from our fork:

```
library(devtools)
install_github("jenzopr/Vennerable")
```

### Usage

To effectively use UROPA, make yourself familiar with the command-line options:

## Command-line usage

```bash
$ uropa
Usage: uropa [options]

Available options:

-h, --help print this help message and further details on the configuration file
-i, --input filename of configuration file [mandatory]
-p, --prefix prefix for output files, can include subdirectories [basename of --input]
-r, --reformat create an additional compact and line-reduced table as result file
-s, --summary additional visualisation of results in graphical format will be created
-t n, --threads n multiprocessed run: n = number of threads to run annotation process
-add-comments show comment lines in output files explaining the columns
-l, --log log file name for messages and warnings
-d, --debug print verbose messages (for debugging purposes)
-v, --version print the version and exit
```

## Biocontainer usage

Running UROPA from a docker container can be done using the following command:

```bash
sudo docker run --rm -v <path-to-input-files-on-HOST>:<path-to-container-mnt> UROPA:LATEST uropa <UROPA-Paramters> -p <path-to-container-mnt>/'your-file-prefix'
```

*-v parameter mounts a HOST folder into your docker CONTAINER. This folder should contain the input files for UROPA and also the result files will be stored here. No files will be stored in the container!*

*--rm removes/closes the container after the run*

Make sure to use the uropa -p option specifying the output directory and prefix, otherwise results are lost in the container environment.

How to cite
-----------

Kondili M, Fust A, Preussner J, Kuenne C, Braun T, and Looso M. UROPA: a tool for Universal RObust Peak Annotation. *Scientific Reports* **7** (2017), doi: [10.1038/s41598-017-02464-y](https://www.nature.com/articles/s41598-017-02464-y)

Contribute
----------

* Source Code [here](https://github.molgen.mpg.de/loosolab/UROPA)
* Issue Tracker [here](https://github.molgen.mpg.de/loosolab/UROPA/issues)

Support
-------

If you have any questions please feel free to contact Mario Looso (mario.looso@mpi-bn.mpg.de).

License
-------

The project is licensed under the MIT License.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

uropa-3.0.0.tar.gz (27.2 kB view details)

Uploaded Source

File details

Details for the file uropa-3.0.0.tar.gz.

File metadata

  • Download URL: uropa-3.0.0.tar.gz
  • Upload date:
  • Size: 27.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.21.0 setuptools/40.8.0 requests-toolbelt/0.9.1 tqdm/4.31.1 CPython/3.7.1

File hashes

Hashes for uropa-3.0.0.tar.gz
Algorithm Hash digest
SHA256 a01faf576355699c5958d8cd0e74eec7ac4b02b0816b8c5bf868042ee94862dd
MD5 361e82bba0a1f4b569a221e3827ea48f
BLAKE2b-256 51b587a848f9dce9b42053a5a5710ca1b8d6b08dee0f408cf9622f852b5cad4a

See more details on using hashes here.

Provenance

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page