Skip to main content

No project description provided

Project description

omniCLIP

omniCLIP is a Bayesian peak caller that can be applied to data from CLIP-Seq experiments to detect regulatory elements in RNAs.

Overview

Introduction

Dependencies

Installation

Usage

Contributors

License

Introduction

omniCLIP can call peaks for CLIP-Seq data data while accounting for confounding factors such as the gene expression and it automatically learns relevant diagnostic events from the data. Furtermore, it can leverage replicate information and model technical and biological variance.

Dependencies and Requirements

omniCLIP requires Python (v.3.7) and the libraries described in the environment.yml file. All required dependencies can be installed using conda by executing the following in the main project directory :

$ conda env create -f environment.yml

The environment then needs to be activated in order to run omniCLIP :

$ conda activate omniEnv

Installation

Manual installation

The latest stable release in the master branch can be downloaded by executing:

$ git clone -b master https://github.com/simojoe/omniCLIP.git

After this the following command has to be executed:

$ python3 setup.py

This compiles the Cython code for the viterbi algorithm.

Usage

An omniCLIP analysis is run into four different steps :

  • Generating the annotation database
  • Parsing the background RNA-seq files
  • Parsing the CLIP files
  • Running the omniCLIP algorithm

The following is an example of the commands. The commands in this example only show the required arguments for the analysis. The following files are necessary to run an analysis.

File name Description
$GFF_file Genome annotation file
$GENOME_dir Directory containing FASTA files with each of the chromosomes sequence
$BG_file[1,2,...] BAM files for the background library. The alignments need to have the MD and NM tags.
$CLIP_file[1,2,...] BAM files for the CLIP library. The alignments need to have the MD and NM tags.

The following files will be created.

File name Description
$DB_file SQL database of the genome annotation.
$BG_dat H5PY file of the parsed background library.
$CLIP_dat H5PY file of the parsed CLIP library.
$OUT_dir Directory containing the final results

1. Generating the annotation database

$ python3 omniCLIP.py generateDB \
--gff-file $GFF_file --db-file $DB_file

2. Parsing the background RNA-seq files

$ python3 omniCLIP.py parsingBG \
--db-file $DB_file --genome-dir $GENOME_dir \
--bg-files $BG_file1 --bg-files $BG_file2 \
--out-file $BG_dat

3. Parsing the CLIP files

$ python3 omniCLIP.py parsingCLIP \
--db-file $DB_file --genome-dir $GENOME_dir \
--clip-files $CLIP_file1 --clip-files $CLIP_file2 \
--out-file $CLIP_dat

4. Running the omniCLIP algorithm

$ python3 omniCLIP.py run_omniCLIP \
--db-file $DB_file --bg-dat $BG_dat --clip-dat $CLIP_dat \
--out-dir $OUT_dir

Optional arguments

For the full list of optional arguments of the different step, consult the help of the commands using :

$ python3 omniCLIP.py parsingBG --help

Contributors

License

GNU GPL license (v3)

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

omniclip-0.2.1.tar.gz (212.5 kB view details)

Uploaded Source

Built Distributions

omniclip-0.2.1-cp310-cp310-manylinux_2_35_x86_64.whl (551.5 kB view details)

Uploaded CPython 3.10 manylinux: glibc 2.35+ x86-64

omniclip-0.2.1-cp38-cp38-manylinux_2_31_x86_64.whl (548.8 kB view details)

Uploaded CPython 3.8 manylinux: glibc 2.31+ x86-64

File details

Details for the file omniclip-0.2.1.tar.gz.

File metadata

  • Download URL: omniclip-0.2.1.tar.gz
  • Upload date:
  • Size: 212.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.1.13 CPython/3.10.2 Linux/5.10.102.1-microsoft-standard-WSL2

File hashes

Hashes for omniclip-0.2.1.tar.gz
Algorithm Hash digest
SHA256 66802f02202e7995f091fc50046dca98d720347d63dfc81c9e397cb7ce538347
MD5 bc7682c2304b48d4204cea62e26d574c
BLAKE2b-256 8312894b5ab975224c016ba03e1d54948b9dd986ef02c987dec602196bdc2c5e

See more details on using hashes here.

File details

Details for the file omniclip-0.2.1-cp310-cp310-manylinux_2_35_x86_64.whl.

File metadata

  • Download URL: omniclip-0.2.1-cp310-cp310-manylinux_2_35_x86_64.whl
  • Upload date:
  • Size: 551.5 kB
  • Tags: CPython 3.10, manylinux: glibc 2.35+ x86-64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.1.13 CPython/3.10.2 Linux/5.10.102.1-microsoft-standard-WSL2

File hashes

Hashes for omniclip-0.2.1-cp310-cp310-manylinux_2_35_x86_64.whl
Algorithm Hash digest
SHA256 09b6997b7ca1c11a3468f4fb6c81dea5a2d28ee61ea99a934a653cb8112c0f80
MD5 4d5369b353339da6a9f5cd002956c9bc
BLAKE2b-256 a6416a1a661ad794f6b9523abd6f49962a21c673d055cd276aac066ee0ef4523

See more details on using hashes here.

File details

Details for the file omniclip-0.2.1-cp38-cp38-manylinux_2_31_x86_64.whl.

File metadata

  • Download URL: omniclip-0.2.1-cp38-cp38-manylinux_2_31_x86_64.whl
  • Upload date:
  • Size: 548.8 kB
  • Tags: CPython 3.8, manylinux: glibc 2.31+ x86-64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.1.13 CPython/3.10.2 Linux/5.10.102.1-microsoft-standard-WSL2

File hashes

Hashes for omniclip-0.2.1-cp38-cp38-manylinux_2_31_x86_64.whl
Algorithm Hash digest
SHA256 a6442191793e7600aaa32ba6e039b965cd855d271918ca6f3fe4c671290d5181
MD5 906c86e3434c39365105c82d4d4cb2a0
BLAKE2b-256 a69a169f9994af639f8d5651cde62f847a8e66b796bedc81e44fab05fab79bde

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page