A robust, parallelized Python CLI for annotating three_prime_UTR
Project description
peaks2utr: a robust, parallelized Python CLI for annotating 3' UTR
peaks2utr is a Python command-line tool that annotates 3' untranslated regions (UTR) for a given set of aligned sequencing reads in BAM format, and canonical annotation in GFF or GTF format. peaks2utr uses MACS3 (https://pypi.org/project/MACS3/) to call broad "peaks" of significant read coverage in the BAM file, and uses those peaks that pass a set of criteria as a basis to annotate novel 3' UTRs. This favours BAM files from the likes of 10x Chromium runs, where signal is inherently concentrated at the distal ends of the 3' or 5' UTRs. Reads containing soft-clipped bases and polyA-tails of a given length are detected, and their end bases tallied as "truncation points". When piled up, each co-occurring truncation point is used to determine the precise end base of a given UTR. peaks2utr can be tuned to extend, override or ignore any pre-existing 3' UTR annotations in the input GFF file.Installation
Install latest release with:
pip install peaks2utr
Alternatively, to install from source:
git clone https://github.com/haessar/peaks2utr.git
cd peaks2utr
python3 -m build
python3 -m pip install dist/*.tar.gz
Dependencies
Installation instructions assume a Debian / Ubuntu system with root privileges. Follow the links for instructions for other systems.
Required
apt-get install bedtools
Optional
GenomeTools (for post-processing of output gff3)
apt-get install genometools
Quick start
Download demo reference annotations Tb927_01_v5.1.gff and bam file Tb927_01_v5.1.slice.bam and run
peaks2utr Tb927_01_v5.1.gff Tb927_01_v5.1.slice.bam
When complete, you should see a file Tb927_01_v5.1.new.gff
which contains original annotations as well as 3' UTRs with source "peaks2utr".
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file peaks2utr-0.4.tar.gz
.
File metadata
- Download URL: peaks2utr-0.4.tar.gz
- Upload date:
- Size: 10.6 MB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.0 CPython/3.9.7
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 0ed8b52c35229d736c971cefb41121b1d75d5840ca3c2edd5d02b63b021f238c |
|
MD5 | 15b6441ba4dcdd2bfaaec5e2137f975b |
|
BLAKE2b-256 | 85f4bc695d7b69bfa18379333a93abb98e6605ff9fb29a949355dd6650ed6aff |
File details
Details for the file peaks2utr-0.4-py3-none-any.whl
.
File metadata
- Download URL: peaks2utr-0.4-py3-none-any.whl
- Upload date:
- Size: 16.5 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.0 CPython/3.9.7
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | c73edc432c072acc3093bc396c8cb85d187d2d66b8d7a3754463942dfbebd510 |
|
MD5 | cb51922809d007ac417aab1a4f099dfc |
|
BLAKE2b-256 | 313b93d47e9ad2768284f2f5074de498e6485a3e75c9b1faf7b582c818a0a754 |