Skip to main content

A robust, parallelized Python CLI for annotating three_prime_UTR

Project description

peaks2utr: a robust, parallelized Python CLI for annotating 3' UTR

PYPI - Version PYPI - Python Version License: GPL v3

peaks2utr is a Python command-line tool that annotates 3' untranslated regions (UTR) for a given set of aligned sequencing reads in BAM format, and canonical annotation in GFF or GTF format. peaks2utr uses MACS3 (https://pypi.org/project/MACS3/) to call broad "peaks" of significant read coverage in the BAM file, and uses those peaks that pass a set of criteria as a basis to annotate novel 3' UTRs. This favours BAM files from the likes of 10x Chromium runs, where signal is inherently concentrated at the distal ends of the 3' or 5' UTRs. Reads containing soft-clipped bases and polyA-tails of a given length are detected, and their end bases tallied as "truncation points". When piled up, each co-occurring truncation point is used to determine the precise end base of a given UTR. peaks2utr can be tuned to extend, override or ignore any pre-existing 3' UTR annotations in the input GFF file.

Installation

Install latest release with:

pip install peaks2utr

Alternatively, to install from source:

git clone https://github.com/haessar/peaks2utr.git
cd peaks2utr
python3 -m build
python3 -m pip install dist/*.tar.gz

Dependencies

Installation instructions assume a Debian / Ubuntu system with root privileges. Follow the links for instructions for other systems.

Required

bedtools

apt-get install bedtools

Optional

GenomeTools (for post-processing of output gff3)

apt-get install genometools

Quick start

Download demo reference annotations Tb927_01_v5.1.gff and bam file Tb927_01_v5.1.slice.bam and run

peaks2utr Tb927_01_v5.1.gff Tb927_01_v5.1.slice.bam

When complete, you should see a file Tb927_01_v5.1.new.gff which contains original annotations as well as 3' UTRs with source "peaks2utr".

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

peaks2utr-0.5.1.tar.gz (15.7 kB view details)

Uploaded Source

Built Distribution

peaks2utr-0.5.1-py3-none-any.whl (16.6 kB view details)

Uploaded Python 3

File details

Details for the file peaks2utr-0.5.1.tar.gz.

File metadata

  • Download URL: peaks2utr-0.5.1.tar.gz
  • Upload date:
  • Size: 15.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.1 CPython/3.11.3

File hashes

Hashes for peaks2utr-0.5.1.tar.gz
Algorithm Hash digest
SHA256 39f4c4bdf37adcf458ee7f800667dd2e7a81a3d81bc23a0b9bd753cd669d5f38
MD5 ea4b53fa30670b89f6fd618eec6ffe35
BLAKE2b-256 4a013c0eb6838c9e7b7d480fb37e15feda8138d4838cc5808aac0791badb8f2f

See more details on using hashes here.

File details

Details for the file peaks2utr-0.5.1-py3-none-any.whl.

File metadata

  • Download URL: peaks2utr-0.5.1-py3-none-any.whl
  • Upload date:
  • Size: 16.6 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.1 CPython/3.11.3

File hashes

Hashes for peaks2utr-0.5.1-py3-none-any.whl
Algorithm Hash digest
SHA256 53eca8d94c886d93cd24eaecf08d333c1658c3eb24a5e112e5404f4d13d0b190
MD5 e8f61e28380dd3a18e764517b2906898
BLAKE2b-256 7806a80e099311ed86a74fc5e993608b4df006228181f8af3f4249d02c7916c0

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page