Skip to main content

Papolarity is a tool to analyze polarity of transcriptomic alignments such as Ribo-seq and RNA-seq.

Project description

Papolarity

Papolarity is a Python package for analysis of transcript-level short read coverage profiles.

For a single sample, for each transcript papolarity allows for computing the classic polarity metric which, in the case of Ribo-Seq, reflects ribosome positional preferences.

For comparison versus a control sample, papolarity estimates an improved metric, the relative linear regression slope of coverage along transcript length. This involves de-noising by profile segmentation with a Poisson model (using pasio), and aggregation of Ribo-Seq coverage within segments, thus achieving reliable estimates of the regression slope.

Toolkit

Papolarity provide a toolkit to perform different tasks necessary for processing transcriptomic data such as Ribo-Seq alignments. The package is organized as a single entry point for a set of subcommands.

You can run it with: papolarity [arguments] or python -m papolarity [arguments] if you need to specify a certain version of python to run a package.

Note that at a moment only Python 3.7 is supported; probably this restriction will be relaxed later. Python 3.8 is not supported yet because pysam (dependency of pybedtools which is necessary for some steps) can't be compiled for that version of python.

There are no conventions about a structure of folders and file names. All files that are used by tool are always specified in command line arguments.

Papolarity have a few conventions about file extensions: all files with .gz extension are treated as gzip archives. Input files with names ending with .gz will be automatically unpacked, output files will be automatically packed. Character - instead of filename will be treated as stdin or stdout. It can be useful to use papolarity in pipelined commands.

You can follow the protocol to get the idea how these tools are supposed to be used. If you need to customize pipelines, please reference to help for corresponding tools: papolarity --help lists all available tools. papolarity <cmd> --help shows description of all arguments and options for a specified tool.

Protocol

In our paper "Estimating positional distribution of ribosomes along transcripts: from polarity to regression slope" (doi:will be soon) we describe a protocol for Ribo-Seq analysis. In a file protocol-paper.sh you can find a script we used in a paper to process our datasets. It's slightly modified for better readability compared to a paper, and is more easily customizable. Also it has a few additional commands to generate plots which are absent in paper. Steps are named after paper sections.

You can use this protocol as is or change any parts you wish. As long as you comply with data formats and use consistent data (e.g. all files should be clipped in the same manner, or non-clipped at all), papolarity will work, order of commands, folder names, filenames and so on doesn't matter.

To run this pipeline, you should have several auxiliary tools installed: csvtk, GNU parallel, and python package pasio.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

papolarity-0.0.3.post9.tar.gz (26.4 kB view hashes)

Uploaded Source

Built Distribution

papolarity-0.0.3.post9-py2.py3-none-any.whl (35.3 kB view hashes)

Uploaded Python 2 Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page