Skip to main content

The small RNA toolkit (smRNAtk) is a comprehensive set of utilities built for the purpose of cleaning and analyzing raw RNA NGS data

Project description

smrnatk

This project implements a de novo clustering methology for analyzing and interpreting smRNA-Seq data.

Introduction

The small RNA toolkit (smRNAtk) is a comprehensive set of utilities built for the purpose of cleaning and analyzing raw RNA NGS data

Documentation

Full documentation is in progress. For the time being here are the command line options that are currently implemented:

Quality Report

Produce plots of quality for a number of different phred score thresholds.

smrnatk-quality_report [options] <fastq>

Quality Filter

Filter fastq reads if they contain more than N bases less than a given phred score

smrnatk-quality_filter [options] <fastq>

Collapse

Collapse fastq reads into a fasta file. The number of original reads collapsed per sequence is recorded in the fasta header.

smrnatk-collapse <fastq>

Align

Align the sequences in the fasta file provided against a reference genome. The default behavior assumes bowtie is installed, but the user may provide their own alignment command as an optional argument.

smrnatk-align <fasta> --align-cmd=CMD

Compute metaMir matrices

Using the given alignments, compute metaMir matrices for every annotation in the provided GTF file, e.g. the one provided by mirBase.

smrnatk-metamir <annotation_gtf> <bam> --output=<metamir_data>

The output is a JSON formatted file that contains all of the metaMir matrices for a single BAM file (e.g. sample). These JSON files are used in the merge and aggregate step.

Merge metaMir matrices

Concatenate individual results from the metamir command into a single file. The resulting JSON file contains the individual metaMir matrices for each sample for each annotation. This is useful for grouping samples together for later aggregation, e.g. into disease and control conditions.

smrnatk-merge <metamir_data>...

Aggregate metaMir matrices

Aggregate metaMir matrices that were merged into a single file with merge. This operation sums the metaMir matrices for every sample in a given JSON output file and produces a single result per annotation.

smrnatk-aggregate <metamir_data>

Quantify metaMir matrices into a counts file

Sum different portions of the metaMir matrices to produce a traditional counts matrix with samples (i.e. merge'ed metaMir matrices) as columns and annotations as rows. The optional --window argument allows the user three options for summing matrices:

  • canonical: default, use only the count in the canonical annotation coordinates (0,0)
  • seed: sum the column (0,*), which includes all species that have the same seed sequence as the annotation
  • whole: sum the entire matrix, which includes all alignments in a 10x10 window around the annotated start positions

smrnatk-quant [options] <merged_metamir_data>

Installing

Currently the only way to install this software is as follows:

$ git clone https://bitbucket.org/rtingram/smrnatk.git
$ cd smrnatk
$ python setup.py install

We recommend using conda to create an isolated environment when installing:

$ conda create -n my_smallrna_project
$ source activate my_smallrna_project
$ python setup.py install

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

smrnatk-0.1.0.tar.gz (10.5 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

smrnatk-0.1.0-py3-none-any.whl (11.5 kB view details)

Uploaded Python 3

File details

Details for the file smrnatk-0.1.0.tar.gz.

File metadata

  • Download URL: smrnatk-0.1.0.tar.gz
  • Upload date:
  • Size: 10.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.8.5 CPython/3.10.13 Linux/4.18.0-553.27.1.el8_10.x86_64

File hashes

Hashes for smrnatk-0.1.0.tar.gz
Algorithm Hash digest
SHA256 4cafa116f167126ec599b7caa9e8942b1ab17641535542d619c0fb2f31bb5aab
MD5 21195f9bca9b3b64fe52217284e2886e
BLAKE2b-256 5769e8682f4612585ddffe8009bd21b9cd666d4b1cee73c8a3ac8095d9f6204c

See more details on using hashes here.

File details

Details for the file smrnatk-0.1.0-py3-none-any.whl.

File metadata

  • Download URL: smrnatk-0.1.0-py3-none-any.whl
  • Upload date:
  • Size: 11.5 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.8.5 CPython/3.10.13 Linux/4.18.0-553.27.1.el8_10.x86_64

File hashes

Hashes for smrnatk-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 fd253830a338e251adc9b8cdc1fa7cd97f2b9183ba4fe0ef9bf2360b9aeba3f1
MD5 b2f0052faa0932b154a4481bec2397b4
BLAKE2b-256 a1ca01e202c0ba487b6cbbea3896a90ba3f46e644b0afb1aa6afe2485a34c675

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page