The small RNA toolkit (smRNAtk) is a comprehensive set of utilities built for the purpose of cleaning and analyzing raw RNA NGS data
Project description
smrnatk
This project implements a de novo clustering methology for analyzing and interpreting smRNA-Seq data.
Introduction
The small RNA toolkit (smRNAtk) is a comprehensive set of utilities built for the purpose of cleaning and analyzing raw RNA NGS data
Documentation
Full documentation is in progress. For the time being here are the command line options that are currently implemented:
Quality Report
Produce plots of quality for a number of different phred score thresholds.
smrnatk-quality_report [options] <fastq>
Quality Filter
Filter fastq reads if they contain more than N bases less than a given phred score
smrnatk-quality_filter [options] <fastq>
Collapse
Collapse fastq reads into a fasta file. The number of original reads collapsed per sequence is recorded in the fasta header.
smrnatk-collapse <fastq>
Align
Align the sequences in the fasta file provided against a reference genome. The default behavior assumes bowtie is installed, but the user may provide their own alignment command as an optional argument.
smrnatk-align <fasta> --align-cmd=CMD
Compute metaMir matrices
Using the given alignments, compute metaMir matrices for every annotation in the provided GTF file, e.g. the one provided by mirBase.
smrnatk-metamir <annotation_gtf> <bam> --output=<metamir_data>
The output is a JSON formatted file that contains all of the metaMir matrices for a single BAM file (e.g. sample). These JSON files are used in the merge and aggregate step.
Merge metaMir matrices
Concatenate individual results from the metamir command into a single file. The resulting JSON file contains the individual metaMir matrices for each sample for each annotation. This is useful for grouping samples together for later aggregation, e.g. into disease and control conditions.
smrnatk-merge <metamir_data>...
Aggregate metaMir matrices
Aggregate metaMir matrices that were merged into a single file with merge. This operation sums the metaMir matrices for every sample in a given JSON output file and produces a single result per annotation.
smrnatk-aggregate <metamir_data>
Quantify metaMir matrices into a counts file
Sum different portions of the metaMir matrices to produce a traditional counts matrix with samples (i.e. merge'ed metaMir matrices) as columns and annotations as rows. The optional --window argument allows the user three options for summing matrices:
canonical: default, use only the count in the canonical annotation coordinates (0,0)seed: sum the column (0,*), which includes all species that have the same seed sequence as the annotationwhole: sum the entire matrix, which includes all alignments in a 10x10 window around the annotated start positions
smrnatk-quant [options] <merged_metamir_data>
Installing
Currently the only way to install this software is as follows:
$ git clone https://bitbucket.org/rtingram/smrnatk.git
$ cd smrnatk
$ python setup.py install
We recommend using conda to create an isolated environment when installing:
$ conda create -n my_smallrna_project
$ source activate my_smallrna_project
$ python setup.py install
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file smrnatk-0.1.0.tar.gz.
File metadata
- Download URL: smrnatk-0.1.0.tar.gz
- Upload date:
- Size: 10.5 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: poetry/1.8.5 CPython/3.10.13 Linux/4.18.0-553.27.1.el8_10.x86_64
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
4cafa116f167126ec599b7caa9e8942b1ab17641535542d619c0fb2f31bb5aab
|
|
| MD5 |
21195f9bca9b3b64fe52217284e2886e
|
|
| BLAKE2b-256 |
5769e8682f4612585ddffe8009bd21b9cd666d4b1cee73c8a3ac8095d9f6204c
|
File details
Details for the file smrnatk-0.1.0-py3-none-any.whl.
File metadata
- Download URL: smrnatk-0.1.0-py3-none-any.whl
- Upload date:
- Size: 11.5 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: poetry/1.8.5 CPython/3.10.13 Linux/4.18.0-553.27.1.el8_10.x86_64
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
fd253830a338e251adc9b8cdc1fa7cd97f2b9183ba4fe0ef9bf2360b9aeba3f1
|
|
| MD5 |
b2f0052faa0932b154a4481bec2397b4
|
|
| BLAKE2b-256 |
a1ca01e202c0ba487b6cbbea3896a90ba3f46e644b0afb1aa6afe2485a34c675
|