Simulator for ddRADseq (double digest restriction site associdated DNA squencing) datasets. Generates reads (FASTQ format) that can be analyzed and validated using a ground truth file (YAML).
Project description
RAGE (ddRAD Data Generator) is a software to simulate double digest restriction site associated DNA sequencing reads. The generated datasets can be used to test ddRAD analysis tools and validate their results.
The documentation, including a tutorial, can be found here.
System Requirements
python >= 3.5
numpy
scipy
matplotlib
pyyaml
numba
For the docs:
sphinx
sphinx_rtd_theme
For parameter visualization:
bokeh
Installation
We recommend the installation using conda:
$ conda create -c bioconda -n rage python rage
$ source activate rage
Alternatively, you can download the source code from bitbucket and install it using the setup script:
$ git clone https://bitbucket.org/genomeinformatics/rage.git
$ cd rage
/rage$ python setup.py install
In this case you have to install the requirements listed above.
Usage
To simulate a ddRAD dataset, call rage from the command line:
$ rage
you can specify parameters to change dataset parameters such as number of individuals (-n), nr of loci (-l), and coverage (--coverage):
$ rage -n 6 -l 10000 --coverage 30
This creates a dataset with reads from 6 individuals at 10000 loci with an expected coverage of 30.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for ddrage-1.1.2.linux-x86_64.tar.gz
Algorithm | Hash digest | |
---|---|---|
SHA256 | 1ba7531e1058d1d46ce8d09eb3717d0596ef0e2d43db84748b9daab845e6b6ed |
|
MD5 | 15cf2a920a1c3624c8cdf74e923efd70 |
|
BLAKE2b-256 | 9925a1cf282968a8971d1670f832fe0280aa6402a8360973a9dc278354e4d090 |