Skip to main content

Simulator for ddRADseq (double digest restriction site associdated DNA squencing) datasets. Generates reads (FASTQ format) that can be analyzed and validated using a ground truth file (YAML).

Project description

RAGE (ddRAD Data Generator) is a software to simulate double digest restriction site associated DNA sequencing reads. The generated datasets can be used to test ddRAD analysis tools and validate their results.

The documentation, including a tutorial, can be found here.

System requirements

  • python >= 3.5

  • numpy

  • scipy

  • matplotlib

  • pyyaml

  • numba

For the docs:

  • sphinx

  • sphinx_rtd_theme

For parameter visualization:

  • bokeh

Installation

We recommend the installation using conda:

$ conda create -c bioconda -n rage  python rage
$ source activate rage

Alternatively, you can download the source code from bitbucket and install it using the setup script:

$ git clone https://bitbucket.org/genomeinformatics/rage.git
$ cd rage
/rage$ python setup.py install

In this case you have to install the requirements listed above.

Usage

To simulate a ddRAD dataset, call rage from the command line:

$ rage

you can specify parameters to change dataset parameters such as number of individuals (-n), nr of loci (-l), and coverage (--coverage):

$ rage -n 6 -l 10000 --coverage 30

This creates a dataset with reads from 6 individuals at 10000 loci with an expected coverage of 30.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

ddrage-1.0.0.tar.gz (139.0 kB view hashes)

Uploaded Source

Built Distribution

ddrage-1.0.0-py3-none-any.whl (148.0 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page