Skip to main content

Sequencing pipeline

Project description

Usage

Execute main.py as follows:

$ python main.py --help
Usage: seqpipe [OPTIONS]

  Create read-genome matrix and compute all read alignments. Subsequently,
  apply various scripts and aggregate results.

Options:
  -r, --read PATH             Path to read file/directory.  [required]
  -g, --genome PATH           Path to genome file/directory.  [required]
  -o, --output DIRECTORY      Directory to save results to.
  --scripts / --no-scripts    Whether to execute scripts or not.
  -m, --min-read-len INTEGER  Minimal read length.
  -M, --max-read-len INTEGER  Maximal read length.
  -b, --bowtie-args TEXT      Extra arguments for bowtie.
  -t, --threads INTEGER       How many threads to run in.
  --help                      Show this message and exit.

This will create a mapping_results_* directory which contains two directories: * runs stores all data related to each individual read file * results contains data generated by scripts from the scripts folder

Extras

Additional useful scripts are contained in extra. The entry point is main.py (check python ./extra/main.py --help for help).

The respective individual files are: * sequential_pipeline.sh * map length-filtered reads against multiple genomes in succession * plot_sequential_data.py * visualize data obtained from sequential pipeline * plot_expression_differences.py * visualize differences in RNAseq expression levels over pairs of samples * utils.py * Various helper methods * mapping_overview.py * Plot various statistics

Dependencies

Tools: * cutadapt * fastqc * more info * bowtie2 * samtools * specifications * bedtools * moreutils

Languages: * bash * python * numpy * pandas * seaborn * matplotlib * tqdm * biopython * pysam * joblib * click * sh * colorama

Development notes

Tests

Run tests using:

$ tox

Release package

This guide assumes a properly setup ~/.pypirc.

Build package:

$ python setup.py sdist

Register it (only once):

$ twine register dist/seqpipe-X.Y.Z.tar.gz

Try installation locally:

$ rm -rf /tmp/seqpipe_tmp
$ virtualenv /tmp/seqpipe_tmp
$ /tmp/seqpipe_tmp/bin/pip install dist/seqpipe-X.Y.Z.tar.gz
$ /tmp/seqpipe_tmp/bin/seqpipe --help

Try installation using test-servers:

$ twine upload -r test dist/seqpipe-X.Y.Z.tar.gz
$ pip install -i https://testpypi.python.org/pypi seqpipe
$ seqpipe --help

Check testpypi-page.

Finally, install it on actual server:

$ twine upload dist/seqpipe-X.Y.Z.tar.gz
$ pip install -U seqpipe
$ seqpipe --help

Check actual pypi-page.

Misc

Create dev-builds with:

$ pip install --user -e .

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

seqpipe-0.0.4.tar.gz (7.4 kB view details)

Uploaded Source

File details

Details for the file seqpipe-0.0.4.tar.gz.

File metadata

  • Download URL: seqpipe-0.0.4.tar.gz
  • Upload date:
  • Size: 7.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No

File hashes

Hashes for seqpipe-0.0.4.tar.gz
Algorithm Hash digest
SHA256 3774fd0637e80e3256bc2125dbbcb6b1f5ce1a4dd98b43fd05949be6f36b29d0
MD5 3021450accf95923b0a99cfc443c60f8
BLAKE2b-256 d13f74d8e31739eaa5c51bf9c1f9fd624483fa8ee8feab3f04335ba24f1e3b2c

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page