Skip to main content

Bioinformatics tools for Janis; the Pipeline creation helper

Project description

Bioinformatics Tools (Pipelines)

Documentation Status Build Status PyPI version

This repository contains tools and data types for Janis directly related to the bioinformatics field.

Refer to the documentation for a full list of tools.

Data types

The data types are a way of encapsulating information about the file (including secondary files), and it allows clarity when connecting inputs and steps together (as you know a BAM file should be connected to BAM input).

This documentation will be updated with a full (generated) list of data types, however for now here's an approximate list (nested means inherited):

Non-exhaustive of bioinformatics data types

  • Sam
  • Bam
  • Bai
    • BamBai (.bam + .bai)
  • Bed
  • Vcf
    • VcfIdx (.vcf + .vcf.idx)
  • CompressedVcf
    • VcfTabix
  • Fastq
  • FastaFai (.fasta + .fasta.fai)
    • Fasta (FastaFai + fasta.amb, fasta.ann, fasta.bwt, fasta.pac, fasta.sa)
      • FastaWithDict (Fasta + .dict)

* Nested means inherited.

Testing

I'm not sure how to unit tests these tools, they're mostly definitions and as long as the syntax is fine, it's hard to logically test them without functionally running them with test data.

As Janis contains unit tests, there should be sufficient coverage for the base of the command tools.

There are plans to build a module to functionally test these tools (with some test data), however that will probably be a little while away.

Documentation

Documentation is generated on Janis. To generate new documentation you will need to:

  1. Commit your changes here,
  2. Update the submodule pointer on Janis,
  3. Checkout Janis (recursively),
  4. Run the regenerate script janis/docs/regeneratedocumentation.py,
  5. Commit these changes and the documentation will autobuild on ReadTheDocs.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

janis-pipelines.bioinformatics-0.0.7.tar.gz (84.6 kB view hashes)

Uploaded Source

Built Distribution

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page