Skip to main content

Tools to make pipelines easier to run and distribute for large biological datasets

Project description

ModuleUltra
===========

.. image:: https://img.shields.io/circleci/project/github/dcdanko/ModuleUltra/master.svg
:alt: CircleCI master

.. image:: https://img.shields.io/pypi/v/ModuleUltra.svg
:target: https://pypi.python.org/pypi/ModuleUltra
:alt: Latest PyPI version

.. image:: https://img.shields.io/pypi/pyversions/ModuleUltra.svg
:alt: PyPI - Python version

.. image:: https://img.shields.io/pypi/dm/ModuleUltra.svg
:alt: PyPI - Downloads

.. image:: https://img.shields.io/github/license/dcdanko/ModuleUltra.svg
:alt: Github license

Easy to use pipelines for large biological datasets.

Goals
-----

Bioinformatics pipelines often involve a large number of files with complex organization and metadata, complex paths, and lots of dependencies. ModuleUltra is intended to make it easier run and distribute complex pipelines. It is built on top of DataSuper and SnakeMake. ModuleUltra defines both an API and CLI.

ModuleUltra is probably overkill for small projects, it has been designed in particular for the MetaSUB project which has thousands of samples and complex analysis pipelines. ModuleUltra makes it easier to do consistent analysis on thousands of samples across many sites.

MetaSUB is also developing a program called DataSuper which tracks complex data and metadata.

In summary:
- ModuleUltra makes it easy to install complex pipelines
- ModuleUltra makes it easy to run pipelines only on slected subsets of the data
- ModuleUltra makes it easy to run parts of large complex pipelines
- ModuleUltra automatically tracks output of pipelines in DataSuper
- ModuleUltra allows you to write most of your pipelines logic in SnakeMake, a popular pipeline system
- ModuleUltra reduces the amount of boilerplate code necessary to make a pipeline

ModuleUltra is intended to improve reproducibility, features are being added to improve file provenance.

Installation
------------

Be aware that ModuleUltra is still an Alpha. There are still bugs and some unimplemented features.

ModuleUltra is currently being used on Ubuntu and RHEL systems. It should work on any *nix system.

To install:


.. code-block:: bash

git clone <url>

python setup.py develop


Licence
-------
MIT License

Authors
-------

`ModuleUltra` was written by `David C. Danko <dcdanko@gmail.com>`_.


Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

ModuleUltra-0.1.2.tar.gz (19.7 kB view details)

Uploaded Source

Built Distribution

ModuleUltra-0.1.2-py3-none-any.whl (25.1 kB view details)

Uploaded Python 3

File details

Details for the file ModuleUltra-0.1.2.tar.gz.

File metadata

  • Download URL: ModuleUltra-0.1.2.tar.gz
  • Upload date:
  • Size: 19.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.12.1 pkginfo/1.4.2 requests/2.20.1 setuptools/40.6.2 requests-toolbelt/0.8.0 tqdm/4.28.1 CPython/3.6.7

File hashes

Hashes for ModuleUltra-0.1.2.tar.gz
Algorithm Hash digest
SHA256 6acc88284dea7548c8eae46f8f13c334a630cc9256ab17829c846cdf6d88aa55
MD5 25b29d198ed81b923980a20fefa88a17
BLAKE2b-256 667fb1588a1d264f9d56126a5ab5f39e9b279bdf2c76068b823a6f04c399d5ed

See more details on using hashes here.

File details

Details for the file ModuleUltra-0.1.2-py3-none-any.whl.

File metadata

  • Download URL: ModuleUltra-0.1.2-py3-none-any.whl
  • Upload date:
  • Size: 25.1 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.12.1 pkginfo/1.4.2 requests/2.20.1 setuptools/40.6.2 requests-toolbelt/0.8.0 tqdm/4.28.1 CPython/3.6.7

File hashes

Hashes for ModuleUltra-0.1.2-py3-none-any.whl
Algorithm Hash digest
SHA256 80e1d9eb8efa5f229e13bb3d00fbef44a9db790fbf5c771fe966f47df27931ae
MD5 6e8f3c051c5bb341a3800c185d54a428
BLAKE2b-256 676cdc5dca37bf9e55bb8788a29e8f3d6aa0dbac1cc47d9f19eecdccb3891cb3

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page