Skip to main content

IPython notebook generator for GGR CWL processing pipelines of genomic data

Project description

ggr-cwl-ipynb-gen

Jupyter notebook generator to download and execute the processing files for GGR related datasets. At this point, is not intented to cover all use cases, but to serve as a quick generator of all related files and scripts to pre-process genomic data generated at the Duke-GCB Sequencing Core in HARDAC.

Example of usage:

$ python ggr_cwl_ipynb_gen.py \
  --root-dir /path/to/rootdir \
  --metadata examples/Hong_3979_170316B1.xlsx \
  --out /path/to/output_dir \
  --force

The information in the example metadata and configuration file should reveal what is needed to download and pre-process the samples.

For a full list of options:

$ python ggr_cwl_ipynb_gen.py -h
usage: Generator of Jupyter notebooks to execute CWL pre-processing pipelines [-h] -o OUT -m METADATA [-f] [-n] [--metadata-sep SEP] [--project-name PROJECT_NAME]
                                                                              [--data-from {sftp,miseq,other,dukeds,local}] [-c CONF_FILE] [-u USER] [-e USER_DUKE_EMAIL] [-r ROOT_DIR]

optional arguments:
  -h, --help            show this help message and exit
  -o OUT, --out OUT     Jupyter notebook output file name
  -m METADATA, --metadata METADATA
                        Metadata file with samples information
  -f, --force           Force to overwrite output file
  -n, --no-upload       Avoids uploading generated data to database when specified
  --metadata-sep SEP    Separator for metadata file (when different than Excel spread sheet)
  --project-name PROJECT_NAME
                        Project name (by default, basename of metadata file name)
  --data-from {sftp,miseq,other,dukeds,local}
                        Choices: sftp, miseq, other, dukeds, local
  -c CONF_FILE, --conf-file CONF_FILE
                        YAML configuration file (see examples)
  -u USER, --user USER  HARDAC User used in SLURM (default: ${USER})
  -e USER_DUKE_EMAIL, --user-duke-email USER_DUKE_EMAIL
                        Email(s) notified when execution is finished (default: ${USER}@duke.edu)
  -r ROOT_DIR, --root-dir ROOT_DIR
                        Root directory where all subfolders and files will be created (semi-required: either defined here or in conf-file)

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

ggr-cwl-ipynb-gen-0.5.0.tar.gz (17.2 kB view details)

Uploaded Source

Built Distribution

ggr_cwl_ipynb_gen-0.5.0-py3-none-any.whl (28.8 kB view details)

Uploaded Python 3

File details

Details for the file ggr-cwl-ipynb-gen-0.5.0.tar.gz.

File metadata

  • Download URL: ggr-cwl-ipynb-gen-0.5.0.tar.gz
  • Upload date:
  • Size: 17.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.4.2 importlib_metadata/0.20 pkginfo/1.7.1 requests/2.21.0 requests-toolbelt/0.9.1 tqdm/4.48.2 CPython/3.7.6

File hashes

Hashes for ggr-cwl-ipynb-gen-0.5.0.tar.gz
Algorithm Hash digest
SHA256 46d860c8f5395bc269baf7eb8fed112271f3989985404b6ca12c22d3adc6db94
MD5 ec192d7258286eea3c0f82ef71f69360
BLAKE2b-256 049184205fc51055889f11af1641b6491ce462eea713d1b438e1ffb68bec4bbe

See more details on using hashes here.

File details

Details for the file ggr_cwl_ipynb_gen-0.5.0-py3-none-any.whl.

File metadata

  • Download URL: ggr_cwl_ipynb_gen-0.5.0-py3-none-any.whl
  • Upload date:
  • Size: 28.8 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.4.2 importlib_metadata/0.20 pkginfo/1.7.1 requests/2.21.0 requests-toolbelt/0.9.1 tqdm/4.48.2 CPython/3.7.6

File hashes

Hashes for ggr_cwl_ipynb_gen-0.5.0-py3-none-any.whl
Algorithm Hash digest
SHA256 61d0352f6b424a831d32838a1dbbad08ebc9f27a2f8ccfe1b7396a1fb0a71b42
MD5 68677de1563cae6e24324b079851cbee
BLAKE2b-256 b5dbe3a56f2a3e4c69c9a5d2587a26e92eacd6277853fbc2ba51c362ae255872

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page