IPython notebook generator for GGR CWL processing pipelines of genomic data
Project description
ggr-cwl-ipynb-gen
Jupyter notebook generator to download and execute the processing files for GGR related datasets. At this point, is not intented to cover all use cases, but to serve as a quick generator of all related files and scripts to pre-process genomic data generated at the Duke-GCB Sequencing Core in HARDAC.
Example of usage:
$ python ggr_cwl_ipynb_gen.py \
--root-dir /path/to/rootdir \
--metadata examples/Hong_3979_170316B1.xlsx \
--out /path/to/output_dir \
--force
The information in the example metadata and configuration file should reveal what is needed to download and pre-process the samples.
For a full list of options:
$ python ggr_cwl_ipynb_gen.py -h
usage: Generator of Jupyter notebooks to execute CWL pre-processing pipelines [-h] -o OUT -m METADATA [-f] [-n] [--metadata-sep SEP] [--project-name PROJECT_NAME]
[--data-from {sftp,miseq,other,dukeds,local}] [-c CONF_FILE] [-u USER] [-e USER_DUKE_EMAIL] [-r ROOT_DIR]
optional arguments:
-h, --help show this help message and exit
-o OUT, --out OUT Jupyter notebook output file name
-m METADATA, --metadata METADATA
Metadata file with samples information
-f, --force Force to overwrite output file
-n, --no-upload Avoids uploading generated data to database when specified
--metadata-sep SEP Separator for metadata file (when different than Excel spread sheet)
--project-name PROJECT_NAME
Project name (by default, basename of metadata file name)
--data-from {sftp,miseq,other,dukeds,local}
Choices: sftp, miseq, other, dukeds, local
-c CONF_FILE, --conf-file CONF_FILE
YAML configuration file (see examples)
-u USER, --user USER HARDAC User used in SLURM (default: ${USER})
-e USER_DUKE_EMAIL, --user-duke-email USER_DUKE_EMAIL
Email(s) notified when execution is finished (default: ${USER}@duke.edu)
-r ROOT_DIR, --root-dir ROOT_DIR
Root directory where all subfolders and files will be created (semi-required: either defined here or in conf-file)
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file ggr-cwl-ipynb-gen-0.5.0.tar.gz
.
File metadata
- Download URL: ggr-cwl-ipynb-gen-0.5.0.tar.gz
- Upload date:
- Size: 17.2 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.4.2 importlib_metadata/0.20 pkginfo/1.7.1 requests/2.21.0 requests-toolbelt/0.9.1 tqdm/4.48.2 CPython/3.7.6
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 46d860c8f5395bc269baf7eb8fed112271f3989985404b6ca12c22d3adc6db94 |
|
MD5 | ec192d7258286eea3c0f82ef71f69360 |
|
BLAKE2b-256 | 049184205fc51055889f11af1641b6491ce462eea713d1b438e1ffb68bec4bbe |
File details
Details for the file ggr_cwl_ipynb_gen-0.5.0-py3-none-any.whl
.
File metadata
- Download URL: ggr_cwl_ipynb_gen-0.5.0-py3-none-any.whl
- Upload date:
- Size: 28.8 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.4.2 importlib_metadata/0.20 pkginfo/1.7.1 requests/2.21.0 requests-toolbelt/0.9.1 tqdm/4.48.2 CPython/3.7.6
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 61d0352f6b424a831d32838a1dbbad08ebc9f27a2f8ccfe1b7396a1fb0a71b42 |
|
MD5 | 68677de1563cae6e24324b079851cbee |
|
BLAKE2b-256 | b5dbe3a56f2a3e4c69c9a5d2587a26e92eacd6277853fbc2ba51c362ae255872 |