Skip to main content

Batch processing for Wormcat data

Project description

WormCat Batch

Overview

WormCat Batch is a command line tool that allows you to batch multiple runs of WormCat from a Microsoft Excel File or a directory path with WormCat formatted csv files.

Note: You can also run WormCat Batch as a docker container. Find more information on running WormCat Batch as a Docker container here.

Prerequisites

WormCat Batch requires Python 3.5+ and R 3.4.1+ with a WormCat package installed.

If you are unsure if you have WormCat installed, you can run find.package("wormcat") from an R command prompt.

If WormCat is not installed, you can follow the directions here to install WormCat.

Note: WormCat can be installed as an R package you; do NOT need to checkout the source unless you intend to modify WormCat. The readme file explains how to install WormCat as an R package.

Excel spreadsheet Naming Conventions

Once you have the R package installed, you will create an Excel Spreadsheet with the required data for batch execution.

See the file Example/Murphy_TS.xsl for details on formating the Excel.

Note:

  • The Spreadsheet Name should ONLY be composed of Letters, Numbers and Underscores (_) and has an extension .xlsx, .xlt, .xls
  • The Sheet Names within the spreadsheet should ONLY be composed of Letters, Numbers, and Underscores (_) other characters may cause the batch process to fail!
  • Each sheet requires a column name 'Sequence ID' or 'Wormbase ID' (This column name is case-sensitive)

To Run the Batch Process

To run the batch process, open a terminal window, change the directory to your project directory.

$pip install wormcat_batch
$wormcat_cli --help

After execution, the Output Directory will contain all the WormCat run data and a summary Excel spreadsheet.

Sample Output

Local development /test hints

Setup to test

  • conda activate <appropriate_env>
  • pip install .

Test

  • cd /Users/dan/delme #some working directory
  • wormcat_cli --input-excel ${PROJ_HOME}/Example/Murphy_TS.xlsx --output-path ./output

Deploy

  • Advance the version number in setup.py
  • conda deactivate # twine is installed in base env
  • increment setup.py version
  • cd <project directory>
  • rm -rf ./dist
  • rm -rf ./wormcat_batch.egg-info
  • python setup.py sdist
  • twine check dist/*
  • twine upload --repository pypi dist/*
  • git add .
  • git commit -m "some comment"
  • git push

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

wormcat_batch-1.1.9.tar.gz (87.7 kB view details)

Uploaded Source

File details

Details for the file wormcat_batch-1.1.9.tar.gz.

File metadata

  • Download URL: wormcat_batch-1.1.9.tar.gz
  • Upload date:
  • Size: 87.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.9.16

File hashes

Hashes for wormcat_batch-1.1.9.tar.gz
Algorithm Hash digest
SHA256 7775edf80c73e1fb83ce58abc642fe73684863ea0ee270145ed47b45897933f6
MD5 801c2c810d422317ad4c6497a645a9af
BLAKE2b-256 76953d6d2b967125fff6c88fdf9a20508e0a1e0bddf9b0cdd83e27fe048f3b0c

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page