Skip to main content

To help you manage your pores

Project description

PoreRefiner

https://img.shields.io/pypi/v/porerefiner.svg https://img.shields.io/travis/crashfrog/porerefiner.svg Documentation Status

To help you manage your pores

Introduction

PoreRefiner is a software tool to watch Nanopore runs in progress and attach sample information to them, as well as provide an interface for integration with LIMS services and other online systems. It supports both push and pull modalities for data exchange with those systems - push, via a series of configurable notifiers, and pull, via a simple Flask webservice and a Protobuf RPC service. It also includes a command-line interface for working with the run database.

Installation

PoreRefiner is available as a Python package:

pip install porerefiner

Copy the files porerefiner.service and porerefiner.app.service from the package to systemd:

cp /usr/local/lib/python3.7/dist-packages/porerefiner.service /lib/systemd/system
cp /usr/local/lib/python3.7/dist-packages/porerefiner.app.service /lib/systemd/system
systemctl enable porerefiner.service
systemctl enable porerefiner.app.service

Once the package is installed, porerefinerd and prfr should be on your path. You can use porerefinerd init to set up the config file for the porerefiner service, it will prompt you for the save locations of the database, the local socket, nanopore’s output directory, and where the config file should be saved:

$ porerefinerd init
create PoreRefiner config at /etc/porerefiner/config.yaml? [y/N]: y
location of porerefiner RPC socket? [/etc/porerefiner/porerefiner.sock]:
location of database? [/etc/porerefiner/database.db]:
nanopore data output location?: /data
export POREREFINER_CONFIG="/etc/porerefiner/config.yaml"

To the end of the config.yaml (section submitters) add:

submitters:
- class: HpcSubmitter
  config:
    login_host: login1-raven2.fda.gov
    username: nanopore
    private_key_path: /root/.ssh/nanopore
    known_hosts_path: /root/.ssh/known_hosts
    scheduler: uge
    queue: service.q
  jobs:
  - class: FdaRunJob
    config:
      command: module load nanopore-lims/0.1.0 && nanopore_HPC {remote_json} &
      platform: GridION sequence
      closure_status_recipients:
      - justin.payne@fda.hhs.gov
      import_ready_recipients:
      - justin.payne@fda.hhs.gov

This configures PoreRefiner for the FDA Raven integration. Then you can start the porerefiner services:

systemctl start porerefiner.service
systemctl start porerefiner.app.service

If you wish to enable the PoreRefiner web interface, you should ensure that port 8844 is reachable from remote hosts.

Using this software

prfr is the end-user client; Minion users should use this tool to monitor runs in progress, load sample sheets, and tag runs and samples.

$ prfr --help
Usage: prfr [OPTIONS] COMMAND [ARGS]...

  Command line interface for PoreRefiner, a Nanopore run manager.

Options:
--help  Show this message and exit.

Commands:
info      Return information about a run, historical or in progress.
load      Load a sample sheet to be attached to a run, or to the next run...
ps        Show runs in progress, or every tracked run (--all), or with a...
tag       Add one or more tags to a run.
template  Write a sample sheet template to STDOUT.
untag     Remove one or more tags from a run.

Administration

When the PoreRefiner service is stopped, it has a number of administrative functions:

$ porerefinerd --help
Usage: porerefiner.py [OPTIONS] COMMAND [ARGS]...

Options:
--help  Show this message and exit.

Commands:
init    Find the Nanopore output directory and create the config file.
list    List job system stuff.
reset   Utility function to reset various state.
start   Start the PoreRefiner service.
verify  Run various checks.
$ porerefinerd init --help
Usage: porerefiner.py init [OPTIONS]

Find the Nanopore output directory and create the config file.

Options:
--config TEXT
--nanopore_dir TEXT
--help               Show this message and exit.
$ porerefinerd list --help
Usage: porerefiner.py list [OPTIONS] COMMAND [ARGS]...

List job system stuff.

Options:
--help  Show this message and exit.

Commands:
jobs        List the configurable and configured jobs.
notifiers   List the configurable and configured notifiers.
submitters  List the configureable and configured submitters.
$ porerefinerd reset --help
Usage: porerefiner.py reset [OPTIONS] COMMAND [ARGS]...

Utility function to reset various state.

Options:
--help  Show this message and exit.

Commands:
config        Reset config to defaults.
database      Reset database to empty state.
jobs          Reset all jobs to a particular status.
runs          Reset all runs to in-progress status.
samplesheets  Clear samplesheets that aren't attached to any run.
$ porerefinerd verify --help
Usage: porerefiner.py verify [OPTIONS] COMMAND [ARGS]...

Run various checks.

Options:
--help  Show this message and exit.

Commands:
notifiers   Verify notifiers by sending notifications.
submitters  Verify configuration of job submitters by running their tests.

Features

Automatic detection of runs in progress

Sample sheet and sample tracking through the flowcell/run context, and beyond

Schedule automatic analysis of runs and files in AWS or your HPC

How it works

PoreRefiner uses fsevents to detect filesystem events during a Nanopore run, including the creating of new directories in the Nanopore output folder. Flowcells, runs, and run files can be detected this way. PoreRefiner will update a SQLite database with run information, including what it’s able to pull out of Minknow.

If all of the files of a run have not been modified in an hour, PoreRefiner will mark a completion time for that run. If any of the files in a run have not been modified in an hour, they may be picked up by the Job runner for some subsequent processing.

PoreRefiner presents many interfaces to address integration challenges:

A CLI interface for both human use and simple scripting

A simple HTTP service for communication with LIMS and other services

A Protobuf-RPC service for inter-process communication (Protobuf bindings are available in Python, C, JavaScript, Java, and many other languages)

Credits

This package was created with Cookiecutter and the audreyr/cookiecutter-pypackage project template.

History

0.8.4 (2020-03-23)

  • New plugin architecture, combined with a cookiecutter definition for creating new ones
  • Removed several in-progress job/submitter types to plugins

0.8.3 (2020-03-13)

  • prfr now recognizes site config if user config doesn’t exist

0.8.2 (2020-03-12)

  • Fixed tests, general bugfixes

0.8.1 (2020-03-11)

  • Improved service files.

0.8.0 (2020-03-09)

  • First release on PyPI.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Files for porerefiner, version 0.8.4
Filename, size File type Python version Upload date Hashes
Filename, size porerefiner-0.8.4-py2.py3-none-any.whl (106.0 kB) File type Wheel Python version py2.py3 Upload date Hashes View
Filename, size porerefiner-0.8.4.tar.gz (103.3 kB) File type Source Python version None Upload date Hashes View

Supported by

Pingdom Pingdom Monitoring Google Google Object Storage and Download Analytics Sentry Sentry Error logging AWS AWS Cloud computing DataDog DataDog Monitoring Fastly Fastly CDN DigiCert DigiCert EV certificate StatusPage StatusPage Status page