Skip to main content

Create interactive comparative sequencing coverage plots from virus sequencing data.

Project description

wgscovplot

wgscovplot generates interactive comparative sequencing coverage plots in self-contained, offline-friendly HTML files with optional annotation of variant calling results, PCR amplicon coverage and genetic features.

Installation

PyPI

Install from PyPI with pip

pip install wgscovplot

If the installation was successful, you should be able to type wgscovplot --help and get a help message on how to use the tool.

Source

Use pip to install from source:

# optionally, create a virtual environment
python -m venv venv
source venv/bin/activate
# install from GitHub repo
pip install git+https://github.com/nhhaidee/wgscovplot.git
# run wgscovplot
wgscovplot --help
wgscovplot /path/to/results_folder

Features

  • Easy-to-use: Simply provide a Nextflow output directory containing and wgscovplot will figure out what files it needs to generate its interactive sequencing coverage plots
  • Fully-interactive plots featuring:
    • Zoom, scroll, pan, select regions of interest
    • Informative tooltips highlighting variant calling results and coverage statistics across all samples being shown
    • Change the y-axis scale to linear or log scale
    • Select which samples to show (and which Influenza gene segments to show)
    • Highlight regions of interest (e.g. genetic features, primer/probe binding sites, low coverage regions)
  • Annotate coverage plots with variant calling results from multiple different variant callers and variant effect results from SnpEff/SnpSift
  • Compare sequencing coverage across multiple samples

Usage

Basic usage will output a wgscovplot.html file in the current directory:

wgscovplot /path/to/results_folder

Show help info with $ wgscovplot --help:

 Usage: wgscovplot [OPTIONS] INPUT_DIR

╭─ Arguments ──────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╮
│ *    input_dir      PATH  Directory containing Mosdepth and variant calling results from sequence analysis. For example, the output directory from execution of the nf-core/viralrecon or CFIA-NCFAD/nf-flu Nextflow workflow                │
│                           [default: None]                                                                                                                                                                                                    │
│                           [required]                                                                                                                                                                                                         │
╰──────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╯
╭─ Options ────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╮
│ --output-html             -o                          PATH                             wgscovplot HTML output file [default: wgscovplot.html]                                                                                                │
│ --primers-fasta           -p                          PATH                             FASTA file containing real-time PCR primer/probe sequences. [default: None]                                                                           │
│ --low-coverage-threshold  -l                          INTEGER                          Low sequencing coverage threshold. [default: 10]                                                                                                      │
│ --edit-distance           -d                          INTEGER                          The maximum differences or 'edits' allowed between real-time PCR primer/probe sequences and the sample sequences. [default: 0]                        │
│ --compress-depths             --no-compress-depths                                     Compress coverage depth arrays? [default: compress-depths]                                                                                            │
│ --verbose                 -v                                                           Verbose logs                                                                                                                                          │
│ --force                   -f                                                           Force overwrite of existing output files                                                                                                              │
│ --version                     --no-version                                             Print wgscovplot version and exit [default: no-version]                                                                                               │
│ --install-completion                                  [bash|zsh|fish|powershell|pwsh]  Install completion for the specified shell. [default: None]                                                                                           │
│ --show-completion                                     [bash|zsh|fish|powershell|pwsh]  Show completion for the specified shell, to copy it or customize the installation. [default: None]                                                    │
│ --help                                                                                 Show this message and exit.                                                                                                                           │
╰──────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╯

 wgscovplot version 0.3.0; Python 3.11.6

Dependencies

  • Python (>=3.9)
  • Typescript/Javascript
    • Echarts for performant generating interactive plots
    • SolidJS for reactive UI components
    • Vite for TS/JS dev and building bundle
    • Tailwind CSS for styling

Development

This project has two main components: a Python "backend" (CLI that spits out a templated HTML with built JS embedded) and a Javascript frontend.

Python backend development is done in the wgscovplot directory.

Web frontend development is done in the web directory. The frontend is built with Vite, SolidJS and ECharts.

Environment

Python development is recommended with PyCharm. Jetbrains IDEs have great support for Python development and virtual environments.

Jetbrains IDEs work great for Typescript/Javascript development as well, but any editor will do if you have Vite live reload enabled.

Setup

# clone repo
git clone https://github.com/nhhaidee/wgscovplot.git
cd wgscovplot
# optionally, create a virtual environment
python -m venv venv
source venv/bin/activate
# install dev dependencies
pip install hatch

# start shell with Hatch
hatch shell

# run linting with Hatch
hatch run lint:all

# run tests with Hatch
hatch run cov

Frontend development

See web/README.md for more details.

Authors

License

Copyright 2024 Canadian Food Inspection Agency of Canada, Government of Canada.

Licensed under the Apache License, Version 2.0 (the "License"); you may not use this work except in compliance with the License. You may obtain a copy of the License at:

http://www.apache.org/licenses/LICENSE-2.0

Unless required by applicable law or agreed to in writing, software distributed under the License is distributed on an "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. See the License for the specific language governing permissions and limitations under the License.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

wgscovplot-1.0.2.tar.gz (319.3 kB view details)

Uploaded Source

Built Distribution

wgscovplot-1.0.2-py3-none-any.whl (325.1 kB view details)

Uploaded Python 3

File details

Details for the file wgscovplot-1.0.2.tar.gz.

File metadata

  • Download URL: wgscovplot-1.0.2.tar.gz
  • Upload date:
  • Size: 319.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/5.0.0 CPython/3.12.3

File hashes

Hashes for wgscovplot-1.0.2.tar.gz
Algorithm Hash digest
SHA256 004e282a48f47bb48f7c053887b28cc5a81dca6f9252a43901221e566d1c347d
MD5 351052c8c3ddee021e18602c9d0b7d84
BLAKE2b-256 a3af76d7c6a6c66d44874142636375eb2a53e40258cd10913b8ae5f092756037

See more details on using hashes here.

Provenance

File details

Details for the file wgscovplot-1.0.2-py3-none-any.whl.

File metadata

  • Download URL: wgscovplot-1.0.2-py3-none-any.whl
  • Upload date:
  • Size: 325.1 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/5.0.0 CPython/3.12.3

File hashes

Hashes for wgscovplot-1.0.2-py3-none-any.whl
Algorithm Hash digest
SHA256 66609b99be0a9c0130c854e8743f3dca49f63c1141c149c124c75eccd02b7b6e
MD5 1564e2447eeced3139938fcae70e80b7
BLAKE2b-256 e1ea911491c9cbcf45a78906fa73ab8e06727d4aa4389c889de4fb7e631c1ab2

See more details on using hashes here.

Provenance

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page