Python module to manipulate the minimap2's CS tag
Project description
cstag
cstag
is a Python module to manipulate minimap2's CS tag.
cstag.shorten()
: to convert a cs tag from long to short formatcstag.lengthen()
: to convert a cs tag from short to long formatcstag.consensus()
: to generate a consensus cs tag from multiple cs tagscstag.mask()
: to mask low-quality bases in a cs tagcstag.to_html()
: to output html report
See documentation for more information.
Installation
From PyPI:
pip install cstag
From Bioconda:
conda install -c bioconda cstag
Examples
Shorten/Lengthen
import cstag
# Convert a cs tag from long to short
cs = "cs:Z:=ACGT*ag=CGT"
cstag.shorten(cs)
# => cs:Z::4*ag:3
# Convert a cs tag from short to long
cs = "cs:Z::4*ag:3"
cigar = "8M"
seq = "ACGTACGT"
cstag.lengthen(cs, cigar, seq)
# => cs:Z:=ACGT*ag=CGT
Call consensus
import cstag
cs_list = ["cs:Z:=ACGT", "cs:Z:=AC*gt=T", "cs:Z:=C*gt=T", "cs:Z:=C*gt=T", "cs:Z:=ACT+ccc=T"]
cigar_list = ["4M", "4M", "1S3M", "3M", "3M3I1M"]
pos_list = [1, 1, 1, 2, 1]
cstag.consensus(cs_list, cigar_list, pos_list)
# => cs:Z:=AC*gt*T
Mask low-quality bases in a cs tag
import cstag
cs = "cs:Z:=ACGT*ac+gg-cc=T"
cigar = "5M2I2D1M"
qual = "AA!!!!AA"
phred_threshold = 10
cstag.mask(cs, cigar, qual, phred_threshold)
# => cs:Z:=ACNN*an+ng-cc=T
Output HTML report
import cstag
from pathlib import Path
cs = "cs:Z:=AC+GGG=T-ACGT*at~gt10cg=GNNN"
description = "Example"
cstag_html = cstag.to_html(cs, description)
Path("report.html").write_text(cstag_html)
# => Output "report.html"
The report.html
is :point_down:
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
cstag-0.3.1.zip
(12.1 kB
view hashes)