Skip to main content

A package to generate QC metrics for smMIP libraries

Project description

# smMIPs #

smMIPs is tool for generating QC metrics in sequenced Single Molecule Molecular Inversion Probes libraries.

## Installation ## ### From PyPi ### smmips is available from PyPi

`pip install smmips`

## Alignment to reference genome ##

Fastqs containing smMIP probes and UMIs need first to be aligned to a reference genome. This can be done outside of smmips.py or with the `align` command. Paired fastqs are then aligned with `bwa mem` to a reference genome.

## smMIP assignment ##

Each aligned read pair is then assigned to a smMIP listed in the input panel. The panel should be designed with [MIPGEN](http://shendurelab.github.io/MIPGEN/) and have the same columns and header.

This step generates 2 coordinate-sorted and indexed bam with reads assigned to smMIPs and tagged with the smMIP name and the UMI sequence in the `out` subdirectory: - out/prefix.assigned_reads.sorted.bam: assigned reads with capture target - out/prefix.empty_reads.sorted.bam: assigned reads missing target smmip

QC metrics with read counts are reported in two json files in the `stats` subdirectory:

  • stats/prefix_extraction_metrics.json: read counts, including total reads, assigned and unassigned reads, empty smMIPs

  • stats/prefix_smmip_counts.json: read counts without and with target for each smMIP in the panel

By default all the reads in the alignment file are assessed against the smmips in the panel. However, one can examine reads mapped to a single chromosome or a region within a chromosome. This can potentially be used for parallelizing smmip assignments and a job for each chromosome or region. Command `merge` can be used to merge alignment files and stats files generated for different regions. Script smmipRegionFinder.py can be used to defined the regions to examine.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

smmips-1.0.9.tar.gz (19.9 kB view details)

Uploaded Source

Built Distribution

smmips-1.0.9-py3-none-any.whl (21.1 kB view details)

Uploaded Python 3

File details

Details for the file smmips-1.0.9.tar.gz.

File metadata

  • Download URL: smmips-1.0.9.tar.gz
  • Upload date:
  • Size: 19.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.22.0 setuptools/41.0.1 requests-toolbelt/0.9.1 tqdm/4.32.1 CPython/3.7.3

File hashes

Hashes for smmips-1.0.9.tar.gz
Algorithm Hash digest
SHA256 91a75f4227e20931201c6bb1f140a08c9dcf45a4d6d6c7deacfede415c645f44
MD5 62c36dc6242253f02015f988697a45c8
BLAKE2b-256 650965cc5341dad59afca445f786b1dd77484ae431646fcc944e778953457973

See more details on using hashes here.

File details

Details for the file smmips-1.0.9-py3-none-any.whl.

File metadata

  • Download URL: smmips-1.0.9-py3-none-any.whl
  • Upload date:
  • Size: 21.1 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.22.0 setuptools/41.0.1 requests-toolbelt/0.9.1 tqdm/4.32.1 CPython/3.7.3

File hashes

Hashes for smmips-1.0.9-py3-none-any.whl
Algorithm Hash digest
SHA256 5e9655def3f181e4655f7ce16928bb30b50bc97fde28ec229725bdd6de63dec6
MD5 f0390eefe6d9724811b4ad536fa3c128
BLAKE2b-256 f6dfb13442f22e8485e0daa32eb08aa5a6237023881fe0f0a4fd9c3ab2cdf43b

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page