Skip to main content

Analyze deep sequencing of complex libraries

Project description

ngs-analysis

Intended for analysis of sequencing reads that span multiple DNA or protein parts. For instance, given a library of protein variants linked to DNA barcodes, it can answer questions like:

  • How accurate are the variant sequences, at the DNA or protein level?
  • How frequently is the same barcode linked to two different variants?
  • Which reads contain parts required for function (e.g., a kozak start sequence, or a fused protein tag)?

This kind of analysis often involves parsing raw sequencing reads for DNA and/or protein sub-sequences (parts), then mapping the parts to a reference of anticipated part combinations. This package offers a simple workflow:

  1. Define how to parse reads into parts using plain text expressions (no code)
  2. Test the parser on simulated DNA sequences (e.g., your vector map)
  3. Parse a batch of sequencing samples
  4. Map the (combination of) parts found in each read to your reference

It’s been tested with Illumina paired-end reads and Oxford Nanopore long reads. Under the hood it uses NGmerge to merge paired reads and MMseqs2 for sequencing mapping. It is moderately performant: 1 million paired-end reads can be mapped to a reference of 100,000 variant-barcode pairs in ~1 minute.

Installation

pip install ngs-analysis

Tested on Linux and MacOS (Apple Silicon).

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

ngs-analysis-0.0.1.tar.gz (22.8 kB view details)

Uploaded Source

Built Distribution

ngs_analysis-0.0.1-py3-none-any.whl (23.0 kB view details)

Uploaded Python 3

File details

Details for the file ngs-analysis-0.0.1.tar.gz.

File metadata

  • Download URL: ngs-analysis-0.0.1.tar.gz
  • Upload date:
  • Size: 22.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.12.1

File hashes

Hashes for ngs-analysis-0.0.1.tar.gz
Algorithm Hash digest
SHA256 4d5535f50389d54843d960e6820bffd832e8189143ef2d633900088e7d7053ca
MD5 4f4cb00d6952d12c811cce0434a4571a
BLAKE2b-256 386c6324aa114942d18c8cdfbf6b1a456bc431f5a11ae985f846ff3bf86140ce

See more details on using hashes here.

File details

Details for the file ngs_analysis-0.0.1-py3-none-any.whl.

File metadata

  • Download URL: ngs_analysis-0.0.1-py3-none-any.whl
  • Upload date:
  • Size: 23.0 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.12.1

File hashes

Hashes for ngs_analysis-0.0.1-py3-none-any.whl
Algorithm Hash digest
SHA256 5b4772de3738f9d57d41d66f837ba478301084a6908e91d47233ee5353a2cdde
MD5 7441d5e1632870b1e6efabc1fa580615
BLAKE2b-256 d00e5974b95cae05368fa7aac54c256b418f09537e0fbb2fb93d9ab233c6c11c

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page