Skip to main content

Parallel processing of sequencing reads

Project description

# Pararead: parallel processing of sequencing reads

Pararead is a python package that simplifies parallel processing of DNA sequencing reads (BAM or SAM files), by parallelizing across chromosomes. Pararead is built for developers of python scripts that process data read-by-read. It enables you to quickly and easily parallelize your script.

## Install

Pararead is hosted on pypi. Install with:

`console pip install pararead `

## Minimum working example

In the [examples](https://github.com/databio/pararead/tree/master/examples) folder you can find count_reads.py, which will count the number of reads in a sam/bam file in parallel.

Run this on your bam file like this:

` python examples/count_reads.py file.bam -O output.txt --cores 2 `

Look at the code to see how this is implemented.

## Developing tools that use pararead

The main model provided is an abstract class called ParaReadProcessor, for which concrete children are created by implementing a __call__ method. This creates a callable instance that is then mapped over chromosomes.

The concept is generally described in this early [blog post](http://databio.org/posts/tabix_files.html), which initiated the project that eventually became pararead.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pararead-0.8.1.tar.gz (19.8 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

pararead-0.8.1-py3-none-any.whl (14.5 kB view details)

Uploaded Python 3

File details

Details for the file pararead-0.8.1.tar.gz.

File metadata

  • Download URL: pararead-0.8.1.tar.gz
  • Upload date:
  • Size: 19.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for pararead-0.8.1.tar.gz
Algorithm Hash digest
SHA256 6738b3c3dc5489fee34cfee52fb1c38db4c0ed4b8c25b92ff0add8f41b006de0
MD5 90d2020c1d0b6bb0f60ce823263d7008
BLAKE2b-256 9078fbebdd305c950b5d8ab3003c26c2b85576d8b680c664fc2ac0dbaa9609c7

See more details on using hashes here.

Provenance

The following attestation bundles were made for pararead-0.8.1.tar.gz:

Publisher: python-publish.yml on databio/pararead

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file pararead-0.8.1-py3-none-any.whl.

File metadata

  • Download URL: pararead-0.8.1-py3-none-any.whl
  • Upload date:
  • Size: 14.5 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for pararead-0.8.1-py3-none-any.whl
Algorithm Hash digest
SHA256 3a88a7dc1c42651e563256fbf35a999f30534ef2a36ddc7cf50a3f479dcba3a1
MD5 93ea17fdda3aa6576d17d8b2a7bf0f57
BLAKE2b-256 815aa7a8a1d422bedbfaf3d63be2799446745fbea4b2395e2d0efa02160ad708

See more details on using hashes here.

Provenance

The following attestation bundles were made for pararead-0.8.1-py3-none-any.whl:

Publisher: python-publish.yml on databio/pararead

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page