A bioinformatics toolkit for processing high-throughput lymphocyte receptor sequencing data.
Project description
pRESTO - The REpertoire Sequencing TOolkit
pRESTO is a toolkit for processing raw reads from high-throughput sequencing of B cell and T cell repertoires.
Dramatic improvements in high-throughput sequencing technologies now enable large-scale characterization of lymphocyte repertoires, defined as the collection of trans-membrane antigen-receptor proteins located on the surface of B cells and T cells. The REpertoire Sequencing TOolkit (pRESTO) is composed of a suite of utilities to handle all stages of sequence processing prior to germline segment assignment. pRESTO is designed to handle either single reads or paired-end reads. It includes features for quality control, primer masking, annotation of reads with sequence embedded barcodes, generation of unique molecular identifier (UMI) consensus sequences, assembly of paired-end reads and identification of duplicate sequences. Numerous options for sequence sorting, sampling and conversion operations are also included.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
File details
Details for the file presto-0.7.2.tar.gz
.
File metadata
- Download URL: presto-0.7.2.tar.gz
- Upload date:
- Size: 584.8 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.1 CPython/3.9.6
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | b4f4b34413af4207eb2052316d31d7bc2067b864286498476d89013ad5423dd9 |
|
MD5 | e01144b735a6cba7d210ef7fe91525da |
|
BLAKE2b-256 | 55dfb9ef6ef83741f82a24cb2426c1186a3d1df8d553d168bb14d2fbbe532e17 |