Skip to main content

Optimized TCRDist calculation for TCR repertoire data analysis

Project description

fast_tcrdist

fast_tcrdist is an optimized version of the TCRDist algorithm published by Dash et al. Nature (2017): doi:10.1038/nature22383

To enhance the original implementation of TCRDist, fast_tcrdist uses the Needleman-Wunsch alignment algorithm to align TCR sequences and creates the TCRDist matrix via cython. To integrate well with other common single cell analysis tools, fast_tcrdist utilizes the Anndata data structure to store the TCRDist matrix and associated metadata. Currently, this has been tested on TCR/gene expression output from Cellranger (https://support.10xgenomics.com/single-cell-gene-expression/software/pipelines/latest/what-is-cell-ranger), but future releases will aim to allow for other file formats.

In addition to running the TCRDist algorithm, fast_tcrdist allows you to aggregate TCR info with single-cell gene expression into a single anndata object to allow for integrated downstream analyses.

Outside files

The BLOSUM62 matrix used for alignments came from NCBI:https://www.ncbi.nlm.nih.gov/IEB/ToolBox/C_DOC/lxr/source/data/BLOSUM62

CDR amino acid info was taken from the TCRDist database file "alphabeta_db.tsv" (https://www.dropbox.com/s/kivfp27gbz2m2st/tcrdist_extras_v2.tgz) and reformated into .json format (mouse_CDRs_for_10X.json and human_CDRs_for_10X.json)

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

fast_tcrdist-0.0.2.tar.gz (146.8 kB view hashes)

Uploaded Source

Built Distribution

fast_tcrdist-0.0.2-cp37-cp37m-win_amd64.whl (96.8 kB view hashes)

Uploaded CPython 3.7m Windows x86-64

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page