Skip to main content

Standardise TCR/MHC gene symbols.

Project description

tidytcells is a lightweight Python package written for bioinformaticians who work with T cell receptor (TCR) data. The main purpose of the package is to solve the problem of parsing and collating together non-standardised TCR datasets. It is often difficult to compile TCR data from multiple sources because the formats/nomenclature of how each dataset encodes TCR and MHC gene names are slightly different, or even inconsistent within themselves. tidytcells attempts to ameliorate this issue by providing simple functions that can standardise TCR and MHC gene symbols to their officially accepted versions, as defined by IMGT, HGNC, or other authorities on gene nomenclature.

Installation

From source

The source code for the package is available on Github. To install from source, clone the git repository, and run:

$ pip install .

from inside the project root directory.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

tidytcells-0.0.5.tar.gz (39.0 kB view hashes)

Uploaded Source

Built Distribution

tidytcells-0.0.5-py3-none-any.whl (40.0 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page