Skip to main content

Sandbox for Computational Protein Design

Project description

                          _____________________.___.____    .____     
                          \__    ___/\______   \   |    |   |    |    
                            |    |    |       _/   |    |   |    |    
                            |    |    |    |   \   |    |___|    |___ 
                            |____|    |____|_  /___|_______ \_______ \
                                             \/            \/       \/

pypi version downloads license Documentation Status

Intro

TRILL (TRaining and Inference using the Language of Life) is a sandbox for creative protein engineering and discovery. As a bioengineer myself, deep-learning based approaches for protein design and analysis are of great interest to me. However, many of these deep-learning models are rather unwieldy, especially for non ML-practitioners due to their sheer size. Not only does TRILL allow researchers to perform inference on their proteins of interest using a variety of models, but it also democratizes the efficient fine-tuning of large-language models. Whether using Google Colab with one GPU or a supercomputer with many, TRILL empowers scientists to leverage models with millions to billions of parameters without worrying (too much) about hardware constraints. Currently, TRILL supports using these models as of v1.5.0:

Breakdown of TRILL's Commands

Command Function Available Models
Embed Generates numerical representations or "embeddings" of protein sequences for quantitative analysis and comparison. ESM2, ProtT5-XL, ProstT5
Visualize Creates interactive 2D visualizations of embeddings for exploratory data analysis. PCA, t-SNE, UMAP
Finetune Finetunes protein language models for specific tasks. ESM2, ProtGPT2, ZymCTRL
Language Model Protein Generation Generates proteins using pretrained language models. ESM2, ProtGPT2, ZymCTRL
Inverse Folding Protein Generation Designs proteins to fold into specific 3D structures. ESM-IF1, ProteinMPNN, ProstT5
Diffusion Based Protein Generation Uses denoising diffusion models to generate proteins. RFDiffusion
Fold Predicts 3D protein structures. ESMFold, ProstT5
Dock Simulates protein-ligand interactions. DiffDock, Smina, Autodock Vina, Lightdock
Classify Predicts protein properties at high throughput. TemStaPro, EpHod
Simulate Uses molecular dynamics with the AMBER force field to relax structures. OpenMM

Documentation

Check out the documentation and examples at https://trill.readthedocs.io/en/latest/index.html

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

trill_proteins-1.5.1.tar.gz (11.0 MB view details)

Uploaded Source

Built Distribution

trill_proteins-1.5.1-py3-none-any.whl (11.0 MB view details)

Uploaded Python 3

File details

Details for the file trill_proteins-1.5.1.tar.gz.

File metadata

  • Download URL: trill_proteins-1.5.1.tar.gz
  • Upload date:
  • Size: 11.0 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.5.1 CPython/3.10.13 Linux/6.2.0-1014-azure

File hashes

Hashes for trill_proteins-1.5.1.tar.gz
Algorithm Hash digest
SHA256 0dbe47a5779738fd3acf1dd36e00470d8c4b689989e7a753833b1d7c67ac0496
MD5 d50451cfcb631b19aefa1de669c087c3
BLAKE2b-256 e713658685eff86fbcb8be7b66387ced88378b3b7c4e1b8bb9231d1403ba694b

See more details on using hashes here.

File details

Details for the file trill_proteins-1.5.1-py3-none-any.whl.

File metadata

  • Download URL: trill_proteins-1.5.1-py3-none-any.whl
  • Upload date:
  • Size: 11.0 MB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.5.1 CPython/3.10.13 Linux/6.2.0-1014-azure

File hashes

Hashes for trill_proteins-1.5.1-py3-none-any.whl
Algorithm Hash digest
SHA256 cbee6b25521937f15eb9edff3ed32e02cd2e20b099eec1dc6ff97dd046937bcc
MD5 252859e8bd5b565097bd97f498f4fcdf
BLAKE2b-256 1e9d1cf3d4b42f64bcbbaa0e21c43da38eb57304c03d1a1bee94de1b0d2e9f77

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page