Skip to main content

Sandbox for Computational Protein Design

Project description

                          _____________________.___.____    .____     
                          \__    ___/\______   \   |    |   |    |    
                            |    |    |       _/   |    |   |    |    
                            |    |    |    |   \   |    |___|    |___ 
                            |____|    |____|_  /___|_______ \_______ \
                                             \/            \/       \/

pypi version downloads license Documentation Status

Intro

TRILL (TRaining and Inference using the Language of Life) is a sandbox for creative protein engineering and discovery. As a bioengineer myself, deep-learning based approaches for protein design and analysis are of great interest to me. However, many of these deep-learning models are rather unwieldy, especially for non ML-practitioners due to their sheer size. Not only does TRILL allow researchers to perform inference on their proteins of interest using a variety of models, but it also democratizes the efficient fine-tuning of large-language models. Whether using Google Colab with one GPU or a supercomputer with many, TRILL empowers scientists to leverage models with millions to billions of parameters without worrying (too much) about hardware constraints. Currently, TRILL supports using these models as of v1.4.3:

  • ESM2 (Embed and Finetune all sizes, depending on hardware constraints doi. Can also generate synthetic proteins from finetuned ESM2 models using Gibbs sampling doi)
  • ESM-IF1 (Generate synthetic proteins from .pdb backbone doi)
  • ESMFold (Predict 3D protein structure doi)
  • ProtGPT2 (Finetune and generate synthetic proteins from seed sequence doi)
  • ProteinMPNN (Generate synthetic proteins from .pdb backbone doi)
  • RFDiffusion (Diffusion-based model for generating synthetic proteins doi)
  • DiffDock (Find best poses for protein-ligand binding doi)
  • ProtT5-XL (Embed proteins into high-dimensional space doi)
  • ProstT5 (Embed proteins or protein structures using 3Di alphabet. Also can perform inverse foldingdoi)
  • TemStaPro (Predict thermostability of proteins doi)
  • EpHod (Predicts optimal pH for enzymes doi)
  • ZymCTRL (Conditional language model for the generation of artificial functional enzymes link)

Documentation

Check out the documentation and examples at https://trill.readthedocs.io/en/latest/index.html

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

trill_proteins-1.4.3.tar.gz (11.0 MB view details)

Uploaded Source

Built Distribution

trill_proteins-1.4.3-py3-none-any.whl (11.0 MB view details)

Uploaded Python 3

File details

Details for the file trill_proteins-1.4.3.tar.gz.

File metadata

  • Download URL: trill_proteins-1.4.3.tar.gz
  • Upload date:
  • Size: 11.0 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.5.1 CPython/3.10.13 Linux/5.15.0-1041-azure

File hashes

Hashes for trill_proteins-1.4.3.tar.gz
Algorithm Hash digest
SHA256 01a4c61f73739ee90dbd0b4dc2459c8c94ac6874ecc4cd4d9eddb66682145078
MD5 2e62455f49ef84fa604fcb604040a12c
BLAKE2b-256 6395f11a6e04a9889c28c79d3a0c26169e8bc0b6c50b8c778cf40ec729dcdcc3

See more details on using hashes here.

File details

Details for the file trill_proteins-1.4.3-py3-none-any.whl.

File metadata

  • Download URL: trill_proteins-1.4.3-py3-none-any.whl
  • Upload date:
  • Size: 11.0 MB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.5.1 CPython/3.10.13 Linux/5.15.0-1041-azure

File hashes

Hashes for trill_proteins-1.4.3-py3-none-any.whl
Algorithm Hash digest
SHA256 f613126b6998d653fe63192a4ff5a595ad0398234960ac3c4566ce15722287dd
MD5 e05814966c83a4cd7026cf995fdcc1dc
BLAKE2b-256 d8463f456cc3ee419a4a7f8766ac6e059751106f14c1132b2f145e74298963a5

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page