Sandbox for Computational Protein Design
Project description
_____________________.___.____ .____
\__ ___/\______ \ | | | |
| | | _/ | | | |
| | | | \ | |___| |___
|____| |____|_ /___|_______ \_______ \
\/ \/ \/
Intro
TRILL (TRaining and Inference using the Language of Life) is a sandbox for creative protein engineering and discovery. As a bioengineer myself, deep-learning based approaches for protein design and analysis are of great interest to me. However, many of these deep-learning models are rather unwieldy, especially for non ML-practitioners due to their sheer size. Not only does TRILL allow researchers to perform inference on their proteins of interest using a variety of models, but it also democratizes the efficient fine-tuning of large-language models. Whether using Google Colab with one GPU or a supercomputer with many, TRILL empowers scientists to leverage models with millions to billions of parameters without worrying (too much) about hardware constraints. Currently, TRILL supports using these models as of v1.3.0:
- ESM2 (Embed and Finetune all sizes, depending on hardware constraints doi. Can also generate synthetic proteins from finetuned ESM2 models using Gibbs sampling doi)
- ESM-IF1 (Generate synthetic proteins from .pdb backbone doi)
- ESMFold (Predict 3D protein structure doi)
- ProtGPT2 (Finetune and generate synthetic proteins from seed sequence doi)
- ProteinMPNN (Generate synthetic proteins from .pdb backbone doi)
- RFDiffusion (Diffusion-based model for generating synthetic proteins doi)
- DiffDock (Find best poses for protein-ligand binding doi)
- ProtT5-XL (Embed proteins into high-dimensional space doi)
- TemStaPro (Predict thermostability of proteins doi)
- ZymCTRL (Conditional language model for the generation of artificial functional enzymes link)
Documentation
Check out the documentation and examples at https://trill.readthedocs.io/en/latest/index.html
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file trill_proteins-1.3.8.tar.gz
.
File metadata
- Download URL: trill_proteins-1.3.8.tar.gz
- Upload date:
- Size: 17.3 MB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: poetry/1.4.2 CPython/3.10.6 Linux/6.2.6-76060206-generic
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | b9bdca85333add832eca6a3d59328752023a9ccb6aae89fbed3cca02009fc647 |
|
MD5 | f26bce1519636bf64e0247fc45014927 |
|
BLAKE2b-256 | 851aff8f25cfcf75d377567bb2829a2f9bdb653096043d6b2d68482fea2f60f1 |
File details
Details for the file trill_proteins-1.3.8-py3-none-any.whl
.
File metadata
- Download URL: trill_proteins-1.3.8-py3-none-any.whl
- Upload date:
- Size: 17.3 MB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: poetry/1.4.2 CPython/3.10.6 Linux/6.2.6-76060206-generic
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | d1d34e2641c98b40b2eacdda846f7a3f91c772b87ce15edc8b0347d8e6e9c62b |
|
MD5 | fcfb58009ac31c80ceb026d71fe9d07e |
|
BLAKE2b-256 | 2d2430e1ea5fd8309947128cc9314b400224d768461ca6b74d47d1948436cd72 |