Skip to main content

Low code chemcial property prediction

Project description

Contributors Forks Stargazers Issues LinkedIn


Logo

Protify

A low code solution for computationally predicting the properties of chemicals.
Explore the docs »

View Demo · Report Bug · Request Feature

Table of Contents
  1. About The Project
  2. Getting Started
  3. Usage
  4. Contributing
  5. Built With
  6. License
  7. Contact
  8. Cite

About The Project

Protify is an open source platform designed to simplify and democratize workflows for chemical language models. With Protify, deep learning models can be trained to predict chemical properties at the click of a button, without requiring extensive coding knowledge or computational resources.

Why Protify?

  • Benchmark multiple models efficiently: Need to evaluate 10 different protein language models against 15 diverse datasets with publication-ready figures? Protify makes this possible without writing a single line of code.
  • Flexible for all skill levels: Build custom pipelines with code or use our no-code interface depending on your needs and expertise.
  • Accessible computing: No GPU? No problem. Synthyra offers precomputed embeddings for many popular datasets, which Protify can download for analysis with scikit-learn on your laptop.
  • Cost-effective solutions: The upcoming Synthyra API integration will offer affordable GPU training options, while our Colab notebook provides an accessible entry point for GPU-reliant analysis.

Protify is currently in beta. We're actively working to enhance features and documentation to meet our ambitious goals.

Current Key Features

  • Multiple interfaces: Run experiments via an intuitive GUI, CLI, or prepared YAML files
  • Efficient embeddings: Leverage fast and efficient embeddings from ESM2 and ESMC via FastPLMs
    • Coming soon: Additional protein, SMILES, SELFIES, codon, and nucleotide language models
  • Flexible model probing: Use efficient MLPs for sequence-wise tasks or transformer probes for token-wise tasks
    • Coming soon: Full model fine-tuning, hybrid probing, and LoRA
  • Automated model selection: Find optimal scikit-learn models for your data with LazyPredict, enhanced by automatic hyperparameter optimization
    • Coming soon: GPU acceleration
  • Complete reproducibility: Every session generates a detailed log that can be used to reproduce your entire workflow
  • Publication-ready visualizations: Generate cross-model and dataset comparisons with radar and bar plots, embedding analysis with PCA, t-SNE, and UMAP, and statistically sound confidence interval plots
  • Extensive dataset support: Access 25 protein datasets by default, or easily integrate your own local or private datasets
    • Coming soon: Additional protein, SMILES, SELFIES, codon, and nucleotide property datasets
  • Advanced interaction modeling: Support for protein-protein interaction datasets
    • Coming soon: Protein-small molecule interaction capabilities

Support Protify's Development

Help us grow by sharing online, starring our repository, or contributing through our bounty program.

(back to top)

Getting Started

Installation

From pip pip install Protify

To get started locally

git clone https://@github.com/Synthyra/Protify.git
cd Protify
python -m pip install -r docs/requirements.txt
git submodule update --init --remote --recursive
cd src/protify

(back to top)

Usage

Examples coming soon.

(back to top)

Contributing

Contributions are what make the open source community such an amazing place to learn, inspire, and create. Any contributions you make are greatly appreciated.

We work with a bounty system. You can find bounties on this page. Contributing bounties will get you listed on the Protify consortium and potentially coauthorship on published papers involving the framework.

Simply open a pull request with the bounty ID in the title to claim one. For additional features not on the bounty list simply use a descriptive title.

For bugs and general suggestions please use GitHub issues.

Top contributors:

contrib.rocks image

(back to top)

Built With

  • PyTorch
  • Transformers
  • Datasets
  • PEFT
  • scikit-learn
  • NumPy
  • SciPy
  • Einops
  • PAUC
  • LazyPredict

(back to top)

License

Distributed under the Protify License. See LICENSE.md for more information.

(back to top)

Contact

Email: info@synthyra.com
Website: https://synthyra.com

(back to top)

Cite

If you use this package, please cite the following papers. (Coming soon)

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

protify-0.0.2.tar.gz (129.8 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

protify-0.0.2-py3-none-any.whl (162.5 kB view details)

Uploaded Python 3

File details

Details for the file protify-0.0.2.tar.gz.

File metadata

  • Download URL: protify-0.0.2.tar.gz
  • Upload date:
  • Size: 129.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.9.22

File hashes

Hashes for protify-0.0.2.tar.gz
Algorithm Hash digest
SHA256 cda4aed0990f65657169191bf00fca293ba5063c441d04ad4b4fe0bdfc5b8358
MD5 0b19f8ab3fac29468a923fa98958f236
BLAKE2b-256 723bdfc1f2c31ebb24f7fbd7926e6a6cab94fbda9c2910f3d5b52668bf0cbb67

See more details on using hashes here.

File details

Details for the file protify-0.0.2-py3-none-any.whl.

File metadata

  • Download URL: protify-0.0.2-py3-none-any.whl
  • Upload date:
  • Size: 162.5 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.9.22

File hashes

Hashes for protify-0.0.2-py3-none-any.whl
Algorithm Hash digest
SHA256 0a65217ea22c0f056ef79285a465b1d4bdd938708771b127dc5509703c3b4c9e
MD5 0c0cf2d031964bd538b332043b280141
BLAKE2b-256 8e6f7b6ff9b33c275694cbbaad0ce9bc17d63db69e2c7ec37e3914567c7a4d79

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page