Skip to main content

Graphium: Scaling molecular GNNs to infinity.

Project description

Scaling molecular GNNs to infinity


test release code-check doc

A deep learning library focused on graph representation learning for real-world chemical tasks.

  • ✅ State-of-the-art GNN architectures.
  • 🐍 Extensible API: build your own GNN model and train it with ease.
  • ⚗️ Rich featurization: powerful and flexible built-in molecular featurization.
  • 🧠 Pretrained models: for fast and easy inference or transfer learning.
  • ⮔ Read-to-use training loop based on Pytorch Lightning.
  • 🔌 Have a new dataset? Graphium provides a simple plug-and-play interface. Change the path, the name of the columns to predict, the atomic featurization, and you’re ready to play!

Documentation

Visit https://graphium-docs.datamol.io/.

Installation for developers

For CPU and GPU developers

Use mamba:

# Install Graphium's dependencies in a new environment named `graphium`
mamba env create -f env.yml -n graphium

# Install Graphium in dev mode
mamba activate graphium
pip install --no-deps -e .

For IPU developers

mkdir ~/.venv                               # Create the folder for the environment
python3 -m venv ~/.venv/graphium_ipu        # Create the environment
source ~/.venv/graphium_ipu/bin/activate    # Activate the environment

# Installing the poptorch SDK. Make sure to change the path
pip install PATH_TO_SDK/poptorch-3.2.0+109946_bb50ce43ab_ubuntu_20_04-cp38-cp38-linux_x86_64.whl

# Activate poplar SDK.
source PATH_TO_SDK/enable

# Install the IPU specific and graphium requirements
PACKAGE_NAME=pytorch pip install -r requirements_ipu.txt

# Install Graphium in dev mode
pip install -e .

Training a model

To learn how to train a model, we invite you to look at the documentation, or the jupyter notebooks available here.

If you are not familiar with PyTorch or PyTorch-Lightning, we highly recommend going through their tutorial first.

License

Under the Apache-2.0 license. See LICENSE.

Documentation

  • Diagram for data processing in molGPS.
Data Processing Chart
  • Diagram for Muti-task network in molGPS
Full Graph Multi-task Network

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

graphium-2.0.0.tar.gz (15.3 MB view details)

Uploaded Source

Built Distribution

graphium-2.0.0-py3-none-any.whl (13.3 MB view details)

Uploaded Python 3

File details

Details for the file graphium-2.0.0.tar.gz.

File metadata

  • Download URL: graphium-2.0.0.tar.gz
  • Upload date:
  • Size: 15.3 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.1 CPython/3.11.3

File hashes

Hashes for graphium-2.0.0.tar.gz
Algorithm Hash digest
SHA256 fbb337f403fbbf23e7d3eea8ea6b7934723c4d7ae056e730239bce0713f9c19c
MD5 c6676fc0d7db1387aa983c0af38b6172
BLAKE2b-256 2f89f12de86a54d777ef714de9af6f907372edc5bba019a7b821161aef1a6b4e

See more details on using hashes here.

File details

Details for the file graphium-2.0.0-py3-none-any.whl.

File metadata

  • Download URL: graphium-2.0.0-py3-none-any.whl
  • Upload date:
  • Size: 13.3 MB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.1 CPython/3.11.3

File hashes

Hashes for graphium-2.0.0-py3-none-any.whl
Algorithm Hash digest
SHA256 26311e34e5e230cce6e171f985c5d175dedfa00a69331e4f5a613ef3241724be
MD5 b9ca215da12fe54b522589aa34008db4
BLAKE2b-256 15a4fb1c607e72d83c2e3075ab27a737443e9d749ca7733bf3445e67e09826fa

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page