Skip to main content

Graph Neural Networks for Molecular Machine Learning

Project description

molcraft-logo

Deep Learning on Molecules: Graph Neural Networks for Molecular Machine Learning.

[!IMPORTANT] Under active development.

Examples

Context-Aware Graph Neural Network

Implement a context-aware graph neural network by embedding context features in the super node. The super node is a virtual node bidirectionally linked to all atomic nodes, allowing both efficient information propagation and inclusion of context features. Context features may be continuous or discrete (categorical); for discrete context features, specify the number of categories expected via num_categories of the AddContext layer.

from molcraft import features
from molcraft import featurizers 
from molcraft import layers
from molcraft import models

import keras
import pandas as pd

featurizer = featurizers.MolGraphFeaturizer(
    atom_features=[
        features.AtomType(),
        features.NumHydrogens(),
        features.Degree(),
    ],
    bond_features=[
        features.BondType(),
        features.IsRotatable(),
    ],
    super_node=True,
    self_loops=True,
)

df = pd.DataFrame({
    'smiles': [
        'N[C@@H](C)C(=O)O', 'N[C@@H](CS)C(=O)O' 
    ],
    'label': [3.5, -1.5],
    'ph': [7.2, 4.5],
    'temperature': [35., 45.],
})

graph = featurizer(df)

model = models.GraphModel.from_layers(
    [
        layers.Input(graph.spec),
        layers.NodeEmbedding(dim=128),
        layers.EdgeEmbedding(dim=128),
        layers.AddContext(field='ph'),
        layers.AddContext(field='temperature'),
        layers.GraphConv(units=128),
        layers.GraphConv(units=128),
        layers.GraphConv(units=128),
        layers.GraphConv(units=128),
        layers.Readout(mode='mean'),
        keras.layers.Dense(units=1024, activation='elu'),
        keras.layers.Dense(units=1024, activation='elu'),
        keras.layers.Dense(1)
    ]
)

model.compile(
    keras.optimizers.Adam(1e-4), keras.losses.MeanSquaredError()
)
model.fit(graph, epochs=30)
pred = model.predict(graph)

# Uncomment below to save and load model (including featurizer)
# featurizers.save_featurizer(featurizer, '/tmp/featurizer.json')
# models.save_model(model, '/tmp/model.keras')

# loaded_featurizer = featurizers.load_featurizer('/tmp/featurizer.json')
# loaded_model = models.load_model('/tmp/model.keras')

Hybrid Model for Peptides

Implement a GNN-RNN hybrid model for peptides.

from molcraft import features
from molcraft import featurizers 
from molcraft import layers
from molcraft import models

import keras
import pandas as pd

featurizer = featurizers.PeptideGraphFeaturizer(
    atom_features=[
        features.AtomType(),
        features.NumHydrogens(),
        features.Degree(),
    ],
    bond_features=[
        features.BondType(),
        features.IsRotatable(),
    ],
)

# Allow modified amino acids:
# featurizer.monomers.update({
#     "C[Carbamidomethyl]": "N[C@@H](CSCC(=O)N)C(=O)O"
# })

df = pd.DataFrame({
    'sequence': [
        'CYIQNCPLG', 'KTTKS' 
    ],
    'label': [1.0, 0.0],
})

graph = featurizer(df)

model = models.GraphModel.from_layers(
    [
        layers.Input(graph.spec),
        layers.NodeEmbedding(dim=128),
        layers.EdgeEmbedding(dim=128),
        layers.GraphConv(units=128),
        layers.GraphConv(units=128),
        layers.GraphConv(units=128),
        layers.GraphConv(units=128),
        layers.PeptideReadout(),
        keras.layers.Masking(),
        keras.layers.Bidirectional(
            keras.layers.LSTM(units=128, return_sequences=True)
        ),
        keras.layers.GlobalAveragePooling1D(),
        keras.layers.Dense(units=1024, activation='elu'),
        keras.layers.Dense(units=1024, activation='elu'),
        keras.layers.Dense(1, activation='sigmoid')
    ]
)

model.compile(
    keras.optimizers.Adam(1e-4), keras.losses.BinaryCrossentropy()
)
model.fit(graph, epochs=30)
pred = model.predict(graph)

# Uncomment below to save and load model (including featurizer)
# featurizers.save_featurizer(featurizer, '/tmp/featurizer.json')
# models.save_model(model, '/tmp/model.keras')

# loaded_featurizer = featurizers.load_featurizer('/tmp/featurizer.json')
# loaded_model = models.load_model('/tmp/model.keras')

Installation

For CPU users:

pip install molcraft

For GPU users:

pip install molcraft[gpu]

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

molcraft-0.10.0.tar.gz (59.5 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

molcraft-0.10.0-py3-none-any.whl (58.8 kB view details)

Uploaded Python 3

File details

Details for the file molcraft-0.10.0.tar.gz.

File metadata

  • Download URL: molcraft-0.10.0.tar.gz
  • Upload date:
  • Size: 59.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.12.3

File hashes

Hashes for molcraft-0.10.0.tar.gz
Algorithm Hash digest
SHA256 62766c41ed75ec66cdc41a8c24c2da263887e0651d6f9520a0210e07ad67f259
MD5 8e98d0bc303cff00f238e5ffdccf1ccc
BLAKE2b-256 e2311196234ead3ab8eaa738a13afbd1247a0014e01cf53e636b6526d4bfcc41

See more details on using hashes here.

File details

Details for the file molcraft-0.10.0-py3-none-any.whl.

File metadata

  • Download URL: molcraft-0.10.0-py3-none-any.whl
  • Upload date:
  • Size: 58.8 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.12.3

File hashes

Hashes for molcraft-0.10.0-py3-none-any.whl
Algorithm Hash digest
SHA256 a2ffe87c874fbcf3eb4caf0d0fd91a124eee586de4765b24ccfc29a9884a7bde
MD5 54a71978ba550ee22a0f15af16f4d25b
BLAKE2b-256 db4e37b68e212cfb3e0b5025c6fa36c88180df8d244a132046f790746168d4d2

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page