Skip to main content

Unified drug discovery ML toolkit

Project description

Refua

Refua is a general drug discovery ML toolkit that brings together structure prediction, affinity modeling, and generative design in one package. It unifies the Boltz inference stack with the BoltzGen design pipeline so you can move from target definition to candidate generation using a shared, programmatic API.

Why use Refua?

  • Fluent Python API: Build specs, run predictions, and post-process results with a readable, composable API—no need to stitch together CLI calls, YAML, or intermediate files.
  • No per-call model reload: Keep a predictor/model instance alive and run many inferences in a loop without paying initialization and checkpoint-loading cost each time.
  • All-in-memory workflows: Prepare inputs, run inference, and perform analysis entirely in memory (e.g., passing objects/arrays between steps) without writing temporary files to disk.
  • Improved build: We are able to support more up to date dependencies. Over time we expect to widen this support.

Quickstart

Install:

pip install refua

Install with NVIDIA GPU support:

pip install "refua[cuda]"

Unified API (same Complex flow for ligands or binders):

from pathlib import Path

from refua import Binder, Complex, Protein, SM

target = Protein(
    "MSEQNNTEMTFQIQRIYTKDISFEAPNAPHVFQQLAGKYTPEEIRNVLSTLQKAD",
    ids="A",
)

# Protein + ligand -> Boltz2 structure + affinity
result = (
    Complex([target, SM("Cn1cnc2n(C)c(=O)n(C)c(=O)c12")], name="demo")
    .request_affinity()
    .fold()
)
Path("complex.bcif").write_bytes(result.to_bcif())
print(result.affinity.ic50, result.affinity.binding_probability)

# Protein + binder placeholder -> Boltz2 structure + BoltzGen design inputs
binder = Binder(length=12, ids="P")
result = Complex([target, binder], name="design").fold()
Path("design.bcif").write_bytes(result.to_bcif())
print("binder spec:", binder.sequence)

For template-based designs, add .file(...) to the same Complex before fold().

Antibody design with BinderDesigns + Complex:

from refua import Binder, BinderDesigns, Complex, Protein

antigen_seq = "MSEQNNTEMTFQIQRIYTKDISFEAPNAPHVFQQLAGKYTPEEIRNVLSTLQKAD"
antigen = Protein(antigen_seq, ids="A", binding_types={"binding": "10..30"})
binder_pair = BinderDesigns.antibody(
    heavy_cdr_lengths=(12, 10, 14),
    light_cdr_lengths=(10, 9, 9),
    heavy_id="H",
    light_id="L",
)
design = Complex([antigen, *binder_pair], name="antibody_design")

# Optional explicit overrides
# design = Complex([antigen, Binder("8C6", ids="H"), Binder("7C5", ids="L")], name="antibody_design")

result = design.fold()
print(result.binder_specs)
print(result.chain_design_summary())

# Peptide presets
linear_peptide = BinderDesigns.peptide(length=14, ids="P")
disulfide_peptide = BinderDesigns.disulfide_peptide(
    segment_lengths=(10, 6, 3),
    ids="Q",
)

Low-level Binder API for arbitrary macromolecule design:

from refua import Binder, Complex, Protein

target = Protein("MSEQNNTEMTFQIQRIYTKDISFEAPNAPHVFQQLAGKYTPEEIRNVLSTLQKAD", ids="A")
macro = Binder(
    spec="{core}C{loop}C{tail}",  # Any BoltzGen-compatible sequence-spec pattern.
    template_values={"core": 10, "loop": 6, "tail": 4},
    ids="M",
    binding_types="3..12",
    cyclic=True,
)
result = Complex([target, macro], name="custom_macro_design").fold()
print(result.binder_specs)

Small molecule properties:

from refua import SM

props = SM("Cn1cnc2n(C)c(=O)n(C)c(=O)c12", lazy=True)
print(props.mol_wt(), props.logp())
print(props.to_dict())

Pass lazy=False to compute all properties eagerly.

Protein properties:

from refua import Protein, protein_property_specs

target = Protein("MSEQNNTEMTFQIQRIYTKDISFEAPNAPHVFQQLAGKYTPEEIRNVLSTLQKAD", ids="A")
print(target.length(), target.pi(), target.gravy())
print(target.antibody_liability_score(), target.peptide_linear_liability_score())
print(target.to_dict(groups=["basic"]))
print(protein_property_specs()["antibody_liability_score"].description)

Model-based ADMET predictions (optional; requires refua[admet]):

props = SM("Cn1cnc2n(C)c(=O)n(C)c(=O)c12")
profile = props.admet_profile()
print(profile["admet_score"], profile["assessment"])
print(props.herg(), props.ames())

BoltzGen defaults to the bundled molecule library in the Hugging Face cache. Set BOLTZGEN_MOLDIR or pass mol_dir to override.

CLI entrypoints are still available:

boltz --help
boltzgen --help

Documentation

  • Boltz docs live in docs/boltz.
  • BoltzGen docs live in docs/boltzgen.
  • API reference source lives in docs/api (build with sphinx-build -b html docs/api docs/api/_build/html).

Examples

  • examples/antibody_design.py shows antibody setup with BinderDesigns.
  • examples/protein_ligand_affinity.py shows a protein-ligand affinity spec.
  • examples/boltz2_kras_mrtx1133.py folds KRAS G12D with the MRTX-1133 inhibitor and prints affinity.
  • examples/boltzgen_peptide_binder.py shows a peptide binder spec with optional cyclic peptides.
  • examples/boltz_constraints.py shows a complex with pocket/contact constraints and an optional MSA.
  • examples/boltz_multichain_msa.py shows multi-chain MSAs with cross-chain constraints.
  • examples/boltzgen_template_insertions.py shows template structure groups and design insertions.
  • examples/boltz_multi_pocket_complex.py shows multi-ligand pocket constraints with contacts.
  • examples/boltzgen_template_masks.py shows template masks with design/not-design ranges and structure groups.

Notes

This repository consolidates the two stacks into a single build and dependency set. If you need the legacy standalone documentation, it has been preserved under docs/.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

refua-0.6.0.tar.gz (538.9 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

refua-0.6.0-py3-none-any.whl (674.0 kB view details)

Uploaded Python 3

File details

Details for the file refua-0.6.0.tar.gz.

File metadata

  • Download URL: refua-0.6.0.tar.gz
  • Upload date:
  • Size: 538.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/2.3.2 CPython/3.14.3 Darwin/25.2.0

File hashes

Hashes for refua-0.6.0.tar.gz
Algorithm Hash digest
SHA256 df8b9eba881f0b0d35a2d4dd77b59c20aaae0396fc92499a5b78fedd088d2e07
MD5 5f999479efb133f36090b663b937cf78
BLAKE2b-256 310924904006c96142de5edca8d5ce1eb8d4e6742da3397c3b6d2e40e876a15c

See more details on using hashes here.

File details

Details for the file refua-0.6.0-py3-none-any.whl.

File metadata

  • Download URL: refua-0.6.0-py3-none-any.whl
  • Upload date:
  • Size: 674.0 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/2.3.2 CPython/3.14.3 Darwin/25.2.0

File hashes

Hashes for refua-0.6.0-py3-none-any.whl
Algorithm Hash digest
SHA256 fd28069467d600a0a806f535db4664fcec176746c892ac5a1349663f41d805e9
MD5 0bcda71cb251bbcbc9b8b64cecc8af2c
BLAKE2b-256 670e95961554ac99d154074a7ac682cf2005c8e68777242648ef04640b38cd46

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page