Skip to main content

A teaching platform for computer-aided drug design (CADD) using open source packages and data.

Project description

TeachOpenCADD

A teaching platform for computer-aided drug design (CADD) using open source packages and data.

TOC DOI

License: CC BY 4.0 GitHub tag (latest by date) Test talktorials Anaconda-Server Badge

GitHub closed pr GitHub open pr GitHub closed issues GitHub open issues

If you use TeachOpenCADD in a publication, please cite us! If you use TeachOpenCADD in class, please include a link back to our repository.

In any case, please star (and tell your students to star) those repositories you consider useful for your learning/teaching activities.

Description

TeachOpenCADD topics
Figure adapted from Figure 1 in the TeachOpenCADD publication (D. Sydow et al., J. Cheminformatics, 2019).

Open source programming packages for cheminformatics and structural bioinformatics are powerful tools to build modular, reproducible, and reusable pipelines for computer-aided drug design (CADD). While documentation for such tools is available, only few freely accessible examples teach underlying concepts focused on CADD applications, addressing especially users new to the field.

TeachOpenCADD is a teaching platform developed by students for students, which provides teaching material for central CADD topics. Since we cover both the theoretical as well as practical aspect of these topics, the platform addresses students and researchers with a biological/chemical as well as a computational background.

Each topic is covered in an interactive Jupyter Notebook, using open source packages such as the Python packages rdkit, pypdb, biopandas, nglview, and mdanalysis (find the full list here). Topics are continuously expanded and open for contributions from the community. Beyond their teaching purpose, the TeachOpenCADD material can serve as starting point for users’ project-directed modifications and extensions.

New edition: we have extended the TeachOpenCADD platform with 6 notebooks introducing deep learning and its application to CADD related topics.

Get started

If you can't wait and just want to read through the materials, please go to the read-only version here.

You can run the TeachOpenCADD talktorials either in the cloud for an instant start or locally for a full development experience.

Run Online

The fastest way to explore is via Google Colab. No installation is required.

  • Navigate to the Talktorials list below in Open in Google Colab section.
  • Click the notebook URL on the title column to launch the tutorial directly in your browser.

Run Locally

To set up the project on your machine you can use the TeachOpenCADD runner. This takes care of downloading the talktorial and necessary data and setting up a virtual environment for talktorials.

Using pip

You can install TeachOpenCADD easily via its pip package.

pip install teachopencadd
teachopencadd -h

To start a notebook, you simply call the runner with the talktorial ID:

teachopencadd 6  # change the ID to whichever talktorial you are interested in

You can also use uv to directly run a notebook. There is no need to download the notebook by hand.

uv run --with teachopencadd teachopencadd -h

Open in Google Colab

Talktorial Title
T001 Compound data acquisition (ChEMBL)
T002 Molecular filtering: ADME and lead-likeness criteria
T003 Molecular filtering: unwanted substructures
T004 Ligand-based screening: compound similarity
T005 Compound clustering
T006 Maximum common substructure
T007 Ligand-based screening: machine learning
T008 Protein data acquisition: Protein Data Bank (PDB)
T009 Ligand-based pharmacophores
T010 Binding site similarity and off-target prediction
T011 Querying online API webservices
T012 Data acquisition from KLIFS
T013 Data acquisition from PubChem
T014 Binding site detection
T015 Protein ligand docking
T016 Protein-ligand interactions
T017 Advanced NGLview usage
T018 Automated pipeline for lead optimization
T019 Molecular dynamics simulation
T020 Analyzing molecular dynamics simulations
T021 One-Hot Encoding
T022 Ligand-based screening: neural networks
T023 What is a kinase?
T024 Kinase similarity: Sequence
T025 Kinase similarity: Kinase pocket (KiSSim fingerprint)
T026 Kinase similarity: Interaction fingerprints
T027 Kinase similarity: Ligand profile
T028 Kinase similarity: Compare different perspectives
T033 Molecular representations
T034 RNN-based molecular property prediction
T035 GNN-based molecular property prediction
T036 An introduction to E(3)-invariant graph neural networks
T037 Uncertainty estimation
T038 Protein Ligand Interaction Prediction

TeachOpenCADD KNIME workflows

DOI DOI KNIME Hub

If you prefer to work in the context of a graphical interface, talktorials T001-T008 are also available as KNIME workflows. Questions regarding this version should be addressed using the "Discussion section" available at this post. You need to create a KNIME account to use the forum.

About TeachOpenCADD

External resources

Please refer to our TeachOpenCADD website to find a list of external resources:

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

teachopencadd-2023.5.2.post431.tar.gz (200.0 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

teachopencadd-2023.5.2.post431-py3-none-any.whl (22.1 kB view details)

Uploaded Python 3

File details

Details for the file teachopencadd-2023.5.2.post431.tar.gz.

File metadata

File hashes

Hashes for teachopencadd-2023.5.2.post431.tar.gz
Algorithm Hash digest
SHA256 0b9d696ffd4e4e7c8a27c755686a1b52685b6c6858d3b99af90c7c9d9f607404
MD5 1a8b5f1b15c58d7c8534355fcf770cd4
BLAKE2b-256 dcaeaa96efffe9460ac587c39745fcbdacc8acaf5f49b36f2571a43663552c6b

See more details on using hashes here.

File details

Details for the file teachopencadd-2023.5.2.post431-py3-none-any.whl.

File metadata

File hashes

Hashes for teachopencadd-2023.5.2.post431-py3-none-any.whl
Algorithm Hash digest
SHA256 8651dd3b2e681711397d75bd6e908730e19a07e951d0943caf5d183f9bd1ccb4
MD5 9713306cffad32e4cd1d5fe6f45cfbf1
BLAKE2b-256 3ee57d61cca4aa7c6c6534bbe35d2bbc558918dcc2b6acf596fb38b41cd7b6e2

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page