Skip to main content

A Chemistry-Focused Predictor of Toxicity Risks in Late-Stage Drug Development

Project description

Badges

(Customize these badges with your own links, and check https://shields.io/ or https://badgen.net/ to see which other badges are available.)

fair-software.eu recommendations
(1/5) code repository github repo badge
(2/5) license github license badge
(3/5) community registry RSD workflow pypi badge
(4/5) citation
(5/5) checklist workflow cii badge
howfairis fair-software badge
Other best practices  
Documentation Documentation Status
Build build

Data

You can download the data, including training_and_test_data, precalculated_data_for_trialblazer_model and precomputed_data_for_reproduction_with_notebooks, from: https://doi.org/10.5281/zenodo.17311675

To download the data automatically, see below the description of the Command Line Interface.

Reproduce experiments

To reproduce the experiments in the paper, you can check the notebooks here: https://github.com/molinfo-vienna/trialblazer_notebooks

How to use Trialblazer

A Chemistry-Focused Predictor of Toxicity Risks in Late-Stage Drug Development

Via Command Line

Several commands are made available:

Downloading the model

# Default model and default folder ($HOME/.trialblazer/models/base_model)
trialblazer-download

# Use other URL/folder
trialblazer-download --url=<MODEL-URL> --model-folder=<FOLDER>

Running the algorithm

The input data should be a CSV file with headers and a column named "SMILES". If present, the column "your_id" will also be used for the output.

The command trialblazer --help outputs:

Options:
  --input_file TEXT    Input File  [required]
  --output_file TEXT   Output File
  --model_folder TEXT  Model Folder
  --help               Show this message and exit.

The default output file is names trialblazer.csv.

As a Python library

The library containers 2 main classes:

Trialblazer

This class loads and runs the model.

from trialblazer import Trialblazer

tb = Trialblazer(input_file=<INPUT_FILE>)
tb.run()  # Includes loading of the model, creation of the classifier, and running the algorithm

df = tb.get_dataframe() # This dataframe is augmented with RDKit Mol objects, and displaying it shows the visual representation of each molecule.

tb.write(output_file=<OUTPUT_FILE>)

Trialtrainer

This class is meant to preprocess training data to recreate a model from a single CSV file (training_target_features.csv). It downloads the Chembl database, extracts relevant info, preprocesses data for active and inactive targets, and creates fingerprints files for the 3 sets of molecules (training, active, inactive).

Simply put your training_target_features.csv in your MODEL_FOLDER and run:

from trialblazer import Trialtrainer

tt = Trialtrainer(model_folder=<MODEL_FOLDER>)
tt.build_model_data()

Then you can run the algorithm using:

from trialblazer import Trialblazer

tb = Trialblazer(input_file=<INPUT_FILE>, model_folder=<MODEL_FOLDER>)
tb.run()  # Includes loading of the model, creation of the classifier, and running the algorithm

Installation

To install via PyPI, simply run:

pip install trialblazer

To install trialblazer from GitHub repository through SSH, do:

git clone git@github.com:molinfo-vienna/trialblazer.git
cd trialblazer
python -m pip install .

or through HTTPS:

git clone https://github.com/molinfo-vienna/trialblazer_notebooks.git
cd trialblazer
python -m pip install .

Credits

This package was created with Copier and the NLeSC/python-template.

Citation

Zhang, H., Welsch, M., Schueller, W., & Kirchmair, J. (2025). Trialblazer: A Chemistry-Focused Predictor of Toxicity Risks in Late-Stage Drug Development [Data set]. Zenodo. https://doi.org/10.5281/zenodo.17311675

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

trialblazer-0.2.0.tar.gz (82.1 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

trialblazer-0.2.0-py3-none-any.whl (73.1 kB view details)

Uploaded Python 3

File details

Details for the file trialblazer-0.2.0.tar.gz.

File metadata

  • Download URL: trialblazer-0.2.0.tar.gz
  • Upload date:
  • Size: 82.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for trialblazer-0.2.0.tar.gz
Algorithm Hash digest
SHA256 68d9e521bd1e326de267fae33cfd12adf3624e60c225cca62b04d97d5c614f7a
MD5 9f3ada319bf223e67ca1e593ac1e3560
BLAKE2b-256 8389f5ad89b871905fd3cf7777aae038702972ff9f61080e217ccc852982f3b6

See more details on using hashes here.

Provenance

The following attestation bundles were made for trialblazer-0.2.0.tar.gz:

Publisher: publish.yml on molinfo-vienna/trialblazer

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file trialblazer-0.2.0-py3-none-any.whl.

File metadata

  • Download URL: trialblazer-0.2.0-py3-none-any.whl
  • Upload date:
  • Size: 73.1 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for trialblazer-0.2.0-py3-none-any.whl
Algorithm Hash digest
SHA256 5fd76d1468deda12d87003232b53707574f560f6b967a6362797c284c7a7cbfb
MD5 3aaabc9cc6de0232ede40d5a27e71190
BLAKE2b-256 88eed203599a63816eb3512bd649589dbb9e5537d4fcefe003171db29d960a31

See more details on using hashes here.

Provenance

The following attestation bundles were made for trialblazer-0.2.0-py3-none-any.whl:

Publisher: publish.yml on molinfo-vienna/trialblazer

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page