Skip to main content

A Chemistry-Focused Predictor of Toxicity Risks in Late-Stage Drug Development

Project description

Badges

(Customize these badges with your own links, and check https://shields.io/ or https://badgen.net/ to see which other badges are available.)

fair-software.eu recommendations
(1/5) code repository github repo badge
(2/5) license github license badge
(3/5) community registry RSD workflow pypi badge
(4/5) citation
(5/5) checklist workflow cii badge
howfairis fair-software badge
Other best practices  
Documentation Documentation Status
Build build

Data

You can download the data, including training_and_test_data, precalculated_data_for_trialblazer_model and precomputed_data_for_reproduction_with_notebooks, from: https://doi.org/10.5281/zenodo.15783346

To download the data automatically, see below the description of the Command Line Interface.

Reproduce experiments

To reproduce the experiments in the paper, you can check the notebooks here: https://github.com/molinfo-vienna/trialblazer_notebooks

How to use Trialblazer

A Chemistry-Focused Predictor of Toxicity Risks in Late-Stage Drug Development

Via Command Line

Several commands are made available:

Downloading the model

# Default model and default folder ($HOME/.trialblazer/models/base_model)
trialblazer-download

# Use other URL/folder
trialblazer-download --url=<MODEL-URL> --model-folder=<FOLDER>

Running the algorithm

The input data should be a CSV file with headers and a column named "SMILES". If present, the column "chembl_id" will also be used for the output.

The command trialblazer --help outputs:

Options:
  --input_file TEXT    Input File  [required]
  --output_file TEXT   Output File
  --model_folder TEXT  Model Folder
  --help               Show this message and exit.

The default output file is names trialblazer.csv.

As a Python library

The library containers 2 main classes:

Trialblazer

This class loads and runs the model.

from trialblazer import Trialblazer

tb = Trialblazer(input_file=<INPUT_FILE>)
tb.run()  # Includes loading of the model, creation of the classifier, and running the algorithm

df = tb.get_dataframe() # This dataframe is augmented with RDKit Mol objects, and displaying it shows the visual representation of each molecule.

tb.write(output_file=<OUTPUT_FILE>)

Trialtrainer

This class is meant to preprocess training data to recreate a model from a single CSV file (training_target_features.csv). It downloads the Chembl database, extracts relevant info, preprocesses data for active and inactive targets, and creates fingerprints files for the 3 sets of molecules (training, active, inactive).

Simply put your training_target_features.csv in your MODEL_FOLDER and run:

from trialblazer import Trialtrainer

tt = Trialtrainer(model_folder=<MODEL_FOLDER>)
tt.build_model_data()

Then you can run the algorithm using:

from trialblazer import Trialblazer

tb = Trialblazer(input_file=<INPUT_FILE>, model_folder=<MODEL_FOLDER>)
tb.run()  # Includes loading of the model, creation of the classifier, and running the algorithm

Installation

To install via PyPI, simply run:

pip install trialblazer

To install trialblazer from GitHub repository through SSH, do:

git clone git@github.com:molinfo-vienna/trialblazer.git
cd trialblazer
python -m pip install .

or through HTTPS:

git clone https://github.com/molinfo-vienna/trialblazer_notebooks.git
cd trialblazer
python -m pip install .

Credits

This package was created with Copier and the NLeSC/python-template.

Citation

Zhang, H., Welsch, M., Schueller, W., & Kirchmair, J. (2025). Trialblazer: A Chemistry-Focused Predictor of Toxicity Risks in Late-Stage Drug Development [Data set]. Zenodo. https://doi.org/10.5281/zenodo.15783346

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

trialblazer-0.1.0.tar.gz (45.0 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

trialblazer-0.1.0-py3-none-any.whl (40.1 kB view details)

Uploaded Python 3

File details

Details for the file trialblazer-0.1.0.tar.gz.

File metadata

  • Download URL: trialblazer-0.1.0.tar.gz
  • Upload date:
  • Size: 45.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.12.9

File hashes

Hashes for trialblazer-0.1.0.tar.gz
Algorithm Hash digest
SHA256 6f2a22ea7458bcc9d0addc76480a46dd2412312cdf4c3c32ddb6c56c75dcceee
MD5 68501334059cdbfcdd1e9fdc961f7631
BLAKE2b-256 3deb2ab78a82afd17a8cc3cd09da112c44a07c5c471d06e99b302c6bc229fc62

See more details on using hashes here.

Provenance

The following attestation bundles were made for trialblazer-0.1.0.tar.gz:

Publisher: publish.yml on molinfo-vienna/trialblazer

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file trialblazer-0.1.0-py3-none-any.whl.

File metadata

  • Download URL: trialblazer-0.1.0-py3-none-any.whl
  • Upload date:
  • Size: 40.1 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.12.9

File hashes

Hashes for trialblazer-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 928723a678faaf5a2dff1f394865a2a044a18f1f090adac1488fceedd3851e5f
MD5 cbb59c12b2a551efd84ff8179540b52e
BLAKE2b-256 2772bffc703037843ab720a6bd6ccd91b7ae04d8d3cacfaba7f6cfef74a0e461

See more details on using hashes here.

Provenance

The following attestation bundles were made for trialblazer-0.1.0-py3-none-any.whl:

Publisher: publish.yml on molinfo-vienna/trialblazer

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page