A Chemistry-Focused Predictor of Toxicity Risks in Late-Stage Drug Development
Project description
Badges
(Customize these badges with your own links, and check https://shields.io/ or https://badgen.net/ to see which other badges are available.)
| fair-software.eu recommendations | |
|---|---|
| (1/5) code repository | |
| (2/5) license | |
| (3/5) community registry | |
| (4/5) citation | |
| (5/5) checklist | |
| howfairis | |
| Other best practices | |
| Documentation | |
| Build |
Data
You can download the data, including training_and_test_data, precalculated_data_for_trialblazer_model and precomputed_data_for_reproduction_with_notebooks, from: https://doi.org/10.5281/zenodo.15783346
To download the data automatically, see below the description of the Command Line Interface.
Reproduce experiments
To reproduce the experiments in the paper, you can check the notebooks here: https://github.com/molinfo-vienna/trialblazer_notebooks
How to use Trialblazer
A Chemistry-Focused Predictor of Toxicity Risks in Late-Stage Drug Development
Via Command Line
Several commands are made available:
Downloading the model
# Default model and default folder ($HOME/.trialblazer/models/base_model)
trialblazer-download
# Use other URL/folder
trialblazer-download --url=<MODEL-URL> --model-folder=<FOLDER>
Running the algorithm
The input data should be a CSV file with headers and a column named "SMILES". If present, the column "chembl_id" will also be used for the output.
The command trialblazer --help outputs:
Options:
--input_file TEXT Input File [required]
--output_file TEXT Output File
--model_folder TEXT Model Folder
--help Show this message and exit.
The default output file is names trialblazer.csv.
As a Python library
The library containers 2 main classes:
Trialblazer
This class loads and runs the model.
from trialblazer import Trialblazer
tb = Trialblazer(input_file=<INPUT_FILE>)
tb.run() # Includes loading of the model, creation of the classifier, and running the algorithm
df = tb.get_dataframe() # This dataframe is augmented with RDKit Mol objects, and displaying it shows the visual representation of each molecule.
tb.write(output_file=<OUTPUT_FILE>)
Trialtrainer
This class is meant to preprocess training data to recreate a model from a single CSV file (training_target_features.csv). It downloads the Chembl database, extracts relevant info, preprocesses data for active and inactive targets, and creates fingerprints files for the 3 sets of molecules (training, active, inactive).
Simply put your training_target_features.csv in your MODEL_FOLDER and run:
from trialblazer import Trialtrainer
tt = Trialtrainer(model_folder=<MODEL_FOLDER>)
tt.build_model_data()
Then you can run the algorithm using:
from trialblazer import Trialblazer
tb = Trialblazer(input_file=<INPUT_FILE>, model_folder=<MODEL_FOLDER>)
tb.run() # Includes loading of the model, creation of the classifier, and running the algorithm
Installation
To install via PyPI, simply run:
pip install trialblazer
To install trialblazer from GitHub repository through SSH, do:
git clone git@github.com:molinfo-vienna/trialblazer.git
cd trialblazer
python -m pip install .
or through HTTPS:
git clone https://github.com/molinfo-vienna/trialblazer_notebooks.git
cd trialblazer
python -m pip install .
Credits
This package was created with Copier and the NLeSC/python-template.
Citation
Zhang, H., Welsch, M., Schueller, W., & Kirchmair, J. (2025). Trialblazer: A Chemistry-Focused Predictor of Toxicity Risks in Late-Stage Drug Development [Data set]. Zenodo. https://doi.org/10.5281/zenodo.15783346
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file trialblazer-0.1.0.tar.gz.
File metadata
- Download URL: trialblazer-0.1.0.tar.gz
- Upload date:
- Size: 45.0 kB
- Tags: Source
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.12.9
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
6f2a22ea7458bcc9d0addc76480a46dd2412312cdf4c3c32ddb6c56c75dcceee
|
|
| MD5 |
68501334059cdbfcdd1e9fdc961f7631
|
|
| BLAKE2b-256 |
3deb2ab78a82afd17a8cc3cd09da112c44a07c5c471d06e99b302c6bc229fc62
|
Provenance
The following attestation bundles were made for trialblazer-0.1.0.tar.gz:
Publisher:
publish.yml on molinfo-vienna/trialblazer
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
trialblazer-0.1.0.tar.gz -
Subject digest:
6f2a22ea7458bcc9d0addc76480a46dd2412312cdf4c3c32ddb6c56c75dcceee - Sigstore transparency entry: 258171157
- Sigstore integration time:
-
Permalink:
molinfo-vienna/trialblazer@87b6360ebfd919426b9483daf727b1658753498e -
Branch / Tag:
refs/heads/release - Owner: https://github.com/molinfo-vienna
-
Access:
private
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
publish.yml@87b6360ebfd919426b9483daf727b1658753498e -
Trigger Event:
push
-
Statement type:
File details
Details for the file trialblazer-0.1.0-py3-none-any.whl.
File metadata
- Download URL: trialblazer-0.1.0-py3-none-any.whl
- Upload date:
- Size: 40.1 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.12.9
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
928723a678faaf5a2dff1f394865a2a044a18f1f090adac1488fceedd3851e5f
|
|
| MD5 |
cbb59c12b2a551efd84ff8179540b52e
|
|
| BLAKE2b-256 |
2772bffc703037843ab720a6bd6ccd91b7ae04d8d3cacfaba7f6cfef74a0e461
|
Provenance
The following attestation bundles were made for trialblazer-0.1.0-py3-none-any.whl:
Publisher:
publish.yml on molinfo-vienna/trialblazer
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
trialblazer-0.1.0-py3-none-any.whl -
Subject digest:
928723a678faaf5a2dff1f394865a2a044a18f1f090adac1488fceedd3851e5f - Sigstore transparency entry: 258171164
- Sigstore integration time:
-
Permalink:
molinfo-vienna/trialblazer@87b6360ebfd919426b9483daf727b1658753498e -
Branch / Tag:
refs/heads/release - Owner: https://github.com/molinfo-vienna
-
Access:
private
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
publish.yml@87b6360ebfd919426b9483daf727b1658753498e -
Trigger Event:
push
-
Statement type: