ProQSAR — automation utilities for QSAR workflows (standardization, featurization, model building & evaluation).
Project description
ProQSAR
ProQSAR — automatic pipeline for quantitative structure–activity relationship (QSAR) modeling
A reproducible toolkit for end-to-end QSAR: data standardization, featurization, splitting, model training, uncertainty estimation, and evaluation. Designed for reproducible experiments, continuous integration, and easy integration into ML/CADD pipelines. Full documentation for ProQSAR is available at ReadTheDocs.
Key features
- Data standardization and sanitization (SMILES normalization, valence checks, tautomer/charge handling).
- Modular featurizers: fingerprints, descriptors, learned featurizers (pluggable API).
- Flexible dataset splitting: random, scaffold, stratified.
- Built-in pipelines for training and evaluation with uncertainty estimation.
- Simple CLI and Python API for reproducible experiments and batch processing.
- CI-tested with unit tests and example notebooks.
Installation
Choose the preferred installation method.
From PyPI
pip install proqsar
From conda (anaconda.org/tieulongphan)
conda install -c tieulongphan proqsar
Docker (containerized)
docker pull tieulongphan/proqsar:latest
# run an example container (bind-mount your project directory)
docker run --rm -v $(pwd):/workspace -w /workspace tieulongphan/proqsar:latest proqsar --help
From source (developer)
git clone https://github.com/Medicine-Artificial-Intelligence/proqsar.git
cd proqsar
pip install -e .[dev]
Development & contributing
Thanks for your interest in contributing! A quick checklist:
- Fork the repository and create a feature branch.
- Implement your changes and include unit tests.
- Run linting and tests locally (
pre-commit,flake8,pytest). - Open a Pull Request describing the change and add tests/examples.
Citation / Publication
If you use ProQSAR in research, please cite the project. Example BibTeX placeholder:
@misc{proqsar2025,
title = {ProQSAR: Automatic pipeline for QSAR modeling},
author = {Tuyet-Minh Phan and Tieu-Long Phan and Phuoc-Chung Nguyen Van and contributors},
year = {2025},
howpublished = {\url{https://github.com/Medicine-Artificial-Intelligence/proqsar}}
}
Authors & Contributors
License
This project is licensed under MIT License - see the License file for details.
Acknowledgments
This work has received support from the Korea International Cooperation Agency (KOICA) under the project entitled “Education and Research Capacity Building Project at University of Medicine and Pharmacy at Ho Chi Minh City”, conducted from 2024 to 2025 (Project No. 2021-00020-3).
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file proqsar-0.0.6.tar.gz.
File metadata
- Download URL: proqsar-0.0.6.tar.gz
- Upload date:
- Size: 3.7 MB
- Tags: Source
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
7df7f6e49e261bc4fe844fbd00e1ddd8608ca5602020a89d59f0b092f29524f2
|
|
| MD5 |
e1336a2fd6152fb74e44a5026f2c7584
|
|
| BLAKE2b-256 |
245bad37d48d82392b79157116626cedf14605c0f7571def5d34f19c4e3c4be1
|
Provenance
The following attestation bundles were made for proqsar-0.0.6.tar.gz:
Publisher:
publish-package.yml on Medicine-Artificial-Intelligence/ProQSAR
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
proqsar-0.0.6.tar.gz -
Subject digest:
7df7f6e49e261bc4fe844fbd00e1ddd8608ca5602020a89d59f0b092f29524f2 - Sigstore transparency entry: 555194556
- Sigstore integration time:
-
Permalink:
Medicine-Artificial-Intelligence/ProQSAR@ac536860f6279b0bbbcffc6879ccc42ebae1dce7 -
Branch / Tag:
refs/tags/v0.0.6 - Owner: https://github.com/Medicine-Artificial-Intelligence
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
publish-package.yml@ac536860f6279b0bbbcffc6879ccc42ebae1dce7 -
Trigger Event:
release
-
Statement type:
File details
Details for the file proqsar-0.0.6-py3-none-any.whl.
File metadata
- Download URL: proqsar-0.0.6-py3-none-any.whl
- Upload date:
- Size: 136.2 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
06692e902ac76b6699817ea96eaeddd7a230b7021307d1210b1ad801b564d87c
|
|
| MD5 |
92d786a6c0a67a8aa474862d06534be9
|
|
| BLAKE2b-256 |
c69687772e8ae098a585cda80e1304c039c1221da0108ff98304e16ec6b1c4af
|
Provenance
The following attestation bundles were made for proqsar-0.0.6-py3-none-any.whl:
Publisher:
publish-package.yml on Medicine-Artificial-Intelligence/ProQSAR
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
proqsar-0.0.6-py3-none-any.whl -
Subject digest:
06692e902ac76b6699817ea96eaeddd7a230b7021307d1210b1ad801b564d87c - Sigstore transparency entry: 555194582
- Sigstore integration time:
-
Permalink:
Medicine-Artificial-Intelligence/ProQSAR@ac536860f6279b0bbbcffc6879ccc42ebae1dce7 -
Branch / Tag:
refs/tags/v0.0.6 - Owner: https://github.com/Medicine-Artificial-Intelligence
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
publish-package.yml@ac536860f6279b0bbbcffc6879ccc42ebae1dce7 -
Trigger Event:
release
-
Statement type: