Skip to main content

Package for STANdard drug Screening by COllaborative FIltering. Performs benchmarks against datasets and SotA algorithms, and implements training, validation and testing procedures.

Project description

funding logo

STANdard for drug Screening with COllaborative FIltering (stanscofi) Python Package

This repository is a part of the EU-funded RECeSS project (#101102016), and hosts the code for the open-source Python package stanscofi for the development of collaborative filtering-based drug repurposing algorithms.

Python Version PyPI version Anaconda version Zenodo version GitHub Build Status Codecov Codefactor JOSS

Statement of need

As of 2022, current drug development pipelines last around 10 years, costing $2billion in average, while drug commercialization failure rates go up to 90%. These issues can be mitigated by drug repurposing, where chemical compounds are screened for new therapeutic indications in a systematic fashion. In prior works, this approach has been implemented through collaborative filtering. This semi-supervised learning framework leverages known drug-disease matchings in order to recommend new ones.

The stanscofi package comprises method-agnostic training, validation, preprocessing and visualization procedures on several published drug repurposing datasets. The proper implementation of these steps is crucial in order to avoid data leakage, i.e., the model is learnt over information that should be unavailable at prediction time. Indeed, data leakage is the source of a major reproducibility crisis in machine learning. This will be avoided by building training and validation sets which are weakly correlated with respect to the drug and disease feature vectors. The main performance metric will be the area under the curve (AUC), which estimates the diagnostic ability of a recommender system better than accuracy in imbalanced datasets.

Medium-term outcomes to this package will significantly alleviate the economic burden of drug discovery pipelines, and will help find treatments in a more sustainable manner, especially for rare or tropical neglected diseases.

For more information about the datasets accessible in stanscofi, please refer to the following repository.

Install the latest release

Run one of the following commands:

## Using pip: install in Python env
pip install stanscofi
## Using Anaconda: install in Python env
conda install -c recess stanscofi
## Using the Docker image: will open a container
docker push recessproject/stanscofi:2.0.0

Documentation about stanscofi can be found at this page. The complete list of dependencies for stanscofi can be found at requirements.txt (pip) or meta.yaml (conda).

Licence

This repository is under an OSI-approved MIT license.

Citation

If you use stanscofi in academic research, please cite it as follows

Réda et al., (2024). stanscofi and benchscofi: a new standard for drug repurposing by collaborative filtering. 
Journal of Open Source Software, 9(93), 5973, https://doi.org/10.21105/joss.05973

Community guidelines with respect to contributions, issue reporting, and support

Pull requests and issue flagging are welcome, and can be made through the GitHub interface. Support can be provided by reaching out to recess-project[at]proton.me. However, please note that contributors and users must abide by the Code of Conduct.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

stanscofi-2.0.1.tar.gz (38.2 kB view details)

Uploaded Source

Built Distribution

stanscofi-2.0.1-py3-none-any.whl (33.2 kB view details)

Uploaded Python 3

File details

Details for the file stanscofi-2.0.1.tar.gz.

File metadata

  • Download URL: stanscofi-2.0.1.tar.gz
  • Upload date:
  • Size: 38.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.0.0 CPython/3.12.2

File hashes

Hashes for stanscofi-2.0.1.tar.gz
Algorithm Hash digest
SHA256 f0e305813594583d562f9dac6c0d4a2f9f025cf9b58e63c5899affe4b9676aec
MD5 9d01e40a5302f7d8471e58e3618fd4aa
BLAKE2b-256 70d31db1402750e32d90ca00074812f7d9a8e776d5b52a70a62058c2a870ac82

See more details on using hashes here.

File details

Details for the file stanscofi-2.0.1-py3-none-any.whl.

File metadata

  • Download URL: stanscofi-2.0.1-py3-none-any.whl
  • Upload date:
  • Size: 33.2 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.0.0 CPython/3.12.2

File hashes

Hashes for stanscofi-2.0.1-py3-none-any.whl
Algorithm Hash digest
SHA256 e5d3ee411f34b375ca0699fc023c4a939cbb55749cf27dddc55b04c0966d795b
MD5 cbb44261d2054723f85cce44b8139eff
BLAKE2b-256 5088e2485615a0d9776093e4335c9d559fb4b8617afb9f8009ca564e194de557

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page