Library for effective molecular fingerprints calculation
Project description
scikit-fingerprints
scikit-fingerprints is a Python library for efficient computation of molecular fingerprints.
Table of Contents
- Description
- Supported platforms
- Installation
- Basic Usage
- General Project Vision
- Contributing
- License
Description
Molecular fingerprints are crucial in various scientific fields, including drug discovery, materials science, and chemical analysis. However, existing Python libraries for computing molecular fingerprints often lack performance, user-friendliness, and support for modern programming standards. This project aims to address these shortcomings by creating an efficient and accessible Python library for molecular fingerprint computation.
You can find the documentation HERE
- The library offers various functions that accept molecule descriptors (e.g., SMILES) and fingerprint parameters, returning the specified fingerprints.
- It's open-source and available for installation via pip.
- The library has been designed for ease of use, minimizing the need for extensive training.
- Compatibility with the standard Python ML stack, based on scikit-learn interfaces, has been a top priority.
Supported platforms
python3.9 |
python3.10 |
python3.11 |
python3.12 |
|
---|---|---|---|---|
Ubuntu - latest | ✅ | ✅ | ✅ | ✅ |
Windows - latest | ✅ | ✅ | ✅ | ✅ |
macOS - latest | only macOS 13 | ✅ | ✅ | ✅ |
Installation
You can install the library using pip:
pip install scikit-fingerprints
Basic Usage
from skfp.fingerprints import AtomPairFingerprint
smiles_list = ['O=S(=O)(O)CCS(=O)(=O)O', 'O=C(O)c1ccccc1O']
atom_pair_fingerprint = AtomPairFingerprint()
X_skfp = atom_pair_fingerprint.transform(smiles_list)
print(X_skfp)
General Project Vision
The primary goal of this project was to develop a Python library that simplifies the computation of widely-used molecular fingerprints, such as Morgan's fingerprint, MACCS fingerprint, and others. This library has the following key features:
-
User-Friendly Interface: The library was designed to provide an intuitive interface, making it easy to integrate into machine learning workflows.
-
Performance Optimization: We implemented molecular fingerprint computation algorithms using concurrent programming techniques to maximize performance. Large datasets of molecules are processed in parallel for improved efficiency.
-
Compatibility: The library's interface was inspired by popular data science libraries like Scikit-Learn, ensuring compatibility and familiarity for users familiar with these tools.
-
Extensibility: Users should be able to customize and extend the library to suit their specific needs.
Contributing
Please read CONTRIBUTING.md and CODE_OF_CONDUCT.md for details on our code of conduct, and the process for submitting pull requests to us.
License
This project is licensed under the MIT License - see the LICENSE.md file for details.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for scikit_fingerprints-1.5.0.tar.gz
Algorithm | Hash digest | |
---|---|---|
SHA256 | bda647135c502f7ec70033fe5e645ab54165782bbc1172da6d502acb74f57cb5 |
|
MD5 | b51db0a7fd66f223a9b630b7b8ea17e8 |
|
BLAKE2b-256 | e5994e48cca596703802897a13c6705384e4ec5831edb368689d29630ae7b239 |
Hashes for scikit_fingerprints-1.5.0-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 2ae75859bd978bf1a95bec32fa1f67abb50b50790b4f604ad3db50827f9f15d6 |
|
MD5 | ea11b8f696657651d19113bf6f528305 |
|
BLAKE2b-256 | bc4957e933fa8b0a771a04c6b5a26cd8d3ea76158c62bb9130c9b2fe5a58abf2 |