Library for effective molecular fingerprints calculation
Project description
Effective Molecule Fingerprints library
Table of Contents
- Description
- General Project Vision
- Library Description
- Installation
- Usage
- Technologies Used
- Contributing
- License
Description
Molecular fingerprints are crucial in various scientific fields, including drug discovery, materials science, and chemical analysis. However, existing Python libraries for computing molecular fingerprints often lack performance, user-friendliness, and support for modern programming standards. This project aims to address these shortcomings by creating an efficient and accessible Python library for molecular fingerprint computation.
General Project Vision
The primary goal of this project is to develop a Python library that simplifies the computation of widely-used molecular fingerprints, such as Morgan's fingerprint, MACCS fingerprint, and others. This library will have the following key features:
-
User-Friendly Interface: The library will provide an intuitive interface, making it easy to integrate into machine learning workflows.
-
Performance Optimization: We will implement molecular fingerprint computation algorithms using concurrent programming techniques to maximize performance. Large datasets of molecules will be processed in parallel for improved efficiency.
-
Compatibility: The library's interface will be inspired by popular data science libraries like Scikit-Learn, ensuring compatibility and familiarity for users familiar with these tools.
-
Extensibility: Users will be able to customize and extend the library to suit their specific needs.
Library Description
- The library will offer various functions that accept molecule descriptors (e.g., SMILES) and fingerprint parameters, returning the specified fingerprints.
- It will be open-source and available for installation via pip.
- Automated testing will be implemented to support continuous development and integration.
- The library will be designed for ease of use, minimizing the need for extensive training.
- Compatibility with the standard Python ML stack, based on Scikit-Learn interfaces, will be a top priority.
Installation
You can install the library using pip:
pip install skfp
Technologies Used
Our project leverages the following technologies:
-
Scikit-learn: A Python library for machine learning and data analysis.
-
NumPy: A library for numerical and scientific computing.
-
Joblib: Used for parallel computing.
-
Rdkit: Open-Source Cheminformatics Software.
By contributing to this project, you can help advance the fields of chemistry and cheminformatics by providing scientists with a powerful tool for molecular structure analysis. We welcome your collaboration and feedback.
Contributing
Please read CONTRIBUTING.md and CODE_OF_CONDUCT.md for details on our code of conduct, and the process for submitting pull requests to us.
License
This project is licensed under the MIT License - see the LICENSE.md file for details.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for scikit_fingerprints-1.0.0.tar.gz
Algorithm | Hash digest | |
---|---|---|
SHA256 | f66604fe7941685dc2437d729204dd05312fdefcf0a7027196e8b5fb578a97b9 |
|
MD5 | 70d94de1d8b995489204c0881cbb569e |
|
BLAKE2b-256 | 8cce88d756ced0467b966e52de31c2b0460724e245ac4f63c5d29a05aa2eb628 |
Hashes for scikit_fingerprints-1.0.0-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 3fc222817f8a7568ceeb17400796e3ed61d4a58d3caae7a3f2be9905402a48d5 |
|
MD5 | 877fc41936bc321bbc87eee328b802b2 |
|
BLAKE2b-256 | ecce153636b0aa8a5db3c06c4a94f8c21108bb2a74f84f2f733c671db039cda2 |