A comprehensive chemical compound analysis platform for drug discovery research, developed by Suneel Kumar BVS, Ph.D., ATOMICAS AI SOLUTIONS PRIVATE LIMITED (https://theatomicas.io)
Project description
DataBuster
A sophisticated chemical compound analysis platform for drug discovery research and molecular data pre=processing through advanced computational chemistry tools.
Developed By
Suneel Kumar BVS, Ph.D.
ATOMICAS AI SOLUTIONS PRIVATE LIMITED
Website: https://theatomicas.io
Contact: suneel@theatomicas.io, suneelkumar.bvs@gmail.com
Official Repo
https://github.com/suneelbvs/databuster
Overview
DataBuster is a powerful platform designed to deep-dive into the datasets and to provide comprehensive analysis of chemical datasets. It provides advanced cheminformatics capabilities to provide an intuitive and powerful analysis which ultimately helps the users to understand the dataset better, to preprocess the dataset for modelling.
Features
- Structure analysis
- Molecular descriptors calculation
- Duplicate detection
- Chirality analysis
- Salt detection
- Structure standardization
- Batch processing
- Command-line interface
Installation
pip install databuster
Usage
Command Line Interface (CLI)
The tool provides a powerful command-line interface for batch processing and automation.
Basic Usage
Usage Examples
1. Basic analysis with all features:
python databuster.py analyze input.csv --smiles-column "SMILES"
2. Specific analysis types using short aliases:
python databuster.py analyze input.csv --smiles-column "SMILES" --analysis-types d m
(d: duplicates, m: molecular descriptors)
3. Export results to custom location:
python databuster.py analyze input.csv --smiles-column "SMILES" --output results.csv
4. Run all available analyses:
python databuster.py analyze input.csv --smiles-column "SMILES" --analysis-types all
License
This project is licensed under the MIT License - see the LICENSE file for details.
Future Developments, and Feedback
Write to suneel@theatomicas.io for any feedback, ideas to implement, and collaborate.
Citation
In progress, will update the details by Jan 15, 2025
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file databuster-0.2.2.tar.gz.
File metadata
- Download URL: databuster-0.2.2.tar.gz
- Upload date:
- Size: 168.0 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.0.1 CPython/3.11.10
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
5ad1063ca7501516a552d1b8cf91562164e912d4b5d4dbad8924625e558767c4
|
|
| MD5 |
313ebd4ed36723804996231a9d16a685
|
|
| BLAKE2b-256 |
b6be357d7b30322ef93965def786e1300162e54ac7f189405ae5e22399527344
|
File details
Details for the file databuster-0.2.2-py3-none-any.whl.
File metadata
- Download URL: databuster-0.2.2-py3-none-any.whl
- Upload date:
- Size: 4.6 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.0.1 CPython/3.11.10
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
49094000fcea76b7a7f629e5790a6866cb11e8edf22bb41374c06693da206392
|
|
| MD5 |
f3db83e869bf16ee06ffbcd32d6b2dde
|
|
| BLAKE2b-256 |
cf4e61fa887d45342e645d4719948dd9fb6d33cf2e9b798b079f6c1035195259
|