Skip to main content

A comprehensive chemical compound analysis platform for drug discovery research, developed by Suneel Kumar BVS, Ph.D., ATOMICAS AI SOLUTIONS PRIVATE LIMITED (https://theatomicas.io)

Project description

DataBuster

A sophisticated chemical compound analysis platform for drug discovery research and molecular data pre=processing through advanced computational chemistry tools.

Developed By

Suneel Kumar BVS, Ph.D.
ATOMICAS AI SOLUTIONS PRIVATE LIMITED
Website: https://theatomicas.io
Contact: suneel@theatomicas.io, suneelkumar.bvs@gmail.com

Official Repo

https://github.com/suneelbvs/databuster

Overview

DataBuster is a powerful platform designed to deep-dive into the datasets and to provide comprehensive analysis of chemical datasets. It provides advanced cheminformatics capabilities to provide an intuitive and powerful analysis which ultimately helps the users to understand the dataset better, to preprocess the dataset for modelling.

Features

  • Structure analysis
  • Molecular descriptors calculation
  • Duplicate detection
  • Chirality analysis
  • Salt detection
  • Structure standardization
  • Batch processing
  • Command-line interface

Installation

pip install databuster

Usage

Command Line Interface (CLI)

The tool provides a powerful command-line interface for batch processing and automation.

Basic Usage

Usage Examples

1. Basic analysis with all features:
   python databuster.py analyze input.csv --smiles-column "SMILES"

2. Specific analysis types using short aliases:
   python databuster.py analyze input.csv --smiles-column "SMILES" --analysis-types d m
   (d: duplicates, m: molecular descriptors)

3. Export results to custom location:
   python databuster.py analyze input.csv --smiles-column "SMILES" --output results.csv

4. Run all available analyses:
   python databuster.py analyze input.csv --smiles-column "SMILES" --analysis-types all

License

This project is licensed under the MIT License - see the LICENSE file for details.

Future Developments, and Feedback

Write to suneel@theatomicas.io for any feedback, ideas to implement, and collaborate.

Citation

In progress, will update the details by Jan 15, 2025

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

databuster-0.2.2.tar.gz (168.0 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

databuster-0.2.2-py3-none-any.whl (4.6 kB view details)

Uploaded Python 3

File details

Details for the file databuster-0.2.2.tar.gz.

File metadata

  • Download URL: databuster-0.2.2.tar.gz
  • Upload date:
  • Size: 168.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.0.1 CPython/3.11.10

File hashes

Hashes for databuster-0.2.2.tar.gz
Algorithm Hash digest
SHA256 5ad1063ca7501516a552d1b8cf91562164e912d4b5d4dbad8924625e558767c4
MD5 313ebd4ed36723804996231a9d16a685
BLAKE2b-256 b6be357d7b30322ef93965def786e1300162e54ac7f189405ae5e22399527344

See more details on using hashes here.

File details

Details for the file databuster-0.2.2-py3-none-any.whl.

File metadata

  • Download URL: databuster-0.2.2-py3-none-any.whl
  • Upload date:
  • Size: 4.6 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.0.1 CPython/3.11.10

File hashes

Hashes for databuster-0.2.2-py3-none-any.whl
Algorithm Hash digest
SHA256 49094000fcea76b7a7f629e5790a6866cb11e8edf22bb41374c06693da206392
MD5 f3db83e869bf16ee06ffbcd32d6b2dde
BLAKE2b-256 cf4e61fa887d45342e645d4719948dd9fb6d33cf2e9b798b079f6c1035195259

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page