Skip to main content

BDTrinity - An open source, low-code machine learning library in Python.

Project description

What is BDTrinity?

BDTrinity is an open-source, low-code machine learning library in Python that automates machine learning workflows. It is an end-to-end machine learning and model management tool that speeds up the experiment cycle exponentially and makes you more productive.

In comparison with the other open-source machine learning libraries, BDTrinity is an alternate low-code library that can be used to replace hundreds of lines of code with few words only. This makes experiments exponentially fast and efficient. BDTrinity is essentially a Python wrapper around several machine learning libraries and frameworks such as scikit-learn, XGBoost, LightGBM, CatBoost, spaCy, Optuna, Hyperopt, Ray, and many more.

The design and simplicity of BDTrinity is inspired by the emerging role of citizen data scientists, a term first used by Gartner. Citizen Data Scientists are power users who can perform both simple and moderately sophisticated analytical tasks that would previously have required more expertise. Seasoned data scientists are often difficult to find and expensive to hire but citizen data scientists can be an effective way to mitigate this gap and address data-related challenges in the business setting.

BDTrinity is a great library which not only simplifies the machine learning tasks for citizen data scientists but also helps new startups to reduce the cost of investing in a team of data scientists. Therefore, this library has not only helped the citizen data scientists but has also helped individuals who want to start exploring the field of data science, having no prior knowledge in this field.

Guide to Install and usage of BDTrinity library

Current Release

bdtrinity 2.0.0 is now available. The easiest way to install bdtrinity is using pip.

pip install bdtrinity
      (or)
pip install bdtrinity==2.0

BDTrinity on GPU

bdtrinity = 2.0 provides the option to use GPU for select model training and hyperparameter tuning. There is no change in the use of the API, however, in some cases, additional libraries have to be installed as they are not installed with the default slim version or the full version. The following estimators can be trained on GPU.

  • Extreme Gradient Boosting (requires no further installation)

  • CatBoost (requires no further installation)

  • Light Gradient Boosting Machine (requires GPU installation: https://lightgbm.readthedocs.io/en/latest/GPU-Tutorial.html)

  • Logistic Regression, Ridge Classifier, Random Forest, K Neighbors Classifier, K Neighbors Regressor, Support Vector Machine, Linear Regression, Ridge Regression, Lasso Regression (requires cuML >= 0.15 https://github.com/rapidsai/cuml)

If you are using Google Colab you can install Light Gradient Boosting Machine for GPU but first you have to uninstall LightGBM on CPU. Use the below command to do that:

pip uninstall lightgbm -y

# install lightgbm GPU
pip install lightgbm --install-option=--gpu --install-option="--opencl-include-dir=/usr/local/cuda/include/" --install-option="--opencl-library=/usr/local/cuda/lib64/libOpenCL.so"

CatBoost is only enabled on GPU when dataset has > 50,000 rows.

cuML >= 0.15 cannot be installed on Google Colab. Instead use blazingSQL (https://blazingsql.com/) which comes pre-installed with cuML 0.15. Use following command to install BDTrinity:

# install BDTrinity on blazingSQL
!/opt/conda-environments/rapids-stable/bin/python -m pip install --upgrade bdtrinity

Who should use BDTrinity?

BDTrinity is an open source library that anybody can use. In our view the ideal target audience of BDTrinity is:

  • Data Science Students.
  • Data Science Professionals who wants to build rapid prototypes.

License

Copyright 2024-2025 S Satish Kumar sathishsriram999@gmail.com

Permission is hereby granted, free of charge, to any person obtaining a copy of this software and associated documentation files (the "Software"), to deal in the Software without restriction, including without limitation the rights to use, copy, modify, merge, publish, distribute, sublicense, and/or sell copies of the Software, and to permit persons to whom the Software is furnished to do so, subject to the following conditions:

The above copyright notice and this permission notice shall be included in all copies or substantial portions of the Software.

THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM, OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE SOFTWARE. © 2024 GitHub, Inc.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

bdtrinity-2.0.0.tar.gz (24.4 kB view details)

Uploaded Source

Built Distribution

bdtrinity-2.0.0-py3-none-any.whl (24.2 kB view details)

Uploaded Python 3

File details

Details for the file bdtrinity-2.0.0.tar.gz.

File metadata

  • Download URL: bdtrinity-2.0.0.tar.gz
  • Upload date:
  • Size: 24.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.8.20

File hashes

Hashes for bdtrinity-2.0.0.tar.gz
Algorithm Hash digest
SHA256 7dd92e3c8a79b905dbd36b96aa0d38bb83fa16935dadbd46c5de471f43a01e01
MD5 859a7a2f25476d3713183131e3091140
BLAKE2b-256 2d83910a0701c044c38d339a80c231bfcceabf027b695c6c1a67c4585ec6f5d1

See more details on using hashes here.

File details

Details for the file bdtrinity-2.0.0-py3-none-any.whl.

File metadata

  • Download URL: bdtrinity-2.0.0-py3-none-any.whl
  • Upload date:
  • Size: 24.2 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.8.20

File hashes

Hashes for bdtrinity-2.0.0-py3-none-any.whl
Algorithm Hash digest
SHA256 deb3c9408c0aa20a80c2ce78084f952cf1b7b1140f45bb9ae976758256aba744
MD5 b1b630c6da0be20e46d61bb6195194aa
BLAKE2b-256 f75045cd4e14c5f3197970dcf44fe8c81ada95bdc5019a3d7e6e24618614f8e8

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page