Skip to main content

Python modules for data analytics applications with dask

Project description

Tests Coverage Test Status 313 Test Status 312 Test Status 311 Test Status 310

TestPyPI Build Status

PyPI Build Status PyPI Version PyPI Downloads

DOI

Buy Me A Coffee

pybear-dask is a Python computing library that supplements the pybear library with analogous modules that have dask capability.

Website: https://github.com/PylarBear/pybear-dask

License

BSD 3-Clause License. See License File.


Installation

Dependencies

pybear-dask requires:

  • Python (>=3.10, <3.13)

  • dask (<2025.1.0)

  • dask-ml (<2025.1.0)

  • distributed (<2025.1.0)

  • pybear (>=0.2.0)

User installation

Install pybear-dask from the online PyPI package repository using pip:

(your-env) $ pip install pybear-dask

Conda distributions are expected to be made available sometime after release to PyPI.


Usage

The folder structure of pybear-dask is nearly identical to scikit-learn. This is so those that are familiar with the scikit layout and have experience with writing the associated import statements have an easy transition to pybear-dask. The pybear-dask subfolders are base and model_selection.

You can import pybear-dask’s packages in the same way you would with scikit. Here are a few examples of how you could import and use pybear-dask modules:

from pybear-dask.model_selection import GSTCVDask

search = GSTCVDask()
search.fit(X, y)

from pybear-dask import model_selection as ms

search = ms.AutoGridSearchCVDask()
search.fit(X, y)

Major Modules

AutoGridSearchCVDask

Perform multiple uninterrupted passes of grid search with dask_ml GridSearchCV and dask objects utilizing progressively narrower search grids.

  • Access via pybear-dask.model_selection.AutoGridSearchCVDask.

GSTCVDask (GridSearchThresholdCV for Dask)

Perform conventional grid search on a classifier with concurrent threshold search using dask objects in parallel and distributed environments. Finds the global optima for the passed parameters and thresholds. Fully compliant with the dask_ml/scikit-learn GridSearchCV API.

  • Access via pybear-dask.model_selection.GSTCVDask.

AutoGSTCVDask (AutoGridSearchThresholdCV for Dask)

Perform multiple uninterrupted passes of grid search with pybear-dask GSTCVDask utilizing progressively narrower search grids.

  • Access via pybear-dask.model_selection.AutoGSTCVDask.


Changelog

See the changelog for a history of notable changes to pybear-dask.


Development

Source code

You can clone the latest source code with the command:

git clone https://github.com/PylarBear/pybear-dask.git

Contributing

pybear-dask is not ready for contributions at this time!

Testing

pybear-dask 0.2 is tested via GitHub Actions to run on Linux, Windows, and MacOS, with Python versions 3.10, 3.11, and 3.12. pybear-dask is not tested on earlier versions, but some features may work.

If you want to test pybear-dask yourself, you will need:

  • pytest (>=7.0.0)

The tests are not available in the PyPI pip installation. You can get the tests by downloading the tarball from the pybear-dask project page on pypi.org or cloning the pybear-dask repo from GitHub. Once you have the source files in a local project folder, create a poetry environment for the project and install the test dependencies. After installation, launch the poetry environment shell and you can launch the test suite from the root of your pybear-dask project folder with:

(your-pybear-dask-env) you@your_computer:/path/to/pybear-dask/project$ pytest tests/

Project History

This project was spun off the main pybear project just prior to the first public release of both. pybear-dask was spun off to ensure maximum stability for the main pybear project, while keeping these modules available.

Help and Support

Documentation

Documentation is not expected to be made available via a website for this package. Use the documentation for similar packages in the main pybear package. See the repo for pybear: https://github.com/PylarBear/pybear/

Communication

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pybear_dask-0.2.3.tar.gz (72.1 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

pybear_dask-0.2.3-py3-none-any.whl (40.0 kB view details)

Uploaded Python 3

File details

Details for the file pybear_dask-0.2.3.tar.gz.

File metadata

  • Download URL: pybear_dask-0.2.3.tar.gz
  • Upload date:
  • Size: 72.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for pybear_dask-0.2.3.tar.gz
Algorithm Hash digest
SHA256 d39507e1e96e61bccddf0416767c1fb1af6e15e4440a7ad764c1d9cf9decc9b6
MD5 ce3c4520ff37f948b430c660febaabae
BLAKE2b-256 88e498442516c38bd5b806b373a415f237068eabf3b036fc79566c2e7aa44173

See more details on using hashes here.

Provenance

The following attestation bundles were made for pybear_dask-0.2.3.tar.gz:

Publisher: pypi-publish.yml on PylarBear/pybear-dask

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file pybear_dask-0.2.3-py3-none-any.whl.

File metadata

  • Download URL: pybear_dask-0.2.3-py3-none-any.whl
  • Upload date:
  • Size: 40.0 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for pybear_dask-0.2.3-py3-none-any.whl
Algorithm Hash digest
SHA256 f7aba36c2869407025238cef4c6de80ab4e1eb957ad5254971775d91c92c67aa
MD5 349d6dd03a6d967f538df8a8dc4fd134
BLAKE2b-256 0893bfa3adfefe89f82a807c3e647051c312a5e6c93c427328f9e11c62cbb1d3

See more details on using hashes here.

Provenance

The following attestation bundles were made for pybear_dask-0.2.3-py3-none-any.whl:

Publisher: pypi-publish.yml on PylarBear/pybear-dask

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page