Python modules for data analytics applications with dask
Project description
pybear-dask is a Python computing library that supplements the pybear library with analogous modules that have dask capability.
Website: https://github.com/PylarBear/pybear-dask
License
BSD 3-Clause License. See License File.
Installation
Dependencies
pybear-dask requires:
Python (>=3.10, <3.13)
dask (<2025.1.0)
dask-ml (<2025.1.0)
distributed (<2025.1.0)
pybear (>=0.2.0)
User installation
Install pybear-dask from the online PyPI package repository using pip:
(your-env) $ pip install pybear-dask
Conda distributions are expected to be made available sometime after release to PyPI.
Usage
The folder structure of pybear-dask is nearly identical to scikit-learn. This is so those that are familiar with the scikit layout and have experience with writing the associated import statements have an easy transition to pybear-dask. The pybear-dask subfolders are base and model_selection.
You can import pybear-dask’s packages in the same way you would with scikit. Here are a few examples of how you could import and use pybear-dask modules:
from pybear-dask.model_selection import GSTCVDask
search = GSTCVDask()
search.fit(X, y)
from pybear-dask import model_selection as ms
search = ms.AutoGridSearchCVDask()
search.fit(X, y)
Major Modules
AutoGridSearchCVDask
Perform multiple uninterrupted passes of grid search with dask_ml GridSearchCV and dask objects utilizing progressively narrower search grids.
Access via pybear-dask.model_selection.AutoGridSearchCVDask.
GSTCVDask (GridSearchThresholdCV for Dask)
Perform conventional grid search on a classifier with concurrent threshold search using dask objects in parallel and distributed environments. Finds the global optima for the passed parameters and thresholds. Fully compliant with the dask_ml/scikit-learn GridSearchCV API.
Access via pybear-dask.model_selection.GSTCVDask.
AutoGSTCVDask (AutoGridSearchThresholdCV for Dask)
Perform multiple uninterrupted passes of grid search with pybear-dask GSTCVDask utilizing progressively narrower search grids.
Access via pybear-dask.model_selection.AutoGSTCVDask.
Changelog
See the changelog for a history of notable changes to pybear-dask.
Development
Important links
Official source code repo: https://github.com/PylarBear/pybear-dask
Download releases: https://pypi.org/project/pybear-dask/
Issue tracker: https://github.com/PylarBear/pybear-dask/issues
Source code
You can clone the latest source code with the command:
git clone https://github.com/PylarBear/pybear-dask.git
Contributing
pybear-dask is not ready for contributions at this time!
Testing
pybear-dask 0.2 is tested via GitHub Actions to run on Linux, Windows, and MacOS, with Python versions 3.10, 3.11, and 3.12. pybear-dask is not tested on earlier versions, but some features may work.
If you want to test pybear-dask yourself, you will need:
pytest (>=7.0.0)
The tests are not available in the PyPI pip installation. You can get the tests by downloading the tarball from the pybear-dask project page on pypi.org or cloning the pybear-dask repo from GitHub. Once you have the source files in a local project folder, create a poetry environment for the project and install the test dependencies. After installation, launch the poetry environment shell and you can launch the test suite from the root of your pybear-dask project folder with:
(your-pybear-dask-env) you@your_computer:/path/to/pybear-dask/project$ pytest tests/
Project History
This project was spun off the main pybear project just prior to the first public release of both. pybear-dask was spun off to ensure maximum stability for the main pybear project, while keeping these modules available.
Help and Support
Documentation
Documentation is not expected to be made available via a website for this package. Use the documentation for similar packages in the main pybear package. See the repo for pybear: https://github.com/PylarBear/pybear/
Communication
GitHub Discussions: https://github.com/PylarBear/pybear-dask/discussions
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file pybear_dask-0.2.3.tar.gz.
File metadata
- Download URL: pybear_dask-0.2.3.tar.gz
- Upload date:
- Size: 72.1 kB
- Tags: Source
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
d39507e1e96e61bccddf0416767c1fb1af6e15e4440a7ad764c1d9cf9decc9b6
|
|
| MD5 |
ce3c4520ff37f948b430c660febaabae
|
|
| BLAKE2b-256 |
88e498442516c38bd5b806b373a415f237068eabf3b036fc79566c2e7aa44173
|
Provenance
The following attestation bundles were made for pybear_dask-0.2.3.tar.gz:
Publisher:
pypi-publish.yml on PylarBear/pybear-dask
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
pybear_dask-0.2.3.tar.gz -
Subject digest:
d39507e1e96e61bccddf0416767c1fb1af6e15e4440a7ad764c1d9cf9decc9b6 - Sigstore transparency entry: 571088185
- Sigstore integration time:
-
Permalink:
PylarBear/pybear-dask@58352c931f463ed4976019dbcee6bacf61e71c9b -
Branch / Tag:
refs/heads/main - Owner: https://github.com/PylarBear
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
pypi-publish.yml@58352c931f463ed4976019dbcee6bacf61e71c9b -
Trigger Event:
workflow_dispatch
-
Statement type:
File details
Details for the file pybear_dask-0.2.3-py3-none-any.whl.
File metadata
- Download URL: pybear_dask-0.2.3-py3-none-any.whl
- Upload date:
- Size: 40.0 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
f7aba36c2869407025238cef4c6de80ab4e1eb957ad5254971775d91c92c67aa
|
|
| MD5 |
349d6dd03a6d967f538df8a8dc4fd134
|
|
| BLAKE2b-256 |
0893bfa3adfefe89f82a807c3e647051c312a5e6c93c427328f9e11c62cbb1d3
|
Provenance
The following attestation bundles were made for pybear_dask-0.2.3-py3-none-any.whl:
Publisher:
pypi-publish.yml on PylarBear/pybear-dask
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
pybear_dask-0.2.3-py3-none-any.whl -
Subject digest:
f7aba36c2869407025238cef4c6de80ab4e1eb957ad5254971775d91c92c67aa - Sigstore transparency entry: 571088195
- Sigstore integration time:
-
Permalink:
PylarBear/pybear-dask@58352c931f463ed4976019dbcee6bacf61e71c9b -
Branch / Tag:
refs/heads/main - Owner: https://github.com/PylarBear
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
pypi-publish.yml@58352c931f463ed4976019dbcee6bacf61e71c9b -
Trigger Event:
workflow_dispatch
-
Statement type: