Skip to main content

Python modules for data analytics applications with dask

Project description

Tests Coverage Test Status 313 Test Status 312 Test Status 311 Test Status 310

TestPyPI Build Status

PyPI Build Status PyPI Version PyPI Downloads

pybear-dask is a Python computing library that supplements the pybear library with analogous modules that have dask capability.

Website: https://github.com/PylarBear/pybear-dask

License

BSD 3-Clause License. See License File.


Installation

Dependencies

pybear-dask requires:

  • Python (>=3.10, <3.13)

  • dask (<2025.1.0)

  • dask-ml (<2025.1.0)

  • distributed (<2025.1.0)

  • pybear (>=0.1.22)

User installation

Install pybear-dask from the online PyPI package repository using pip:

(your-env) $ pip install pybear-dask

Conda distributions are expected to be made available sometime after release to PyPI.


Usage

The folder structure of pybear-dask is nearly identical to scikit-learn. This is so those that are familiar with the scikit layout and have experience with writing the associated import statements have an easy transition to pybear-dask. The pybear-dask subfolders are base and model_selection.

You can import pybear-dask’s packages in the same way you would with scikit. Here are a few examples of how you could import and use pybear-dask modules:

from pybear-dask.model_selection import GSTCVDask

search = GSTCVDask()
search.fit(X, y)

from pybear-dask import model_selection as ms

search = ms.AutoGridSearchCVDask()
search.fit(X, y)

Major Modules

AutoGridSearchCVDask

Perform multiple uninterrupted passes of grid search with dask_ml GridSearchCV and dask objects utilizing progressively narrower search grids.

  • Access via pybear-dask.model_selection.AutoGridSearchCVDask.

GSTCVDask (GridSearchThresholdCV for Dask)

Perform conventional grid search on a classifier with concurrent threshold search using dask objects in parallel and distributed environments. Finds the global optima for the passed parameters and thresholds. Fully compliant with the dask_ml/scikit-learn GridSearchCV API.

  • Access via pybear-dask.model_selection.GSTCVDask.

AutoGSTCVDask (AutoGridSearchThresholdCV for Dask)

Perform multiple uninterrupted passes of grid search with pybear-dask GSTCVDask utilizing progressively narrower search grids.

  • Access via pybear-dask.model_selection.AutoGSTCVDask.


Changelog

See the changelog for a history of notable changes to pybear-dask.


Development

Source code

You can clone the latest source code with the command:

git clone https://github.com/PylarBear/pybear-dask.git

Contributing

pybear-dask is not ready for contributions at this time!

Testing

pybear-dask 0.2 is tested via GitHub Actions to run on Linux, Windows, and MacOS, with Python versions 3.10, 3.11, and 3.12. pybear-dask is not tested on earlier versions, but some features may work.

If you want to test pybear-dask yourself, you will need:

  • pytest (>=7.0.0)

The tests are not available in the PyPI pip installation. You can get the tests by downloading the tarball from the pybear-dask project page on pypi.org or cloning the pybear-dask repo from GitHub. Once you have the source files in a local project folder, create a poetry environment for the project and install the test dependencies. After installation, launch the poetry environment shell and you can launch the test suite from the root of your pybear-dask project folder with:

(your-pybear-dask-env) you@your_computer:/path/to/pybear-dask/project$ pytest tests/

Project History

This project was spun off the main pybear project just prior to the first public release of both. pybear-dask was spun off to ensure maximum stability for the main pybear project, while keeping these modules available.

Help and Support

Documentation

Documentation is not expected to be made available via a website for this package. Use the documentation for similar packages in the main pybear package. See the repo for pybear: https://github.com/PylarBear/pybear/

Communication

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pybear_dask-0.2.0.tar.gz (73.1 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

pybear_dask-0.2.0-py3-none-any.whl (40.3 kB view details)

Uploaded Python 3

File details

Details for the file pybear_dask-0.2.0.tar.gz.

File metadata

  • Download URL: pybear_dask-0.2.0.tar.gz
  • Upload date:
  • Size: 73.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/2.1.3 CPython/3.13.5 Linux/6.14.0-24-generic

File hashes

Hashes for pybear_dask-0.2.0.tar.gz
Algorithm Hash digest
SHA256 a4a671f2f52dea32076795db9b63ccf97d643a8f866fa51ba9ccd4d6cd5926c8
MD5 dbc0720496005aa0eec25f9737d7a975
BLAKE2b-256 cd7000297e78cbc0695d94699f797838c425b60550ee0901c71efa69792a9dea

See more details on using hashes here.

File details

Details for the file pybear_dask-0.2.0-py3-none-any.whl.

File metadata

  • Download URL: pybear_dask-0.2.0-py3-none-any.whl
  • Upload date:
  • Size: 40.3 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/2.1.3 CPython/3.13.5 Linux/6.14.0-24-generic

File hashes

Hashes for pybear_dask-0.2.0-py3-none-any.whl
Algorithm Hash digest
SHA256 54637a1542f530c9d0c63fa14eda9620db0175a263f0afd1c917c09ac5778e95
MD5 690e147a9dd89132a382085afc02ad8b
BLAKE2b-256 d28faeb6d6e593f1aa8bd95b7e88ef16ee4e849fb41a7de9509405282f4bc0df

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page