pybear

Python modules for miscellaneous data analytics applications

These details have been verified by PyPI

Project links

Owner

PylarBear

GitHub Statistics

These details have not been verified by PyPI

Project links

Documentation

Project description

Python packages that augment your data analytics experience.

pybear is a scikit-style Python computing library that augments data analytics functionality found in popular packages like scikit-learn and xgboost.

Python versions 3.10, 3.11, 3.12, 3.13, and 3.14 are supported.

See documentation for more information.

Website: https://pybear.readthedocs.io/en/stable/index.html

License

BSD 3-Clause License. See License file.

Installation

Dependencies

pybear operating on Python 3.14 requires:

Python (==3.14)
joblib (>=1.5.2)
numpy (>=2.3.2)
pandas (>=2.3.3)
polars (>=1.19.0)
psutil (>=7.1.2)
pyarrow (>=22.0.0)
scikit-learn (>=1.7.2)
scipy (>=1.16.1)
typing_extensions (>=4.6.0)

pybear operating on Python 3.13 requires:

Python (==3.13)
joblib (>=1.3.0)
numpy (>=2.1.0)
pandas (>=2.2.3)
polars (>=1.19.0)
psutil (>=5.9.4)
pyarrow (>=18.0.0)
scikit-learn (>=1.5.2)
scipy (>=1.14.1)
typing_extensions (>=4.6.0)

pybear operating on Python 3.12 requires:

Python (==3.12)
joblib (>=1.3.0)
numpy (>=1.26.0)
pandas (>=2.1.1)
polars (>=1.19.0)
psutil (>=5.9.4)
pyarrow (>=14.0.0)
scikit-learn (>=1.3.1)
scipy (>=1.12.0)
typing_extensions (>=4.6.0)

pybear operating on Python 3.11 requires:

Python (==3.11)
joblib (>=1.3.0)
numpy (>=1.23.3)
pandas (>=1.5.1)
polars (>=1.19.0)
psutil (>=5.9.4)
pyarrow (>=10.0.1)
scikit-learn (>=1.3.0)
scipy (>=1.12.0)
typing_extensions (>=4.6.0)

pybear operating on Python 3.10 requires:

Python (==3.10)
joblib (>=1.3.0)
numpy (>=1.23.3, <2.3)
pandas (>=1.3.4, <3.0)
polars (>=1.19.0)
psutil (>=5.9.0)
pyarrow (>=6.0.0)
scikit-learn (>=1.3.0,<1.8)
scipy (>=1.12.0,<1.16)
typing_extensions (>=4.6.0)

User installation

Install pybear from the online PyPI package repository using pip:

(your-env) $ pip install pybear

A Conda distribution is not expected to be made available anytime soon.

Usage

The folder structure of pybear is nearly identical to scikit-learn. This is so those that are familiar with the scikit layout and have experience with writing the associated import statements have an easy transition to pybear. The pybear subfolders are base, feature_extraction, model_selection, new_numpy, preprocessing, and utilities. For the full layout, see the API section of the pybear website on Read The Docs.

You can import pybear’s packages in the same way you would with scikit. Here are a few examples of how you could import and use pybear modules:

from pybear.preprocessing import InterceptManager as IM

trfm = IM()
trfm.fit(X, y)

from pybear import preprocessing as pp

trfm = pp.ColumnDeduplicator()
trfm.fit(X, y)

Major Modules

AutoGridSearchCV

Perform multiple uninterrupted passes of grid search with sci-kit learn GridSearchCV utilizing progressively narrower search grids.

Access via pybear.model_selection.AutoGridSearchCV.

autogridsearch_wrapper

Create your own auto-gridsearch class. A function that wraps any scikit-learn, pybear, or dask_ml GridSearchCV module to create an identical GridSearch class that performs multiple passes of grid search using progressively narrower search grids.

Access via pybear.model_selection.autogridsearch_wrapper.

GSTCV (GridSearchThresholdCV)

Perform conventional grid search on a classifier with concurrent threshold search. Finds the global optima for the passed parameters and thresholds. Fully compliant with the scikit-learn GridSearchCV API.

Access via pybear.model_selection.GSTCV.

AutoGSTCV

Perform multiple uninterrupted passes of grid search with pybear GSTCV utilizing progressively narrower search grids.

Access via pybear.model_selection.AutoGSTCV.

MinCountTransformer

Perform minimum frequency thresholding on numerical or categorical data simultaneously across an entire array of data. Violates the scikit-learn API in that datasets are modified along the example axis (examples may be deleted.) Otherwise is fully compliant with the sci-kit learn transformer API, with fit, transform, and partial_fit methods.

Access via pybear.preprocessing.MinCountTransformer.

ColumnDeduplicator

Identify and selectively remove duplicate columns in numerical or categorical data. Fully compliant with the scikit-learn transformer API, with fit, transform, and partial_fit methods. Perfect for removing duplicate columns from one-hot encoded data in a scikit-learn pipeline. Also fits and transforms data batch-wise, such as with dask_ml Incremental and ParallelPostFit wrappers.

Access via pybear.preprocessing.ColumnDeduplicator.

InterceptManager

A scikit-style transformer that identifies and manages constant columns in a dataset. IM can remove all, selectively keep one, or append a column of constants. Handles numerical & non-numerical data, and nan-like values. Does batch-wise fitting via a partial_fit method, and can be wrapped with dask_ml Incremental and ParallelPostFit wrappers.

Access via pybear.preprocessing.InterceptManager.

SlimPolyFeatures

Perform a polynomial feature expansion on a dataset omitting constant and duplicate columns. Follows the standard scikit-learn transformer API. Handles scipy sparse matrices/arrays. Suitable for sklearn pipelines. Has a partial_fit method for batch-wise training and can be wrapped with dask_ml Incremental and ParallelPostFit wrappers.

Access via pybear.preprocessing.SlimPolyFeatures.

The pybear Text Wrangling Suite

pybear has a wide selection of text wrangling tools for those who don’t have a PhD in NLP. Most modules have the dual capability of working with regular expressions or literal strings (for those who don’t know regular expressions!) Most of the modules also accept data in 1D list-like format or (ragged!) 2D array-like format. All of these are built in scikit transformer API style and can be stacked in a scikit pipeline.

These modules can be found in pybear.feature_extraction.text. The modules include:

Lexicon - A class exposing 68,000+ English words and a stop words attribute
NGramMerger - Join select adjacent tokens together to handle as a single token
StopRemover - Remove pybear stop words from a body of text
TextJoiner - Join tokenized text into a contiguous string with separators
TextJustifier - Justify to a fixed margin; wrap on literals or regex patterns
TextLookup - Compare words in a body of text against the pybear Lexicon
TextLookupRealTime - Same as TextLookup but with in-situ save capability
TextNormalizer - Normalize text to the same case
TextPadder - Pad ragged text into shaped containers using fill
TextRemover - Remove units of contiguous text
TextReplacer - Remove substrings from contiguous text
TextSplitter - Split contiguous text into tokens using literal strings or regex
TextStatistics - Compile statistics about a body of text
TextStripper - Remove leading and trailing spaces from text

Changelog

See the changelog for a history of notable changes to pybear.

Development

Important links

Official source code repo: https://github.com/PylarBear/pybear
Download releases: https://pypi.org/project/pybear/
Issue tracker: https://github.com/PylarBear/pybear/issues

Source code

You can clone the latest source code with the command:

git clone https://github.com/PylarBear/pybear.git

Contributing

For guidelines on how to contribute to pybear, see the CONTRIBUTING.rst file in the source code repository. You can also access the contributing guidelines on the contributing webpage of the pybear documentation website.

Testing

pybear 0.2 is tested via GitHub Actions to run on Linux, Windows, and MacOS, with Python versions 3.10, 3.11, 3.12, 3.13, and 3.14. pybear is not supported nor test tested on earlier versions.

If you want to test pybear yourself, you will need:

pytest (>=8.0.0) for Python version 3.14
pytest (>=7.0.0) for Python versions 3.13, 3.12, 3.11, and 3.10

The tests are not available in the PyPI pip installation. You can get the tests by downloading the tarball from the pybear project page on pypi.org or cloning the pybear repo from GitHub. Once you have the source files in a local project folder, create a poetry environment for the project and install the test dependencies. After installation, open the poetry environment shell and you can launch the test suite from the root of your pybear project folder with:

(your-pybear-env) you@your_computer:/path/to/pybear/project$ pytest tests/

Project History

The project originated in the early 2020’s as a collection of miscellaneous private modules to enhance the python data analytics ecosystem. In 2025, the modules were formalized and bundled together for their first release as pybear.

Help and Support

Documentation

HTML documentation: https://pybear.readthedocs.io/en/stable/index.html

Communication

GitHub Discussions: https://github.com/PylarBear/pybear/discussions

Project details

These details have been verified by PyPI

Project links

Owner

PylarBear

GitHub Statistics

These details have not been verified by PyPI

Project links

Documentation

Release history Release notifications | RSS feed

This version

0.2.5

Jul 7, 2026

0.2.4

Jun 10, 2026

0.2.3

Sep 30, 2025

0.2.2

Sep 8, 2025

0.2.1

Aug 16, 2025

0.2.0

Jul 28, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pybear-0.2.5.tar.gz (1.1 MB view details)

Uploaded Jul 7, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

pybear-0.2.5-py3-none-any.whl (928.9 kB view details)

Uploaded Jul 7, 2026 Python 3

File details

Details for the file pybear-0.2.5.tar.gz.

File metadata

Download URL: pybear-0.2.5.tar.gz
Upload date: Jul 7, 2026
Size: 1.1 MB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for pybear-0.2.5.tar.gz
Algorithm	Hash digest
SHA256	`42823873350a3ee838e6b270dc8e119397c7429325465f9f4f3b709ced910a3a`
MD5	`8e1167f37de343ae67793623a06b6076`
BLAKE2b-256	`9260b075e777500e1786ae34563f19c215ec9461b1ace39fc637506b90007e56`

See more details on using hashes here.

Provenance

The following attestation bundles were made for pybear-0.2.5.tar.gz:

Publisher: pypi-publish.yml on PylarBear/pybear

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: pybear-0.2.5.tar.gz
- Subject digest: 42823873350a3ee838e6b270dc8e119397c7429325465f9f4f3b709ced910a3a
- Sigstore transparency entry: 2101108372
- Sigstore integration time: Jul 7, 2026
Source repository:
- Permalink: PylarBear/pybear@1321c0a8eeedb26df44835a4441a7a47def67828
- Branch / Tag: refs/heads/main
- Owner: https://github.com/PylarBear
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: pypi-publish.yml@1321c0a8eeedb26df44835a4441a7a47def67828
- Trigger Event: workflow_dispatch

File details

Details for the file pybear-0.2.5-py3-none-any.whl.

File metadata

Download URL: pybear-0.2.5-py3-none-any.whl
Upload date: Jul 7, 2026
Size: 928.9 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for pybear-0.2.5-py3-none-any.whl
Algorithm	Hash digest
SHA256	`9b109b2b633a84370553a02ed1eb96840002e8927b4e5ed55d8f1ac036fa6536`
MD5	`b17f74e96dc78dc404236c807fdbca12`
BLAKE2b-256	`6bd5e83150647a162a55557fde4bf2a7ddc3483c2b08375e4ccfec700163caf6`

See more details on using hashes here.

Provenance

The following attestation bundles were made for pybear-0.2.5-py3-none-any.whl:

Publisher: pypi-publish.yml on PylarBear/pybear

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: pybear-0.2.5-py3-none-any.whl
- Subject digest: 9b109b2b633a84370553a02ed1eb96840002e8927b4e5ed55d8f1ac036fa6536
- Sigstore transparency entry: 2101109053
- Sigstore integration time: Jul 7, 2026
Source repository:
- Permalink: PylarBear/pybear@1321c0a8eeedb26df44835a4441a7a47def67828
- Branch / Tag: refs/heads/main
- Owner: https://github.com/PylarBear
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: pypi-publish.yml@1321c0a8eeedb26df44835a4441a7a47def67828
- Trigger Event: workflow_dispatch

pybear 0.2.5

Navigation

Verified details

Project links

Owner

GitHub Statistics

Unverified details

Project links

Meta

Classifiers

Project description

License

Installation

Dependencies

User installation

Usage

Major Modules

AutoGridSearchCV

autogridsearch_wrapper

GSTCV (GridSearchThresholdCV)

AutoGSTCV

MinCountTransformer

ColumnDeduplicator

InterceptManager

SlimPolyFeatures

The pybear Text Wrangling Suite

Related Resources

Changelog

Development

Important links

Source code

Contributing

Testing

Project History

Help and Support

Documentation

Communication

Project details

Verified details

Project links

Owner

GitHub Statistics

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance