Skip to main content

An expansion pack for scikit-learn

Project description

Epic sklearn — An expansion pack for scikit-learn

Epic-sklearn CI

What is it?

The epic-sklearn Python library is a companion to the scikit-learn library for machine learning. It provides additional components and utilities, which can make working within the scikit-learn framework even more convenient and productive.

The main difference in base-assumptions between scikit-learn and epic-sklearn is that epic-sklearn has pandas as a dependency. Moreover, most epic-sklearn components support pandas objects (DataFrame and Series) and "pass along" as much information as possible. For example, in most transformers, if the features matrix is provided as a DataFrame, the transformed matrix will also be a DataFrame, and the index (and columns, if applicable) will be preserved. There are also a few components specifically designed for working only with pandas objects.

Content Highlights

  • composite: Classifiers acting on other classifiers.
  • feature_selection:
    • mutual_info: Calculation of conditional mutual information between a feature and the target given another feature, and feature selection algorithms based on conditional mutual information.
  • metrics:
    • Metrics and scores for evaluating classification results and other data sets.
    • Also includes the leven module, allowing parallel computation of pairwise Levenshtein distances between python strings.
  • neighbors: Utilities relevant for nearest neighbors algorithms.
  • pipeline: Transformers for constructing transformation pipelines.
    • Contains a transformer that splits the samples based on a criterion, and applies different transformations on each sample group.
  • plot: Plotting utilities.
  • preprocessing:
    • categorical: Transformers for encoding and processing categorical data.
    • data: Transformers for binning and manipulating data distribution. Includes the Yeo–Johnson transformation.
    • general: General-purpose transformers (e.g. select DataFrame columns, apply a function in parallel, generate features from an iterator).
    • label: Utilities for encoding labels.
  • utils:
    • data: Generate random batches from data.
    • validation: Functions for input validation and normalization.
    • kneedle: Implementation of the "Kneedle in a Haystack" algorithm.
    • thresholding: A helper for setting and applying a threshold based on classification metrics.

Contributors

Thanks to Yaron Cohen for his contribution to this project.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

epic_sklearn-1.1.2.tar.gz (55.3 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

epic_sklearn-1.1.2-cp310-cp310-manylinux_2_39_x86_64.whl (192.9 kB view details)

Uploaded CPython 3.10manylinux: glibc 2.39+ x86-64

File details

Details for the file epic_sklearn-1.1.2.tar.gz.

File metadata

  • Download URL: epic_sklearn-1.1.2.tar.gz
  • Upload date:
  • Size: 55.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.8.5 CPython/3.13.1 Darwin/24.3.0

File hashes

Hashes for epic_sklearn-1.1.2.tar.gz
Algorithm Hash digest
SHA256 0accc0549ae6a7a61a70112382d2327ad8f3d8a0708255a5807018e9a54f66ad
MD5 4f64a26215eec64ff9698828480f3b10
BLAKE2b-256 a21eed6bf0a75e101670fa05b5356a4a58b9c555f636cd566bb1328037b309ef

See more details on using hashes here.

File details

Details for the file epic_sklearn-1.1.2-cp310-cp310-manylinux_2_39_x86_64.whl.

File metadata

File hashes

Hashes for epic_sklearn-1.1.2-cp310-cp310-manylinux_2_39_x86_64.whl
Algorithm Hash digest
SHA256 91c6683ff8f7d9df47dc816927dee69fb0716e2da972c9f81943731e0fa4f9b8
MD5 f4a33e196e15753c858ea8270b740473
BLAKE2b-256 d49233c4b88dac8b2aff8718691df3568067ed05c2897646810f8fa0e889fa28

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page