Skip to main content

An expansion pack for scikit-learn

Project description

Epic sklearn — An expansion pack for scikit-learn

Epic-sklearn CI

What is it?

The epic-sklearn Python library is a companion to the scikit-learn library for machine learning. It provides additional components and utilities, which can make working within the scikit-learn framework even more convenient and productive.

The main difference in base-assumptions between scikit-learn and epic-sklearn is that epic-sklearn has pandas as a dependency. Moreover, most epic-sklearn components support pandas objects (DataFrame and Series) and "pass along" as much information as possible. For example, in most transformers, if the features matrix is provided as a DataFrame, the transformed matrix will also be a DataFrame, and the index (and columns, if applicable) will be preserved. There are also a few components specifically designed for working only with pandas objects.

Content Highlights

  • composite: Classifiers acting on other classifiers.
  • feature_selection:
    • mutual_info: Calculation of conditional mutual information between a feature and the target given another feature, and feature selection algorithms based on conditional mutual information.
  • metrics: Metrics and scores for evaluating classification results and other data sets.
  • neighbors: Utilities relevant for nearest neighbors algorithms.
  • pipeline: Transformers for constructing transformation pipelines.
    • Contains a transformer that splits the samples based on a criterion, and applies different transformations on each sample group.
  • plot: Plotting utilities.
  • preprocessing:
    • categorical: Transformers for encoding and processing categorical data.
    • data: Transformers for binning and manipulating data distribution. Includes the Yeo–Johnson transformation.
    • general: General-purpose transformers (e.g. select DataFrame columns, apply a function in parallel, generate features from an iterator).
    • label: Utilities for encoding labels.
  • utils:
    • data: Generate random batches from data.
    • general: Functions for input validation and normalization.
    • kneedle: Implementation of the "Kneedle in a Haystack" algorithm.
    • thresholding: A helper for setting and applying a threshold based on classification metrics.

Contributors

Thanks to Yaron Cohen for his contribution to this project.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

epic-sklearn-1.0.zip (75.5 kB view details)

Uploaded Source

File details

Details for the file epic-sklearn-1.0.zip.

File metadata

  • Download URL: epic-sklearn-1.0.zip
  • Upload date:
  • Size: 75.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.10.0

File hashes

Hashes for epic-sklearn-1.0.zip
Algorithm Hash digest
SHA256 de40a02977e566729e9cbd799a65eec8ccd1fe623482f79fe4bd0a1f14ca9f8c
MD5 86de57b97b2a042bd05c5f0f3f77aa28
BLAKE2b-256 073f0dc75851f4b08b013a83672d6394ef9ed72230a4190592245d9d60a45c7f

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page