An expansion pack for scikit-learn
Project description
Epic sklearn — An expansion pack for scikit-learn
What is it?
The epic-sklearn Python library is a companion to the scikit-learn library for machine learning. It provides additional components and utilities, which can make working within the scikit-learn framework even more convenient and productive.
The main difference in base-assumptions between scikit-learn and epic-sklearn is that epic-sklearn
has pandas as a dependency. Moreover, most epic-sklearn components support pandas objects (DataFrame
and Series
) and "pass along" as much information as possible. For example, in most transformers,
if the features matrix is provided as a DataFrame
, the transformed matrix will also be a DataFrame
,
and the index
(and columns
, if applicable) will be preserved. There are also a few components specifically
designed for working only with pandas objects.
Content Highlights
- composite: Classifiers acting on other classifiers.
- feature_selection:
- mutual_info: Calculation of conditional mutual information between a feature and the target given another feature, and feature selection algorithms based on conditional mutual information.
- metrics:
- Metrics and scores for evaluating classification results and other data sets.
- Also includes the leven module, allowing parallel computation of pairwise Levenshtein distances between python strings.
- neighbors: Utilities relevant for nearest neighbors algorithms.
- pipeline: Transformers for constructing transformation pipelines.
- Contains a transformer that splits the samples based on a criterion, and applies different transformations on each sample group.
- plot: Plotting utilities.
- preprocessing:
- categorical: Transformers for encoding and processing categorical data.
- data: Transformers for binning and manipulating data distribution. Includes the Yeo–Johnson transformation.
- general: General-purpose transformers (e.g. select DataFrame columns, apply a function in parallel, generate features from an iterator).
- label: Utilities for encoding labels.
- utils:
- data: Generate random batches from data.
- general: Functions for input validation and normalization.
- kneedle: Implementation of the "Kneedle in a Haystack" algorithm.
- thresholding: A helper for setting and applying a threshold based on classification metrics.
Contributors
Thanks to Yaron Cohen for his contribution to this project.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
File details
Details for the file epic-sklearn-1.0.10.zip
.
File metadata
- Download URL: epic-sklearn-1.0.10.zip
- Upload date:
- Size: 86.3 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.10.0
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 948e3d68702183efbc430c1d03ec59451fd52ce4c59c739f8b6cc925c35b790c |
|
MD5 | b01e8294b31eb412430fedc0545d97ea |
|
BLAKE2b-256 | d2d1f590ffc8321776509f9b3b7050a9254f663b3f558b8f73fa3e9af01d3b37 |