PerMetrics: A Framework of Performance Metrics for Machine Learning Models

These details have not been verified by PyPI

Project links

Project description

PERMETRICS

PyPI - Python Version PyPI - Status PyPI - Downloads GitHub Release Date GitHub contributors

PerMetrics is a python library for performance metrics of machine learning models. We aim to implement all performance metrics for problems such as regression, classification, clustering, ... problems. Helping users in all field access metrics as fast as possible

Free software: GNU General Public License (GPL) V3 license
Total metrics: 111 (47 regression metrics, 20 classification metrics, 44 clustering metrics)
Documentation: https://permetrics.readthedocs.io/en/latest/
Python versions: >= 3.7.x
Dependencies: numpy, scipy

Notification

Currently, there is a huge misunderstanding among frameworks around the world about the notation of R, R2, and R^2.
Please read the file R-R2-Rsquared.docx to understand the differences between them and why there is such confusion.

My recommendation is to denote the Coefficient of Determination as COD or R2, while the squared Pearson's Correlation Coefficient should be denoted as R^2 or RSQ (as in Excel software).

Installation

Install with pip

Install the current PyPI release:

$ pip install permetrics==1.5.0

Or installing from the source code, use:

$ git clone https://github.com/thieu1995/permetrics.git
$ cd permetrics
$ python setup.py install

Or install the development version from GitHub:

pip install git+https://github.com/thieu1995/permetrics

After installation, you can import Permetrics as any other Python module:

$ python
>>> import permetrics
>>> permetrics.__version__

Let's go through some examples. The more complicated test case in the folder: examples

The documentation includes more detailed installation instructions and explanations.

Example with Regression metrics

import numpy as np
from permetrics import RegressionMetric

## For 1-D array
y_true = [3, -0.5, 2, 7]
y_pred = [2.5, 0.0, 2, 8]

evaluator = RegressionMetric(y_true, y_pred, decimal=5)
print(evaluator.RMSE())
print(evaluator.MSE())

## For > 1-D array
y_true = np.array([[0.5, 1], [-1, 1], [7, -6]])
y_pred = np.array([[0, 2], [-1, 2], [8, -5]])

evaluator = RegressionMetric(y_true, y_pred, decimal=5)
print(evaluator.RMSE(multi_output="raw_values", decimal=5))
print(evaluator.MAE(multi_output="raw_values", decimal=5))

Example with Classification metrics

from permetrics import ClassificationMetric

## For integer labels or categorical labels
y_true = [0, 1, 0, 0, 1, 0]
y_pred = [0, 1, 0, 0, 0, 1]

# y_true = ["cat", "ant", "cat", "cat", "ant", "bird", "bird", "bird"]
# y_pred = ["ant", "ant", "cat", "cat", "ant", "cat", "bird", "ant"]

evaluator = ClassificationMetric(y_true, y_pred, decimal=5)

## Call specific function inside object, each function has 3 names like below

print(evaluator.f1_score())
print(evaluator.F1S(average="micro"))
print(evaluator.F1S(average="macro"))
print(evaluator.F1S(average="weighted"))

Example with Clustering metrics

import numpy as np
from permetrics import ClusteringMetric

# generate sample data
X = np.random.uniform(-1, 10, size=(500, 7))        # 500 examples, 7 features
y_true = np.random.randint(0, 4, size=500)          # 4 clusters
y_pred = np.random.randint(0, 4, size=500)

evaluator = ClusteringMetric(y_true=y_true, y_pred=y_pred, X=X, decimal=5)

## Call specific function inside object, each function has 2 names (fullname and short name)
##    + Internal metrics: Need X and y_pred and has suffix as index
##    + External metrics: Need y_true and y_pred and has suffix as score

print(evaluator.ball_hall_index())
print(evaluator.BHI())

Metrics

Problem	STT	Metric	Metric Fullname	Characteristics
Regression	1	EVS	Explained Variance Score	Bigger is better (Best = 1), Range=(-inf, 1.0]
****	2	ME	Max Error	Smaller is better (Best = 0), Range=[0, +inf)
****	3	MBE	Mean Bias Error	Best = 0, Range=(-inf, +inf)
****	4	MAE	Mean Absolute Error	Smaller is better (Best = 0), Range=[0, +inf)
****	5	MSE	Mean Squared Error	Smaller is better (Best = 0), Range=[0, +inf)
****	6	RMSE	Root Mean Squared Error	Smaller is better (Best = 0), Range=[0, +inf)
****	7	MSLE	Mean Squared Log Error	Smaller is better (Best = 0), Range=[0, +inf)
****	8	MedAE	Median Absolute Error	Smaller is better (Best = 0), Range=[0, +inf)
****	9	MRE / MRB	Mean Relative Error / Mean Relative Bias	Smaller is better (Best = 0), Range=[0, +inf)
****	10	MPE	Mean Percentage Error	Best = 0, Range=(-inf, +inf)
****	11	MAPE	Mean Absolute Percentage Error	Smaller is better (Best = 0), Range=[0, +inf)
****	12	SMAPE	Symmetric Mean Absolute Percentage Error	Smaller is better (Best = 0), Range=[0, 1]
****	13	MAAPE	Mean Arctangent Absolute Percentage Error	Smaller is better (Best = 0), Range=[0, +inf)
****	14	MASE	Mean Absolute Scaled Error	Smaller is better (Best = 0), Range=[0, +inf)
****	15	NSE	Nash-Sutcliffe Efficiency Coefficient	Bigger is better (Best = 1), Range=(-inf, 1]
****	16	NNSE	Normalized Nash-Sutcliffe Efficiency Coefficient	Bigger is better (Best = 1), Range=[0, 1]
****	17	WI	Willmott Index	Bigger is better (Best = 1), Range=[0, 1]
****	18	R / PCC	Pearson’s Correlation Coefficient	Bigger is better (Best = 1), Range=[-1, 1]
****	19	AR / APCC	Absolute Pearson's Correlation Coefficient	Bigger is better (Best = 1), Range=[-1, 1]
****	20	RSQ/R2S	(Pearson’s Correlation Index) ^ 2	Bigger is better (Best = 1), Range=[0, 1]
****	21	R2 / COD	Coefficient of Determination	Bigger is better (Best = 1), Range=(-inf, 1]
****	22	AR2 / ACOD	Adjusted Coefficient of Determination	Bigger is better (Best = 1), Range=(-inf, 1]
****	23	CI	Confidence Index	Bigger is better (Best = 1), Range=(-inf, 1]
****	24	DRV	Deviation of Runoff Volume	Smaller is better (Best = 1.0), Range=[1, +inf)
****	25	KGE	Kling-Gupta Efficiency	Bigger is better (Best = 1), Range=(-inf, 1]
****	26	GINI	Gini Coefficient	Smaller is better (Best = 0), Range=[0, +inf)
****	27	GINI_WIKI	Gini Coefficient on Wikipage	Smaller is better (Best = 0), Range=[0, +inf)
****	28	PCD	Prediction of Change in Direction	Bigger is better (Best = 1.0), Range=[0, 1]
****	29	CE	Cross Entropy	Range(-inf, 0], Can't give comment about this
****	30	KLD	Kullback Leibler Divergence	Best = 0, Range=(-inf, +inf)
****	31	JSD	Jensen Shannon Divergence	Smaller is better (Best = 0), Range=[0, +inf)
****	32	VAF	Variance Accounted For	Bigger is better (Best = 100%), Range=(-inf, 100%]
****	33	RAE	Relative Absolute Error	Smaller is better (Best = 0), Range=[0, +inf)
****	34	A10	A10 Index	Bigger is better (Best = 1), Range=[0, 1]
****	35	A20	A20 Index	Bigger is better (Best = 1), Range=[0, 1]
****	36	A30	A30 Index	Bigger is better (Best = 1), Range=[0, 1]
****	37	NRMSE	Normalized Root Mean Square Error	Smaller is better (Best = 0), Range=[0, +inf)
****	38	RSE	Residual Standard Error	Smaller is better (Best = 0), Range=[0, +inf)
****	39	RE / RB	Relative Error / Relative Bias	Best = 0, Range=(-inf, +inf)
****	40	AE	Absolute Error	Best = 0, Range=(-inf, +inf)
****	41	SE	Squared Error	Smaller is better (Best = 0), Range=[0, +inf)
****	42	SLE	Squared Log Error	Smaller is better (Best = 0), Range=[0, +inf)
****	43	COV	Covariance	Bigger is better (No best value), Range=(-inf, +inf)
****	44	COR	Correlation	Bigger is better (Best = 1), Range=[-1, +1]
****	45	EC	Efficiency Coefficient	Bigger is better (Best = 1), Range=(-inf, +1]
****	46	OI	Overall Index	Bigger is better (Best = 1), Range=(-inf, +1]
****	47	CRM	Coefficient of Residual Mass	Smaller is better (Best = 0), Range=(-inf, +inf)
****	****	****	****	****
Classification	1	PS	Precision Score	Bigger is better (Best = 1), Range = [0, 1]
****	2	NPV	Negative Predictive Value	Bigger is better (Best = 1), Range = [0, 1]
****	3	RS	Recall Score	Bigger is better (Best = 1), Range = [0, 1]
****	4	AS	Accuracy Score	Bigger is better (Best = 1), Range = [0, 1]
****	5	F1S	F1 Score	Bigger is better (Best = 1), Range = [0, 1]
****	6	F2S	F2 Score	Bigger is better (Best = 1), Range = [0, 1]
****	7	FBS	F-Beta Score	Bigger is better (Best = 1), Range = [0, 1]
****	8	SS	Specificity Score	Bigger is better (Best = 1), Range = [0, 1]
****	9	MCC	Matthews Correlation Coefficient	Bigger is better (Best = 1), Range = [-1, +1]
****	10	HS	Hamming Score	Bigger is better (Best = 1), Range = [0, 1]
****	11	CKS	Cohen's kappa score	Bigger is better (Best = +1), Range = [-1, +1]
****	12	JSI	Jaccard Similarity Coefficient	Bigger is better (Best = +1), Range = [0, +1]
****	13	GMS	Geometric Mean Score	Bigger is better (Best = +1), Range = [0, +1]
****	14	ROC-AUC	ROC-AUC	Bigger is better (Best = +1), Range = [0, +1]
****	15	LS	Lift Score	Bigger is better (No best value), Range = [0, +inf)
****	16	GINI	GINI Index	Smaller is better (Best = 0), Range = [0, +1]
****	17	CEL	Cross Entropy Loss	Smaller is better (Best = 0), Range=[0, +inf)
****	18	HL	Hinge Loss	Smaller is better (Best = 0), Range=[0, +inf)
****	19	KLDL	Kullback Leibler Divergence Loss	Smaller is better (Best = 0), Range=[0, +inf)
****	20	BSL	Brier Score Loss	Smaller is better (Best = 0), Range=[0, +1]
****	****	****	****	****
Clustering	1	BHI	Ball Hall Index	Smaller is better (Best = 0), Range=[0, +inf)
****	2	XBI	Xie Beni Index	Smaller is better (Best = 0), Range=[0, +inf)
****	3	DBI	Davies Bouldin Index	Smaller is better (Best = 0), Range=[0, +inf)
****	4	BRI	Banfeld Raftery Index	Smaller is better (No best value), Range=(-inf, inf)
****	5	KDI	Ksq Detw Index	Smaller is better (No best value), Range=(-inf, +inf)
****	6	DRI	Det Ratio Index	Bigger is better (No best value), Range=[0, +inf)
****	7	DI	Dunn Index	Bigger is better (No best value), Range=[0, +inf)
****	8	CHI	Calinski Harabasz Index	Bigger is better (No best value), Range=[0, inf)
****	9	LDRI	Log Det Ratio Index	Bigger is better (No best value), Range=(-inf, +inf)
****	10	LSRI	Log SS Ratio Index	Bigger is better (No best value), Range=(-inf, +inf)
****	11	SI	Silhouette Index	Bigger is better (Best = 1), Range = [-1, +1]
****	12	SSEI	Sum of Squared Error Index	Smaller is better (Best = 0), Range = [0, +inf)
****	13	MSEI	Mean Squared Error Index	Smaller is better (Best = 0), Range = [0, +inf)
****	14	DHI	Duda-Hart Index	Smaller is better (Best = 0), Range = [0, +inf)
****	15	BI	Beale Index	Smaller is better (Best = 0), Range = [0, +inf)
****	16	RSI	R-squared Index	Bigger is better (Best=1), Range = (-inf, 1]
****	17	DBCVI	Density-based Clustering Validation Index	Bigger is better (Best=0), Range = [0, 1]
****	18	HI	Hartigan Index	Bigger is better (best=0), Range = [0, +inf)
****	19	MIS	Mutual Info Score	Bigger is better (No best value), Range = [0, +inf)
****	20	NMIS	Normalized Mutual Info Score	Bigger is better (Best = 1), Range = [0, 1]
****	21	RaS	Rand Score	Bigger is better (Best = 1), Range = [0, 1]
****	22	ARS	Adjusted Rand Score	Bigger is better (Best = 1), Range = [-1, 1]
****	23	FMS	Fowlkes Mallows Score	Bigger is better (Best = 1), Range = [0, 1]
****	24	HS	Homogeneity Score	Bigger is better (Best = 1), Range = [0, 1]
****	25	CS	Completeness Score	Bigger is better (Best = 1), Range = [0, 1]
****	26	VMS	V-Measure Score	Bigger is better (Best = 1), Range = [0, 1]
****	27	PrS	Precision Score	Bigger is better (Best = 1), Range = [0, 1]
****	28	ReS	Recall Score	Bigger is better (Best = 1), Range = [0, 1]
****	29	FmS	F-Measure Score	Bigger is better (Best = 1), Range = [0, 1]
****	30	CDS	Czekanowski Dice Score	Bigger is better (Best = 1), Range = [0, 1]
****	31	HGS	Hubert Gamma Score	Bigger is better (Best = 1), Range=[-1, +1]
****	32	JS	Jaccard Score	Bigger is better (Best = 1), Range = [0, 1]
****	33	KS	Kulczynski Score	Bigger is better (Best = 1), Range = [0, 1]
****	34	MNS	Mc Nemar Score	Bigger is better (No best value), Range=(-inf, +inf)
****	35	PhS	Phi Score	Bigger is better (No best value), Range = (-inf, +inf)
****	36	RTS	Rogers Tanimoto Score	Bigger is better (Best = 1), Range = [0, 1]
****	37	RRS	Russel Rao Score	Bigger is better (Best = 1), Range = [0, 1]
****	38	SS1S	Sokal Sneath1 Score	Bigger is better (Best = 1), Range = [0, 1]
****	39	SS2S	Sokal Sneath2 Score	Bigger is better (Best = 1), Range = [0, 1]
****	40	PuS	Purity Score	Bigger is better (Best = 1), Range = [0, 1]
****	41	ES	Entropy Score	Smaller is better (Best = 0), Range = [0, +inf)
****	42	TS	Tau Score	Bigger is better (No best value), Range = (-inf, +inf)
****	43	GAS	Gamma Score	Bigger is better (Best = 1), Range = [-1, 1]
****	44	GPS	Gplus Score	Smaller is better (Best = 0), Range = [0, 1]
****	****	****	****	****

Support (questions, problems)

Official channels

Official source code repo: https://github.com/thieu1995/permetrics
Official document: https://permetrics.readthedocs.io/
Download releases: https://pypi.org/project/permetrics/
Issue tracker: https://github.com/thieu1995/permetrics/issues
Notable changes log: https://github.com/thieu1995/permetrics/blob/master/ChangeLog.md
Official chat group: https://t.me/+fRVCJGuGJg1mNDg1
This project also related to our another projects which are "optimization" and "machine learning", check it here:

Citation Request

Please include these citations if you plan to use this library:

@software{nguyen_van_thieu_2023_8220489,
  author       = {Nguyen Van Thieu},
  title        = {PerMetrics: A Framework of Performance Metrics for Machine Learning Models},
  month        = aug,
  year         = 2023,
  publisher    = {Zenodo},
  doi          = {10.5281/zenodo.3951205},
  url          = {https://github.com/thieu1995/permetrics}
}

@article{van2023mealpy,
  title={MEALPY: An open-source library for latest meta-heuristic algorithms in Python},
  author={Van Thieu, Nguyen and Mirjalili, Seyedali},
  journal={Journal of Systems Architecture},
  year={2023},
  publisher={Elsevier},
  doi={10.1016/j.sysarc.2023.102871}
}

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

2.0.0

Feb 24, 2024

This version

1.5.0

Aug 25, 2023

1.4.3

Aug 12, 2023

1.4.2

Aug 7, 2023

1.4.1

Aug 4, 2023

1.4.0

Jul 27, 2023

1.3.3

Apr 5, 2023

1.3.2

Jan 6, 2023

1.3.1

Oct 2, 2022

1.3.0

May 23, 2022

1.2.2

Apr 8, 2022

1.2.1

Apr 2, 2022

1.2.0

Mar 25, 2022

1.1.3

Apr 26, 2021

1.1.0

Feb 26, 2021

1.0.4

Sep 23, 2020

1.0.3

Sep 22, 2020

1.0.2

Jul 26, 2020

1.0.1

Jul 23, 2020

1.0.0

Jul 19, 2020

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

permetrics-1.5.0.tar.gz (386.1 kB view details)

Uploaded Aug 25, 2023 Source

Built Distribution

permetrics-1.5.0-py3-none-any.whl (56.5 kB view details)

Uploaded Aug 25, 2023 Python 3

File details

Details for the file permetrics-1.5.0.tar.gz.

File metadata

Download URL: permetrics-1.5.0.tar.gz
Upload date: Aug 25, 2023
Size: 386.1 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/4.0.2 CPython/3.9.17

File hashes

Hashes for permetrics-1.5.0.tar.gz
Algorithm	Hash digest
SHA256	`582024a4233d8a56b6c172ae9e74864c27e7148e0d6e54b87bd1b720c1ddaf37`
MD5	`e0cf498e607b984060ffc19071b8ce01`
BLAKE2b-256	`af19b73aff4fa7ce08ad98c1d0eaff0d39cab2d6b8b8ff9a4b27e0b19180093f`

See more details on using hashes here.

File details

Details for the file permetrics-1.5.0-py3-none-any.whl.

File metadata

Download URL: permetrics-1.5.0-py3-none-any.whl
Upload date: Aug 25, 2023
Size: 56.5 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/4.0.2 CPython/3.9.17

File hashes

Hashes for permetrics-1.5.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`8b967dd4e0dffb46346d4f06c156f16c8db3927d9b655106e68bf2e5cd562c1c`
MD5	`daed1c0a8ddfe2dd8f5530212ee69e63`
BLAKE2b-256	`0dde6ad1c0bdf5a9cbd59becca9e26d59335135a6bf5332073860cf4cda13f56`

See more details on using hashes here.

permetrics 1.5.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Notification

Installation

Install with pip

Example with Regression metrics

Example with Classification metrics

Example with Clustering metrics

Metrics

Support (questions, problems)

Official channels

Citation Request

Related Documents

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes