Data exploration tool for assessing optimal classification methods
Project description
Polyssifier
Polyssifier runs a multitude of machine learning models on data. It reports scores, confusion matrices, predictions, and plots the scores ranked by classifier performance.
Installation
pip install polyssifier
How to use
For classification
from polyssifier import poly
# Load data
data = np.load("/path/to/data.npy")
label = np.load("/path/to/labels.npy")
# Run analysis
report = poly(data,label, n_folds=8)
# Plot results
report.plot_scores()
report.plot_features(ntop=10)
For Regression
from polyssifier import polyr
# Load data
data = np.load("/path/to/data.npy")
target = np.load("/path/to/target.npy")
# Run analysis
report = polyr(data, target, n_folds=8)
# Plot results
report.plot_scores()
report.plot_features(ntop=10)
Note for Windows
Windows and Linux multiprocessing are performed differently. To run polyssifier in Windows, an extra line of code must be added. The below example is for classification, but the same addition must also be made for regression.
from polyssifier import poly
# Load data
data = np.load("/path/to/data.npy")
label = np.load("/path/to/labels.npy")
# Run analysis
if name == '__main__':
report = poly(data,label, n_folds=8)
# Plot results
report.plot_scores()
report.plot_features(ntop=10)
In the terminal
poly data.npy label.npy --concurrency 10
Requirements
- Sklearn
- Numpy
- Pandas
Features
- Cross validated scores.
- Report f1 score (scoring='f1') or ROC (scoring='auc') for classification
- Report MSE or R^2 for regression
- Feature ranking for compatible models (Logistic Regression, Linear SVM, Random Forest)
- Parallel processing.
- Control the number of threads with 'concurrency'.
- We recommend setting concurrency to half the number of Cores in your system.
- Saves trained models for future use in case of server malfunction.
- Set project_name for identifying a experiment.
- Activate feature selection step setting
- feature_selection=True
- Automatically scales your data with scale=True
Example: on sample/example.ipynb
It includes the following classifiers:
- Multilayer Perceptron
- Nearest Neighbors
- Linear SVM
- RBF SVM
- Decision Tree
- Random Forest
- Logistic Regression
- Naive Bayes
- Voting Classifier
and the following regressors:
- Linear Regression
- Bayesian Ridge
- PassiveAggressiveRegressor
- GaussianProcessRegressor
- Ridge
- Lasso
- Lars
- LassoLars
- OrthogonalMatchingPursuit
- ElasticNet
You can exclude some of this models by providing a list of names as follows:
from polyssifier import poly
report = poly(data,label, n_folds=8,
exclude=['Multilayer Perceptron'])
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file polyssifier-1.0.15.tar.gz
.
File metadata
- Download URL: polyssifier-1.0.15.tar.gz
- Upload date:
- Size: 22.5 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.0 CPython/3.12.5
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 49442846e61c05ceef24e717e5f7821434344ec4a412a9ed72c678fa59d4cdb2 |
|
MD5 | 900efe36e17f24f8e42e2340a866e35b |
|
BLAKE2b-256 | 8f0ce34a893c23ea7f4058e2732fe09af6b0ba0fd4b97fe3e15d3f0ed74dd523 |
File details
Details for the file polyssifier-1.0.15-py2.py3-none-any.whl
.
File metadata
- Download URL: polyssifier-1.0.15-py2.py3-none-any.whl
- Upload date:
- Size: 20.3 kB
- Tags: Python 2, Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.0 CPython/3.12.5
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 4bbb12097ac41919350a732d7d0fb889bff0f3952792c7df9ecc920b88aab84b |
|
MD5 | c8d6d2a8f9219e718e855ce604e0f8a4 |
|
BLAKE2b-256 | f659a971c787c17ee636760d9d0ccc0462acbe7028275a229c352075542c224c |