Skip to main content

amlr - Auto Machine Learning Report

Project description

AMLR - Auto Machine Learning Report

Create a bealtifull Machine Learning Report with One-Line-Code


Main Features:

  • Exploratory Data Analisys
    • Dataset Configuration
      • Shape
      • Detect number of classes (Bernoulli or binary for while)
      • Automatically Duplicate Observations dropped
      • You can drop Duplicate Observations manually as well
      • Exclude automatically features with highest frequencies (Names, IDs, FW keys etc)
    • Regression Analysis
    • Automatic Balance Classes
    • Correlation Analysis
    • Detecting Multicollinearity with VIF
    • Residual Analisys
  • Grid - Hyperparameter optimization
  • Partial dependence plot (PDP)
  • Individual Conditional Expectation (ICE)
  • Variable Importance by Model
  • AML - Partial Dependence
  • Ensemble - (ICE) Individual Condition Expectation
  • Correlation Heatmap by Model
  • Model Performance
    • Analytical Performance Modeling
    • Comparative Metrics Table with:
      • Overall ACC
      • Kappa
      • Overall
      • RACC
      • SOA1(Landis & Koch)
      • SOA1(Fleiss)
      • SOA1(Altman)
      • SOA1(Cicchetti)
      • SOA1(Cramer)
      • SOA1(Matthews)
      • TNR Macro
      • TPR Macro
      • FPR Macro
      • FNR Macro
      • PPV Macro
      • ACC Macro
      • F1 Macro
      • TNR Micro
      • FPR Micro
      • TPR Micro
      • FNR Micro
      • PPV Micro
      • F1 Micro
      • Scott PI
      • Gwet AC1
      • Bennett S
      • Kappa Standard Error
      • Kappa 1% CI
      • Chi-Squared
      • Phi-Squared
      • Cramer V
      • Chi-Squared DF
      • 1% CI
      • Standard Error
      • Response Entropy
      • Reference Entropy
      • Cross Entropy
      • Joint Entropy
      • Conditional Entropy
      • KL Divergence
      • Lambda B
      • Lambda A
      • Kappa Unbiased
      • Overall RACCU
      • Kappa No Prevalence
      • Mutual Information
      • Overall J
      • Hamming Loss
      • Zero-one Loss
      • NIR
      • P-Value
      • Overall CEN
      • Overall MCEN
      • Overall MCC
      • RR
      • CBA
      • AUNU
      • AUNP
      • RCI
      • Pearson C
      • CSI
      • ARI
      • Bangdiwala B
      • Krippendorff
      • Alpha
    • The Best Algorithms Table
      • Automatically chooses the best model based on the metrics above
    • Confusion Matrix for all Models
    • Feature Importance for all models
    • Save all Models into a Pickle file

How to Install

sudo apt-get install default-jre
pip install amlr



How to use

sintax:

from amlr import amlr as rp
import webbrowser

rp = rp.report()
rp.create_report(dataset='data/titanic-passengers.csv', target='Survived', max_runtime_secs=1)

webbrowser.open('report/index.html')

Another option is to load your own data set with pandas and switch, or parse, to the AMLR report command, but you cannot use both methods. The code will be:

df = pd.read_csv('data/titanic-passengers.csv', sep=';')
rp.create_report(data_frame=df, target='Survived', max_runtime_secs=1)

Parameters

  • dataset: File to read by AMLR
  • data_frame: Pandas DataFrame
  • target: The target column
  • duplicated: Default True Looking for duplicated lines
  • sep: Default ; if file is a csv, you must explicity the column sepatator character
  • exclude: Default True a list with the columns to exclude to the process
  • max_runtime_secs: Default 1 time limit to run deep learnig models

max_run_time

When building a model, this option specifes the maximum runtime in seconds that you want to allot in order to complete the model. If this maximum runtime is exceeded before the model build is completed, then the model will fail.

Specifying max_runtime_secs=1 disables this option for production enviroment, thus allowing for an unlimited amount of runtime. If you just want to do a test, regardless of the results, use 1 seconds or a maximum of 61 seconds.




We tested with the following Dataset

  • Classic dataset on Titanic disaster
    • Bernoulli Distribution Target or Binary Classification
    • Download here: Titanic

Output

See the output here

This is an example of the test made with the Titanic Dataset;



enjoi!

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

amlr-0.4.1.tar.gz (277.2 kB view details)

Uploaded Source

Built Distribution

amlr-0.4.1-py3-none-any.whl (276.4 kB view details)

Uploaded Python 3

File details

Details for the file amlr-0.4.1.tar.gz.

File metadata

  • Download URL: amlr-0.4.1.tar.gz
  • Upload date:
  • Size: 277.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.4.2 requests/2.22.0 setuptools/49.3.1 requests-toolbelt/0.8.0 tqdm/4.48.2 CPython/3.8.10

File hashes

Hashes for amlr-0.4.1.tar.gz
Algorithm Hash digest
SHA256 44f0868b6808a898b817723c4d79f8b828e218e127543995c623d28517c7f6fb
MD5 1d56a1e23f1f5c2095a1909f68b2e5f4
BLAKE2b-256 9834536dea549ce3dc437d96d1151557176366b8d8aef6cec4abfd7f86086926

See more details on using hashes here.

File details

Details for the file amlr-0.4.1-py3-none-any.whl.

File metadata

  • Download URL: amlr-0.4.1-py3-none-any.whl
  • Upload date:
  • Size: 276.4 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.4.2 requests/2.22.0 setuptools/49.3.1 requests-toolbelt/0.8.0 tqdm/4.48.2 CPython/3.8.10

File hashes

Hashes for amlr-0.4.1-py3-none-any.whl
Algorithm Hash digest
SHA256 822d202890dd19252fb114f6972954d2b28ca11916d1a440aac682acd88b83a7
MD5 d059cfe19421e69237208b6e899d68b9
BLAKE2b-256 2a9e53cb63738b5dd72c9aa0b8a68ff1265785fd5f5670d0c786d9d4d1fbe24b

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page