Skip to main content

an AutoML library that builds, optimizes, and evaluates machine learning pipelines using domain-specific objective functions

Project description

EvalML

Tests Documentation Status PyPI Version Anaconda Version PyPI Downloads


EvalML is an AutoML library which builds, optimizes, and evaluates machine learning pipelines using domain-specific objective functions.

Key Functionality

  • Automation - Makes machine learning easier. Avoid training and tuning models by hand. Includes data quality checks, cross-validation and more.
  • Data Checks - Catches and warns of problems with your data and problem setup before modeling.
  • End-to-end - Constructs and optimizes pipelines that include state-of-the-art preprocessing, feature engineering, feature selection, and a variety of modeling techniques.
  • Model Understanding - Provides tools to understand and introspect on models, to learn how they'll behave in your problem domain.
  • Domain-specific - Includes repository of domain-specific objective functions and an interface to define your own.

Installation

Install from PyPI:

pip install evalml

or from the conda-forge channel on conda:

conda install -c conda-forge evalml

Add-ons

Update checker - Receive automatic notifications of new Woodwork releases

PyPI:

pip install "evalml[updater]"

Conda:

conda install -c conda-forge alteryx-open-src-update-checker

Start

Load and split example data

import evalml
X, y = evalml.demos.load_breast_cancer()
X_train, X_test, y_train, y_test = evalml.preprocessing.split_data(X, y, problem_type='binary')

Run AutoML

from evalml.automl import AutoMLSearch
automl = AutoMLSearch(X_train=X_train, y_train=y_train, problem_type='binary')
automl.search()

View pipeline rankings

automl.rankings

Get best pipeline and predict on new data

pipeline = automl.best_pipeline
pipeline.predict(X_test)

Next Steps

Read more about EvalML on our documentation page:

Support

The EvalML community is happy to provide support to users of EvalML. Project support can be found in four places depending on the type of question:

  1. For usage questions, use Stack Overflow with the evalml tag.
  2. For bugs, issues, or feature requests start a Github issue.
  3. For discussion regarding development on the core library, use Slack.
  4. For everything else, the core developers can be reached by email at open_source_support@alteryx.com

Built at Alteryx

EvalML is an open source project built by Alteryx. To see the other open source projects we’re working on visit Alteryx Open Source. If building impactful data science pipelines is important to you or your business, please get in touch.

Alteryx Open Source

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

evalml_automining-0.0.2.tar.gz (6.4 MB view details)

Uploaded Source

Built Distribution

evalml_automining-0.0.2-py3-none-any.whl (6.6 MB view details)

Uploaded Python 3

File details

Details for the file evalml_automining-0.0.2.tar.gz.

File metadata

  • Download URL: evalml_automining-0.0.2.tar.gz
  • Upload date:
  • Size: 6.4 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.10.14

File hashes

Hashes for evalml_automining-0.0.2.tar.gz
Algorithm Hash digest
SHA256 8212096cd56a8e11d867f09758992cc4452b7b9b5cb730999ae56798b89ae4c4
MD5 e6d1f6972ccadb8750c07cbde812faa3
BLAKE2b-256 cdd6469bc9ff9bb737907a8c0d266f750bc2ef840b2137743735252e716342b3

See more details on using hashes here.

File details

Details for the file evalml_automining-0.0.2-py3-none-any.whl.

File metadata

File hashes

Hashes for evalml_automining-0.0.2-py3-none-any.whl
Algorithm Hash digest
SHA256 851ccf23ba97856f4785d6bf0590349725fc5912b5c05c2777da404ddb2a3727
MD5 fd52a1fea17bfd860b7722a6639ba489
BLAKE2b-256 362c04f9066e15f5ef8462b22e273e5d39a7f2a1578f74031973ebf460a0fab3

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page