Skip to main content

PyCaret - An open source, low-code machine learning library in Python.

Project description

drawing

An open-source, low-code machine learning library in Python
:rocket: Version 2.3.5 out now! Check out the release notes here.

OfficialDocsInstallTutorialsDiscussionsContributeResourcesMediumLinkedInYouTubeSlack

Python pytest on push Documentation Status PyPI version License

Slack

alt text

Welcome to PyCaret

PyCaret is an open-source, low-code machine learning library in Python that automates machine learning workflows. It is an end-to-end machine learning and model management tool that speeds up the experiment cycle exponentially and makes you more productive.

In comparison with the other open-source machine learning libraries, PyCaret is an alternate low-code library that can be used to replace hundreds of lines of code with few lines only. This makes experiments exponentially fast and efficient. PyCaret is essentially a Python wrapper around several machine learning libraries and frameworks such as scikit-learn, XGBoost, LightGBM, CatBoost, spaCy, Optuna, Hyperopt, Ray, and few more.

The design and simplicity of PyCaret are inspired by the emerging role of citizen data scientists, a term first used by Gartner. Citizen Data Scientists are power users who can perform both simple and moderately sophisticated analytical tasks that would previously have required more technical expertise.

Important Links
:star: Tutorials New to PyCaret? Checkout our official notebooks!
:clipboard: Example Notebooks Example notebooks created by community.
:orange_book: Blog Tutorials and articles by contributors.
:books: Documentation The detailed API docs of PyCaret
:tv: Video Tutorials Our video tutorial from various events.
:loudspeaker: Discussions Have questions? Engage with community and contributors.
:hammer_and_wrench: Changelog Changes and version history.
:deciduous_tree: Roadmap PyCaret's software and community development plan.

Installation

PyCaret's default installation only installs hard dependencies as listed in the requirements.txt file.

pip install pycaret

To install the full version:

pip install pycaret[full]

Supervised Workflow

Classification Regression

Unsupervised Workflow

Clustering Anomaly Detection

PyCaret ⚡NEW⚡ Time Series Module

PyCaret new time series module is now available in beta. Staying true to simplicity of PyCaret, it is consistent with our existing API and fully loaded with functionalities. Statistical testing, model training and selection (30+ algorithms), model analysis, automated hyperparameter tuning, experiment logging, deployment on cloud, and more. All of this with only few lines of code (just like the other modules of pycaret). If you would like to give it a try, checkout our official quick start notebook.

:books: Time Series Docs

:question: Time Series FAQs

:rocket: Features and Roadmap

The module is still in beta. We are adding new functionalities every day and doing weekly pip releases. Please ensure to create a separate python environment to avoid dependency conflicts with main pycaret. The final release of this module will be merged with the main pycaret in next major release.

pip install pycaret-ts-alpha

alt text

Who should use PyCaret?

PyCaret is an open source library that anybody can use. In our view the ideal target audience of PyCaret is:

  • Experienced Data Scientists who want to increase productivity.
  • Citizen Data Scientists who prefer a low code machine learning solution.
  • Data Science Professionals who want to build rapid prototypes.
  • Data Science and Machine Learning students and enthusiasts.

PyCaret on GPU

With PyCaret >= 2.2, you can train models on GPU and speed up your workflow by 10x. To train models on GPU simply pass use_gpu = True in the setup function. There is no change in the use of the API, however, in some cases, additional libraries have to be installed as they are not installed with the default version or the full version. As of the latest release, the following models can be trained on GPU:

  • Extreme Gradient Boosting (requires no further installation)
  • CatBoost (requires no further installation)
  • Light Gradient Boosting Machine requires GPU installation
  • Logistic Regression, Ridge Classifier, Random Forest, K Neighbors Classifier, K Neighbors Regressor, Support Vector Machine, Linear Regression, Ridge Regression, Lasso Regression requires cuML >= 0.15

License

PyCaret is completely free and open-source and licensed under the MIT license.

Contributors

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pycaret-ts-alpha-3.0.0.dev1643476561.tar.gz (438.0 kB view details)

Uploaded Source

Built Distribution

File details

Details for the file pycaret-ts-alpha-3.0.0.dev1643476561.tar.gz.

File metadata

  • Download URL: pycaret-ts-alpha-3.0.0.dev1643476561.tar.gz
  • Upload date:
  • Size: 438.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.7.1 importlib_metadata/4.10.1 pkginfo/1.8.2 requests/2.27.1 requests-toolbelt/0.9.1 tqdm/4.62.3 CPython/3.9.10

File hashes

Hashes for pycaret-ts-alpha-3.0.0.dev1643476561.tar.gz
Algorithm Hash digest
SHA256 19d2b4612060184e96d5701af172773ddb589ffceec99636a4da8bc5039cc892
MD5 6ab45a8147a8be6d7e7332fdef15dec2
BLAKE2b-256 81d2a34a69bb6e483d08bba3ba4b49a67a93279af0bf62ec8b9c665744daf073

See more details on using hashes here.

File details

Details for the file pycaret_ts_alpha-3.0.0.dev1643476561-py3-none-any.whl.

File metadata

  • Download URL: pycaret_ts_alpha-3.0.0.dev1643476561-py3-none-any.whl
  • Upload date:
  • Size: 498.6 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.7.1 importlib_metadata/4.10.1 pkginfo/1.8.2 requests/2.27.1 requests-toolbelt/0.9.1 tqdm/4.62.3 CPython/3.9.10

File hashes

Hashes for pycaret_ts_alpha-3.0.0.dev1643476561-py3-none-any.whl
Algorithm Hash digest
SHA256 02e3c7d6622e4a0aa2e4c61f5dba221a49eb86a58090d7269dd70eca172969e3
MD5 8d9346605039dbcc84839718bca2ed35
BLAKE2b-256 c28051ef875cf1b44a75afed7d10a88eddcb31a56106e697110fe15d5bbdae77

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page