Skip to main content

Nightly version of PyCaret - An open source, low-code machine learning library in Python.

Project description

This is a nightly version of the PyCaret library, intended as a preview of the upcoming 2.3.4 version. It may contain unstable and untested code. alt text

PyCaret 2.3

Python pytest on push Documentation Status PyPI version License

Slack

What is PyCaret?

PyCaret is an open-source, low-code machine learning library in Python that automates machine learning workflows. It is an end-to-end machine learning and model management tool that speeds up the experiment cycle exponentially and makes you more productive.

In comparison with the other open-source machine learning libraries, PyCaret is an alternate low-code library that can be used to replace hundreds of lines of code with few words only. This makes experiments exponentially fast and efficient. PyCaret is essentially a Python wrapper around several machine learning libraries and frameworks such as scikit-learn, XGBoost, LightGBM, CatBoost, spaCy, Optuna, Hyperopt, Ray, and many more.

The design and simplicity of PyCaret are inspired by the emerging role of citizen data scientists, a term first used by Gartner. Citizen Data Scientists are power users who can perform both simple and moderately sophisticated analytical tasks that would previously have required more expertise. Seasoned data scientists are often difficult to find and expensive to hire but citizen data scientists can be an effective way to mitigate this gap and address data-related challenges in the business setting.

PyCaret is a great library which not only simplifies the machine learning tasks for citizen data scientists but also helps new startups to reduce the cost of investing in a team of data scientists. Therefore, this library has not only helped the citizen data scientists but has also helped individuals who want to start exploring the field of data science, having no prior knowledge in this field. The Initial idea of PyCaret was inspired by Caret library in R.

alt text

Current Release

PyCaret 2.3.4 is now available. See 2.3.4 release notes. The easiest way to install pycaret is using pip.

pip install pycaret

PyCaret's default installation is a slim version of pycaret which only installs hard dependencies that are listed in requirements.txt. To install the full version of pycaret, use the following command:

pip install pycaret[full]

PyCaret on GPU

PyCaret >= 2.2 provides the option to use GPU for select model training and hyperparameter tuning. There is no change in the use of the API, however, in some cases, additional libraries have to be installed as they are not installed with the default slim version or the full version. The following estimators can be trained on GPU.

  • Extreme Gradient Boosting (requires no further installation)

  • CatBoost (requires no further installation)

  • Light Gradient Boosting Machine (requires GPU installation: https://lightgbm.readthedocs.io/en/latest/GPU-Tutorial.html)

  • Logistic Regression, Ridge Classifier, Random Forest, K Neighbors Classifier, K Neighbors Regressor, Support Vector Machine, Linear Regression, Ridge Regression, Lasso Regression (requires cuML >= 0.15 https://github.com/rapidsai/cuml)

If you are using Google Colab you can install Light Gradient Boosting Machine for GPU but first you have to uninstall LightGBM on CPU. Use the below command to do that:

pip uninstall lightgbm -y

# install lightgbm GPU
pip install lightgbm --install-option=--gpu --install-option="--opencl-include-dir=/usr/local/cuda/include/" --install-option="--opencl-library=/usr/local/cuda/lib64/libOpenCL.so"

CatBoost is only enabled on GPU when dataset has > 50,000 rows.

cuML >= 0.15 cannot be installed on Google Colab. Instead use blazingSQL (https://blazingsql.com/) which comes pre-installed with cuML 0.15. Use following command to install pycaret:

# install pycaret on blazingSQL
!/opt/conda-environments/rapids-stable/bin/python -m pip install --upgrade pycaret

Important Links

Who should use PyCaret?

PyCaret is an open source library that anybody can use. In our view the ideal target audience of PyCaret is:

  • Experienced Data Scientists who want to increase productivity.
  • Citizen Data Scientists who prefer a low code machine learning solution.
  • Data Science Students.
  • Data Science Professionals who want to build rapid prototypes.

Contributors

Made with contributors-img.

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pycaret-nightly-2.3.4.dev1634864022.tar.gz (248.8 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

pycaret_nightly-2.3.4.dev1634864022-py3-none-any.whl (285.7 kB view details)

Uploaded Python 3

File details

Details for the file pycaret-nightly-2.3.4.dev1634864022.tar.gz.

File metadata

  • Download URL: pycaret-nightly-2.3.4.dev1634864022.tar.gz
  • Upload date:
  • Size: 248.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.4.2 importlib_metadata/4.8.1 pkginfo/1.7.1 requests/2.26.0 requests-toolbelt/0.9.1 tqdm/4.62.3 CPython/3.9.7

File hashes

Hashes for pycaret-nightly-2.3.4.dev1634864022.tar.gz
Algorithm Hash digest
SHA256 acd3bbf9835b5b4657f27bf52d50b3c27fb7fa2135c1f83bb4b705f50b7021ea
MD5 01af9a92f3d820c921113c6d3ad6ee8e
BLAKE2b-256 a209d4dd0f5104835bc574185f02d4069b072fbfea22c23637818fbdb34ead03

See more details on using hashes here.

File details

Details for the file pycaret_nightly-2.3.4.dev1634864022-py3-none-any.whl.

File metadata

  • Download URL: pycaret_nightly-2.3.4.dev1634864022-py3-none-any.whl
  • Upload date:
  • Size: 285.7 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.4.2 importlib_metadata/4.8.1 pkginfo/1.7.1 requests/2.26.0 requests-toolbelt/0.9.1 tqdm/4.62.3 CPython/3.9.7

File hashes

Hashes for pycaret_nightly-2.3.4.dev1634864022-py3-none-any.whl
Algorithm Hash digest
SHA256 5cce541ff804238785e9cf824ca925488940711bac320c786db377c0b2464ac1
MD5 5076356fc4ada96633bcf9c0fe1bb530
BLAKE2b-256 d35c1e114b4f0613aa0b43c9d0bb1fe60d0be8ee085ab925292885fea23f8182

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page