cuML - RAPIDS ML Algorithms (C++)

These details have not been verified by PyPI

Project links

Homepage

Environment
- GPU :: NVIDIA CUDA
Intended Audience
- Developers
Programming Language
- C++
Topic
- Scientific/Engineering

Project description

cuML - GPU Machine Learning Algorithms

cuML is a suite of libraries that implement machine learning algorithms and mathematical primitives functions that share compatible APIs with other RAPIDS projects.

cuML enables data scientists, researchers, and software engineers to run traditional tabular ML tasks on GPUs without going into the details of CUDA programming. In most cases, cuML's Python API matches the API from scikit-learn.

For large datasets, these GPU-based implementations can complete 10-50x faster than their CPU equivalents. For details on performance, see the cuML Benchmarks Notebook.

As an example, the following Python snippet loads input and computes DBSCAN clusters, all on GPU, using cuDF:

import cudf
from cuml.cluster import DBSCAN

# Create and populate a GPU DataFrame
gdf_float = cudf.DataFrame()
gdf_float['0'] = [1.0, 2.0, 5.0]
gdf_float['1'] = [4.0, 2.0, 1.0]
gdf_float['2'] = [4.0, 2.0, 1.0]

# Setup and fit clusters
dbscan_float = DBSCAN(eps=1.0, min_samples=1)
dbscan_float.fit(gdf_float)

print(dbscan_float.labels_)

Output:

0    0
1    1
2    2
dtype: int32

cuML also features multi-GPU and multi-node-multi-GPU operation, using Dask, for a growing list of algorithms. The following Python snippet reads input from a CSV file and performs a NearestNeighbors query across a cluster of Dask workers, using multiple GPUs on a single node:

Initialize a LocalCUDACluster configured with UCXX for fast transport of CUDA arrays

# Initialize UCX for high-speed transport of CUDA arrays
from dask_cuda import LocalCUDACluster

# Create a Dask single-node CUDA cluster w/ one worker per device
cluster = LocalCUDACluster(protocol="ucx",
                           enable_tcp_over_ucx=True,
                           enable_nvlink=True,
                           enable_infiniband=False)

Load data and perform k-Nearest Neighbors search. cuml.dask estimators also support Dask.Array as input:

from dask.distributed import Client
client = Client(cluster)

# Read CSV file in parallel across workers
import dask_cudf
df = dask_cudf.read_csv("/path/to/csv")

# Fit a NearestNeighbors model and query it
from cuml.dask.neighbors import NearestNeighbors
nn = NearestNeighbors(n_neighbors = 10, client=client)
nn.fit(df)
neighbors = nn.kneighbors(df)

For additional examples, browse our complete API documentation, or check out our example walkthrough notebooks. Finally, you can find complete end-to-end examples in the notebooks-contrib repo.

Supported Algorithms

Category	Algorithm	Notes
Clustering	Density-Based Spatial Clustering of Applications with Noise (DBSCAN)	Multi-node multi-GPU via Dask
	Hierarchical Density-Based Spatial Clustering of Applications with Noise (HDBSCAN)
	K-Means	Multi-node multi-GPU via Dask
	Single-Linkage Agglomerative Clustering
	Spectral Clustering
Dimensionality Reduction	Principal Components Analysis (PCA)	Multi-node multi-GPU via Dask
	Incremental PCA
	Truncated Singular Value Decomposition (tSVD)	Multi-node multi-GPU via Dask
	Uniform Manifold Approximation and Projection (UMAP)	Multi-node multi-GPU Inference via Dask
	Random Projection
	t-Distributed Stochastic Neighbor Embedding (TSNE)
	Spectral Embedding
Linear Models for Regression or Classification	Linear Regression (OLS)	Multi-node multi-GPU via Dask
	Linear Regression with Lasso or Ridge Regularization	Multi-node multi-GPU via Dask
	ElasticNet Regression
	LARS Regression	(experimental)
	Logistic Regression	Multi-node multi-GPU via Dask-GLM demo
	Naive Bayes	Multi-node multi-GPU via Dask
	Stochastic Gradient Descent (SGD), Coordinate Descent (CD), and Quasi-Newton (QN) (including L-BFGS and OWL-QN) solvers for linear models
Nonlinear Models for Regression or Classification	Random Forest (RF) Classification	Experimental multi-node multi-GPU via Dask
	Random Forest (RF) Regression	Experimental multi-node multi-GPU via Dask
	Inference for decision tree-based models	Forest Inference Library (FIL)
	K-Nearest Neighbors (KNN) Classification	Multi-node multi-GPU via Dask+UCXX, uses Faiss for Nearest Neighbors Query.
	K-Nearest Neighbors (KNN) Regression	Multi-node multi-GPU via Dask+UCXX, uses Faiss for Nearest Neighbors Query.
	Support Vector Machine Classifier (SVC)
	Epsilon-Support Vector Regression (SVR)
Preprocessing	Standardization, or mean removal and variance scaling / Normalization / Encoding categorical features / Discretization / Imputation of missing values / Polynomial features generation / and coming soon custom transformers and non-linear transformation	Based on Scikit-Learn preprocessing
Time Series	Holt-Winters Exponential Smoothing
	Auto-regressive Integrated Moving Average (ARIMA)	Supports seasonality (SARIMA)
Model Explanation	SHAP Kernel Explainer	Based on SHAP
	SHAP Permutation Explainer	Based on SHAP
Execution device interoperability		Run estimators interchangeably from host/cpu or device/gpu with minimal code change demo
Other	K-Nearest Neighbors (KNN) Search	Multi-node multi-GPU via Dask+UCXX, uses Faiss for Nearest Neighbors Query.

Installation

See the RAPIDS Release Selector for the command line to install either nightly or official release cuML packages via conda, pip, or Docker.

Build/Install from Source

See the build guide.

Scikit-learn Compatibility

cuML is compatible with scikit-learn version 1.4 or higher.

Model serialization and security

cuML models can be serialized with pickle or joblib and loaded later for inference. cuML uses cloudpickle so that models trained with cuml.accel can be loaded and used with scikit-learn.

Only unpickle or deserialize from trusted sources. The pickle module (and by extension joblib) is not secure: malicious payloads can execute arbitrary code during deserialization and compromise your system. Do not unpickle or load data from untrusted or tampered sources. This applies to pickle.load() / pickle.loads(), joblib.load(), and any file-based model loading. For details and patterns, see the Model Serialization and Persistence notebook and the Python pickle security documentation.

Contributing

Please see our guide for contributing to cuML.

References

The RAPIDS team has a number of blogs with deeper technical dives and examples. You can find them here on Medium.

For additional details on the technologies behind cuML, as well as a broader overview of the Python Machine Learning landscape, see Machine Learning in Python: Main developments and technology trends in data science, machine learning, and artificial intelligence (2020) by Sebastian Raschka, Joshua Patterson, and Corey Nolet.

Please consider citing this when using cuML in a project. You can use the citation BibTeX:

@article{raschka2020machine,
  title={Machine Learning in Python: Main developments and technology trends in data science, machine learning, and artificial intelligence},
  author={Raschka, Sebastian and Patterson, Joshua and Nolet, Corey},
  journal={arXiv preprint arXiv:2002.04803},
  year={2020}
}

Contact

Find out more details on the RAPIDS site

Open GPU Data Science

The RAPIDS suite of open source software libraries aim to enable execution of end-to-end data science and analytics pipelines entirely on GPUs. It relies on NVIDIA® CUDA® primitives for low-level compute optimization, but exposing that GPU parallelism and high-bandwidth memory speed through user-friendly Python interfaces.

Project details

These details have not been verified by PyPI

Project links

Homepage

Environment
- GPU :: NVIDIA CUDA
Intended Audience
- Developers
Programming Language
- C++
Topic
- Scientific/Engineering

Release history Release notifications | RSS feed

This version

26.4.0

Apr 9, 2026

26.2.0

Feb 6, 2026

25.12.0

Dec 11, 2025

25.10.0

Oct 9, 2025

25.8.0

Aug 7, 2025

25.6.0

Jun 6, 2025

25.4.0

Apr 10, 2025

25.2.1

Mar 3, 2025

25.2.0

Feb 14, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distributions

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

libcuml_cu12-26.4.0-py3-none-manylinux_2_27_x86_64.manylinux_2_28_x86_64.whl (469.1 MB view details)

Uploaded Apr 9, 2026 Python 3manylinux: glibc 2.27+ x86-64manylinux: glibc 2.28+ x86-64

libcuml_cu12-26.4.0-py3-none-manylinux_2_27_aarch64.manylinux_2_28_aarch64.whl (467.7 MB view details)

Uploaded Apr 9, 2026 Python 3manylinux: glibc 2.27+ ARM64manylinux: glibc 2.28+ ARM64

File details

Details for the file libcuml_cu12-26.4.0-py3-none-manylinux_2_27_x86_64.manylinux_2_28_x86_64.whl.

File metadata

Download URL: libcuml_cu12-26.4.0-py3-none-manylinux_2_27_x86_64.manylinux_2_28_x86_64.whl
Upload date: Apr 9, 2026
Size: 469.1 MB
Tags: Python 3, manylinux: glibc 2.27+ x86-64, manylinux: glibc 2.28+ x86-64
Uploaded using Trusted Publishing? No
Uploaded via: twine/5.0.0 CPython/3.10.20

File hashes

Hashes for libcuml_cu12-26.4.0-py3-none-manylinux_2_27_x86_64.manylinux_2_28_x86_64.whl
Algorithm	Hash digest
SHA256	`4586eb12d9c9888650b54a202efdd1341ac161ec9f77ba3c304c145873b551d3`
MD5	`4a43b70f0344a3713b4bd161e08a1a3f`
BLAKE2b-256	`84dd00031bd84a6cd42f028273ef0acab780d6bb5981a024c11fd1bcd66fdec0`

See more details on using hashes here.

File details

Details for the file libcuml_cu12-26.4.0-py3-none-manylinux_2_27_aarch64.manylinux_2_28_aarch64.whl.

File metadata

Download URL: libcuml_cu12-26.4.0-py3-none-manylinux_2_27_aarch64.manylinux_2_28_aarch64.whl
Upload date: Apr 9, 2026
Size: 467.7 MB
Tags: Python 3, manylinux: glibc 2.27+ ARM64, manylinux: glibc 2.28+ ARM64
Uploaded using Trusted Publishing? No
Uploaded via: twine/5.0.0 CPython/3.10.20

File hashes

Hashes for libcuml_cu12-26.4.0-py3-none-manylinux_2_27_aarch64.manylinux_2_28_aarch64.whl
Algorithm	Hash digest
SHA256	`e25c23c0c44453a9e70173f1c25ed5fe2a16a8245d56e53e50cd1a60b2503ebe`
MD5	`2220034b6fe3a3fdac1b4a9e12e2c33a`
BLAKE2b-256	`3022cc64a376d50f985e862ed733dfe8c0f42e2c83fb03d1edddb7e937531c4c`

See more details on using hashes here.

libcuml-cu12 26.4.0

Navigation

Verified details

Owner

Unverified details

Project links

Meta

Classifiers

Project description

cuML - GPU Machine Learning Algorithms

Supported Algorithms

Installation

Build/Install from Source

Scikit-learn Compatibility

Model serialization and security

Contributing

References

Contact

Open GPU Data Science

Project details

Verified details

Owner

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distributions

Built Distributions

File details

File metadata

File hashes

File details

File metadata

File hashes