Skip to main content

CKKS Homomorphic Encryption backend with CUDA 12.4 GPU acceleration

Project description

English | 한국어

CuKKS

GPU-accelerated CKKS Homomorphic Encryption for PyTorch

Build Status License Python 3.10-3.13

Run trained PyTorch models on encrypted data — preserving privacy while maintaining accuracy.
Built on OpenFHE with CUDA acceleration.


Quick Start

import torch.nn as nn
import cukks

# 1. Define and train your model (standard PyTorch)
model = nn.Sequential(nn.Linear(784, 128), nn.ReLU(), nn.Linear(128, 10))

# 2. Convert to encrypted model (polynomial ReLU approximation)
enc_model, ctx = cukks.convert(model)

# 3. Run encrypted inference
enc_input = ctx.encrypt(test_input)
enc_output = enc_model(enc_input)
output = ctx.decrypt(enc_output)

Installation

Option 1: Install with extras (Recommended)

Install cukks and the GPU backend matching your PyTorch CUDA version in one command:

# CUDA 12.1 (choose the one matching your PyTorch CUDA version)
pip install cukks[cu121]
Command CUDA Supported GPUs
pip install cukks[cu118] 11.8 V100, T4, RTX 20/30/40xx, A100, H100
pip install cukks[cu121] 12.1 V100, T4, RTX 20/30/40xx, A100, H100
pip install cukks[cu124] 12.4 V100, T4, RTX 20/30/40xx, A100, H100
pip install cukks[cu128] 12.8 All above + RTX 50xx

Option 2: Check CUDA version first

import torch
print(torch.version.cuda)  # prints e.g., '12.1'

Then install with the matching extras command above.

Option 3: Install backend separately

# Install the backend first, then cukks
pip install cukks-cu121
pip install cukks

Or use the CLI for auto-detection:

pip install cukks
cukks-install-backend  # Auto-detects PyTorch CUDA and installs the matching backend
Post-install CLI & environment variables
cukks-install-backend             # Auto-detect & install
cukks-install-backend cu128       # Install specific backend
cukks-install-backend --status    # Show CUDA compatibility status
Variable Effect
CUKKS_BACKEND=cukks-cu128 Force a specific backend
Docker images
CUDA Compatible Docker Images
11.8 pytorch/pytorch:2.1.0-cuda11.8-cudnn8-runtime
12.1 pytorch/pytorch:2.2.0-cuda12.1-cudnn8-runtime
12.4 pytorch/pytorch:2.4.0-cuda12.4-cudnn9-runtime
12.8 nvidia/cuda:12.8.0-cudnn9-runtime-ubuntu22.04
docker run --gpus all -it pytorch/pytorch:2.2.0-cuda12.1-cudnn8-runtime bash
pip install cukks[cu121]  # Install for CUDA 12.1
Build from source
git clone https://github.com/devUuung/CuKKS.git && cd CuKKS
pip install -e .

# Build OpenFHE backend
cd openfhe-gpu-public && mkdir build && cd build
cmake .. -DWITH_CUDA=ON && make -j$(nproc)

cd ../../bindings/openfhe_backend
pip install -e .

Features

Feature Description
PyTorch API Familiar interface — just call cukks.convert(model)
GPU Acceleration CUDA-accelerated HE operations via OpenFHE
Auto Optimization BatchNorm folding, BSGS matrix multiplication
Wide Layer Support Linear, Conv2d, ReLU/GELU/SiLU, Pool, LayerNorm, Attention

Supported Layers

Layer Encrypted Version Notes
nn.Linear EncryptedLinear BSGS optimization
nn.Conv2d EncryptedConv2d im2col method
nn.ReLU/GELU/SiLU Polynomial approx Configurable degree
nn.AvgPool2d EncryptedAvgPool2d Rotation-based
nn.BatchNorm Folded Merged into prev layer
nn.LayerNorm EncryptedLayerNorm Polynomial approx
nn.Attention EncryptedApproxAttention seq_len=1
Full layer support table
PyTorch Layer Encrypted Version Notes
nn.Linear EncryptedLinear Full support with BSGS optimization
nn.Conv2d EncryptedConv2d Via im2col method
nn.ReLU EncryptedReLU Polynomial approximation
nn.GELU EncryptedGELU Polynomial approximation
nn.SiLU EncryptedSiLU Polynomial approximation
nn.Sigmoid EncryptedSigmoid Polynomial approximation
nn.Tanh EncryptedTanh Polynomial approximation
nn.AvgPool2d EncryptedAvgPool2d Full support
nn.MaxPool2d EncryptedMaxPool2d Approximate via polynomial
nn.Flatten EncryptedFlatten Logical reshape
nn.BatchNorm1d/2d Folded Merged into preceding layer
nn.Sequential EncryptedSequential Full support
nn.Dropout EncryptedDropout No-op during inference
nn.LayerNorm EncryptedLayerNorm Pure HE polynomial approximation
nn.MultiheadAttention EncryptedApproxAttention Polynomial softmax (seq_len=1)

Activation Functions

CKKS only supports polynomial operations. CuKKS approximates activations (ReLU, GELU, SiLU, etc.) using polynomial fitting:

# Default: degree-4 polynomial approximation (recommended)
enc_model, ctx = cukks.convert(model)

# Higher degree for better accuracy (costs more multiplicative depth)
enc_model, ctx = cukks.convert(model, activation_degree=8)

The default activation_degree=4 provides a good balance between accuracy and depth consumption. Higher degrees approximate the original activation more closely but require deeper circuits.

GPU Acceleration

Operation Accelerated
Add/Sub/Mul/Square ✅ GPU
Rotate/Rescale ✅ GPU
Bootstrap ✅ GPU
Encrypt/Decrypt CPU
from ckks.torch_api import CKKSContext, CKKSConfig

config = CKKSConfig(poly_mod_degree=8192, scale_bits=40)
ctx = CKKSContext(config, enable_gpu=True)  # GPU enabled by default

Examples

# Quick demo (no GPU required)
python -m cukks.examples.encrypted_inference --demo conversion

# MNIST encrypted inference
python examples/mnist_encrypted.py --hidden 64 --samples 5

Contributing

External contributions are welcome.

If you want to contribute to CuKKS, start here first:

  • Contributor workflow

  • CI/CD overview

  • start with an issue using the templates in .github/ISSUE_TEMPLATE/

  • open a PR using .github/pull_request_template.md

  • maintainers assign milestones and cut releases from closed milestones

Read:

CNN example
import torch.nn as nn
import cukks

class MNISTCNN(nn.Module):
    def __init__(self):
        super().__init__()
        self.conv1 = nn.Conv2d(1, 8, kernel_size=3, padding=1)
        self.act1 = nn.ReLU()
        self.pool1 = nn.AvgPool2d(2)
        self.flatten = nn.Flatten()
        self.fc = nn.Linear(8 * 14 * 14, 10)
    
    def forward(self, x):
        return self.fc(self.flatten(self.pool1(self.act1(self.conv1(x)))))

model = MNISTCNN()
enc_model, ctx = cukks.convert(model)

enc_input = ctx.encrypt(image)
prediction = ctx.decrypt(enc_model(enc_input)).argmax()

Note: All operations in forward() must be layer attributes (e.g., self.act1), not inline operations like x ** 2.

Batch processing
# Pack multiple samples into a single ciphertext (SIMD)
samples = [torch.randn(784) for _ in range(8)]
enc_batch = ctx.encrypt_batch(samples)
enc_output = enc_model(enc_batch)
outputs = ctx.decrypt_batch(enc_output, num_samples=8)

Documentation

License

Apache License 2.0

Citation

@software{cukks,
  title = {CuKKS: PyTorch-compatible Encrypted Deep Learning},
  year = {2024},
  url = {https://github.com/devUuung/CuKKS}
}

Related

Libraries

Papers

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distributions

If you're not sure about the file name format, learn more about wheel file names.

cukks_cu124-0.1.4-cp313-cp313-manylinux_2_27_x86_64.manylinux_2_28_x86_64.whl (6.2 MB view details)

Uploaded CPython 3.13manylinux: glibc 2.27+ x86-64manylinux: glibc 2.28+ x86-64

cukks_cu124-0.1.4-cp312-cp312-manylinux_2_27_x86_64.manylinux_2_28_x86_64.whl (6.2 MB view details)

Uploaded CPython 3.12manylinux: glibc 2.27+ x86-64manylinux: glibc 2.28+ x86-64

cukks_cu124-0.1.4-cp311-cp311-manylinux_2_27_x86_64.manylinux_2_28_x86_64.whl (6.2 MB view details)

Uploaded CPython 3.11manylinux: glibc 2.27+ x86-64manylinux: glibc 2.28+ x86-64

cukks_cu124-0.1.4-cp310-cp310-manylinux_2_27_x86_64.manylinux_2_28_x86_64.whl (6.2 MB view details)

Uploaded CPython 3.10manylinux: glibc 2.27+ x86-64manylinux: glibc 2.28+ x86-64

File details

Details for the file cukks_cu124-0.1.4-cp313-cp313-manylinux_2_27_x86_64.manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for cukks_cu124-0.1.4-cp313-cp313-manylinux_2_27_x86_64.manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 f881a8aefac77c7a498886671bc0156e99798aec2ce02c64ebeb9f945a157d4b
MD5 14af02e7349041b659d8e95d4c304b3c
BLAKE2b-256 5bb2b90979ce8d47ce5480b66962ae1534843c3a17817d5b9e134b11a3816692

See more details on using hashes here.

Provenance

The following attestation bundles were made for cukks_cu124-0.1.4-cp313-cp313-manylinux_2_27_x86_64.manylinux_2_28_x86_64.whl:

Publisher: release-milestone.yml on devUuung/CuKKS

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file cukks_cu124-0.1.4-cp312-cp312-manylinux_2_27_x86_64.manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for cukks_cu124-0.1.4-cp312-cp312-manylinux_2_27_x86_64.manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 932432e9b4c679220de6a7b0948827a8a67d11feb0d884dd35ad19caab1f4438
MD5 d32a868faafe7a7a001c18421757505b
BLAKE2b-256 27bf4cd32fb594bb2ed545a03ef189f37b8a3ec1d12ad7d75c66b3461b93839d

See more details on using hashes here.

Provenance

The following attestation bundles were made for cukks_cu124-0.1.4-cp312-cp312-manylinux_2_27_x86_64.manylinux_2_28_x86_64.whl:

Publisher: release-milestone.yml on devUuung/CuKKS

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file cukks_cu124-0.1.4-cp311-cp311-manylinux_2_27_x86_64.manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for cukks_cu124-0.1.4-cp311-cp311-manylinux_2_27_x86_64.manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 3b90f6dc61bd652069143b40b3e1bccc50660074d5d60b20049219b45c8630c9
MD5 1c5771c4f2539aabe79b1fe0741406e3
BLAKE2b-256 77c3217ff284dafa9275b4b19283919e0959142ed4353311d38893137d364c67

See more details on using hashes here.

Provenance

The following attestation bundles were made for cukks_cu124-0.1.4-cp311-cp311-manylinux_2_27_x86_64.manylinux_2_28_x86_64.whl:

Publisher: release-milestone.yml on devUuung/CuKKS

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file cukks_cu124-0.1.4-cp310-cp310-manylinux_2_27_x86_64.manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for cukks_cu124-0.1.4-cp310-cp310-manylinux_2_27_x86_64.manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 282004e4ba138cabed0dc7479ca6c84457c2764a20907580f8898f7c92ee68a3
MD5 88525bbd31d9bc1f32f92c1dd5620969
BLAKE2b-256 2201aba91d23f6e7673a8547a58145e0ebec6923e96cd2af2c9c76d0a6758880

See more details on using hashes here.

Provenance

The following attestation bundles were made for cukks_cu124-0.1.4-cp310-cp310-manylinux_2_27_x86_64.manylinux_2_28_x86_64.whl:

Publisher: release-milestone.yml on devUuung/CuKKS

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page