CKKS Homomorphic Encryption backend with CUDA 12.1 GPU acceleration
Project description
CuKKS
GPU-accelerated CKKS Homomorphic Encryption for PyTorch
Run trained PyTorch models on encrypted data — no decryption needed, no privacy compromised.
Built on OpenFHE with CUDA acceleration.
Why CuKKS?
Traditional machine learning requires access to raw input data — a privacy risk for sensitive domains like healthcare, finance, and biometrics. CuKKS lets you deploy models that never see plaintext:
User: encrypt(input) → [ciphertext] → Server: model([ciphertext]) → [encrypted output] → User: decrypt
The server performs full inference without ever decrypting the data. CuKKS makes this practical with:
- One-line conversion —
cukks.convert(model)transforms any trained PyTorch model - GPU acceleration — CUDA-accelerated HE operations via OpenFHE
- 37 layer types — from Linear and Conv2d to Attention, GroupNorm, and ConvTranspose2d
Quick Start
import torch.nn as nn
import cukks
# 1. Train your model normally
model = nn.Sequential(nn.Linear(784, 128), nn.ReLU(), nn.Linear(128, 10))
# 2. Convert to encrypted model
enc_model, ctx = cukks.convert(model)
# 3. Run encrypted inference
enc_input = ctx.encrypt(test_input)
enc_output = enc_model(enc_input)
output = ctx.decrypt(enc_output) # Same result, never decrypted on server
Installation
pip install cukks[cu121] # Match your PyTorch CUDA version: cu118, cu121, cu124, cu128
| Command | CUDA | Compute Capability |
|---|---|---|
pip install cukks[cu118] |
11.8 | sm_50 – sm_90 (Maxwell ~ Hopper) |
pip install cukks[cu121] |
12.1 | sm_50 – sm_90 (Maxwell ~ Hopper) |
pip install cukks[cu124] |
12.4 | sm_50 – sm_90a (Maxwell ~ Hopper) |
pip install cukks[cu128] |
12.8 | sm_50 – sm_100 (Maxwell ~ Blackwell) |
Not sure which CUDA version? Run python -c "import torch; print(torch.version.cuda)".
Docker, CLI tools, and building from source
Docker
docker run --gpus all -it pytorch/pytorch:2.2.0-cuda12.1-cudnn8-runtime bash
pip install cukks[cu121]
Auto-install CLI
pip install cukks
cukks-install-backend # Auto-detects PyTorch CUDA and installs matching backend
Build from source
git clone https://github.com/devUuung/CuKKS.git && cd CuKKS
pip install -e .
# Build OpenFHE backend
cd openfhe-gpu-public && mkdir build && cd build
cmake .. -DWITH_CUDA=ON && make -j$(nproc)
cd ../../bindings/openfhe_backend
pip install -e .
Key Features
Drop-in Model Conversion
No model rewriting. No custom HE code. Just call cukks.convert(model):
# Any PyTorch model — MLP, CNN, Transformer — converts automatically
enc_model, ctx = cukks.convert(model, activation_degree=4)
BatchNorm folding, BSGS matrix multiplication, and CNN optimizations are applied automatically.
37 Supported Layer Types
| Category | Layers |
|---|---|
| Linear | Linear, BlockDiagonalLinear, BlockDiagLowRankLinear |
| Convolution | Conv1d, Conv2d, ConvTranspose2d |
| Pooling | AvgPool2d, MaxPool2d, AdaptiveAvgPool2d |
| Normalization | LayerNorm, GroupNorm, InstanceNorm1d/2d, BatchNorm (folded) |
| Activation | ReLU, GELU, SiLU, Sigmoid, Tanh, Square |
| Attention | MultiheadAttention (seq_len ≤ 8) |
| Embedding | Embedding |
| Spatial | Upsample, PixelShuffle, PixelUnshuffle |
| Padding | ZeroPad2d, ConstantPad2d, ReflectionPad2d, ReplicationPad2d |
| Other | Flatten, Dropout, Sequential, ResidualBlock |
GPU-Accelerated HE Operations
All core HE operations run on GPU:
| Operation | GPU |
|---|---|
| Add / Sub / Mul / Square | ✅ |
| Rotate / Rescale | ✅ |
| Bootstrap | ✅ |
| Plaintext cache | ✅ |
Polynomial Activation Approximation
CKKS only supports polynomial operations. CuKKS approximates non-polynomial activations (ReLU, GELU, SiLU, etc.) using Chebyshev polynomial fitting:
# Default: degree-4 (good accuracy / depth balance)
enc_model, ctx = cukks.convert(model)
# Higher degree for better accuracy (costs more depth)
enc_model, ctx = cukks.convert(model, activation_degree=8)
Packed Batch Inference
Process multiple samples in a single ciphertext:
samples = [torch.randn(784) for _ in range(8)]
enc_batch = ctx.encrypt_batch(samples)
enc_output = enc_model(enc_batch)
outputs = ctx.decrypt_batch(enc_output, sample_shape=(8,))
Examples
# MNIST classification (MLP)
python examples/mnist_encrypted.py --hidden 64 --samples 5
# UNet-style segmentation (ConvTranspose2d, AdaptiveAvgPool2d, Upsample)
python examples/unet_encrypted.py --samples 2
# ResNet-style classification (GroupNorm, AdaptiveAvgPool2d)
python examples/resnet_encrypted.py --samples 1
# Transformer-style NLP (Embedding, LayerNorm)
python examples/transformer_encrypted.py --samples 2
See examples/ for full scripts.
Benchmarking
CuKKS includes a benchmark suite for measuring encrypted inference performance:
# Run all benchmarks
python benchmarks/run_benchmarks.py
# Benchmark a specific model
python benchmarks/run_benchmarks.py --model mlp
# Save results to JSON
python benchmarks/run_benchmarks.py --output results.json
Supported models:
| Model | Params | Input | Architecture |
|---|---|---|---|
| MLP | 50,890 | (1, 784) | Linear(784→64) → ReLU → Linear(64→10) |
| CNN | 15,770 | (1, 1, 28, 28) | Conv2d(1→8, 3×3) → ReLU → AvgPool2d(2) → Linear(1568→10) |
| ResNet | 1,300 | (1, 1, 8, 8) | Conv2d(1→8, 3×3) → GroupNorm → ReLU → Conv2d(8→16, 1×1) → GroupNorm → ReLU → AdaptiveAvgPool2d(4×4) → Linear(256→4) |
| Transformer | 212 | (1, 8) | Linear(8→16) → ReLU → Linear(16→4) |
Example output:
Model Plain (ms) Encrypted (ms) Overhead MAE
--------------------------------------------------------------
mlp 0.01 84.16 6858 x 0.075594
cnn 0.03 2820.15 94053 x 0.048294
resnet 0.04 11444.55 269018 x 0.098026
transformer 0.01 54.27 7794 x 0.087978
Note: Benchmarks require a GPU backend with OpenFHE. Run on a machine with CUDA support for accurate timing.
Supported Layers
| PyTorch Layer | Encrypted Version | Notes |
|---|---|---|
nn.Linear |
EncryptedLinear |
BSGS optimization |
nn.Conv1d |
EncryptedConv1d |
1D im2col |
nn.Conv2d |
EncryptedConv2d |
im2col, BSGS |
nn.ConvTranspose2d |
EncryptedConvTranspose2d |
Transposed convolution |
nn.ReLU |
EncryptedReLU |
Polynomial approx |
nn.GELU |
EncryptedGELU |
Polynomial approx |
nn.SiLU |
EncryptedSiLU |
Polynomial approx |
nn.Sigmoid |
EncryptedSigmoid |
Polynomial approx |
nn.Tanh |
EncryptedTanh |
Polynomial approx |
nn.AvgPool2d |
EncryptedAvgPool2d |
Rotation-based |
nn.MaxPool2d |
EncryptedMaxPool2d |
Polynomial approx |
nn.AdaptiveAvgPool2d |
EncryptedAdaptiveAvgPool2d |
Global-pool fast path |
nn.Flatten |
EncryptedFlatten |
Logical reshape |
nn.BatchNorm1d/2d |
Folded | Merged into preceding layer |
nn.GroupNorm |
EncryptedGroupNorm |
Per-group polynomial |
nn.InstanceNorm1d/2d |
EncryptedInstanceNorm1d/2d |
Per-channel polynomial |
nn.LayerNorm |
EncryptedLayerNorm |
Polynomial 1/sqrt |
nn.MultiheadAttention |
EncryptedApproxAttention |
seq_len ≤ 8 |
nn.Embedding |
EncryptedEmbedding |
One-hot matmul |
nn.Upsample |
EncryptedUpsample |
Nearest / bilinear |
nn.PixelShuffle |
EncryptedPixelShuffle |
Channel-to-spatial |
nn.PixelUnshuffle |
EncryptedPixelUnshuffle |
Spatial-to-channel |
nn.ZeroPad2d |
EncryptedZeroPad2d |
Scatter matrix |
nn.ConstantPad2d |
EncryptedConstantPad2d |
Scatter + constant |
nn.ReflectionPad2d |
EncryptedReflectionPad2d |
Reflection mapping |
nn.ReplicationPad2d |
EncryptedReplicationPad2d |
Replication mapping |
nn.Sequential |
EncryptedSequential |
Full support |
nn.Dropout |
EncryptedDropout |
No-op during inference |
nn.ResidualBlock |
EncryptedResidualBlock |
Skip connection |
Documentation
License
Apache License 2.0
Citation
@software{cukks,
title = {CuKKS: PyTorch-compatible Encrypted Deep Learning},
year = {2024},
url = {https://github.com/devUuung/CuKKS}
}
Related
Libraries
- OpenFHE — Underlying HE library
- Microsoft SEAL — Alternative HE library
Papers
- Homomorphic Encryption for Arithmetic of Approximate Numbers — Cheon et al. (CKKS)
- Bootstrapping for Approximate Homomorphic Encryption — Cheon et al.
- Faster Homomorphic Linear Transformations in HElib — Halevi & Shoup (BSGS)
- GAZELLE: A Low Latency Framework for Secure Neural Network Inference — Juvekar et al. (Convolution)
- PP-STAT: An Efficient Privacy-Preserving Statistical Analysis Framework using Homomorphic Encryption — Choi (Encrypted Statistics)
- STIP: Efficient and Secure Non-Interactive Transformer Inference via Compact Packing — Wang et al. (Packed Attention)
- Efficient Bootstrapping for Approximate Homomorphic Encryption — Bossuat et al. (Double-Hoisting)
- On the Number of Nonscalar Multiplications Necessary to Evaluate Polynomials — Paterson & Stockmeyer (Polynomial Evaluation)
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distributions
Built Distributions
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file cukks_cu121-0.2.2-cp313-cp313-manylinux_2_27_x86_64.manylinux_2_28_x86_64.whl.
File metadata
- Download URL: cukks_cu121-0.2.2-cp313-cp313-manylinux_2_27_x86_64.manylinux_2_28_x86_64.whl
- Upload date:
- Size: 6.5 MB
- Tags: CPython 3.13, manylinux: glibc 2.27+ x86-64, manylinux: glibc 2.28+ x86-64
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
ac0f2878ea6811fa1366cc3adafee4df545422cfd5bcf8e38566d4a6ee5b62d6
|
|
| MD5 |
9e0fd4bfcf9ab74e8c5ed5426a584284
|
|
| BLAKE2b-256 |
2aeb4424aa2805c5f1286dfef2e4e300827b7a23fa0d23a9bb1fc4465f6c96a9
|
Provenance
The following attestation bundles were made for cukks_cu121-0.2.2-cp313-cp313-manylinux_2_27_x86_64.manylinux_2_28_x86_64.whl:
Publisher:
publish-packages.yml on devUuung/CuKKS
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
cukks_cu121-0.2.2-cp313-cp313-manylinux_2_27_x86_64.manylinux_2_28_x86_64.whl -
Subject digest:
ac0f2878ea6811fa1366cc3adafee4df545422cfd5bcf8e38566d4a6ee5b62d6 - Sigstore transparency entry: 1237704813
- Sigstore integration time:
-
Permalink:
devUuung/CuKKS@dcca78172ce212a8ddde72171bc0539218fedd80 -
Branch / Tag:
refs/heads/main - Owner: https://github.com/devUuung
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
publish-packages.yml@dcca78172ce212a8ddde72171bc0539218fedd80 -
Trigger Event:
workflow_dispatch
-
Statement type:
File details
Details for the file cukks_cu121-0.2.2-cp312-cp312-manylinux_2_27_x86_64.manylinux_2_28_x86_64.whl.
File metadata
- Download URL: cukks_cu121-0.2.2-cp312-cp312-manylinux_2_27_x86_64.manylinux_2_28_x86_64.whl
- Upload date:
- Size: 6.5 MB
- Tags: CPython 3.12, manylinux: glibc 2.27+ x86-64, manylinux: glibc 2.28+ x86-64
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
cdc68d3f97c9b42a358bf204ee84c38852edcfc6b7c5ba03bf4224ad36cfcd25
|
|
| MD5 |
59bd8d8f2db29196536dd1648c226403
|
|
| BLAKE2b-256 |
87a15722ea6ad054d272b64f0d8fab6c6d2efcb70f0e3229d2b87c7421d04d33
|
Provenance
The following attestation bundles were made for cukks_cu121-0.2.2-cp312-cp312-manylinux_2_27_x86_64.manylinux_2_28_x86_64.whl:
Publisher:
publish-packages.yml on devUuung/CuKKS
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
cukks_cu121-0.2.2-cp312-cp312-manylinux_2_27_x86_64.manylinux_2_28_x86_64.whl -
Subject digest:
cdc68d3f97c9b42a358bf204ee84c38852edcfc6b7c5ba03bf4224ad36cfcd25 - Sigstore transparency entry: 1237704812
- Sigstore integration time:
-
Permalink:
devUuung/CuKKS@dcca78172ce212a8ddde72171bc0539218fedd80 -
Branch / Tag:
refs/heads/main - Owner: https://github.com/devUuung
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
publish-packages.yml@dcca78172ce212a8ddde72171bc0539218fedd80 -
Trigger Event:
workflow_dispatch
-
Statement type:
File details
Details for the file cukks_cu121-0.2.2-cp311-cp311-manylinux_2_27_x86_64.manylinux_2_28_x86_64.whl.
File metadata
- Download URL: cukks_cu121-0.2.2-cp311-cp311-manylinux_2_27_x86_64.manylinux_2_28_x86_64.whl
- Upload date:
- Size: 6.5 MB
- Tags: CPython 3.11, manylinux: glibc 2.27+ x86-64, manylinux: glibc 2.28+ x86-64
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
c05c50d8589876ac186f7f3025691ae90fb1af03492996fb2438d1daa80881b4
|
|
| MD5 |
e382cff5c5e8a6203e61056314bb1ae9
|
|
| BLAKE2b-256 |
3ea2caa698fbed3875e81aff3e6cd8c4e6c60de7a92cc49a3f22e1f371efc9e5
|
Provenance
The following attestation bundles were made for cukks_cu121-0.2.2-cp311-cp311-manylinux_2_27_x86_64.manylinux_2_28_x86_64.whl:
Publisher:
publish-packages.yml on devUuung/CuKKS
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
cukks_cu121-0.2.2-cp311-cp311-manylinux_2_27_x86_64.manylinux_2_28_x86_64.whl -
Subject digest:
c05c50d8589876ac186f7f3025691ae90fb1af03492996fb2438d1daa80881b4 - Sigstore transparency entry: 1237704817
- Sigstore integration time:
-
Permalink:
devUuung/CuKKS@dcca78172ce212a8ddde72171bc0539218fedd80 -
Branch / Tag:
refs/heads/main - Owner: https://github.com/devUuung
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
publish-packages.yml@dcca78172ce212a8ddde72171bc0539218fedd80 -
Trigger Event:
workflow_dispatch
-
Statement type:
File details
Details for the file cukks_cu121-0.2.2-cp310-cp310-manylinux_2_27_x86_64.manylinux_2_28_x86_64.whl.
File metadata
- Download URL: cukks_cu121-0.2.2-cp310-cp310-manylinux_2_27_x86_64.manylinux_2_28_x86_64.whl
- Upload date:
- Size: 6.5 MB
- Tags: CPython 3.10, manylinux: glibc 2.27+ x86-64, manylinux: glibc 2.28+ x86-64
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
d9cc3fc0ee5154c090e4718d37cd7b283aa9267d28d6651e77403f44b854a5e6
|
|
| MD5 |
450fd8cd6f94ce013a88247b5f1b038b
|
|
| BLAKE2b-256 |
4a0ca463dc06c54d10ba43c3ba032cd46176a52ec296e62566193c40472f708b
|
Provenance
The following attestation bundles were made for cukks_cu121-0.2.2-cp310-cp310-manylinux_2_27_x86_64.manylinux_2_28_x86_64.whl:
Publisher:
publish-packages.yml on devUuung/CuKKS
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
cukks_cu121-0.2.2-cp310-cp310-manylinux_2_27_x86_64.manylinux_2_28_x86_64.whl -
Subject digest:
d9cc3fc0ee5154c090e4718d37cd7b283aa9267d28d6651e77403f44b854a5e6 - Sigstore transparency entry: 1237704814
- Sigstore integration time:
-
Permalink:
devUuung/CuKKS@dcca78172ce212a8ddde72171bc0539218fedd80 -
Branch / Tag:
refs/heads/main - Owner: https://github.com/devUuung
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
publish-packages.yml@dcca78172ce212a8ddde72171bc0539218fedd80 -
Trigger Event:
workflow_dispatch
-
Statement type: