Skip to main content

A library for efficient similarity search and clustering of dense vectors.

Project description

faiss-wheels

This repository is based on kyamagu/faiss-wheels.

PyPI PyPI

Overview

This repository provides scripts to build GPU-enabled wheels for the faiss library. Distributes faiss-gpu-cuXX packages to PyPI using the contents of this repository.

Key Features

  • No local CUDA installation required - Dynamically links to CUDA Runtime and cuBLAS libraries from PyPI
  • Builds CUDA 11.8+ and CUDA 12.1+ compatible wheels
  • Supports Volta to Ada Lovelace architecture GPUs (Compute Capability 7.0–8.9)
  • Bundles OpenBLAS in Linux
  • Reduces wheel file size through dynamic linking instead of static compilation

Important Requirements

The published faiss-gpu-cuXX packages require proper system setup that cannot be managed by pip. It is your responsibility to prepare a suitable environment:

  1. NVIDIA Driver: Your host must have a CUDA-compatible NVIDIA driver installed

  2. GPU Architecture: Your GPU must be compatible (Compute Capability 7.0–8.9)

    • Supported: Volta, Turing, Ampere, Ada Lovelace
  3. Library Compatibility: If you install multiple CUDA-dependent libraries (e.g., PyTorch) in the same environment, they must link to the same CUDA version

GPU Architecture Support for PyPI Packages

Support Policy for faiss-gpu-cu11 and faiss-gpu-cu12

Note: This is an unofficial, personal development project with limited computational resources. Due to these constraints, comprehensive testing across all NVIDIA GPU architectures is not feasible. The pre-built faiss-gpu-cu11 and faiss-gpu-cu12 packages on PyPI aim to support the same GPU architecture range (Compute Capability 7.0–8.9) as the official Faiss repository.

Sponsoring New GPU Architecture Support

Adding support for a new GPU architecture (e.g., Hopper, Blackwell) requires dedicated hardware for building and testing. NVIDIA GPUs have limited compatibility across compute capabilities — binaries built for one architecture do not necessarily work correctly on another. Distributing untested wheels is not an option.

This is an unfunded personal project. If you or your organization need support for an architecture outside the current range, please consider sponsoring this project to help cover the hardware and infrastructure costs. For ongoing discussion and status updates, see Support for New GPU Architectures.

For Unsupported GPU Architectures

If you have a GPU architecture that is not supported by these pre-built wheels:

  1. Official Faiss: Follow the official Faiss repository build instructions
  2. Build from Source: Use this repository's code to build wheels for your specific architecture (see Building from Source section)

Installation

The faiss-gpu-cu11 and faiss-gpu-cu12 wheels are available on PyPI. Choose the appropriate version for your CUDA environment.

For CUDA 12

# Install with fixed CUDA 12.1 (requires NVIDIA Driver ≥R530)
pip install 'faiss-gpu-cu12[fix-cuda]'

# Install with CUDA 12.X (X≥1) - allows flexibility but driver requirement varies
pip install faiss-gpu-cu12

Details:

  • faiss-gpu-cu12 is built with CUDA Toolkit 12.1 and maintains minor version compatibility
  • With [fix-cuda]: Installs exactly CUDA 12.1, requiring NVIDIA Driver ≥R530
  • Without [fix-cuda]: Allows any CUDA 12.X (X≥1), driver requirement depends on the actual CUDA version installed
    • For example: CUDA 12.4 requires Driver ≥R550
  • Use without [fix-cuda] when integrating with other CUDA-dependent packages (e.g., PyTorch with CUDA 12.4)

System Requirements:

  • OS: Linux x86_64 (glibc ≥2.17)
  • GPU: Compute Capability 7.0–8.9

For CUDA 11

# Install with CUDA 11.8 (requires NVIDIA Driver ≥R520)
pip install faiss-gpu-cu11[fix-cuda]

# Same as above (CUDA 11.8 is the final version)
pip install faiss-gpu-cu11

Details:

  • faiss-gpu-cu11 is built with CUDA Toolkit 11.8
  • Both commands install CUDA 11.8 since no newer CUDA 11.X versions exist
  • Requires NVIDIA Driver ≥R520

System Requirements:

  • OS: Linux x86_64 (glibc ≥2.17)
  • GPU: Compute Capability 7.0–8.9

Driver Compatibility Reference

CUDA Version Minimum Driver Version
CUDA 11.8 ≥R520 (520.61.05)
CUDA 12.1 ≥R530 (530.30.02)
CUDA 12.2+ Check NVIDIA Documentation

Warning: When installing without [fix-cuda], pip may resolve to a newer CUDA version that requires a newer driver than you have installed. Always verify driver compatibility before installation.

Advanced: Using System CUDA Libraries

If you need to use system-installed CUDA instead of PyPI CUDA packages, you can bypass the automatic CUDA loading:

  1. Exclude PyPI CUDA dependencies using your package manager (e.g., uv, pdm)
  2. Set environment variable: _FAISS_WHEEL_DISABLE_CUDA_PRELOAD=1
  3. Ensure CUDA libraries are accessible via LD_LIBRARY_PATH

Example with uv (workaround):

# In pyproject.toml
[tool.uv]
override-dependencies = [
    "nvidia-cuda-runtime-cu11==0.0.0; sys_platform == 'never'",
    "nvidia-cublas-cu11==0.0.0; sys_platform == 'never'",
]

Versioning

  • Follows the original faiss repository versioning (e.g., 1.11.0)
  • Patches specific to this repository use postN suffix (e.g., 1.11.0.post1)

Building from Source

Build faiss-gpu-cu11 and faiss-gpu-cu12 wheels using cibuildwheel.

Build Configuration

# Configure build parameters
export NJOB="32"                          # Number of parallel build jobs
export FAISS_OPT_LEVEL="generic"          # Options: generic, avx2, avx512
export CUDA_ARCHITECTURES="70-real;80-real"  # Target GPU architectures

# For builds without GPU testing
export CIBW_TEST_COMMAND_LINUX=""

# For builds with GPU testing (requires NVIDIA Docker)
export CIBW_CONTAINER_ENGINE='docker; create_args: --gpus all'
# Note: GPU testing requires Docker with NVIDIA Container Toolkit configured

Build Commands

# Build faiss-gpu-cu11 wheels
uvx cibuildwheel@2.23.2 variant/gpu-cu11 --output-dir wheelhouse/gpu-cu11

# Build faiss-gpu-cu12 wheels
uvx cibuildwheel@2.23.2 variant/gpu-cu12 --output-dir wheelhouse/gpu-cu12

Wheels will be created in {repository_root}/wheelhouse/gpu-cuXX/.

Build Requirements

  • OS: Linux x86_64
  • NVIDIA Container Toolkit (if running tests)
  • NVIDIA Driver: ≥R530 (if running tests with CUDA 12)

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distributions

If you're not sure about the file name format, learn more about wheel file names.

faiss_gpu_cu12-1.14.1.post1-cp313-cp313-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (48.4 MB view details)

Uploaded CPython 3.13manylinux: glibc 2.17+ x86-64

faiss_gpu_cu12-1.14.1.post1-cp312-cp312-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (48.4 MB view details)

Uploaded CPython 3.12manylinux: glibc 2.17+ x86-64

faiss_gpu_cu12-1.14.1.post1-cp311-cp311-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (48.4 MB view details)

Uploaded CPython 3.11manylinux: glibc 2.17+ x86-64

faiss_gpu_cu12-1.14.1.post1-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (48.4 MB view details)

Uploaded CPython 3.10manylinux: glibc 2.17+ x86-64

File details

Details for the file faiss_gpu_cu12-1.14.1.post1-cp313-cp313-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.

File metadata

File hashes

Hashes for faiss_gpu_cu12-1.14.1.post1-cp313-cp313-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 a48e1932ef7d2dc8e113ae89b1479235cf940ce30c14edff16adb45b2bc70d26
MD5 7c0e23980541fbc59035b95e80bc72d3
BLAKE2b-256 9834cd2ce7daae86ddc911d64c3bb2cf1c6088cc4f7ab454dcbaa767e221d3ca

See more details on using hashes here.

Provenance

The following attestation bundles were made for faiss_gpu_cu12-1.14.1.post1-cp313-cp313-manylinux_2_17_x86_64.manylinux2014_x86_64.whl:

Publisher: build_wheel.yml on Di-Is/faiss-gpu-wheels

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file faiss_gpu_cu12-1.14.1.post1-cp312-cp312-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.

File metadata

File hashes

Hashes for faiss_gpu_cu12-1.14.1.post1-cp312-cp312-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 d997b9a4d9c7697e3ce39576edb91733c20ac2e81b455b4c08afea9d69e894cf
MD5 10f16d9a8f0b97983f448dbc046bdc91
BLAKE2b-256 e491cd341417dc21b3ec99ea665c66f6939bc21640944c79a8f3908362fe7d63

See more details on using hashes here.

Provenance

The following attestation bundles were made for faiss_gpu_cu12-1.14.1.post1-cp312-cp312-manylinux_2_17_x86_64.manylinux2014_x86_64.whl:

Publisher: build_wheel.yml on Di-Is/faiss-gpu-wheels

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file faiss_gpu_cu12-1.14.1.post1-cp311-cp311-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.

File metadata

File hashes

Hashes for faiss_gpu_cu12-1.14.1.post1-cp311-cp311-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 87e630faed349ba71e6c10e8340e93edf406bf22b42e08f097d7d3d64e16d0cb
MD5 89a152ad08b041c9fd5d23b1f4cd1236
BLAKE2b-256 7e4bc32a725ec4b8be8a7abbeb05afc973dc72256b2d1ae8a3fd8c120d6199de

See more details on using hashes here.

Provenance

The following attestation bundles were made for faiss_gpu_cu12-1.14.1.post1-cp311-cp311-manylinux_2_17_x86_64.manylinux2014_x86_64.whl:

Publisher: build_wheel.yml on Di-Is/faiss-gpu-wheels

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file faiss_gpu_cu12-1.14.1.post1-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.

File metadata

File hashes

Hashes for faiss_gpu_cu12-1.14.1.post1-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 675162965e1fb6c685276c491ad2cf58923ebf99e6571927f554e17b9ff5e84b
MD5 f899da7a35847d18686aabe36a9afef5
BLAKE2b-256 e3151871e23eaad481858e4703069b266f2cfd73326c04b654bb70b68dd4d198

See more details on using hashes here.

Provenance

The following attestation bundles were made for faiss_gpu_cu12-1.14.1.post1-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl:

Publisher: build_wheel.yml on Di-Is/faiss-gpu-wheels

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page