Skip to main content

vLLM environment for NVIDIA DGX Spark (GB10, sm_121)

Project description

DGX Spark vLLM

Pre-built vLLM environment for NVIDIA DGX Spark (GB10, sm_121).

About

This package contains the vLLM environment extracted from NVIDIA's official nvcr.io/nvidia/vllm:25.09-py3 Docker image, configured for local installation on DGX Spark without Docker.

Why? The official vLLM doesn't support sm_121 (GB10) yet. NVIDIA's Docker image has patches for Blackwell support, but requires running everything in a container. This package lets you run vLLM natively on DGX Spark.

Requirements

  • NVIDIA DGX Spark (GB10 chip, sm_121)
  • Ubuntu with CUDA 13.0 installed
  • Python 3.12
  • aarch64 (ARM64) architecture

Installation

pip install dgx-spark-vllm
sudo dgx-spark-vllm-install

Then restart your shell or run:

source /etc/profile.d/nvidia-venv.sh

Included Versions

Package Version
PyTorch 2.9.0a0+50eac811a6.nv25.09
vLLM 0.10.1.1+381074ae.nv25.09
CUDA 13.0
Triton 3.4.0
FlashAttention 2.7.4
cuDNN 9.13.0
NCCL 2.27.7

Usage

import torch
print(torch.__version__)  # 2.9.0a0+50eac811a6.nv25.09
print(torch.cuda.get_device_name(0))  # NVIDIA GB10

import vllm
print(vllm.__version__)  # 0.10.1.1+381074ae.nv25.09

Run vLLM server:

python3 -m vllm.entrypoints.openai.api_server --model <model_name>

What's Installed

  • /home/srpost/shared/nvidia-venv/ - Python packages
  • /opt/hpcx/ - HPC-X libraries (UCX, UCC, OpenMPI)
  • /usr/lib/aarch64-linux-gnu/libcudnn* - cuDNN
  • /usr/lib/aarch64-linux-gnu/libnccl* - NCCL
  • /usr/local/lib/libnvpl* - NVPL (NVIDIA Performance Libraries)
  • /etc/ld.so.conf.d/00-hpcx.conf - Library path config
  • /etc/profile.d/nvidia-venv.sh - PYTHONPATH config

License

The installer is Apache 2.0. The installed packages retain their original licenses (PyTorch BSD, vLLM Apache 2.0, NVIDIA components under NVIDIA license).

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

dgx_spark_vllm-25.9.1.tar.gz (3.5 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

dgx_spark_vllm-25.9.1-py3-none-any.whl (4.1 kB view details)

Uploaded Python 3

File details

Details for the file dgx_spark_vllm-25.9.1.tar.gz.

File metadata

  • Download URL: dgx_spark_vllm-25.9.1.tar.gz
  • Upload date:
  • Size: 3.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.12.3

File hashes

Hashes for dgx_spark_vllm-25.9.1.tar.gz
Algorithm Hash digest
SHA256 3d6966f39ea6a40feec1a7addca15dd44676702277e75b6637ee9584047c04ec
MD5 8d5a6e6c46acebd003da914ade6c610b
BLAKE2b-256 1e318fb7e27ea629d9095db04047136b29f494667b87e447d6155279fe7e254f

See more details on using hashes here.

File details

Details for the file dgx_spark_vllm-25.9.1-py3-none-any.whl.

File metadata

File hashes

Hashes for dgx_spark_vllm-25.9.1-py3-none-any.whl
Algorithm Hash digest
SHA256 2b53f2f6a3eb0629d54598ddc8d99db46e3283e51b46be19cbb8c919b68cfe14
MD5 6dae9b9168687e2c3c47df32032155ed
BLAKE2b-256 527cd818a245f84b130757c3c507cbfe38fc077089393dbf9f3c9908e2ba08db

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page