Skip to main content

vLLM for DGX Spark (GB10). After pip install, run: sudo dgx-spark-vllm-install

Project description

DGX Spark vLLM

Pre-built vLLM environment for NVIDIA DGX Spark (GB10, sm_121).

About

This package contains the vLLM environment extracted from NVIDIA's official nvcr.io/nvidia/vllm:25.09-py3 Docker image, configured for local installation on DGX Spark without Docker.

Why? The official vLLM doesn't support sm_121 (GB10) yet. NVIDIA's Docker image has patches for Blackwell support, but requires running everything in a container. This package lets you run vLLM natively on DGX Spark.

Requirements

  • NVIDIA DGX Spark (GB10 chip, sm_121)
  • Ubuntu with CUDA 13.0 installed
  • Python 3.12
  • aarch64 (ARM64) architecture

Installation

pip install dgx-spark-vllm
sudo dgx-spark-vllm-install

Then restart your shell or run:

source /etc/profile.d/nvidia-venv.sh

Included Versions

Package Version
PyTorch 2.9.0a0+50eac811a6.nv25.09
vLLM 0.10.1.1+381074ae.nv25.09
CUDA 13.0
Triton 3.4.0
FlashAttention 2.7.4
cuDNN 9.13.0
NCCL 2.27.7

Usage

import torch
print(torch.__version__)  # 2.9.0a0+50eac811a6.nv25.09
print(torch.cuda.get_device_name(0))  # NVIDIA GB10

import vllm
print(vllm.__version__)  # 0.10.1.1+381074ae.nv25.09

Run vLLM server:

python3 -m vllm.entrypoints.openai.api_server --model <model_name>

What's Installed

  • /home/srpost/shared/nvidia-venv/ - Python packages
  • /opt/hpcx/ - HPC-X libraries (UCX, UCC, OpenMPI)
  • /usr/lib/aarch64-linux-gnu/libcudnn* - cuDNN
  • /usr/lib/aarch64-linux-gnu/libnccl* - NCCL
  • /usr/local/lib/libnvpl* - NVPL (NVIDIA Performance Libraries)
  • /etc/ld.so.conf.d/00-hpcx.conf - Library path config
  • /etc/profile.d/nvidia-venv.sh - PYTHONPATH config

License

The installer is Apache 2.0. The installed packages retain their original licenses (PyTorch BSD, vLLM Apache 2.0, NVIDIA components under NVIDIA license).

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

dgx_spark_vllm-25.9.2.tar.gz (4.0 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

dgx_spark_vllm-25.9.2-py3-none-any.whl (4.4 kB view details)

Uploaded Python 3

File details

Details for the file dgx_spark_vllm-25.9.2.tar.gz.

File metadata

  • Download URL: dgx_spark_vllm-25.9.2.tar.gz
  • Upload date:
  • Size: 4.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.12.3

File hashes

Hashes for dgx_spark_vllm-25.9.2.tar.gz
Algorithm Hash digest
SHA256 f6cc2db51b3de060e1cc3944c67a5a009413002513ffa8cb471a044f74396d02
MD5 d71eebfc2630baf4dd18eb563c66a3e6
BLAKE2b-256 be90afe18bfa2e06ed8a89e39fd597a746eae8cc7b142b2072702d0fc947485c

See more details on using hashes here.

File details

Details for the file dgx_spark_vllm-25.9.2-py3-none-any.whl.

File metadata

File hashes

Hashes for dgx_spark_vllm-25.9.2-py3-none-any.whl
Algorithm Hash digest
SHA256 b553e949e5cfec9c1e383b6661c2de695121fb45557cce623014ece6e60b08fb
MD5 ca6a7391ef058ff2ca9ad80728d67d52
BLAKE2b-256 f495c047172036c0455614035976a12145777b4a4cf1598187bf07921bae2604

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page