Skip to main content

vLLM environment for NVIDIA DGX Spark (GB10, sm_121)

Project description

DGX Spark vLLM

Pre-built vLLM environment for NVIDIA DGX Spark (GB10, sm_121).

Requirements

  • NVIDIA DGX Spark (GB10 chip, sm_121)
  • Ubuntu with CUDA 13.0 installed
  • Python 3.12
  • aarch64 (ARM64) architecture

Installation

pip install dgx-spark-vllm
sudo dgx-spark-vllm-install

Then restart your shell or run:

source /etc/profile.d/nvidia-venv.sh

Included Versions

Package Version
PyTorch 2.9.0a0+50eac811a6.nv25.09
vLLM 0.10.1.1+381074ae.nv25.09
CUDA 13.0
Triton 3.4.0
FlashAttention 2.7.4
cuDNN 9.13.0
NCCL 2.27.7

Usage

import torch
print(torch.__version__)  # 2.9.0a0+50eac811a6.nv25.09
print(torch.cuda.get_device_name(0))  # NVIDIA GB10

import vllm
print(vllm.__version__)  # 0.10.1.1+381074ae.nv25.09

Run vLLM server:

python3 -m vllm.entrypoints.openai.api_server --model <model_name>

What's Installed

  • /home/srpost/shared/nvidia-venv/ - Python packages
  • /opt/hpcx/ - HPC-X libraries (UCX, UCC, OpenMPI)
  • /usr/lib/aarch64-linux-gnu/libcudnn* - cuDNN
  • /usr/lib/aarch64-linux-gnu/libnccl* - NCCL
  • /usr/local/lib/libnvpl* - NVPL (NVIDIA Performance Libraries)
  • /etc/ld.so.conf.d/00-hpcx.conf - Library path config
  • /etc/profile.d/nvidia-venv.sh - PYTHONPATH config

License

The installer is Apache 2.0. The installed packages retain their original licenses (PyTorch BSD, vLLM Apache 2.0, NVIDIA components under NVIDIA license).

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

dgx_spark_vllm-25.9.0.tar.gz (3.3 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

dgx_spark_vllm-25.9.0-py3-none-any.whl (3.9 kB view details)

Uploaded Python 3

File details

Details for the file dgx_spark_vllm-25.9.0.tar.gz.

File metadata

  • Download URL: dgx_spark_vllm-25.9.0.tar.gz
  • Upload date:
  • Size: 3.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.12.3

File hashes

Hashes for dgx_spark_vllm-25.9.0.tar.gz
Algorithm Hash digest
SHA256 1a85c27ed73b3432e1ef2e5068e2f286a01a4adb4a96578c2777eba39a808ad3
MD5 c1be8b5a748abd4a1d3bcb0694fff996
BLAKE2b-256 b23ac8c48139931cac3a18939dc4c0b7c34016763610057846f758de079c6a1d

See more details on using hashes here.

File details

Details for the file dgx_spark_vllm-25.9.0-py3-none-any.whl.

File metadata

File hashes

Hashes for dgx_spark_vllm-25.9.0-py3-none-any.whl
Algorithm Hash digest
SHA256 e67f8b5f52efd53c268fd297c84d64b68336acc2d978661a5807d8661811f95d
MD5 93bd1311f7ffc2d85232f8631d9d8cd1
BLAKE2b-256 78ae8cb3d9bd5763cdbdbeb621c6be02f65eae6f527718329b7502631a33b3f6

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page