A web interface for managing and interacting with vLLM servers

These details have not been verified by PyPI

Project links

Project description

vLLM Playground

A modern web interface for managing and interacting with vLLM servers (www.github.com/vllm-project/vllm). Supports GPU and CPU modes, with special optimizations for macOS Apple Silicon and enterprise deployment on OpenShift/Kubernetes.

✨ Agentic-Ready with MCP Support

vLLM Playground MCP Integration

MCP (Model Context Protocol) integration enables models to use external tools with human-in-the-loop approval.

✨ Tool Calling Support

vLLM Playground Interface

✨ Structured Outputs Support

vLLM Playground with Structured Outputs

🆕 What's New in v0.1.2

🌏 ModelScope Support - Alternative model source for China region users
🌐 i18n Chinese - Comprehensive Chinese language translations
💬 Chat Export - Save conversations with export functionality
🐛 Bug Fixes - Windows Unicode fix, sidebar UI improvements

See Changelog for full details.

🚀 Quick Start

# Install from PyPI
pip install vllm-playground

# Pre-download container image (~10GB for GPU)
vllm-playground pull

# Start the playground
vllm-playground

Open http://localhost:7860 and click "Start Server" - that's it! 🎉

CLI Options

vllm-playground pull                # Pre-download GPU image
vllm-playground pull --cpu          # Pre-download CPU image
vllm-playground --port 8080         # Custom port
vllm-playground stop                # Stop running instance
vllm-playground status              # Check status

✨ Key Features

Feature	Description
💬 Modern Chat UI	Streamlined ChatGPT-style interface with streaming responses
🔧 Tool Calling	Function calling with Llama, Mistral, Qwen, and more
🔗 MCP Integration	Connect to MCP servers for agentic capabilities
🏗️ Structured Outputs	Constrain responses to JSON Schema, Regex, or Grammar
🐳 Container Mode	Zero-setup vLLM via automatic container management
☸️ OpenShift/K8s	Enterprise deployment with dynamic pod creation
📊 Benchmarking	GuideLLM integration for load testing
📚 Recipes	One-click configs from vLLM community recipes

📦 Installation Options

Method	Command	Best For
PyPI	`pip install vllm-playground`	Most users
With Benchmarking	`pip install vllm-playground[benchmark]`	Load testing
From Source	`git clone` + `python run.py`	Development
OpenShift/K8s	`./openshift/deploy.sh`	Enterprise

📖 See Installation Guide for detailed instructions.

🔧 Configuration

Tool Calling

Enable in Server Configuration before starting:

Check "Enable Tool Calling"
Select parser (or "Auto-detect")
Start server
Define tools in the 🔧 toolbar panel

Supported Models:

Llama 3.x (llama3_json)
Mistral (mistral)
Qwen (hermes)
Hermes (hermes)

MCP Servers

Connect to external tools via Model Context Protocol:

Go to MCP Servers in the sidebar
Add a server (presets available: Filesystem, Git, Fetch, Time)
Connect and enable in chat panel

⚠️ MCP requires Python 3.10+

CPU Mode (macOS)

Edit config/vllm_cpu.env:

export VLLM_CPU_KVCACHE_SPACE=40
export VLLM_CPU_OMP_THREADS_BIND=auto

📖 Documentation

Getting Started

Installation Guide - All installation methods
Quick Start - Get running in minutes
macOS CPU Guide - Apple Silicon setup

Features

Features Overview - Complete feature list
Gated Models Guide - Access Llama, Gemma, etc.

Deployment

OpenShift/K8s Deployment - Enterprise deployment
Architecture Overview - System design
Container Variants - Container options

Reference

Troubleshooting - Common issues
Performance Metrics - Benchmarking
Command Reference - CLI cheat sheet

Releases

Changelog - Version history and changes
v0.1.2 - ModelScope integration, i18n improvements
v0.1.1 - MCP integration, runtime detection
v0.1.0 - First release, modern UI, tool calling

🏗️ Architecture

┌──────────────────┐
│   User Browser   │
└────────┬─────────┘
         │ http://localhost:7860
         ↓
┌──────────────────┐
│   Web UI (Host)  │  ← FastAPI + JavaScript
└────────┬─────────┘
         │
    ┌────┴────┐
    ↓         ↓
┌───────-─┐ ┌────────┐
│ vLLM    │ │  MCP   │  ← Containers / External Servers
│Container│ │Servers │
└────────-┘ └────────┘

📖 See Architecture Overview for details.

🆘 Quick Troubleshooting

Issue	Solution
Port in use	`vllm-playground stop`
Container won't start	`podman logs vllm-service`
Tool calling fails	Restart with "Enable Tool Calling" checked
Image pull errors	`vllm-playground pull --all`

📖 See Troubleshooting Guide for more.

🔗 Related Projects

vLLM - High-throughput LLM serving
LLMCompressor Playground - Model compression & quantization
GuideLLM - Performance benchmarking
MCP Servers - Official MCP servers

📝 License

Apache 2.0 License - See LICENSE file for details.

🤝 Contributing

Contributions welcome! Please feel free to submit issues and pull requests.

Made with ❤️ for the vLLM community

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

0.1.8

Apr 7, 2026

0.1.8rc2 pre-release

Apr 7, 2026

0.1.8rc1 pre-release

Apr 6, 2026

0.1.7

Mar 4, 2026

0.1.6 yanked

Mar 2, 2026

Reason this release was yanked:

bug

0.1.5

Feb 7, 2026

0.1.4

Feb 1, 2026

0.1.4rc1 pre-release

Feb 1, 2026

0.1.3

Jan 22, 2026

0.1.3rc1 pre-release

Jan 22, 2026

This version

0.1.2

Jan 19, 2026

0.1.1

Jan 8, 2026

0.1.1rc2 pre-release

Jan 8, 2026

0.1.1rc1 pre-release

Jan 8, 2026

0.1.0

Jan 1, 2026

0.0.16

Jan 1, 2026

0.0.15

Jan 1, 2026

0.0.14 yanked

Jan 1, 2026

Reason this release was yanked:

broken

0.0.13

Dec 22, 2025

0.0.12

Dec 22, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

vllm_playground-0.1.2.tar.gz (5.3 MB view details)

Uploaded Jan 19, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

vllm_playground-0.1.2-py3-none-any.whl (5.3 MB view details)

Uploaded Jan 19, 2026 Python 3

File details

Details for the file vllm_playground-0.1.2.tar.gz.

File metadata

Download URL: vllm_playground-0.1.2.tar.gz
Upload date: Jan 19, 2026
Size: 5.3 MB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: uv/0.8.17

File hashes

Hashes for vllm_playground-0.1.2.tar.gz
Algorithm	Hash digest
SHA256	`d97fb1a9d9c6bab6319558cc08494598810f6e7f8655a083874350062aade0e7`
MD5	`3c779d907917af94816cf88b28f81655`
BLAKE2b-256	`5425abe4c21074b9998efd35f40caadf6fdec4f082d9d36a8377104933f09e74`

See more details on using hashes here.

File details

Details for the file vllm_playground-0.1.2-py3-none-any.whl.

File metadata

Download URL: vllm_playground-0.1.2-py3-none-any.whl
Upload date: Jan 19, 2026
Size: 5.3 MB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: uv/0.8.17

File hashes

Hashes for vllm_playground-0.1.2-py3-none-any.whl
Algorithm	Hash digest
SHA256	`230d049f9a1cd7376041e4e1b154b9c7187f1636186480b3cda8d93e308bf4fe`
MD5	`278bd37442ec79934bac9c590f5afd34`
BLAKE2b-256	`646a27226daff9e100630f7d6553e41aefd821d136438a9531229a7c8702144b`

See more details on using hashes here.

vllm-playground 0.1.2

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

vLLM Playground

✨ Agentic-Ready with MCP Support

✨ Tool Calling Support

✨ Structured Outputs Support

🆕 What's New in v0.1.2

🚀 Quick Start

CLI Options

✨ Key Features

📦 Installation Options

🔧 Configuration

Tool Calling

MCP Servers

CPU Mode (macOS)

📖 Documentation

Getting Started

Features

Deployment

Reference

Releases

🏗️ Architecture

🆘 Quick Troubleshooting

🔗 Related Projects

📝 License

🤝 Contributing

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes