Skip to main content

OpenTelemetry vLLM instrumentation

Project description

pypi

This library allows tracing and metrics collection for vLLM LLM serving framework.

Installation

pip install opentelemetry-instrumentation-vllm

Features

  • Works with vLLM 0.7.x–0.10.x (set VLLM_USE_V1=1 to opt into the v1 engine when running 0.8.x–0.10.x).

  • Captures spans for non-streaming and streaming create_completion and create_chat_completion calls, including prompt/response bodies, token usage and finish reasons.

  • Hooks the internal LLM engine to emit spans with TTFT, queueing time, per-stage latency and sampling parameters.

  • Hijacks vLLM’s Prometheus loggers so that all engine metrics are pushed through OpenTelemetry meters instead of relying on pull-based scraping.

Usage

from opentelemetry.instrumentation.vllm import VLLMInstrumentor

VLLMInstrumentor().instrument()

References

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

tapm_instrumentation_vllm-0.1b3.tar.gz (25.5 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

tapm_instrumentation_vllm-0.1b3-py3-none-any.whl (29.2 kB view details)

Uploaded Python 3

File details

Details for the file tapm_instrumentation_vllm-0.1b3.tar.gz.

File metadata

File hashes

Hashes for tapm_instrumentation_vllm-0.1b3.tar.gz
Algorithm Hash digest
SHA256 71ebd9db44f180a842851ec142aa7c0a642c3ca0c6bf6e81591c9aa83350cb7d
MD5 f84e0c7b0af67df70b61928c03d615f7
BLAKE2b-256 10b2d50ece767017f7b952bd5a0e5da44519205fb234e3929e6caa55b4430e67

See more details on using hashes here.

File details

Details for the file tapm_instrumentation_vllm-0.1b3-py3-none-any.whl.

File metadata

File hashes

Hashes for tapm_instrumentation_vllm-0.1b3-py3-none-any.whl
Algorithm Hash digest
SHA256 8ef076f08f9fb2376c247dc7d026bddf88ad0914802cf7d02e3b625fc7395fe0
MD5 123bc71fbc7e6e82ef9ffd831d11a221
BLAKE2b-256 9a46c575cfb78b503f3bee3d70cb4e7d176ca73463baeba9c8eb021dab455c62

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page