Skip to main content

Google Cloud Vertex AI provider for lmux

Project description

lmux-gcp-vertex

Google Cloud Vertex AI provider for lmux. Wraps the google-genai SDK.

Supports chat completions, streaming, and embeddings.

Part of the lmux ecosystem: standardized interface, cost tracking on every response, and registry-based routing across providers.

Auth

Three authentication methods:

Application Default Credentials (default)

Uses google.auth.default(), which works with GOOGLE_APPLICATION_CREDENTIALS, gcloud CLI, or instance metadata.

from lmux_gcp_vertex import GCPVertexProvider

provider = GCPVertexProvider(project="my-project", location="us-central1")

Service Account

from lmux_gcp_vertex import GCPVertexServiceAccountAuthProvider

provider = GCPVertexProvider(
    project="my-project",
    location="us-central1",
    auth=GCPVertexServiceAccountAuthProvider(service_account_file="/path/to/key.json"),
)

API Key

Set GOOGLE_API_KEY in your environment:

from lmux_gcp_vertex import GCPVertexAPIKeyAuthProvider

provider = GCPVertexProvider(auth=GCPVertexAPIKeyAuthProvider(), vertexai=False)

Usage

Chat

from lmux import UserMessage

response = provider.chat("gemini-2.5-pro", [UserMessage(content="Hello")])
print(response.content)
print(response.cost)

Streaming

for chunk in provider.chat_stream("gemini-2.5-pro", [UserMessage(content="Hello")]):
    if chunk.delta:
        print(chunk.delta, end="")

Embeddings

response = provider.embed("text-embedding-005", "Hello")
print(response.embeddings)

Async

All methods have async variants: achat, achat_stream, aembed.

Registry

Use with the lmux registry to route across multiple providers:

from lmux import Registry

registry = Registry()
registry.register("gcp", provider)
response = registry.chat("gcp/gemini-2.5-pro", messages)

Provider Params

from lmux_gcp_vertex import GCPVertexParams

response = provider.chat(
    "gemini-2.5-pro",
    messages,
    provider_params=GCPVertexParams(thinking_config={"thinking_budget": 1024}),
)
Parameter Type Description
safety_settings list[SafetySetting] Content safety thresholds
presence_penalty float Presence penalty
frequency_penalty float Frequency penalty
seed int Deterministic sampling seed
labels dict[str, str] Request labels
thinking_config dict Thinking/reasoning configuration

Constructor Options

GCPVertexProvider(
    auth=...,       # AuthProvider, default: GCPVertexADCAuthProvider()
    project=...,    # GCP project ID
    location=...,   # GCP region
    vertexai=...,   # Use Vertex AI (default: True) vs. AI Studio
)

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

lmux_gcp_vertex-0.2.0.tar.gz (10.6 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

lmux_gcp_vertex-0.2.0-py3-none-any.whl (13.8 kB view details)

Uploaded Python 3

File details

Details for the file lmux_gcp_vertex-0.2.0.tar.gz.

File metadata

  • Download URL: lmux_gcp_vertex-0.2.0.tar.gz
  • Upload date:
  • Size: 10.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for lmux_gcp_vertex-0.2.0.tar.gz
Algorithm Hash digest
SHA256 86f30fa97bc07ee685ce695aba2ed84a359e0c84240dcc5b8193191d46a28297
MD5 b909b318253cab03fa9862a066014e5c
BLAKE2b-256 f1fe0933b694194b9812d5b387558c00df269c887a23e98da076e80d1d34ec3f

See more details on using hashes here.

Provenance

The following attestation bundles were made for lmux_gcp_vertex-0.2.0.tar.gz:

Publisher: publish.yml on cluebbehusen/lmux

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file lmux_gcp_vertex-0.2.0-py3-none-any.whl.

File metadata

File hashes

Hashes for lmux_gcp_vertex-0.2.0-py3-none-any.whl
Algorithm Hash digest
SHA256 0a0e17a42a22c06d8a046972e5b278d0be6093dcfb7fa3ea10eee9f6cdf7b6eb
MD5 e1ff297e30d13ee59fa56f18d149cb3d
BLAKE2b-256 444c1f04e5cc2e9d064f3be5b088f710a66db156a6b327956ae24c511a3fff8e

See more details on using hashes here.

Provenance

The following attestation bundles were made for lmux_gcp_vertex-0.2.0-py3-none-any.whl:

Publisher: publish.yml on cluebbehusen/lmux

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page