LLM plugin for Google Cloud Vertex AI models including Gemini and Model Garden
Project description
llm-gcp-vertex
llm plugin for Google Cloud Vertex AI, providing access to Gemini and Claude models through their official SDKs. If you are looking for support for
Google AI Studio, see the llm-gemini plugin.
Features
- Gemini models via the official
google-genaiSDK - Claude models via the official
anthropic[vertex]SDK - Full streaming support for both model families
- Async support for high-throughput applications
Installation
llm install llm-gcp-vertex
Prerequisites
1. Google Cloud Authentication
This plugin uses Application Default Credentials (ADC):
gcloud auth application-default login
2. Configure Project and Location
Using environment variables:
# Falls back to GOOGLE_CLOUD_PROJECT and GOOGLE_CLOUD_LOCATION if not set
export LLM_VERTEX_CLOUD_PROJECT="your-project-id"
export LLM_VERTEX_CLOUD_LOCATION="us-central1" # optional, defaults to us-central1
Or using LLM keys:
llm keys set vertex-project
llm keys set vertex-location # optional
Available Models
Gemini
| Model ID | Description |
|---|---|
gemini-3-pro-preview |
Latest flagship, complex reasoning (preview) |
gemini-3-flash-preview |
Latest fast model (preview) |
gemini-2.5-pro |
Strong reasoning and coding |
gemini-2.5-flash |
Fast and cost-effective |
gemini-2.0-flash |
Balanced speed and quality |
Claude (via Vertex AI)
| Model ID | Description |
|---|---|
claude-opus-4.5 |
Most capable, complex analysis |
claude-sonnet-4.5 |
Best for coding and agents |
claude-haiku-4.5 |
Fast and affordable |
claude-opus-4.1 |
Extended thinking, agentic tasks |
claude-sonnet-4 |
Balanced speed and capability |
claude-opus-4 |
Strong reasoning |
Usage
Command Line
# Gemini
llm -m gemini-2.5-flash "Explain quantum computing"
llm -m gemini-2.5-pro "Write a poem" -o temperature 0.9
# Claude
llm -m claude-sonnet-4.5 "Review this code"
llm -m claude-haiku-4.5 "Summarize this" -o max_tokens 500
# With system prompt
llm -m gemini-2.0-flash "Explain recursion" -s "You are a patient teacher"
Options
Gemini Options
| Option | Type | Description |
|---|---|---|
temperature |
float (0.0-2.0) | Controls randomness |
max_output_tokens |
int | Maximum tokens to generate |
top_p |
float (0.0-1.0) | Nucleus sampling threshold |
top_k |
int | Top-k sampling parameter |
Claude Options
| Option | Type | Description |
|---|---|---|
temperature |
float (0.0-1.0) | Controls randomness |
max_tokens |
int | Maximum tokens to generate (default: 4096) |
top_p |
float (0.0-1.0) | Nucleus sampling threshold |
top_k |
int | Top-k sampling parameter |
stop_sequences |
list[str] | Custom stop sequences |
Development
git clone https://github.com/ASRagab/llm-gcp-vertex.git
cd llm-gcp-vertex
# Install with dev dependencies
uv sync --group dev
# Run tests
uv run pytest
# Type check (strict)
uv run basedpyright llm_gcp_vertex.py
Troubleshooting
"Vertex AI project ID required"
export LLM_VERTEX_CLOUD_PROJECT="your-project-id"
# or
llm keys set vertex-project
"Could not automatically determine credentials"
gcloud auth application-default login
Claude models not working
Claude on Vertex AI requires:
- Enable the Claude API in your GCP project
- Accept the usage agreement in Model Garden
- Use a supported region (e.g.,
us-east5,europe-west1)
export LLM_VERTEX_CLOUD_LOCATION="us-east5"
License
Apache 2.0
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file llm_gcp_vertex-0.2.0.tar.gz.
File metadata
- Download URL: llm_gcp_vertex-0.2.0.tar.gz
- Upload date:
- Size: 9.6 kB
- Tags: Source
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
d8c19e792e320a037bf4e14c90fdd8d782048af2545113ac8e8c7367dbd98fd2
|
|
| MD5 |
062d145cfe4fa37dbf314e78c938aec9
|
|
| BLAKE2b-256 |
4027cd8baaf5ef67940087134ccc9d13697a3afeb83cb0a227d286e81e71fb3b
|
Provenance
The following attestation bundles were made for llm_gcp_vertex-0.2.0.tar.gz:
Publisher:
ci.yml on ASRagab/llm-gcp-vertex
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
llm_gcp_vertex-0.2.0.tar.gz -
Subject digest:
d8c19e792e320a037bf4e14c90fdd8d782048af2545113ac8e8c7367dbd98fd2 - Sigstore transparency entry: 790790370
- Sigstore integration time:
-
Permalink:
ASRagab/llm-gcp-vertex@5301286aca98859901dc548871c0138dca2f9d09 -
Branch / Tag:
refs/tags/v0.2.0 - Owner: https://github.com/ASRagab
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
ci.yml@5301286aca98859901dc548871c0138dca2f9d09 -
Trigger Event:
push
-
Statement type:
File details
Details for the file llm_gcp_vertex-0.2.0-py3-none-any.whl.
File metadata
- Download URL: llm_gcp_vertex-0.2.0-py3-none-any.whl
- Upload date:
- Size: 10.6 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
ee229e4476cc5f7620c170625c211322a76a62fed2efa09515bbf2c30fc5da32
|
|
| MD5 |
d2bafcd90f64cacf3da941788c172867
|
|
| BLAKE2b-256 |
afd29e766a61f1764be34b4cbe470607c07a12b69f965c68f656fbc599228050
|
Provenance
The following attestation bundles were made for llm_gcp_vertex-0.2.0-py3-none-any.whl:
Publisher:
ci.yml on ASRagab/llm-gcp-vertex
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
llm_gcp_vertex-0.2.0-py3-none-any.whl -
Subject digest:
ee229e4476cc5f7620c170625c211322a76a62fed2efa09515bbf2c30fc5da32 - Sigstore transparency entry: 790790372
- Sigstore integration time:
-
Permalink:
ASRagab/llm-gcp-vertex@5301286aca98859901dc548871c0138dca2f9d09 -
Branch / Tag:
refs/tags/v0.2.0 - Owner: https://github.com/ASRagab
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
ci.yml@5301286aca98859901dc548871c0138dca2f9d09 -
Trigger Event:
push
-
Statement type: