LLM plugin to access Google's Gemini family of models

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

simonw

These details have not been verified by PyPI

Project description

llm-gemini

API access to Google's Gemini models

Installation

Install this plugin in the same environment as LLM.

llm install llm-gemini

Usage

Configure the model by setting a key called "gemini" to your API key:

llm keys set gemini

<paste key here>

You can also set the API key by assigning it to the environment variable LLM_GEMINI_KEY.

Now run the model using -m gemini-2.0-flash, for example:

llm -m gemini-2.0-flash "A short joke about a pelican and a walrus"

A pelican and a walrus are sitting at a bar. The pelican orders a fishbowl cocktail, and the walrus orders a plate of clams. The bartender asks, "So, what brings you two together?"

The walrus sighs and says, "It's a long story. Let's just say we met through a mutual friend... of the fin."

You can set the default model to avoid the extra -m option:

llm models default gemini-2.0-flash
llm "A joke about a pelican and a walrus"

Available models

gemini/gemini-3.5-flash
gemini/gemini-3.1-flash-lite
gemini/gemma-4-31b-it
gemini/gemma-4-26b-a4b-it
gemini/gemini-3.1-flash-lite-preview
gemini/gemini-3.1-pro-preview-customtools
gemini/gemini-3.1-pro-preview: Gemini 3.1 Pro Preview
gemini/gemini-3-flash-preview
gemini/gemini-3-pro-preview: Gemini 3 Pro Preview
gemini/gemini-2.5-flash-lite-preview-09-2025
gemini/gemini-2.5-flash-preview-09-2025
gemini/gemini-flash-lite-latest: Latest Gemini Flash Lite
gemini/gemini-flash-latest: Latest Gemini Flash
gemini/gemini-2.5-flash-lite: Gemini 2.5 Flash Lite
gemini/gemini-2.5-pro: Gemini 2.5 Pro
gemini/gemini-2.5-flash: Gemini 2.5 Flash
gemini/gemini-2.5-pro-preview-06-05
gemini/gemini-2.5-flash-preview-05-20: Gemini 2.5 Flash preview (priced differently from 2.5 Flash)
gemini/gemini-2.5-pro-preview-05-06
gemini/gemini-2.5-flash-preview-04-17
gemini/gemini-2.5-pro-preview-03-25
gemini/gemini-2.5-pro-exp-03-25
gemini/gemini-2.0-flash-lite
gemini/gemini-2.0-pro-exp-02-05
gemini/gemini-2.0-flash
gemini/gemini-2.0-flash-thinking-exp-01-21: Experimental "thinking" model from January 2025
gemini/gemini-2.0-flash-thinking-exp-1219
gemini/gemma-3n-e4b-it
gemini/gemma-3-27b-it
gemini/gemma-3-12b-it
gemini/gemma-3-4b-it
gemini/gemma-3-1b-it
gemini/learnlm-1.5-pro-experimental
gemini/gemini-2.0-flash-exp
gemini/gemini-exp-1206
gemini/gemini-exp-1121
gemini/gemini-exp-1114
gemini/gemini-1.5-flash-8b-001
gemini/gemini-1.5-flash-8b-latest: The least expensive model
gemini/gemini-1.5-flash-002
gemini/gemini-1.5-pro-002
gemini/gemini-1.5-flash-001
gemini/gemini-1.5-pro-001
gemini/gemini-1.5-flash-latest
gemini/gemini-1.5-pro-latest
gemini/gemini-pro

All of these models have aliases that omit the gemini/ prefix, for example:

llm -m gemini-1.5-flash-8b-latest --schema 'name,age int,bio' 'invent a dog'

Images, audio and video

Gemini models are multi-modal. You can provide images, audio or video files as input like this:

llm -m gemini-2.0-flash 'extract text' -a image.jpg

Or with a URL:

llm -m gemini-2.0-flash-lite 'describe image' \
  -a https://static.simonwillison.net/static/2024/pelicans.jpg

Audio works too:

llm -m gemini-2.0-flash 'transcribe audio' -a audio.mp3

And video:

llm -m gemini-2.0-flash 'describe what happens' -a video.mp4

The Gemini prompting guide includes extensive advice on multi-modal prompting.

YouTube videos

You can provide YouTube video URLs as attachments as well:

llm -m gemini-3-pro-preview -a 'https://www.youtube.com/watch?v=9o1_DL9uNlM' \
  'Produce a summary with relevant URLs and code example snippets, then an accurate transcript with timestamps.'

Example output here.

These will be processed with media resolution low by default. You can use the -o media_resolution X option to set that to medium, high, or unspecified.

JSON output

Use -o json_object 1 to force the output to be JSON:

llm -m gemini-2.0-flash -o json_object 1 \
  '3 largest cities in California, list of {"name": "..."}'

Outputs:

{"cities": [{"name": "Los Angeles"}, {"name": "San Diego"}, {"name": "San Jose"}]}

Code execution

Gemini models can write and execute code - they can decide to write Python code, execute it in a secure sandbox and use the result as part of their response.

To enable this feature, use -o code_execution 1:

llm -m gemini-2.0-flash -o code_execution 1 \
'use python to calculate (factorial of 13) * 3'

Google search

Some Gemini models support Grounding with Google Search, where the model can run a Google search and use the results as part of answering a prompt.

Using this feature may incur additional requirements in terms of how you use the results. Consult Google's documentation for more details.

To run a prompt with Google search enabled, use -o google_search 1:

llm -m gemini-2.0-flash -o google_search 1 \
  'What happened in Ireland today?'

Use llm logs -c --json after running a prompt to see the full JSON response, which includes additional information about grounded results.

URL context

Gemini models support a URL context tool which, when enabled, allows the models to fetch additional content from URLs as part of their execution.

You can enable that with the -o url_context 1 option - for example:

llm -m gemini-2.5-flash -o url_context 1 'Latest headline on simonwillison.net'

Extra tokens introduced by this tool will be charged as input tokens. Use --usage to see details of those:

llm -m gemini-2.5-flash -o url_context 1 --usage \
  'Latest headline on simonwillison.net'

Outputs:

The latest headline on simonwillison.net as of August 17, 2025, is "TIL: Running a gpt-oss eval suite against LM Studio on a Mac.".
Token usage: 9,613 input, 87 output, {"candidatesTokenCount": 57, "promptTokensDetails": [{"modality": "TEXT", "tokenCount": 10}], "toolUsePromptTokenCount": 9603, "toolUsePromptTokensDetails": [{"modality": "TEXT", "tokenCount": 9603}], "thoughtsTokenCount": 30}

The "toolUsePromptTokenCount" key shows how many tokens were used for that URL context.

Chat

To chat interactively with the model, run llm chat:

llm chat -m gemini-2.0-flash

Timeouts

By default there is no timeout against the Gemini API. You can use the timeout option to protect against API requests that hang indefinitely.

With the CLI tool that looks like this, to set a 1.5 second timeout:

llm -m gemini-2.5-flash-preview-05-20 'epic saga about mice' -o timeout 1.5

In the Python library timeouts are used like this:

import httpx, llm

model = llm.get_model("gemini/gemini-2.5-flash-preview-05-20")

try:
    response = model.prompt(
        "epic saga about mice", timeout=1.5
    )
    print(response.text())
except httpx.TimeoutException:
    print("Timeout exceeded")

An httpx.TimeoutException subclass will be raised if the timeout is exceeded.

Embeddings

The plugin also adds support for the gemini-embedding-exp-03-07 and text-embedding-004 embedding models.

Run that against a single string like this:

llm embed -m text-embedding-004 -c 'hello world'

This returns a JSON array of 768 numbers.

The gemini-embedding-exp-03-07 model is larger, returning 3072 numbers. You can also use variants of it that are truncated down to smaller sizes:

gemini-embedding-exp-03-07 - 3072 numbers
gemini-embedding-exp-03-07-2048 - 2048 numbers
gemini-embedding-exp-03-07-1024 - 1024 numbers
gemini-embedding-exp-03-07-512 - 512 numbers
gemini-embedding-exp-03-07-256 - 256 numbers
gemini-embedding-exp-03-07-128 - 128 numbers

This command will embed every README.md file in child directories of the current directory and store the results in a SQLite database called embed.db in a collection called readmes:

llm embed-multi readmes -d embed.db -m gemini-embedding-exp-03-07-128 \
  --files . '*/README.md'

You can then run similarity searches against that collection like this:

llm similar readmes -c 'upload csvs to stuff' -d embed.db

See the LLM embeddings documentation for further details.

Listing all Gemini API models

The llm gemini models command lists all of the models that are exposed by the Gemini API, some of which may not be available through this plugin.

llm gemini models

You can add a --key X option to use a different API key.

To filter models by their supported generation methods use --method one or more times:

llm gemini models --method embedContent

If you provide multiple methods you will see models that support any of them.

Development

To set up this plugin locally, first checkout the code, then run the tests with uv:

cd llm-gemini
uv run pytest

Run llm with the plugin like this:

uv run llm models -q gemini

This project uses pytest-recording to record Gemini API responses for the tests.

If you add a new test that calls the API you can capture the API response like this:

PYTEST_GEMINI_API_KEY="$(llm keys get gemini)" uv run pytest --record-mode once

You will need to have stored a valid Gemini API key using this command first:

llm keys set gemini
# Paste key here

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

simonw

These details have not been verified by PyPI

Release history Release notifications | RSS feed

This version

0.32

May 19, 2026

0.32a0 pre-release

May 19, 2026

0.31

May 7, 2026

0.30

Apr 2, 2026

0.29

Feb 19, 2026

0.28.2

Dec 23, 2025

0.28.1

Dec 18, 2025

0.28

Dec 17, 2025

0.27

Nov 18, 2025

0.26.1

Oct 11, 2025

0.26

Sep 25, 2025

0.25

Aug 18, 2025

0.24

Jul 22, 2025

0.23

Jun 17, 2025

0.22

Jun 5, 2025

0.21

May 27, 2025

0.20

May 20, 2025

0.20a2 pre-release

May 20, 2025

0.20a1 pre-release

May 16, 2025

0.20a0 pre-release

May 14, 2025

0.19.1

May 8, 2025

0.19

May 6, 2025

0.18.1

Apr 18, 2025

0.18

Apr 17, 2025

0.17

Apr 4, 2025

0.16

Mar 25, 2025

0.15

Mar 12, 2025

0.14.1

Mar 8, 2025

0.14

Mar 7, 2025

0.13.1

Mar 4, 2025

0.13

Feb 28, 2025

0.13a0 pre-release

Feb 27, 2025

0.12

Feb 25, 2025

0.11

Feb 17, 2025

0.10

Feb 5, 2025

0.9

Jan 22, 2025

0.8

Dec 19, 2024

0.7

Dec 11, 2024

0.6

Dec 6, 2024

0.5

Dec 2, 2024

0.5a0 pre-release

Nov 20, 2024

0.4.2

Nov 22, 2024

0.4.1

Nov 18, 2024

0.4

Nov 18, 2024

0.3

Oct 29, 2024

0.3a0 pre-release

Oct 28, 2024

0.2

Oct 3, 2024

0.1a5 pre-release

Sep 24, 2024

0.1a4 pre-release

May 14, 2024

0.1a3 pre-release

Apr 10, 2024

0.1a2 pre-release

Apr 10, 2024

0.1a1 pre-release

Mar 27, 2024

0.1a0 pre-release

Dec 13, 2023

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

llm_gemini-0.32.tar.gz (23.3 kB view details)

Uploaded May 19, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

llm_gemini-0.32-py3-none-any.whl (17.8 kB view details)

Uploaded May 19, 2026 Python 3

File details

Details for the file llm_gemini-0.32.tar.gz.

File metadata

Download URL: llm_gemini-0.32.tar.gz
Upload date: May 19, 2026
Size: 23.3 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for llm_gemini-0.32.tar.gz
Algorithm	Hash digest
SHA256	`de51b03309590c36bbff42aa1aab66384f7549725c43a8cd6dbaa5dfb9b61007`
MD5	`41d6af3883809cf6bd49f356c4d0dace`
BLAKE2b-256	`6aadd2082db62a9301a962bed8b3e769d12755c7975a113811bd17996301ee02`

See more details on using hashes here.

Provenance

The following attestation bundles were made for llm_gemini-0.32.tar.gz:

Publisher: publish.yml on simonw/llm-gemini

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: llm_gemini-0.32.tar.gz
- Subject digest: de51b03309590c36bbff42aa1aab66384f7549725c43a8cd6dbaa5dfb9b61007
- Sigstore transparency entry: 1575731807
- Sigstore integration time: May 19, 2026
Source repository:
- Permalink: simonw/llm-gemini@5327fac695ddc5f0e2bda6a7edacfd2f56f89abf
- Branch / Tag: refs/tags/0.32
- Owner: https://github.com/simonw
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@5327fac695ddc5f0e2bda6a7edacfd2f56f89abf
- Trigger Event: release

File details

Details for the file llm_gemini-0.32-py3-none-any.whl.

File metadata

Download URL: llm_gemini-0.32-py3-none-any.whl
Upload date: May 19, 2026
Size: 17.8 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for llm_gemini-0.32-py3-none-any.whl
Algorithm	Hash digest
SHA256	`69d82aff597eda8eaca62bd0e805bbb9b06035c138dbb7874c1ad0e95ed32dac`
MD5	`edc2f79d2fd207cb6bf381b0d2ee73a9`
BLAKE2b-256	`3b6c80d01107144165d43089614ff2977bdb1ae8dcc9781e5ebeae7f6b1f1e7e`

See more details on using hashes here.

Provenance

The following attestation bundles were made for llm_gemini-0.32-py3-none-any.whl:

Publisher: publish.yml on simonw/llm-gemini

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: llm_gemini-0.32-py3-none-any.whl
- Subject digest: 69d82aff597eda8eaca62bd0e805bbb9b06035c138dbb7874c1ad0e95ed32dac
- Sigstore transparency entry: 1575731846
- Sigstore integration time: May 19, 2026
Source repository:
- Permalink: simonw/llm-gemini@5327fac695ddc5f0e2bda6a7edacfd2f56f89abf
- Branch / Tag: refs/tags/0.32
- Owner: https://github.com/simonw
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@5327fac695ddc5f0e2bda6a7edacfd2f56f89abf
- Trigger Event: release

llm-gemini 0.32

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Project description

llm-gemini

Installation

Usage

Available models

Images, audio and video

YouTube videos

JSON output

Code execution

Google search

URL context

Chat

Timeouts

Embeddings

Listing all Gemini API models

Development

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance