Skip to main content

A CLI tool to list available LLM models from various providers

Project description

llm-models

Python License

A simple command-line tool to list available LLM models from various providers (OpenAI, Google, Anthropic, xAI, Baseten, OpenRouter).

Installation

Linux/macOS

$ pipx install llm-models

Windows (untested)

pip install llm-models

Usage

$ llm-models -h
usage: llm-models [-h] -p {OpenAI,Anthropic,xAI,GoogleAI,VertexAI,Baseten,OpenRouter}
                  [-r REGION] [-c]

List available LLM models from various providers

options:
  -h, --help            show this help message and exit
  -p {OpenAI,Anthropic,xAI,GoogleAI,VertexAI,Baseten,OpenRouter}, --provider {OpenAI,Anthropic,xAI,GoogleAI,VertexAI,Baseten,OpenRouter}
                        The LLM provider backend.
                        - 'GoogleAI': Google AI Studio (API Key). Global/Auto-routed.
                        - 'VertexAI': Google Cloud Vertex AI (IAM Auth). Region-specific.
  -r REGION, --region REGION
                        Google Cloud region (e.g., 'us-central1').
                        *Required* if provider is VertexAI. Ignored for other providers.
  -c, --check           Probe each model with a minimal 1-token request to report
                        live availability instead of just listing the catalog.
                        Anthropic only; consumes a tiny amount of credits per model.

The tool requires API keys set as environment variables:

  • OPENAI_API_KEY for OpenAI
  • GOOGLE_API_KEY for GoogleAI API, or GOOGLE_CLOUD_PROJECT for VertexAI API
  • ANTHROPIC_API_KEY for Anthropic
  • XAI_API_KEY for xAI
  • BASETEN_API_KEY for Baseten
  • OPENROUTER_API_KEY for OpenRouter (optional — listing is public)

Examples

List OpenAI models:

$ llm-models --provider OpenAI
Listing available OpenAI models...
================================================================================
Model: babbage-002
Model: chatgpt-4o-latest
Model: codex-mini-latest
...

List Google models using GoogleAI API:

$ llm-models -p GoogleAI
Listing available Google AI Studio models (auto-routed region)...
================================================================================
Model: models/embedding-gecko-001
Model: models/gemini-2.5-pro-preview-03-25
Model: models/gemini-2.5-flash
...

List Google models using Vertex AI API (with regional endpoint):

$ llm-models -p VertexAI -r us-central1
Listing available Vertex AI models (project: ZZZ, region: us-central1)...
================================================================================
Model: publishers/google/models/imageclassification-efficientnet
Model: publishers/google/models/occupancy-analytics
Model: publishers/google/models/multimodalembedding
...

List Anthropic models:

$ llm-models -p Anthropic
Listing available Anthropic models...
================================================================================
Model: claude-haiku-4-5-20251001 (Claude Haiku 4.5)
Model: claude-sonnet-4-5-20250929 (Claude Sonnet 4.5)
...

Listing vs. checking (--check)

A plain listing reads Anthropic's model catalog (/v1/models): the set of model IDs your API key is entitled to address. It is free and instant, but it is not a health check — a model can appear in the catalog while not actually being served to your key (still rolling out, gated by tier, or mid-incident).

--check answers the other question — can I get a completion from it right now? — by sending each model a minimal 1-token request and reporting the live result:

$ llm-models -p Anthropic --check
Checking live availability of Anthropic models...
================================================================================ claude-fable-5 (Claude Fable 5) - not found for this key
✓ claude-opus-4-8 (Claude Opus 4.8) - available
✓ claude-sonnet-4-6 (Claude Sonnet 4.6) - available
✓ claude-haiku-4-5-20251001 (Claude Haiku 4.5) - available
...

Status meanings:

Symbol Meaning
✓ available Served right now (HTTP 200).
✗ unavailable (overloaded) Real model, temporarily over capacity (HTTP 529) — retry later.
✗ not found for this key In the catalog but not serveable to your key (HTTP 404): not yet rolled out, or gated by tier/region.
✗ unauthorized API key rejected (HTTP 401).
⚠ rate-limited Throttled (HTTP 429) — couldn't determine availability.

The two views fail for different reasons and have different fixes: a 404 is a configuration/entitlement problem (retrying won't help), while a 529 is transient (retrying will). The catalog tells you whether it's worth trying; --check tells you whether trying works right now.

Note: --check spends a tiny amount of credits per model. If your account is out of credits, it prints a friendly message and exits instead of dumping a stack trace — plain listing still works, since it costs nothing.

List xAI models:

$ llm-models -p xAI
Listing available xAI models (NOTE: xAI uses aliases, so grok-4 is an acceptable API name, resolving to grok-4-0709 as of Nov. 2025)...
================================================================================
Model: grok-2-1212
Model: grok-2-vision-1212
Model: grok-3
...

List Baseten models:

$ llm-models -p Baseten
Listing available Baseten models...
================================================================================
Model: openai/gpt-oss-120b (OpenAI GPT 120B) context: 128,072
Model: deepseek-ai/DeepSeek-V3.1 (DeepSeek V3.1) context: 163,840
Model: zai-org/GLM-4.7 (GLM 4.7) context: 200,000
Model: moonshotai/Kimi-K2.5 (Kimi K2.5) context: 262,000
Model: MiniMaxAI/MiniMax-M2.5 (Minimax M2.5) context: 204,000
Model: zai-org/GLM-5 (GLM 5) context: 202,800
Model: nvidia/Nemotron-120B-A12B (Nemotron Super) context: 202,800
Model: moonshotai/Kimi-K2.6 (Kimi K2.6) context: 262,000

List OpenRouter models (sorted by id, with context window and input price; OPENROUTER_API_KEY is optional since the listing endpoint is public):

$ llm-models -p OpenRouter
Listing available OpenRouter models...
================================================================================
Model: ai21/jamba-large-1.7 (AI21: Jamba Large 1.7) context: 256,000 $0.000002/in-tok
Model: aion-labs/aion-1.0 (AionLabs: Aion-1.0) context: 131,072 $0.000004/in-tok
Model: allenai/olmo-3-32b-think (AllenAI: Olmo 3 32B Think) context: 65,536 $0.00000015/in-tok
...

Note: OpenRouter lists 300+ models, so expect long output.

Requirements

  • Python 3.10+
  • tested on Ubuntu 24.04 and macOS 26.1

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

llm_models-0.8.0.tar.gz (8.3 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

llm_models-0.8.0-py3-none-any.whl (8.9 kB view details)

Uploaded Python 3

File details

Details for the file llm_models-0.8.0.tar.gz.

File metadata

  • Download URL: llm_models-0.8.0.tar.gz
  • Upload date:
  • Size: 8.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.8.4

File hashes

Hashes for llm_models-0.8.0.tar.gz
Algorithm Hash digest
SHA256 e6ae2022f6322013b77e323a57ab7d36450ebcd4a01c04f76d5b65f13c47963f
MD5 6656a366322aa08417d5ec33426479ec
BLAKE2b-256 78db42ed75601a7c3be0dde481e61f8c17a914468527b27caab815d4467c7e1b

See more details on using hashes here.

File details

Details for the file llm_models-0.8.0-py3-none-any.whl.

File metadata

File hashes

Hashes for llm_models-0.8.0-py3-none-any.whl
Algorithm Hash digest
SHA256 6371159589c165c2c5f6cc70c0aa27d2075467c902bae3587e17a5be7349a9f0
MD5 624c25b713d621273f93b2ff34ca1651
BLAKE2b-256 b8a404238c63eeb4173e20fb149f5afee359231f8ea26bb7efe1685ca5410f82

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page