To calculate token and translated USD cost of string and message calls to OpenAI, for example when used by AI agents
Project description
Clientside token counting + price estimation for LLM apps and AI agents.
🐦 Twitter • 📢 Discord • 🖇️ AgentOps
TokenCostAuto
Tokencost helps calculate the USD cost of using major Large Language Model (LLMs) APIs by calculating the estimated cost of prompts and completions.
Building AI agents? Check out AgentOps
Features
- LLM Price Tracking Major LLM providers frequently add new models and update pricing. This repo helps track the latest price changes
- Token counting Accurately count prompt tokens before sending OpenAI requests
- Easy integration Get the cost of a prompt or completion with a single function
Example usage:
from tokencostauto import calculate_prompt_cost, calculate_completion_cost
model = "gpt-3.5-turbo"
prompt = [{ "role": "user", "content": "Hello world"}]
completion = "How may I assist you today?"
prompt_cost = calculate_prompt_cost(prompt, model)
completion_cost = calculate_completion_cost(completion, model)
print(f"{prompt_cost} + {completion_cost} = {prompt_cost + completion_cost}")
# 0.0000135 + 0.000014 = 0.0000275
Installation
Recommended: PyPI:
pip install tokencostauto
Usage
Cost estimates
Calculating the cost of prompts and completions from OpenAI requests
from openai import OpenAI
client = OpenAI()
model = "gpt-3.5-turbo"
prompt = [{ "role": "user", "content": "Say this is a test"}]
chat_completion = client.chat.completions.create(
messages=prompt, model=model
)
completion = chat_completion.choices[0].message.content
# "This is a test."
prompt_cost = calculate_prompt_cost(prompt, model)
completion_cost = calculate_completion_cost(completion, model)
print(f"{prompt_cost} + {completion_cost} = {prompt_cost + completion_cost}")
# 0.0000180 + 0.000010 = 0.0000280
Calculating cost using string prompts instead of messages:
from tokencostauto import calculate_prompt_cost
prompt_string = "Hello world"
response = "How may I assist you today?"
model= "gpt-3.5-turbo"
prompt_cost = calculate_prompt_cost(prompt_string, model)
print(f"Cost: ${prompt_cost}")
# Cost: $3e-06
Counting tokens
from tokencostauto import count_message_tokens, count_string_tokens
message_prompt = [{ "role": "user", "content": "Hello world"}]
# Counting tokens in prompts formatted as message lists
print(count_message_tokens(message_prompt, model="gpt-3.5-turbo"))
# 9
# Alternatively, counting tokens in string prompts
print(count_string_tokens(prompt="Hello world", model="gpt-3.5-turbo"))
# 2
How tokens are counted
Under the hood, strings and ChatML messages are tokenized using Tiktoken, OpenAI's official tokenizer. Tiktoken splits text into tokens (which can be parts of words or individual characters) and handles both raw strings and message formats with additional tokens for message formatting and roles.
For Anthropic models above version 3 (i.e. Sonnet 3.5, Haiku 3.5, and Opus 3), we use the Anthropic beta token counting API to ensure accurate token counts. For older Claude models, we approximate using Tiktoken with the cl100k_base encoding.
Cost table
Units denominated in USD. All prices can be located in model_prices.json.
- Prices last updated Jan 30, 2024 from LiteLLM's cost dictionary
| Model Name | Prompt Cost (USD) per 1M tokens | Completion Cost (USD) per 1M tokens | Max Prompt Tokens | Max Output Tokens |
|---|---|---|---|---|
| gpt-4 | $30 | $60 | 8192 | 4096 |
| gpt-4o | $2.5 | $10 | 128000 | 16384 |
| gpt-4o-audio-preview | $2.5 | $10 | 128000 | 16384 |
| gpt-4o-audio-preview-2024-10-01 | $2.5 | $10 | 128000 | 16384 |
| gpt-4o-mini | $0.15 | $0.6 | 128000 | 16384 |
| gpt-4o-mini-2024-07-18 | $0.15 | $0.6 | 128000 | 16384 |
| o1-mini | $1.1 | $4.4 | 128000 | 65536 |
| o1-mini-2024-09-12 | $3 | $12 | 128000 | 65536 |
| o1-preview | $15 | $60 | 128000 | 32768 |
| o1-preview-2024-09-12 | $15 | $60 | 128000 | 32768 |
| chatgpt-4o-latest | $5 | $15 | 128000 | 4096 |
| gpt-4o-2024-05-13 | $5 | $15 | 128000 | 4096 |
| gpt-4o-2024-08-06 | $2.5 | $10 | 128000 | 16384 |
| gpt-4-turbo-preview | $10 | $30 | 128000 | 4096 |
| gpt-4-0314 | $30 | $60 | 8192 | 4096 |
| gpt-4-0613 | $30 | $60 | 8192 | 4096 |
| gpt-4-32k | $60 | $120 | 32768 | 4096 |
| gpt-4-32k-0314 | $60 | $120 | 32768 | 4096 |
| gpt-4-32k-0613 | $60 | $120 | 32768 | 4096 |
| gpt-4-turbo | $10 | $30 | 128000 | 4096 |
| gpt-4-turbo-2024-04-09 | $10 | $30 | 128000 | 4096 |
| gpt-4-1106-preview | $10 | $30 | 128000 | 4096 |
| gpt-4-0125-preview | $10 | $30 | 128000 | 4096 |
| gpt-4-vision-preview | $10 | $30 | 128000 | 4096 |
| gpt-4-1106-vision-preview | $10 | $30 | 128000 | 4096 |
| gpt-3.5-turbo | $0.5 | $1.5 | 16385 | 4096 |
| gpt-3.5-turbo-0301 | $1.5 | $2 | 4097 | 4096 |
| gpt-3.5-turbo-0613 | $1.5 | $2 | 4097 | 4096 |
| gpt-3.5-turbo-1106 | $1 | $2 | 16385 | 4096 |
| gpt-3.5-turbo-0125 | $0.5 | $1.5 | 16385 | 4096 |
| gpt-3.5-turbo-16k | $3 | $4 | 16385 | 4096 |
| gpt-3.5-turbo-16k-0613 | $3 | $4 | 16385 | 4096 |
| ft:gpt-3.5-turbo | $3 | $6 | 16385 | 4096 |
| ft:gpt-3.5-turbo-0125 | $3 | $6 | 16385 | 4096 |
| ft:gpt-3.5-turbo-1106 | $3 | $6 | 16385 | 4096 |
| ft:gpt-3.5-turbo-0613 | $3 | $6 | 4096 | 4096 |
| ft:gpt-4-0613 | $30 | $60 | 8192 | 4096 |
| ft:gpt-4o-2024-08-06 | $3.75 | $15 | 128000 | 16384 |
| ft:gpt-4o-mini-2024-07-18 | $0.3 | $1.2 | 128000 | 16384 |
| ft:davinci-002 | $12 | $12 | 16384 | 4096 |
| ft:babbage-002 | $1.6 | $1.6 | 16384 | 4096 |
| text-embedding-3-large | $0.13 | $0 | 8191 | nan |
| text-embedding-3-small | $0.02 | $0 | 8191 | nan |
| text-embedding-ada-002 | $0.1 | $0 | 8191 | nan |
| text-embedding-ada-002-v2 | $0.1 | $0 | 8191 | nan |
| text-moderation-stable | $0 | $0 | 32768 | 0 |
| text-moderation-007 | $0 | $0 | 32768 | 0 |
| text-moderation-latest | $0 | $0 | 32768 | 0 |
| 256-x-256/dall-e-2 | -- | -- | nan | nan |
| 512-x-512/dall-e-2 | -- | -- | nan | nan |
| 1024-x-1024/dall-e-2 | -- | -- | nan | nan |
| hd/1024-x-1792/dall-e-3 | -- | -- | nan | nan |
| hd/1792-x-1024/dall-e-3 | -- | -- | nan | nan |
| hd/1024-x-1024/dall-e-3 | -- | -- | nan | nan |
| standard/1024-x-1792/dall-e-3 | -- | -- | nan | nan |
| standard/1792-x-1024/dall-e-3 | -- | -- | nan | nan |
| standard/1024-x-1024/dall-e-3 | -- | -- | nan | nan |
| whisper-1 | -- | -- | nan | nan |
| tts-1 | -- | -- | nan | nan |
| tts-1-hd | -- | -- | nan | nan |
| azure/tts-1 | -- | -- | nan | nan |
| azure/tts-1-hd | -- | -- | nan | nan |
| azure/whisper-1 | -- | -- | nan | nan |
| azure/o1-mini | $1.21 | $4.84 | 128000 | 65536 |
| azure/o1-mini-2024-09-12 | $1.1 | $4.4 | 128000 | 65536 |
| azure/o1-preview | $15 | $60 | 128000 | 32768 |
| azure/o1-preview-2024-09-12 | $15 | $60 | 128000 | 32768 |
| azure/gpt-4o | $2.5 | $10 | 128000 | 16384 |
| azure/gpt-4o-2024-08-06 | $2.5 | $10 | 128000 | 16384 |
| azure/gpt-4o-2024-05-13 | $5 | $15 | 128000 | 4096 |
| azure/global-standard/gpt-4o-2024-08-06 | $2.5 | $10 | 128000 | 16384 |
| azure/global-standard/gpt-4o-mini | $0.15 | $0.6 | 128000 | 16384 |
| azure/gpt-4o-mini | $0.16 | $0.66 | 128000 | 16384 |
| azure/gpt-4-turbo-2024-04-09 | $10 | $30 | 128000 | 4096 |
| azure/gpt-4-0125-preview | $10 | $30 | 128000 | 4096 |
| azure/gpt-4-1106-preview | $10 | $30 | 128000 | 4096 |
| azure/gpt-4-0613 | $30 | $60 | 8192 | 4096 |
| azure/gpt-4-32k-0613 | $60 | $120 | 32768 | 4096 |
| azure/gpt-4-32k | $60 | $120 | 32768 | 4096 |
| azure/gpt-4 | $30 | $60 | 8192 | 4096 |
| azure/gpt-4-turbo | $10 | $30 | 128000 | 4096 |
| azure/gpt-4-turbo-vision-preview | $10 | $30 | 128000 | 4096 |
| azure/gpt-35-turbo-16k-0613 | $3 | $4 | 16385 | 4096 |
| azure/gpt-35-turbo-1106 | $1 | $2 | 16384 | 4096 |
| azure/gpt-35-turbo-0613 | $1.5 | $2 | 4097 | 4096 |
| azure/gpt-35-turbo-0301 | $0.2 | $2 | 4097 | 4096 |
| azure/gpt-35-turbo-0125 | $0.5 | $1.5 | 16384 | 4096 |
| azure/gpt-35-turbo-16k | $3 | $4 | 16385 | 4096 |
| azure/gpt-35-turbo | $0.5 | $1.5 | 4097 | 4096 |
| azure/gpt-3.5-turbo-instruct-0914 | $1.5 | $2 | 4097 | nan |
| azure/gpt-35-turbo-instruct | $1.5 | $2 | 4097 | nan |
| azure/gpt-35-turbo-instruct-0914 | $1.5 | $2 | 4097 | nan |
| azure/mistral-large-latest | $8 | $24 | 32000 | nan |
| azure/mistral-large-2402 | $8 | $24 | 32000 | nan |
| azure/command-r-plus | $3 | $15 | 128000 | 4096 |
| azure/ada | $0.1 | $0 | 8191 | nan |
| azure/text-embedding-ada-002 | $0.1 | $0 | 8191 | nan |
| azure/text-embedding-3-large | $0.13 | $0 | 8191 | nan |
| azure/text-embedding-3-small | $0.02 | $0 | 8191 | nan |
| azure/standard/1024-x-1024/dall-e-3 | -- | $0 | nan | nan |
| azure/hd/1024-x-1024/dall-e-3 | -- | $0 | nan | nan |
| azure/standard/1024-x-1792/dall-e-3 | -- | $0 | nan | nan |
| azure/standard/1792-x-1024/dall-e-3 | -- | $0 | nan | nan |
| azure/hd/1024-x-1792/dall-e-3 | -- | $0 | nan | nan |
| azure/hd/1792-x-1024/dall-e-3 | -- | $0 | nan | nan |
| azure/standard/1024-x-1024/dall-e-2 | -- | $0 | nan | nan |
| azure_ai/jamba-instruct | $0.5 | $0.7 | 70000 | 4096 |
| azure_ai/mistral-large | $4 | $12 | 32000 | 8191 |
| azure_ai/mistral-small | $1 | $3 | 32000 | 8191 |
| azure_ai/Meta-Llama-3-70B-Instruct | $1.1 | $0.37 | 8192 | 2048 |
| azure_ai/Meta-Llama-3.1-8B-Instruct | $0.3 | $0.61 | 128000 | 2048 |
| azure_ai/Meta-Llama-3.1-70B-Instruct | $2.68 | $3.54 | 128000 | 2048 |
| azure_ai/Meta-Llama-3.1-405B-Instruct | $5.33 | $16 | 128000 | 2048 |
| azure_ai/cohere-rerank-v3-multilingual | $0 | $0 | 4096 | 4096 |
| azure_ai/cohere-rerank-v3-english | $0 | $0 | 4096 | 4096 |
| azure_ai/Cohere-embed-v3-english | $0.1 | $0 | 512 | nan |
| azure_ai/Cohere-embed-v3-multilingual | $0.1 | $0 | 512 | nan |
| babbage-002 | $0.4 | $0.4 | 16384 | 4096 |
| davinci-002 | $2 | $2 | 16384 | 4096 |
| gpt-3.5-turbo-instruct | $1.5 | $2 | 8192 | 4096 |
| gpt-3.5-turbo-instruct-0914 | $1.5 | $2 | 8192 | 4097 |
| claude-instant-1 | $1.63 | $5.51 | 100000 | 8191 |
| mistral/mistral-tiny | $0.25 | $0.25 | 32000 | 8191 |
| mistral/mistral-small | $0.1 | $0.3 | 32000 | 8191 |
| mistral/mistral-small-latest | $0.06 | $0.18 | 131072 | 131072 |
| mistral/mistral-medium | $2.7 | $8.1 | 32000 | 8191 |
| mistral/mistral-medium-latest | $0.4 | $2 | 131072 | 131072 |
| mistral/mistral-medium-2312 | $2.7 | $8.1 | 32000 | 8191 |
| mistral/mistral-large-latest | $0.5 | $1.5 | 262144 | 262144 |
| mistral/mistral-large-2402 | $4 | $12 | 32000 | 8191 |
| mistral/mistral-large-2407 | $3 | $9 | 128000 | 128000 |
| mistral/pixtral-12b-2409 | $0.15 | $0.15 | 128000 | 128000 |
| mistral/open-mistral-7b | $0.25 | $0.25 | 32000 | 8191 |
| mistral/open-mixtral-8x7b | $0.7 | $0.7 | 32000 | 8191 |
| mistral/open-mixtral-8x22b | $2 | $6 | 65336 | 8191 |
| mistral/codestral-latest | $1 | $3 | 32000 | 8191 |
| mistral/codestral-2405 | $1 | $3 | 32000 | 8191 |
| mistral/open-mistral-nemo | $0.3 | $0.3 | 128000 | 128000 |
| mistral/open-mistral-nemo-2407 | $0.3 | $0.3 | 128000 | 128000 |
| mistral/open-codestral-mamba | $0.25 | $0.25 | 256000 | 256000 |
| mistral/codestral-mamba-latest | $0.25 | $0.25 | 256000 | 256000 |
| mistral/mistral-embed | $0.1 | -- | 8192 | nan |
| deepseek-chat | $0.28 | $0.42 | 131072 | 8192 |
| codestral/codestral-latest | $0 | $0 | 32000 | 8191 |
| codestral/codestral-2405 | $0 | $0 | 32000 | 8191 |
| text-completion-codestral/codestral-latest | $0 | $0 | 32000 | 8191 |
| text-completion-codestral/codestral-2405 | $0 | $0 | 32000 | 8191 |
| deepseek-coder | $0.14 | $0.28 | 128000 | 4096 |
| groq/llama2-70b-4096 | $0.7 | $0.8 | 4096 | 4096 |
| groq/llama3-8b-8192 | $0.05 | $0.08 | 8192 | 8192 |
| groq/llama3-70b-8192 | $0.59 | $0.79 | 8192 | 8192 |
| groq/llama-3.1-8b-instant | $0.05 | $0.08 | 128000 | 8192 |
| groq/llama-3.1-70b-versatile | $0.59 | $0.79 | 8192 | 8192 |
| groq/llama-3.1-405b-reasoning | $0.59 | $0.79 | 8192 | 8192 |
| groq/mixtral-8x7b-32768 | $0.24 | $0.24 | 32768 | 32768 |
| groq/gemma-7b-it | $0.05 | $0.08 | 8192 | 8192 |
| groq/gemma2-9b-it | $0.2 | $0.2 | 8192 | 8192 |
| groq/llama3-groq-70b-8192-tool-use-preview | $0.89 | $0.89 | 8192 | 8192 |
| groq/llama3-groq-8b-8192-tool-use-preview | $0.19 | $0.19 | 8192 | 8192 |
| cerebras/llama3.1-8b | $0.1 | $0.1 | 128000 | 128000 |
| cerebras/llama3.1-70b | $0.6 | $0.6 | 128000 | 128000 |
| friendliai/mixtral-8x7b-instruct-v0-1 | $0.4 | $0.4 | 32768 | 32768 |
| friendliai/meta-llama-3-8b-instruct | $0.1 | $0.1 | 8192 | 8192 |
| friendliai/meta-llama-3-70b-instruct | $0.8 | $0.8 | 8192 | 8192 |
| claude-instant-1.2 | $0.16 | $0.55 | 100000 | 8191 |
| claude-2 | $8 | $24 | 100000 | 8191 |
| claude-2.1 | $8 | $24 | 200000 | 8191 |
| claude-3-haiku-20240307 | $0.25 | $1.25 | 200000 | 4096 |
| claude-3-haiku-latest | $0.25 | $1.25 | 200000 | 4096 |
| claude-3-opus-20240229 | $15 | $75 | 200000 | 4096 |
| claude-3-opus-latest | $15 | $75 | 200000 | 4096 |
| claude-3-sonnet-20240229 | $3 | $15 | 200000 | 4096 |
| claude-3-5-sonnet-20240620 | $3 | $15 | 200000 | 8192 |
| claude-3-5-sonnet-20241022 | $3 | $15 | 200000 | 8192 |
| claude-3-5-sonnet-latest | $3 | $15 | 200000 | 8192 |
| text-bison | -- | -- | 8192 | 2048 |
| text-bison@001 | -- | -- | 8192 | 1024 |
| text-bison@002 | -- | -- | 8192 | 1024 |
| text-bison32k | $0.12 | $0.12 | 8192 | 1024 |
| text-bison32k@002 | $0.12 | $0.12 | 8192 | 1024 |
| text-unicorn | $10 | $28 | 8192 | 1024 |
| text-unicorn@001 | $10 | $28 | 8192 | 1024 |
| chat-bison | $0.12 | $0.12 | 8192 | 4096 |
| chat-bison@001 | $0.12 | $0.12 | 8192 | 4096 |
| chat-bison@002 | $0.12 | $0.12 | 8192 | 4096 |
| chat-bison-32k | $0.12 | $0.12 | 32000 | 8192 |
| chat-bison-32k@002 | $0.12 | $0.12 | 32000 | 8192 |
| code-bison | $0.12 | $0.12 | 6144 | 1024 |
| code-bison@001 | $0.12 | $0.12 | 6144 | 1024 |
| code-bison@002 | $0.12 | $0.12 | 6144 | 1024 |
| code-bison32k | $0.12 | $0.12 | 6144 | 1024 |
| code-bison-32k@002 | $0.12 | $0.12 | 6144 | 1024 |
| code-gecko@001 | $0.12 | $0.12 | 2048 | 64 |
| code-gecko@002 | $0.12 | $0.12 | 2048 | 64 |
| code-gecko | $0.12 | $0.12 | 2048 | 64 |
| code-gecko-latest | $0.12 | $0.12 | 2048 | 64 |
| codechat-bison@latest | $0.12 | $0.12 | 6144 | 1024 |
| codechat-bison | $0.12 | $0.12 | 6144 | 1024 |
| codechat-bison@001 | $0.12 | $0.12 | 6144 | 1024 |
| codechat-bison@002 | $0.12 | $0.12 | 6144 | 1024 |
| codechat-bison-32k | $0.12 | $0.12 | 32000 | 8192 |
| codechat-bison-32k@002 | $0.12 | $0.12 | 32000 | 8192 |
| gemini-pro | $0.5 | $1.5 | 32760 | 8192 |
| gemini-1.0-pro | $0.5 | $1.5 | 32760 | 8192 |
| gemini-1.0-pro-001 | $0.5 | $1.5 | 32760 | 8192 |
| gemini-1.0-ultra | $0.5 | $1.5 | 8192 | 2048 |
| gemini-1.0-ultra-001 | $0.5 | $1.5 | 8192 | 2048 |
| gemini-1.0-pro-002 | $0.5 | $1.5 | 32760 | 8192 |
| gemini-1.5-pro | $1.25 | $5 | 2.09715e+06 | 8192 |
| gemini-1.5-pro-002 | $1.25 | $5 | 2.09715e+06 | 8192 |
| gemini-1.5-pro-001 | $1.25 | $5 | 1e+06 | 8192 |
| gemini-1.5-pro-preview-0514 | $0.08 | $0.31 | 1e+06 | 8192 |
| gemini-1.5-pro-preview-0215 | $0.08 | $0.31 | 1e+06 | 8192 |
| gemini-1.5-pro-preview-0409 | $0.08 | $0.31 | 1e+06 | 8192 |
| gemini-1.5-flash | $0.08 | $0.3 | 1e+06 | 8192 |
| gemini-1.5-flash-exp-0827 | $0 | $0 | 1e+06 | 8192 |
| gemini-1.5-flash-002 | $0.08 | $0.3 | 1.04858e+06 | 8192 |
| gemini-1.5-flash-001 | $0.08 | $0.3 | 1e+06 | 8192 |
| gemini-1.5-flash-preview-0514 | $0.08 | $0 | 1e+06 | 8192 |
| gemini-pro-experimental | $0 | $0 | 1e+06 | 8192 |
| gemini-flash-experimental | $0 | $0 | 1e+06 | 8192 |
| gemini-pro-vision | $0.5 | $1.5 | 16384 | 2048 |
| gemini-1.0-pro-vision | $0.5 | $1.5 | 16384 | 2048 |
| gemini-1.0-pro-vision-001 | $0.5 | $1.5 | 16384 | 2048 |
| medlm-medium | -- | -- | 32768 | 8192 |
| medlm-large | -- | -- | 8192 | 1024 |
| vertex_ai/claude-3-sonnet@20240229 | $3 | $15 | 200000 | 4096 |
| vertex_ai/claude-3-5-sonnet@20240620 | $3 | $15 | 200000 | 8192 |
| vertex_ai/claude-3-5-sonnet-v2@20241022 | $3 | $15 | 200000 | 8192 |
| vertex_ai/claude-3-haiku@20240307 | $0.25 | $1.25 | 200000 | 4096 |
| vertex_ai/claude-3-opus@20240229 | $15 | $75 | 200000 | 4096 |
| vertex_ai/meta/llama3-405b-instruct-maas | $0 | $0 | 32000 | 32000 |
| vertex_ai/meta/llama3-70b-instruct-maas | $0 | $0 | 32000 | 32000 |
| vertex_ai/meta/llama3-8b-instruct-maas | $0 | $0 | 32000 | 32000 |
| vertex_ai/meta/llama-3.2-90b-vision-instruct-maas | $0 | $0 | 128000 | 2048 |
| vertex_ai/mistral-large@latest | $2 | $6 | 128000 | 8191 |
| vertex_ai/mistral-large@2407 | $2 | $6 | 128000 | 8191 |
| vertex_ai/mistral-nemo@latest | $0.15 | $0.15 | 128000 | 128000 |
| vertex_ai/jamba-1.5-mini@001 | $0.2 | $0.4 | 256000 | 256000 |
| vertex_ai/jamba-1.5-large@001 | $2 | $8 | 256000 | 256000 |
| vertex_ai/jamba-1.5 | $0.2 | $0.4 | 256000 | 256000 |
| vertex_ai/jamba-1.5-mini | $0.2 | $0.4 | 256000 | 256000 |
| vertex_ai/jamba-1.5-large | $2 | $8 | 256000 | 256000 |
| vertex_ai/mistral-nemo@2407 | $3 | $3 | 128000 | 128000 |
| vertex_ai/codestral@latest | $0.2 | $0.6 | 128000 | 128000 |
| vertex_ai/codestral@2405 | $0.2 | $0.6 | 128000 | 128000 |
| vertex_ai/imagegeneration@006 | -- | -- | nan | nan |
| vertex_ai/imagen-3.0-generate-001 | -- | -- | nan | nan |
| vertex_ai/imagen-3.0-fast-generate-001 | -- | -- | nan | nan |
| text-embedding-004 | $0.1 | $0 | 2048 | nan |
| text-multilingual-embedding-002 | $0.1 | $0 | 2048 | nan |
| textembedding-gecko | $0.1 | $0 | 3072 | nan |
| textembedding-gecko-multilingual | $0.1 | $0 | 3072 | nan |
| textembedding-gecko-multilingual@001 | $0.1 | $0 | 3072 | nan |
| textembedding-gecko@001 | $0.1 | $0 | 3072 | nan |
| textembedding-gecko@003 | $0.1 | $0 | 3072 | nan |
| text-embedding-preview-0409 | $0.01 | $0 | 3072 | nan |
| text-multilingual-embedding-preview-0409 | $0.01 | $0 | 3072 | nan |
| palm/chat-bison | $0.12 | $0.12 | 8192 | 4096 |
| palm/chat-bison-001 | $0.12 | $0.12 | 8192 | 4096 |
| palm/text-bison | $0.12 | $0.12 | 8192 | 1024 |
| palm/text-bison-001 | $0.12 | $0.12 | 8192 | 1024 |
| palm/text-bison-safety-off | $0.12 | $0.12 | 8192 | 1024 |
| palm/text-bison-safety-recitation-off | $0.12 | $0.12 | 8192 | 1024 |
| gemini/gemini-1.5-flash-002 | $0.08 | $0.3 | 1.04858e+06 | 8192 |
| gemini/gemini-1.5-flash-001 | $0.08 | $0.3 | 1.04858e+06 | 8192 |
| gemini/gemini-1.5-flash | $0.08 | $0 | 8192 | nan |
| gemini/gemini-1.5-flash-latest | $0.08 | $0.3 | 1.04858e+06 | 8192 |
| gemini/gemini-1.5-flash-8b-exp-0924 | $0 | $0 | 1.04858e+06 | 8192 |
| gemini/gemini-1.5-flash-exp-0827 | $0 | $0 | 1.04858e+06 | 8192 |
| gemini/gemini-1.5-flash-8b-exp-0827 | $0 | $0 | 1e+06 | 8192 |
| gemini/gemini-pro | $0.35 | $1.05 | 32760 | 8192 |
| gemini/gemini-1.5-pro | $3.5 | $10.5 | 2.09715e+06 | 8192 |
| gemini/gemini-1.5-pro-002 | $3.5 | $10.5 | 2.09715e+06 | 8192 |
| gemini/gemini-1.5-pro-001 | $3.5 | $10.5 | 2.09715e+06 | 8192 |
| gemini/gemini-1.5-pro-exp-0801 | $3.5 | $10.5 | 2.09715e+06 | 8192 |
| gemini/gemini-1.5-pro-exp-0827 | $0 | $0 | 2.09715e+06 | 8192 |
| gemini/gemini-1.5-pro-latest | $3.5 | $1.05 | 1.04858e+06 | 8192 |
| gemini/gemini-pro-vision | $0.35 | $1.05 | 30720 | 2048 |
| gemini/gemini-gemma-2-27b-it | $0.35 | $1.05 | nan | 8192 |
| gemini/gemini-gemma-2-9b-it | $0.35 | $1.05 | nan | 8192 |
| command-r | $0.15 | $0.6 | 128000 | 4096 |
| command-r-08-2024 | $0.15 | $0.6 | 128000 | 4096 |
| command-light | $0.3 | $0.6 | 4096 | 4096 |
| command-r-plus | $2.5 | $10 | 128000 | 4096 |
| command-r-plus-08-2024 | $2.5 | $10 | 128000 | 4096 |
| command-nightly | $1 | $2 | 4096 | 4096 |
| command | $1 | $2 | 4096 | 4096 |
| rerank-english-v3.0 | $0 | $0 | 4096 | 4096 |
| rerank-multilingual-v3.0 | $0 | $0 | 4096 | 4096 |
| rerank-english-v2.0 | $0 | $0 | 4096 | 4096 |
| rerank-multilingual-v2.0 | $0 | $0 | 4096 | 4096 |
| embed-english-v3.0 | $0.1 | $0 | 1024 | nan |
| embed-english-light-v3.0 | $0.1 | $0 | 1024 | nan |
| embed-multilingual-v3.0 | $0.1 | $0 | 1024 | nan |
| embed-english-v2.0 | $0.1 | $0 | 4096 | nan |
| embed-english-light-v2.0 | $0.1 | $0 | 1024 | nan |
| embed-multilingual-v2.0 | $0.1 | $0 | 768 | nan |
| replicate/meta/llama-2-13b | $0.1 | $0.5 | 4096 | 4096 |
| replicate/meta/llama-2-13b-chat | $0.1 | $0.5 | 4096 | 4096 |
| replicate/meta/llama-2-70b | $0.65 | $2.75 | 4096 | 4096 |
| replicate/meta/llama-2-70b-chat | $0.65 | $2.75 | 4096 | 4096 |
| replicate/meta/llama-2-7b | $0.05 | $0.25 | 4096 | 4096 |
| replicate/meta/llama-2-7b-chat | $0.05 | $0.25 | 4096 | 4096 |
| replicate/meta/llama-3-70b | $0.65 | $2.75 | 8192 | 8192 |
| replicate/meta/llama-3-70b-instruct | $0.65 | $2.75 | 8192 | 8192 |
| replicate/meta/llama-3-8b | $0.05 | $0.25 | 8086 | 8086 |
| replicate/meta/llama-3-8b-instruct | $0.05 | $0.25 | 8086 | 8086 |
| replicate/mistralai/mistral-7b-v0.1 | $0.05 | $0.25 | 4096 | 4096 |
| replicate/mistralai/mistral-7b-instruct-v0.2 | $0.05 | $0.25 | 4096 | 4096 |
| replicate/mistralai/mixtral-8x7b-instruct-v0.1 | $0.3 | $1 | 4096 | 4096 |
| openrouter/deepseek/deepseek-coder | $0.14 | $0.28 | 66000 | 4096 |
| openrouter/microsoft/wizardlm-2-8x22b:nitro | $1 | $1 | nan | nan |
| openrouter/google/gemini-pro-1.5 | $2.5 | $7.5 | 1e+06 | 8192 |
| openrouter/mistralai/mixtral-8x22b-instruct | $0.65 | $0.65 | 65536 | 65536 |
| openrouter/cohere/command-r-plus | $3 | $15 | nan | nan |
| openrouter/databricks/dbrx-instruct | $0.6 | $0.6 | nan | nan |
| openrouter/anthropic/claude-3-haiku | $0.25 | $1.25 | 200000 | 4096 |
| openrouter/anthropic/claude-3-haiku-20240307 | $0.25 | $1.25 | 200000 | 4096 |
| anthropic/claude-3-5-sonnet-20241022 | $3 | $15 | 200000 | 8192 |
| anthropic/claude-3-5-sonnet-latest | $3 | $15 | 200000 | 8192 |
| openrouter/anthropic/claude-3.5-sonnet | $3 | $15 | 200000 | 8192 |
| openrouter/anthropic/claude-3.5-sonnet:beta | $3 | $15 | 200000 | 8192 |
| openrouter/anthropic/claude-3-sonnet | $3 | $15 | nan | nan |
| openrouter/mistralai/mistral-large | $8 | $24 | 128000 | 8191 |
| openrouter/cognitivecomputations/dolphin-mixtral-8x7b | $0.5 | $0.5 | nan | nan |
| openrouter/google/gemini-pro-vision | $0.12 | $0.38 | nan | nan |
| openrouter/fireworks/firellava-13b | $0.2 | $0.2 | nan | nan |
| openrouter/meta-llama/llama-3-8b-instruct:free | $0 | $0 | nan | nan |
| openrouter/meta-llama/llama-3-8b-instruct:extended | $0.22 | $2.25 | nan | nan |
| openrouter/meta-llama/llama-3-70b-instruct:nitro | $0.9 | $0.9 | nan | nan |
| openrouter/meta-llama/llama-3-70b-instruct | $0.59 | $0.79 | 8192 | 8000 |
| openrouter/openai/o1-mini | $3 | $12 | 128000 | 65536 |
| openrouter/openai/o1-mini-2024-09-12 | $3 | $12 | 128000 | 65536 |
| openrouter/openai/o1-preview | $15 | $60 | 128000 | 32768 |
| openrouter/openai/o1-preview-2024-09-12 | $15 | $60 | 128000 | 32768 |
| openrouter/openai/gpt-4o | $2.5 | $10 | 128000 | 4096 |
| openrouter/openai/gpt-4o-2024-05-13 | $5 | $15 | 128000 | 4096 |
| openrouter/openai/gpt-4-vision-preview | $10 | $30 | nan | nan |
| openrouter/openai/gpt-3.5-turbo | $1.5 | $2 | 16385 | 4096 |
| openrouter/openai/gpt-3.5-turbo-16k | $3 | $4 | 16385 | 4096 |
| openrouter/openai/gpt-4 | $30 | $60 | 8191 | 4096 |
| openrouter/anthropic/claude-instant-v1 | $1.63 | $5.51 | nan | 8191 |
| openrouter/anthropic/claude-2 | $11.02 | $32.68 | nan | 8191 |
| openrouter/anthropic/claude-3-opus | $15 | $75 | 200000 | 4096 |
| openrouter/google/palm-2-chat-bison | $0.5 | $0.5 | nan | nan |
| openrouter/google/palm-2-codechat-bison | $0.5 | $0.5 | nan | nan |
| openrouter/meta-llama/llama-2-13b-chat | $0.2 | $0.2 | nan | nan |
| openrouter/meta-llama/llama-2-70b-chat | $1.5 | $1.5 | nan | nan |
| openrouter/meta-llama/codellama-34b-instruct | $0.5 | $0.5 | nan | nan |
| openrouter/nousresearch/nous-hermes-llama2-13b | $0.2 | $0.2 | nan | nan |
| openrouter/mancer/weaver | $5.62 | $5.62 | 8000 | 2000 |
| openrouter/gryphe/mythomax-l2-13b | $1.88 | $1.88 | nan | nan |
| openrouter/jondurbin/airoboros-l2-70b-2.1 | $13.88 | $13.88 | nan | nan |
| openrouter/undi95/remm-slerp-l2-13b | $1.88 | $1.88 | 6144 | 4096 |
| openrouter/pygmalionai/mythalion-13b | $1.88 | $1.88 | nan | nan |
| openrouter/mistralai/mistral-7b-instruct | $0.13 | $0.13 | 32768 | 8191 |
| openrouter/mistralai/mistral-7b-instruct:free | $0 | $0 | nan | nan |
| j2-ultra | $15 | $15 | 8192 | 8192 |
| jamba-1.5-mini@001 | $0.2 | $0.4 | 256000 | 256000 |
| jamba-1.5-large@001 | $2 | $8 | 256000 | 256000 |
| jamba-1.5 | $0.2 | $0.4 | 256000 | 256000 |
| jamba-1.5-mini | $0.2 | $0.4 | 256000 | 256000 |
| jamba-1.5-large | $2 | $8 | 256000 | 256000 |
| j2-mid | $10 | $10 | 8192 | 8192 |
| j2-light | $3 | $3 | 8192 | 8192 |
| dolphin | $0.5 | $0.5 | 16384 | 16384 |
| chatdolphin | $0.5 | $0.5 | 16384 | 16384 |
| luminous-base | $30 | $33 | nan | nan |
| luminous-base-control | $37.5 | $41.25 | nan | nan |
| luminous-extended | $45 | $49.5 | nan | nan |
| luminous-extended-control | $56.25 | $61.88 | nan | nan |
| luminous-supreme | $175 | $192.5 | nan | nan |
| luminous-supreme-control | $218.75 | $240.62 | nan | nan |
| ai21.j2-mid-v1 | $12.5 | $12.5 | 8191 | 8191 |
| ai21.j2-ultra-v1 | $18.8 | $18.8 | 8191 | 8191 |
| ai21.jamba-instruct-v1:0 | $0.5 | $0.7 | 70000 | 4096 |
| amazon.titan-text-lite-v1 | $0.3 | $0.4 | 42000 | 4000 |
| amazon.titan-text-express-v1 | $1.3 | $1.7 | 42000 | 8000 |
| amazon.titan-text-premier-v1:0 | $0.5 | $1.5 | 42000 | 32000 |
| amazon.titan-embed-text-v1 | $0.1 | $0 | 8192 | nan |
| amazon.titan-embed-text-v2:0 | $0.2 | $0 | 8192 | nan |
| mistral.mistral-7b-instruct-v0:2 | $0.15 | $0.2 | 32000 | 8191 |
| mistral.mixtral-8x7b-instruct-v0:1 | $0.45 | $0.7 | 32000 | 8191 |
| mistral.mistral-large-2402-v1:0 | $8 | $24 | 32000 | 8191 |
| mistral.mistral-large-2407-v1:0 | $3 | $9 | 128000 | 8191 |
| mistral.mistral-small-2402-v1:0 | $1 | $3 | 32000 | 8191 |
| bedrock/us-west-2/mistral.mixtral-8x7b-instruct-v0:1 | $0.45 | $0.7 | 32000 | 8191 |
| bedrock/us-east-1/mistral.mixtral-8x7b-instruct-v0:1 | $0.45 | $0.7 | 32000 | 8191 |
| bedrock/eu-west-3/mistral.mixtral-8x7b-instruct-v0:1 | $0.59 | $0.91 | 32000 | 8191 |
| bedrock/us-west-2/mistral.mistral-7b-instruct-v0:2 | $0.15 | $0.2 | 32000 | 8191 |
| bedrock/us-east-1/mistral.mistral-7b-instruct-v0:2 | $0.15 | $0.2 | 32000 | 8191 |
| bedrock/eu-west-3/mistral.mistral-7b-instruct-v0:2 | $0.2 | $0.26 | 32000 | 8191 |
| bedrock/us-east-1/mistral.mistral-large-2402-v1:0 | $8 | $24 | 32000 | 8191 |
| bedrock/us-west-2/mistral.mistral-large-2402-v1:0 | $8 | $24 | 32000 | 8191 |
| bedrock/eu-west-3/mistral.mistral-large-2402-v1:0 | $10.4 | $31.2 | 32000 | 8191 |
| anthropic.claude-3-sonnet-20240229-v1:0 | $3 | $15 | 200000 | 4096 |
| anthropic.claude-3-5-sonnet-20240620-v1:0 | $3 | $15 | 1e+06 | 4096 |
| anthropic.claude-3-5-sonnet-20241022-v2:0 | $3 | $15 | 1e+06 | 8192 |
| anthropic.claude-3-5-sonnet-latest-v2:0 | $3 | $15 | 200000 | 4096 |
| anthropic.claude-3-haiku-20240307-v1:0 | $0.25 | $1.25 | 200000 | 4096 |
| anthropic.claude-3-opus-20240229-v1:0 | $15 | $75 | 200000 | 4096 |
| us.anthropic.claude-3-sonnet-20240229-v1:0 | $3 | $15 | 200000 | 4096 |
| us.anthropic.claude-3-5-sonnet-20240620-v1:0 | $3 | $15 | 200000 | 4096 |
| us.anthropic.claude-3-5-sonnet-20241022-v2:0 | $3 | $15 | 200000 | 8192 |
| us.anthropic.claude-3-haiku-20240307-v1:0 | $0.25 | $1.25 | 200000 | 4096 |
| us.anthropic.claude-3-opus-20240229-v1:0 | $15 | $75 | 200000 | 4096 |
| eu.anthropic.claude-3-sonnet-20240229-v1:0 | $3 | $15 | 200000 | 4096 |
| eu.anthropic.claude-3-5-sonnet-20240620-v1:0 | $3 | $15 | 200000 | 4096 |
| eu.anthropic.claude-3-5-sonnet-20241022-v2:0 | $3 | $15 | 200000 | 8192 |
| eu.anthropic.claude-3-haiku-20240307-v1:0 | $0.25 | $1.25 | 200000 | 4096 |
| eu.anthropic.claude-3-opus-20240229-v1:0 | $15 | $75 | 200000 | 4096 |
| anthropic.claude-v1 | $8 | $24 | 100000 | 8191 |
| bedrock/us-east-1/anthropic.claude-v1 | $8 | $24 | 100000 | 8191 |
| bedrock/us-west-2/anthropic.claude-v1 | $8 | $24 | 100000 | 8191 |
| bedrock/ap-northeast-1/anthropic.claude-v1 | $8 | $24 | 100000 | 8191 |
| bedrock/ap-northeast-1/1-month-commitment/anthropic.claude-v1 | -- | -- | 100000 | 8191 |
| bedrock/ap-northeast-1/6-month-commitment/anthropic.claude-v1 | -- | -- | 100000 | 8191 |
| bedrock/eu-central-1/anthropic.claude-v1 | $8 | $24 | 100000 | 8191 |
| bedrock/eu-central-1/1-month-commitment/anthropic.claude-v1 | -- | -- | 100000 | 8191 |
| bedrock/eu-central-1/6-month-commitment/anthropic.claude-v1 | -- | -- | 100000 | 8191 |
| bedrock/us-east-1/1-month-commitment/anthropic.claude-v1 | -- | -- | 100000 | 8191 |
| bedrock/us-east-1/6-month-commitment/anthropic.claude-v1 | -- | -- | 100000 | 8191 |
| bedrock/us-west-2/1-month-commitment/anthropic.claude-v1 | -- | -- | 100000 | 8191 |
| bedrock/us-west-2/6-month-commitment/anthropic.claude-v1 | -- | -- | 100000 | 8191 |
| anthropic.claude-v2 | $8 | $24 | 100000 | 8191 |
| bedrock/us-east-1/anthropic.claude-v2 | $8 | $24 | 100000 | 8191 |
| bedrock/us-west-2/anthropic.claude-v2 | $8 | $24 | 100000 | 8191 |
| bedrock/ap-northeast-1/anthropic.claude-v2 | $8 | $24 | 100000 | 8191 |
| bedrock/ap-northeast-1/1-month-commitment/anthropic.claude-v2 | -- | -- | 100000 | 8191 |
| bedrock/ap-northeast-1/6-month-commitment/anthropic.claude-v2 | -- | -- | 100000 | 8191 |
| bedrock/eu-central-1/anthropic.claude-v2 | $8 | $24 | 100000 | 8191 |
| bedrock/eu-central-1/1-month-commitment/anthropic.claude-v2 | -- | -- | 100000 | 8191 |
| bedrock/eu-central-1/6-month-commitment/anthropic.claude-v2 | -- | -- | 100000 | 8191 |
| bedrock/us-east-1/1-month-commitment/anthropic.claude-v2 | -- | -- | 100000 | 8191 |
| bedrock/us-east-1/6-month-commitment/anthropic.claude-v2 | -- | -- | 100000 | 8191 |
| bedrock/us-west-2/1-month-commitment/anthropic.claude-v2 | -- | -- | 100000 | 8191 |
| bedrock/us-west-2/6-month-commitment/anthropic.claude-v2 | -- | -- | 100000 | 8191 |
| anthropic.claude-v2:1 | $8 | $24 | 100000 | 8191 |
| bedrock/us-east-1/anthropic.claude-v2:1 | $8 | $24 | 100000 | 8191 |
| bedrock/us-west-2/anthropic.claude-v2:1 | $8 | $24 | 100000 | 8191 |
| bedrock/ap-northeast-1/anthropic.claude-v2:1 | $8 | $24 | 100000 | 8191 |
| bedrock/ap-northeast-1/1-month-commitment/anthropic.claude-v2:1 | -- | -- | 100000 | 8191 |
| bedrock/ap-northeast-1/6-month-commitment/anthropic.claude-v2:1 | -- | -- | 100000 | 8191 |
| bedrock/eu-central-1/anthropic.claude-v2:1 | $8 | $24 | 100000 | 8191 |
| bedrock/eu-central-1/1-month-commitment/anthropic.claude-v2:1 | -- | -- | 100000 | 8191 |
| bedrock/eu-central-1/6-month-commitment/anthropic.claude-v2:1 | -- | -- | 100000 | 8191 |
| bedrock/us-east-1/1-month-commitment/anthropic.claude-v2:1 | -- | -- | 100000 | 8191 |
| bedrock/us-east-1/6-month-commitment/anthropic.claude-v2:1 | -- | -- | 100000 | 8191 |
| bedrock/us-west-2/1-month-commitment/anthropic.claude-v2:1 | -- | -- | 100000 | 8191 |
| bedrock/us-west-2/6-month-commitment/anthropic.claude-v2:1 | -- | -- | 100000 | 8191 |
| anthropic.claude-instant-v1 | $0.8 | $2.4 | 100000 | 8191 |
| bedrock/us-east-1/anthropic.claude-instant-v1 | $0.8 | $2.4 | 100000 | 8191 |
| bedrock/us-east-1/1-month-commitment/anthropic.claude-instant-v1 | -- | -- | 100000 | 8191 |
| bedrock/us-east-1/6-month-commitment/anthropic.claude-instant-v1 | -- | -- | 100000 | 8191 |
| bedrock/us-west-2/1-month-commitment/anthropic.claude-instant-v1 | -- | -- | 100000 | 8191 |
| bedrock/us-west-2/6-month-commitment/anthropic.claude-instant-v1 | -- | -- | 100000 | 8191 |
| bedrock/us-west-2/anthropic.claude-instant-v1 | $0.8 | $2.4 | 100000 | 8191 |
| bedrock/ap-northeast-1/anthropic.claude-instant-v1 | $2.23 | $7.55 | 100000 | 8191 |
| bedrock/ap-northeast-1/1-month-commitment/anthropic.claude-instant-v1 | -- | -- | 100000 | 8191 |
| bedrock/ap-northeast-1/6-month-commitment/anthropic.claude-instant-v1 | -- | -- | 100000 | 8191 |
| bedrock/eu-central-1/anthropic.claude-instant-v1 | $2.48 | $8.38 | 100000 | 8191 |
| bedrock/eu-central-1/1-month-commitment/anthropic.claude-instant-v1 | -- | -- | 100000 | 8191 |
| bedrock/eu-central-1/6-month-commitment/anthropic.claude-instant-v1 | -- | -- | 100000 | 8191 |
| cohere.command-text-v14 | $1.5 | $2 | 4096 | 4096 |
| bedrock/*/1-month-commitment/cohere.command-text-v14 | -- | -- | 4096 | 4096 |
| bedrock/*/6-month-commitment/cohere.command-text-v14 | -- | -- | 4096 | 4096 |
| cohere.command-light-text-v14 | $0.3 | $0.6 | 4096 | 4096 |
| bedrock/*/1-month-commitment/cohere.command-light-text-v14 | -- | -- | 4096 | 4096 |
| bedrock/*/6-month-commitment/cohere.command-light-text-v14 | -- | -- | 4096 | 4096 |
| cohere.command-r-plus-v1:0 | $3 | $15 | 128000 | 4096 |
| cohere.command-r-v1:0 | $0.5 | $1.5 | 128000 | 4096 |
| cohere.embed-english-v3 | $0.1 | $0 | 512 | nan |
| cohere.embed-multilingual-v3 | $0.1 | $0 | 512 | nan |
| meta.llama2-13b-chat-v1 | $0.75 | $1 | 4096 | 4096 |
| meta.llama2-70b-chat-v1 | $1.95 | $2.56 | 4096 | 4096 |
| meta.llama3-8b-instruct-v1:0 | $0.3 | $0.6 | 8192 | 8192 |
| bedrock/us-east-1/meta.llama3-8b-instruct-v1:0 | $0.3 | $0.6 | 8192 | 8192 |
| bedrock/us-west-1/meta.llama3-8b-instruct-v1:0 | $0.3 | $0.6 | 8192 | 8192 |
| bedrock/ap-south-1/meta.llama3-8b-instruct-v1:0 | $0.36 | $0.72 | 8192 | 8192 |
| bedrock/ca-central-1/meta.llama3-8b-instruct-v1:0 | $0.35 | $0.69 | 8192 | 8192 |
| bedrock/eu-west-1/meta.llama3-8b-instruct-v1:0 | $0.32 | $0.65 | 8192 | 8192 |
| bedrock/eu-west-2/meta.llama3-8b-instruct-v1:0 | $0.39 | $0.78 | 8192 | 8192 |
| bedrock/sa-east-1/meta.llama3-8b-instruct-v1:0 | $0.5 | $1.01 | 8192 | 8192 |
| meta.llama3-70b-instruct-v1:0 | $2.65 | $3.5 | 8192 | 8192 |
| bedrock/us-east-1/meta.llama3-70b-instruct-v1:0 | $2.65 | $3.5 | 8192 | 8192 |
| bedrock/us-west-1/meta.llama3-70b-instruct-v1:0 | $2.65 | $3.5 | 8192 | 8192 |
| bedrock/ap-south-1/meta.llama3-70b-instruct-v1:0 | $3.18 | $4.2 | 8192 | 8192 |
| bedrock/ca-central-1/meta.llama3-70b-instruct-v1:0 | $3.05 | $4.03 | 8192 | 8192 |
| bedrock/eu-west-1/meta.llama3-70b-instruct-v1:0 | $2.86 | $3.78 | 8192 | 8192 |
| bedrock/eu-west-2/meta.llama3-70b-instruct-v1:0 | $3.45 | $4.55 | 8192 | 8192 |
| bedrock/sa-east-1/meta.llama3-70b-instruct-v1:0 | $4.45 | $5.88 | 8192 | 8192 |
| meta.llama3-1-8b-instruct-v1:0 | $0.22 | $0.22 | 128000 | 2048 |
| meta.llama3-1-70b-instruct-v1:0 | $0.99 | $0.99 | 128000 | 2048 |
| meta.llama3-1-405b-instruct-v1:0 | $5.32 | $16 | 128000 | 4096 |
| meta.llama3-2-1b-instruct-v1:0 | $0.1 | $0.1 | 128000 | 4096 |
| us.meta.llama3-2-1b-instruct-v1:0 | $0.1 | $0.1 | 128000 | 4096 |
| eu.meta.llama3-2-1b-instruct-v1:0 | $0.13 | $0.13 | 128000 | 4096 |
| meta.llama3-2-3b-instruct-v1:0 | $0.15 | $0.15 | 128000 | 4096 |
| us.meta.llama3-2-3b-instruct-v1:0 | $0.15 | $0.15 | 128000 | 4096 |
| eu.meta.llama3-2-3b-instruct-v1:0 | $0.19 | $0.19 | 128000 | 4096 |
| meta.llama3-2-11b-instruct-v1:0 | $0.35 | $0.35 | 128000 | 4096 |
| us.meta.llama3-2-11b-instruct-v1:0 | $0.35 | $0.35 | 128000 | 4096 |
| meta.llama3-2-90b-instruct-v1:0 | $2 | $2 | 128000 | 4096 |
| us.meta.llama3-2-90b-instruct-v1:0 | $2 | $2 | 128000 | 4096 |
| 512-x-512/50-steps/stability.stable-diffusion-xl-v0 | -- | -- | 77 | nan |
| 512-x-512/max-steps/stability.stable-diffusion-xl-v0 | -- | -- | 77 | nan |
| max-x-max/50-steps/stability.stable-diffusion-xl-v0 | -- | -- | 77 | nan |
| max-x-max/max-steps/stability.stable-diffusion-xl-v0 | -- | -- | 77 | nan |
| 1024-x-1024/50-steps/stability.stable-diffusion-xl-v1 | -- | -- | 77 | nan |
| 1024-x-1024/max-steps/stability.stable-diffusion-xl-v1 | -- | -- | 77 | nan |
| sagemaker/meta-textgeneration-llama-2-7b | $0 | $0 | 4096 | 4096 |
| sagemaker/meta-textgeneration-llama-2-7b-f | $0 | $0 | 4096 | 4096 |
| sagemaker/meta-textgeneration-llama-2-13b | $0 | $0 | 4096 | 4096 |
| sagemaker/meta-textgeneration-llama-2-13b-f | $0 | $0 | 4096 | 4096 |
| sagemaker/meta-textgeneration-llama-2-70b | $0 | $0 | 4096 | 4096 |
| sagemaker/meta-textgeneration-llama-2-70b-b-f | $0 | $0 | 4096 | 4096 |
| together-ai-up-to-4b | $0.1 | $0.1 | nan | nan |
| together-ai-4.1b-8b | $0.2 | $0.2 | nan | nan |
| together-ai-8.1b-21b | $0.3 | $0.3 | nan | nan |
| together-ai-21.1b-41b | $0.8 | $0.8 | nan | nan |
| together-ai-41.1b-80b | $0.9 | $0.9 | nan | nan |
| together-ai-81.1b-110b | $1.8 | $1.8 | nan | nan |
| together-ai-embedding-up-to-150m | $0.01 | $0 | nan | nan |
| together-ai-embedding-151m-to-350m | $0.02 | $0 | nan | nan |
| together_ai/mistralai/Mixtral-8x7B-Instruct-v0.1 | $0.6 | $0.6 | nan | nan |
| together_ai/mistralai/Mistral-7B-Instruct-v0.1 | -- | -- | nan | nan |
| together_ai/togethercomputer/CodeLlama-34b-Instruct | -- | -- | nan | nan |
| ollama/codegemma | $0 | $0 | 8192 | 8192 |
| ollama/codegeex4 | $0 | $0 | 32768 | 8192 |
| ollama/deepseek-coder-v2-instruct | $0 | $0 | 32768 | 8192 |
| ollama/deepseek-coder-v2-base | $0 | $0 | 8192 | 8192 |
| ollama/deepseek-coder-v2-lite-instruct | $0 | $0 | 32768 | 8192 |
| ollama/deepseek-coder-v2-lite-base | $0 | $0 | 8192 | 8192 |
| ollama/internlm2_5-20b-chat | $0 | $0 | 32768 | 8192 |
| ollama/llama2 | $0 | $0 | 4096 | 4096 |
| ollama/llama2:7b | $0 | $0 | 4096 | 4096 |
| ollama/llama2:13b | $0 | $0 | 4096 | 4096 |
| ollama/llama2:70b | $0 | $0 | 4096 | 4096 |
| ollama/llama2-uncensored | $0 | $0 | 4096 | 4096 |
| ollama/llama3 | $0 | $0 | 8192 | 8192 |
| ollama/llama3:8b | $0 | $0 | 8192 | 8192 |
| ollama/llama3:70b | $0 | $0 | 8192 | 8192 |
| ollama/llama3.1 | $0 | $0 | 8192 | 8192 |
| ollama/mistral-large-instruct-2407 | $0 | $0 | 65536 | 8192 |
| ollama/mistral | $0 | $0 | 8192 | 8192 |
| ollama/mistral-7B-Instruct-v0.1 | $0 | $0 | 8192 | 8192 |
| ollama/mistral-7B-Instruct-v0.2 | $0 | $0 | 32768 | 32768 |
| ollama/mixtral-8x7B-Instruct-v0.1 | $0 | $0 | 32768 | 32768 |
| ollama/mixtral-8x22B-Instruct-v0.1 | $0 | $0 | 65536 | 65536 |
| ollama/codellama | $0 | $0 | 4096 | 4096 |
| ollama/orca-mini | $0 | $0 | 4096 | 4096 |
| ollama/vicuna | $0 | $0 | 2048 | 2048 |
| deepinfra/lizpreciatior/lzlv_70b_fp16_hf | $0.35 | $0.4 | 4096 | 4096 |
| deepinfra/Gryphe/MythoMax-L2-13b | $0.08 | $0.09 | 4096 | 4096 |
| deepinfra/mistralai/Mistral-7B-Instruct-v0.1 | $0.06 | $0.06 | 32768 | 32768 |
| deepinfra/meta-llama/Llama-2-70b-chat-hf | $0.64 | $0.8 | 4096 | 4096 |
| deepinfra/cognitivecomputations/dolphin-2.6-mixtral-8x7b | $0.24 | $0.24 | 32768 | 32768 |
| deepinfra/codellama/CodeLlama-34b-Instruct-hf | $0.6 | $0.6 | 4096 | 4096 |
| deepinfra/deepinfra/mixtral | $0.27 | $0.27 | 32000 | 4096 |
| deepinfra/Phind/Phind-CodeLlama-34B-v2 | $0.6 | $0.6 | 4096 | 4096 |
| deepinfra/mistralai/Mixtral-8x7B-Instruct-v0.1 | $0.4 | $0.4 | 32768 | 32768 |
| deepinfra/deepinfra/airoboros-70b | $0.7 | $0.9 | 4096 | 4096 |
| deepinfra/01-ai/Yi-34B-Chat | $0.6 | $0.6 | 4096 | 4096 |
| deepinfra/01-ai/Yi-6B-200K | $0.13 | $0.13 | 200000 | 4096 |
| deepinfra/jondurbin/airoboros-l2-70b-gpt4-1.4.1 | $0.7 | $0.9 | 4096 | 4096 |
| deepinfra/meta-llama/Llama-2-13b-chat-hf | $0.13 | $0.13 | 4096 | 4096 |
| deepinfra/amazon/MistralLite | $0.2 | $0.2 | 32768 | 8191 |
| deepinfra/meta-llama/Llama-2-7b-chat-hf | $0.13 | $0.13 | 4096 | 4096 |
| deepinfra/meta-llama/Meta-Llama-3-8B-Instruct | $0.03 | $0.06 | 8192 | 8192 |
| deepinfra/meta-llama/Meta-Llama-3-70B-Instruct | $0.3 | $0.4 | 8192 | 8192 |
| deepinfra/01-ai/Yi-34B-200K | $0.6 | $0.6 | 200000 | 4096 |
| deepinfra/openchat/openchat_3.5 | $0.06 | $0.06 | 8192 | 8192 |
| perplexity/codellama-34b-instruct | $0.35 | $1.4 | 16384 | 16384 |
| perplexity/codellama-70b-instruct | $0.7 | $2.8 | 16384 | 16384 |
| perplexity/llama-3.1-70b-instruct | $1 | $1 | 131072 | 131072 |
| perplexity/llama-3.1-8b-instruct | $0.2 | $0.2 | 131072 | 131072 |
| perplexity/llama-3.1-sonar-huge-128k-online | $5 | $5 | 127072 | 127072 |
| perplexity/llama-3.1-sonar-large-128k-online | $1 | $1 | 127072 | 127072 |
| perplexity/llama-3.1-sonar-large-128k-chat | $1 | $1 | 131072 | 131072 |
| perplexity/llama-3.1-sonar-small-128k-chat | $0.2 | $0.2 | 131072 | 131072 |
| perplexity/llama-3.1-sonar-small-128k-online | $0.2 | $0.2 | 127072 | 127072 |
| perplexity/pplx-7b-chat | $0.07 | $0.28 | 8192 | 8192 |
| perplexity/pplx-70b-chat | $0.7 | $2.8 | 4096 | 4096 |
| perplexity/pplx-7b-online | $0 | $0.28 | 4096 | 4096 |
| perplexity/pplx-70b-online | $0 | $2.8 | 4096 | 4096 |
| perplexity/llama-2-70b-chat | $0.7 | $2.8 | 4096 | 4096 |
| perplexity/mistral-7b-instruct | $0.07 | $0.28 | 4096 | 4096 |
| perplexity/mixtral-8x7b-instruct | $0.07 | $0.28 | 4096 | 4096 |
| perplexity/sonar-small-chat | $0.07 | $0.28 | 16384 | 16384 |
| perplexity/sonar-small-online | $0 | $0.28 | 12000 | 12000 |
| perplexity/sonar-medium-chat | $0.6 | $1.8 | 16384 | 16384 |
| perplexity/sonar-medium-online | $0 | $1.8 | 12000 | 12000 |
| fireworks_ai/accounts/fireworks/models/llama-v3p2-1b-instruct | $0.1 | $0.1 | 16384 | 16384 |
| fireworks_ai/accounts/fireworks/models/llama-v3p2-3b-instruct | $0.1 | $0.1 | 16384 | 16384 |
| fireworks_ai/accounts/fireworks/models/llama-v3p2-11b-vision-instruct | $0.2 | $0.2 | 16384 | 16384 |
| accounts/fireworks/models/llama-v3p2-90b-vision-instruct | $0.9 | $0.9 | 16384 | 16384 |
| fireworks_ai/accounts/fireworks/models/firefunction-v2 | $0.9 | $0.9 | 8192 | 8192 |
| fireworks_ai/accounts/fireworks/models/mixtral-8x22b-instruct-hf | $1.2 | $1.2 | 65536 | 65536 |
| fireworks_ai/accounts/fireworks/models/qwen2-72b-instruct | $0.9 | $0.9 | 32768 | 32768 |
| fireworks_ai/accounts/fireworks/models/yi-large | $3 | $3 | 32768 | 32768 |
| fireworks_ai/accounts/fireworks/models/deepseek-coder-v2-instruct | $1.2 | $1.2 | 65536 | 65536 |
| fireworks_ai/nomic-ai/nomic-embed-text-v1.5 | $0.01 | $0 | 8192 | nan |
| fireworks_ai/nomic-ai/nomic-embed-text-v1 | $0.01 | $0 | 8192 | nan |
| fireworks_ai/WhereIsAI/UAE-Large-V1 | $0.02 | $0 | 512 | nan |
| fireworks_ai/thenlper/gte-large | $0.02 | $0 | 512 | nan |
| fireworks_ai/thenlper/gte-base | $0.01 | $0 | 512 | nan |
| fireworks-ai-up-to-16b | $0.2 | $0.2 | nan | nan |
| fireworks-ai-16.1b-to-80b | $0.9 | $0.9 | nan | nan |
| fireworks-ai-moe-up-to-56b | $0.5 | $0.5 | nan | nan |
| fireworks-ai-56b-to-176b | $1.2 | $1.2 | nan | nan |
| fireworks-ai-default | $0 | $0 | nan | nan |
| fireworks-ai-embedding-up-to-150m | $0.01 | $0 | nan | nan |
| fireworks-ai-embedding-150m-to-350m | $0.02 | $0 | nan | nan |
| anyscale/mistralai/Mistral-7B-Instruct-v0.1 | $0.15 | $0.15 | 16384 | 16384 |
| anyscale/mistralai/Mixtral-8x7B-Instruct-v0.1 | $0.15 | $0.15 | 16384 | 16384 |
| anyscale/mistralai/Mixtral-8x22B-Instruct-v0.1 | $0.9 | $0.9 | 65536 | 65536 |
| anyscale/HuggingFaceH4/zephyr-7b-beta | $0.15 | $0.15 | 16384 | 16384 |
| anyscale/google/gemma-7b-it | $0.15 | $0.15 | 8192 | 8192 |
| anyscale/meta-llama/Llama-2-7b-chat-hf | $0.15 | $0.15 | 4096 | 4096 |
| anyscale/meta-llama/Llama-2-13b-chat-hf | $0.25 | $0.25 | 4096 | 4096 |
| anyscale/meta-llama/Llama-2-70b-chat-hf | $1 | $1 | 4096 | 4096 |
| anyscale/codellama/CodeLlama-34b-Instruct-hf | $1 | $1 | 4096 | 4096 |
| anyscale/codellama/CodeLlama-70b-Instruct-hf | $1 | $1 | 4096 | 4096 |
| anyscale/meta-llama/Meta-Llama-3-8B-Instruct | $0.15 | $0.15 | 8192 | 8192 |
| anyscale/meta-llama/Meta-Llama-3-70B-Instruct | $1 | $1 | 8192 | 8192 |
| cloudflare/@cf/meta/llama-2-7b-chat-fp16 | $1.92 | $1.92 | 3072 | 3072 |
| cloudflare/@cf/meta/llama-2-7b-chat-int8 | $1.92 | $1.92 | 2048 | 2048 |
| cloudflare/@cf/mistral/mistral-7b-instruct-v0.1 | $1.92 | $1.92 | 8192 | 8192 |
| cloudflare/@hf/thebloke/codellama-7b-instruct-awq | $1.92 | $1.92 | 4096 | 4096 |
| voyage/voyage-01 | $0.1 | $0 | 4096 | nan |
| voyage/voyage-lite-01 | $0.1 | $0 | 4096 | nan |
| voyage/voyage-large-2 | $0.12 | $0 | 16000 | nan |
| voyage/voyage-law-2 | $0.12 | $0 | 16000 | nan |
| voyage/voyage-code-2 | $0.12 | $0 | 16000 | nan |
| voyage/voyage-2 | $0.1 | $0 | 4000 | nan |
| voyage/voyage-lite-02-instruct | $0.1 | $0 | 4000 | nan |
| voyage/voyage-finance-2 | $0.12 | $0 | 32000 | nan |
| databricks/databricks-meta-llama-3-1-405b-instruct | $5 | $15 | 128000 | 128000 |
| databricks/databricks-meta-llama-3-1-70b-instruct | $1 | $3 | 128000 | 128000 |
| databricks/databricks-dbrx-instruct | $0.75 | $2.25 | 32768 | 32768 |
| databricks/databricks-meta-llama-3-70b-instruct | $1 | $3 | 128000 | 128000 |
| databricks/databricks-llama-2-70b-chat | $0.5 | $1.5 | 4096 | 4096 |
| databricks/databricks-mixtral-8x7b-instruct | $0.5 | $1 | 4096 | 4096 |
| databricks/databricks-mpt-30b-instruct | $1 | $1 | 8192 | 8192 |
| databricks/databricks-mpt-7b-instruct | $0.5 | $0 | 8192 | 8192 |
| databricks/databricks-bge-large-en | $0.1 | $0 | 512 | nan |
| databricks/databricks-gte-large-en | $0.13 | $0 | 8192 | nan |
| azure/gpt-4o-mini-2024-07-18 | $0.16 | $0.66 | 128000 | 16384 |
| amazon.titan-embed-image-v1 | $0.8 | $0 | 128 | nan |
| azure_ai/mistral-large-2407 | $2 | $6 | 128000 | 4096 |
| azure_ai/ministral-3b | $0.04 | $0.04 | 128000 | 4096 |
| azure_ai/Llama-3.2-11B-Vision-Instruct | $0.37 | $0.37 | 128000 | 2048 |
| azure_ai/Llama-3.2-90B-Vision-Instruct | $2.04 | $2.04 | 128000 | 2048 |
| azure_ai/Phi-3.5-mini-instruct | $0.13 | $0.52 | 128000 | 4096 |
| azure_ai/Phi-3.5-vision-instruct | $0.13 | $0.52 | 128000 | 4096 |
| azure_ai/Phi-3.5-MoE-instruct | $0.16 | $0.64 | 128000 | 4096 |
| azure_ai/Phi-3-mini-4k-instruct | $0.13 | $0.52 | 4096 | 4096 |
| azure_ai/Phi-3-mini-128k-instruct | $0.13 | $0.52 | 128000 | 4096 |
| azure_ai/Phi-3-small-8k-instruct | $0.15 | $0.6 | 8192 | 4096 |
| azure_ai/Phi-3-small-128k-instruct | $0.15 | $0.6 | 128000 | 4096 |
| azure_ai/Phi-3-medium-4k-instruct | $0.17 | $0.68 | 4096 | 4096 |
| azure_ai/Phi-3-medium-128k-instruct | $0.17 | $0.68 | 128000 | 4096 |
| xai/grok-beta | $5 | $15 | 131072 | 131072 |
| claude-3-5-haiku-20241022 | $0.8 | $4 | 200000 | 8192 |
| vertex_ai/claude-3-5-haiku@20241022 | $1 | $5 | 200000 | 8192 |
| openrouter/anthropic/claude-3-5-haiku | $1 | $5 | nan | nan |
| openrouter/anthropic/claude-3-5-haiku-20241022 | $1 | $5 | 200000 | 8192 |
| anthropic.claude-3-5-haiku-20241022-v1:0 | $0.8 | $4 | 200000 | 8192 |
| us.anthropic.claude-3-5-haiku-20241022-v1:0 | $0.8 | $4 | 200000 | 8192 |
| eu.anthropic.claude-3-5-haiku-20241022-v1:0 | $0.25 | $1.25 | 200000 | 8192 |
| stability.sd3-large-v1:0 | -- | -- | 77 | nan |
| gpt-4o-2024-11-20 | $2.5 | $10 | 128000 | 16384 |
| ft:gpt-4o-2024-11-20 | $3.75 | $15 | 128000 | 16384 |
| azure/gpt-4o-2024-11-20 | $2.75 | $11 | 128000 | 16384 |
| azure/global-standard/gpt-4o-2024-11-20 | $2.5 | $10 | 128000 | 16384 |
| groq/llama-3.2-1b-preview | $0.04 | $0.04 | 8192 | 8192 |
| groq/llama-3.2-3b-preview | $0.06 | $0.06 | 8192 | 8192 |
| groq/llama-3.2-11b-text-preview | $0.18 | $0.18 | 8192 | 8192 |
| groq/llama-3.2-11b-vision-preview | $0.18 | $0.18 | 8192 | 8192 |
| groq/llama-3.2-90b-text-preview | $0.9 | $0.9 | 8192 | 8192 |
| groq/llama-3.2-90b-vision-preview | $0.9 | $0.9 | 8192 | 8192 |
| vertex_ai/claude-3-sonnet | $3 | $15 | 200000 | 4096 |
| vertex_ai/claude-3-5-sonnet | $3 | $15 | 200000 | 8192 |
| vertex_ai/claude-3-5-sonnet-v2 | $3 | $15 | 200000 | 8192 |
| vertex_ai/claude-3-haiku | $0.25 | $1.25 | 200000 | 4096 |
| vertex_ai/claude-3-5-haiku | $1 | $5 | 200000 | 8192 |
| vertex_ai/claude-3-opus | $15 | $75 | 200000 | 4096 |
| gemini/gemini-exp-1114 | $0 | $0 | 1.04858e+06 | 8192 |
| openrouter/qwen/qwen-2.5-coder-32b-instruct | $0.18 | $0.18 | 33792 | 33792 |
| us.meta.llama3-1-8b-instruct-v1:0 | $0.22 | $0.22 | 128000 | 2048 |
| us.meta.llama3-1-70b-instruct-v1:0 | $0.99 | $0.99 | 128000 | 2048 |
| us.meta.llama3-1-405b-instruct-v1:0 | $5.32 | $16 | 128000 | 4096 |
| stability.stable-image-ultra-v1:0 | -- | -- | 77 | nan |
| fireworks_ai/accounts/fireworks/models/qwen2p5-coder-32b-instruct | $0.9 | $0.9 | 4096 | 4096 |
| omni-moderation-latest | $0 | $0 | 32768 | 0 |
| omni-moderation-latest-intents | $0 | $0 | 32768 | 0 |
| omni-moderation-2024-09-26 | $0 | $0 | 32768 | 0 |
| gpt-4o-audio-preview-2024-12-17 | $2.5 | $10 | 128000 | 16384 |
| gpt-4o-mini-audio-preview-2024-12-17 | $0.15 | $0.6 | 128000 | 16384 |
| o1 | $15 | $60 | 200000 | 100000 |
| o1-2024-12-17 | $15 | $60 | 200000 | 100000 |
| gpt-4o-realtime-preview-2024-10-01 | $5 | $20 | 128000 | 4096 |
| gpt-4o-realtime-preview | $5 | $20 | 128000 | 4096 |
| gpt-4o-realtime-preview-2024-12-17 | $5 | $20 | 128000 | 4096 |
| gpt-4o-mini-realtime-preview | $0.6 | $2.4 | 128000 | 4096 |
| gpt-4o-mini-realtime-preview-2024-12-17 | $0.6 | $2.4 | 128000 | 4096 |
| azure/o1 | $15 | $60 | 200000 | 100000 |
| azure_ai/Llama-3.3-70B-Instruct | $0.71 | $0.71 | 128000 | 2048 |
| mistral/mistral-large-2411 | $2 | $6 | 128000 | 128000 |
| mistral/pixtral-large-latest | $2 | $6 | 128000 | 128000 |
| mistral/pixtral-large-2411 | $2 | $6 | 128000 | 128000 |
| deepseek/deepseek-chat | $0.28 | $0.42 | 131072 | 8192 |
| deepseek/deepseek-coder | $0.14 | $0.28 | 128000 | 4096 |
| groq/llama-3.3-70b-versatile | $0.59 | $0.79 | 128000 | 32768 |
| groq/llama-3.3-70b-specdec | $0.59 | $0.99 | 8192 | 8192 |
| friendliai/meta-llama-3.1-8b-instruct | $0.1 | $0.1 | 8192 | 8192 |
| friendliai/meta-llama-3.1-70b-instruct | $0.6 | $0.6 | 8192 | 8192 |
| gemini-2.0-flash-exp | $0.15 | $0.6 | 1.04858e+06 | 8192 |
| gemini/gemini-2.0-flash-exp | $0 | $0 | 1.04858e+06 | 8192 |
| vertex_ai/mistral-large@2411-001 | $2 | $6 | 128000 | 8191 |
| vertex_ai/mistral-large-2411 | $2 | $6 | 128000 | 8191 |
| text-embedding-005 | $0.1 | $0 | 2048 | nan |
| gemini/gemini-1.5-flash-8b | $0 | $0 | 1.04858e+06 | 8192 |
| gemini/gemini-exp-1206 | $0 | $0 | 2.09715e+06 | 8192 |
| command-r7b-12-2024 | $0.15 | $0.04 | 128000 | 4096 |
| rerank-v3.5 | $0 | $0 | 4096 | 4096 |
| openrouter/deepseek/deepseek-chat | $0.14 | $0.28 | 65536 | 8192 |
| openrouter/openai/o1 | $15 | $60 | 200000 | 100000 |
| amazon.nova-micro-v1:0 | $0.04 | $0.14 | 128000 | 10000 |
| amazon.nova-lite-v1:0 | $0.06 | $0.24 | 300000 | 10000 |
| amazon.nova-pro-v1:0 | $0.8 | $3.2 | 300000 | 10000 |
| meta.llama3-3-70b-instruct-v1:0 | $0.72 | $0.72 | 128000 | 4096 |
| together_ai/meta-llama/Meta-Llama-3.1-8B-Instruct-Turbo | $0.18 | $0.18 | nan | nan |
| together_ai/meta-llama/Meta-Llama-3.1-70B-Instruct-Turbo | $0.88 | $0.88 | nan | nan |
| together_ai/meta-llama/Meta-Llama-3.1-405B-Instruct-Turbo | $3.5 | $3.5 | nan | nan |
| deepinfra/meta-llama/Meta-Llama-3.1-405B-Instruct | $0.8 | $0.8 | 32768 | 32768 |
| fireworks_ai/accounts/fireworks/models/deepseek-v3 | $0.9 | $0.9 | 128000 | 8192 |
| voyage/voyage-3-large | $0.18 | $0 | 32000 | nan |
| voyage/voyage-3 | $0.06 | $0 | 32000 | nan |
| voyage/voyage-3-lite | $0.02 | $0 | 32000 | nan |
| voyage/voyage-code-3 | $0.18 | $0 | 32000 | nan |
| voyage/voyage-multimodal-3 | $0.12 | $0 | 32000 | nan |
| voyage/rerank-2 | $0.05 | $0 | 16000 | 16000 |
| voyage/rerank-2-lite | $0.02 | $0 | 8000 | 8000 |
| databricks/meta-llama-3.3-70b-instruct | $1 | $3 | 128000 | 128000 |
| sambanova/Meta-Llama-3.1-8B-Instruct | $0.1 | $0.2 | 16384 | 16384 |
| sambanova/Meta-Llama-3.1-70B-Instruct | $0.6 | $1.2 | 128000 | 128000 |
| sambanova/Meta-Llama-3.1-405B-Instruct | $5 | $10 | 16384 | 16384 |
| sambanova/Meta-Llama-3.2-1B-Instruct | $0.04 | $0.08 | 16384 | 16384 |
| sambanova/Meta-Llama-3.2-3B-Instruct | $0.08 | $0.16 | 4096 | 4096 |
| sambanova/Meta-Llama-3.3-70B-Instruct | $0.6 | $1.2 | 131072 | 131072 |
| sambanova/Qwen2.5-Coder-32B-Instruct | $1.5 | $3 | 8000 | 8000 |
| sambanova/Qwen2.5-72B-Instruct | $2 | $4 | 8000 | 8000 |
| o3-mini | $1.1 | $4.4 | 200000 | 100000 |
| o3-mini-2025-01-31 | $1.1 | $4.4 | 200000 | 100000 |
| azure/o3-mini-2025-01-31 | $1.1 | $4.4 | 200000 | 100000 |
| azure/o3-mini | $1.1 | $4.4 | 200000 | 100000 |
| azure/o1-2024-12-17 | $15 | $60 | 200000 | 100000 |
| azure_ai/deepseek-r1 | $1.35 | $5.4 | 128000 | 8192 |
| deepseek/deepseek-reasoner | $0.28 | $0.42 | 131072 | 65536 |
| xai/grok-2-vision-1212 | $2 | $10 | 32768 | 32768 |
| xai/grok-2-vision-latest | $2 | $10 | 32768 | 32768 |
| xai/grok-2-vision | $2 | $10 | 32768 | 32768 |
| xai/grok-vision-beta | $5 | $15 | 8192 | 8192 |
| xai/grok-2-1212 | $2 | $10 | 131072 | 131072 |
| xai/grok-2 | $2 | $10 | 131072 | 131072 |
| xai/grok-2-latest | $2 | $10 | 131072 | 131072 |
| groq/deepseek-r1-distill-llama-70b | $0.75 | $0.99 | 128000 | 128000 |
| gemini/gemini-2.0-flash | $0.1 | $0.4 | 1.04858e+06 | 8192 |
| gemini-2.0-flash-001 | $0.15 | $0.6 | 1.04858e+06 | 8192 |
| gemini-2.0-flash-thinking-exp | $0 | $0 | 1.04858e+06 | 8192 |
| gemini-2.0-flash-thinking-exp-01-21 | $0 | $0 | 1.04858e+06 | 65536 |
| gemini/gemini-2.0-flash-001 | $0.1 | $0.4 | 1.04858e+06 | 8192 |
| gemini/gemini-2.0-flash-lite-preview-02-05 | $0.08 | $0.3 | 1.04858e+06 | 8192 |
| gemini/gemini-2.0-flash-thinking-exp | $0 | $0 | 1.04858e+06 | 65536 |
| vertex_ai/codestral-2501 | $0.2 | $0.6 | 128000 | 128000 |
| openrouter/deepseek/deepseek-r1 | $0.55 | $2.19 | 65336 | 8192 |
| ai21.jamba-1-5-large-v1:0 | $2 | $8 | 256000 | 256000 |
| ai21.jamba-1-5-mini-v1:0 | $0.2 | $0.4 | 256000 | 256000 |
| us.amazon.nova-micro-v1:0 | $0.04 | $0.14 | 128000 | 10000 |
| us.amazon.nova-lite-v1:0 | $0.06 | $0.24 | 300000 | 10000 |
| us.amazon.nova-pro-v1:0 | $0.8 | $3.2 | 300000 | 10000 |
| stability.sd3-5-large-v1:0 | -- | -- | 77 | nan |
| stability.stable-image-core-v1:0 | -- | -- | 77 | nan |
| stability.stable-image-core-v1:1 | -- | -- | 77 | nan |
| stability.stable-image-ultra-v1:1 | -- | -- | 77 | nan |
| together_ai/meta-llama/Llama-3.3-70B-Instruct-Turbo | $0.88 | $0.88 | nan | nan |
| together_ai/meta-llama/Llama-3.3-70B-Instruct-Turbo-Free | $0 | $0 | nan | nan |
| fireworks_ai/accounts/fireworks/models/llama-v3p1-8b-instruct | $0.1 | $0.1 | 16384 | 16384 |
| assemblyai/nano | -- | -- | nan | nan |
| assemblyai/best | -- | -- | nan | nan |
| azure/gpt-3.5-turbo-0125 | $0.5 | $1.5 | 16384 | 4096 |
| azure/gpt-3.5-turbo | $0.5 | $1.5 | 4097 | 4096 |
| gemini-2.0-pro-exp-02-05 | $1.25 | $10 | 2.09715e+06 | 8192 |
| us.meta.llama3-3-70b-instruct-v1:0 | $0.72 | $0.72 | 128000 | 4096 |
| perplexity/sonar | $1 | $1 | 128000 | nan |
| perplexity/sonar-pro | $3 | $15 | 200000 | 8000 |
| openrouter/google/gemini-2.0-flash-001 | $0.1 | $0.4 | 1.04858e+06 | 8192 |
| gpt-4.5-preview | $75 | $150 | 128000 | 16384 |
| gpt-4.5-preview-2025-02-27 | $75 | $150 | 128000 | 16384 |
| azure_ai/Phi-4 | $0.12 | $0.5 | 16384 | 16384 |
| cerebras/llama3.3-70b | $0.85 | $1.2 | 128000 | 128000 |
| claude-3-5-haiku-latest | $1 | $5 | 200000 | 8192 |
| claude-3-7-sonnet-latest | $3 | $15 | 200000 | 64000 |
| claude-3-7-sonnet-20250219 | $3 | $15 | 200000 | 64000 |
| vertex_ai/claude-3-7-sonnet@20250219 | $3 | $15 | 200000 | 8192 |
| openrouter/anthropic/claude-3.7-sonnet | $3 | $15 | 200000 | 128000 |
| openrouter/anthropic/claude-3.7-sonnet:beta | $3 | $15 | 200000 | 128000 |
| amazon.rerank-v1:0 | $0 | $0 | 32000 | 32000 |
| anthropic.claude-3-7-sonnet-20250219-v1:0 | $3 | $15 | 200000 | 8192 |
| us.anthropic.claude-3-7-sonnet-20250219-v1:0 | $3 | $15 | 200000 | 8192 |
| cohere.rerank-v3-5:0 | $0 | $0 | 32000 | 32000 |
| jina-reranker-v2-base-multilingual | $0.02 | $0.02 | 1024 | 1024 |
| bedrock/invoke/anthropic.claude-3-5-sonnet-20240620-v1:0 | $3 | $15 | 200000 | 4096 |
| azure/gpt-4o-mini-realtime-preview-2024-12-17 | $0.6 | $2.4 | 128000 | 4096 |
| azure/eu/gpt-4o-mini-realtime-preview-2024-12-17 | $0.66 | $2.64 | 128000 | 4096 |
| azure/us/gpt-4o-mini-realtime-preview-2024-12-17 | $0.66 | $2.64 | 128000 | 4096 |
| azure/gpt-4o-realtime-preview-2024-10-01 | $5 | $20 | 128000 | 4096 |
| azure/us/gpt-4o-realtime-preview-2024-10-01 | $5.5 | $22 | 128000 | 4096 |
| azure/eu/gpt-4o-realtime-preview-2024-10-01 | $5.5 | $22 | 128000 | 4096 |
| azure/us/o3-mini-2025-01-31 | $1.21 | $4.84 | 200000 | 100000 |
| azure/eu/o3-mini-2025-01-31 | $1.21 | $4.84 | 200000 | 100000 |
| azure/us/o1-mini-2024-09-12 | $1.21 | $4.84 | 128000 | 65536 |
| azure/eu/o1-mini-2024-09-12 | $1.21 | $4.84 | 128000 | 65536 |
| azure/us/o1-2024-12-17 | $16.5 | $66 | 200000 | 100000 |
| azure/eu/o1-2024-12-17 | $16.5 | $66 | 200000 | 100000 |
| azure/us/o1-preview-2024-09-12 | $16.5 | $66 | 128000 | 32768 |
| azure/eu/o1-preview-2024-09-12 | $16.5 | $66 | 128000 | 32768 |
| azure/us/gpt-4o-2024-11-20 | $2.75 | $11 | 128000 | 16384 |
| azure/eu/gpt-4o-2024-11-20 | $2.75 | $11 | 128000 | 16384 |
| azure/us/gpt-4o-2024-08-06 | $2.75 | $11 | 128000 | 16384 |
| azure/eu/gpt-4o-2024-08-06 | $2.75 | $11 | 128000 | 16384 |
| azure/us/gpt-4o-mini-2024-07-18 | $0.16 | $0.66 | 128000 | 16384 |
| azure/eu/gpt-4o-mini-2024-07-18 | $0.16 | $0.66 | 128000 | 16384 |
| azure_ai/deepseek-v3 | $1.14 | $4.56 | 128000 | 8192 |
| azure_ai/mistral-nemo | $0.15 | $0.15 | 131072 | 4096 |
| azure_ai/Phi-4-mini-instruct | $0.08 | $0.3 | 131072 | 4096 |
| azure_ai/Phi-4-multimodal-instruct | $0.08 | $0.32 | 131072 | 4096 |
| gemini/gemini-2.0-pro-exp-02-05 | $0 | $0 | 2.09715e+06 | 8192 |
| gemini/gemini-2.0-flash-thinking-exp-01-21 | $0 | $0 | 1.04858e+06 | 65536 |
| gemini/gemma-3-27b-it | $0 | $0 | 131072 | 8192 |
| gemini/learnlm-1.5-pro-experimental | $0 | $0 | 32767 | 8192 |
| vertex_ai/imagen-3.0-generate-002 | -- | -- | nan | nan |
| jamba-large-1.6 | $2 | $8 | 256000 | 256000 |
| jamba-mini-1.6 | $0.2 | $0.4 | 256000 | 256000 |
| eu.amazon.nova-micro-v1:0 | $0.05 | $0.18 | 128000 | 10000 |
| eu.amazon.nova-lite-v1:0 | $0.08 | $0.31 | 300000 | 10000 |
| 1024-x-1024/50-steps/bedrock/amazon.nova-canvas-v1:0 | -- | -- | 2600 | nan |
| eu.amazon.nova-pro-v1:0 | $1.05 | $4.2 | 300000 | 10000 |
| us.deepseek.r1-v1:0 | $1.35 | $5.4 | 128000 | 4096 |
| snowflake/deepseek-r1 | -- | -- | 32768 | 8192 |
| snowflake/snowflake-arctic | -- | -- | 4096 | 8192 |
| snowflake/claude-3-5-sonnet | -- | -- | 18000 | 8192 |
| snowflake/mistral-large | -- | -- | 32000 | 8192 |
| snowflake/mistral-large2 | -- | -- | 128000 | 8192 |
| snowflake/reka-flash | -- | -- | 100000 | 8192 |
| snowflake/reka-core | -- | -- | 32000 | 8192 |
| snowflake/jamba-instruct | -- | -- | 256000 | 8192 |
| snowflake/jamba-1.5-mini | -- | -- | 256000 | 8192 |
| snowflake/jamba-1.5-large | -- | -- | 256000 | 8192 |
| snowflake/mixtral-8x7b | -- | -- | 32000 | 8192 |
| snowflake/llama2-70b-chat | -- | -- | 4096 | 8192 |
| snowflake/llama3-8b | -- | -- | 8000 | 8192 |
| snowflake/llama3-70b | -- | -- | 8000 | 8192 |
| snowflake/llama3.1-8b | -- | -- | 128000 | 8192 |
| snowflake/llama3.1-70b | -- | -- | 128000 | 8192 |
| snowflake/llama3.3-70b | -- | -- | 128000 | 8192 |
| snowflake/snowflake-llama-3.3-70b | -- | -- | 8000 | 8192 |
| snowflake/llama3.1-405b | -- | -- | 128000 | 8192 |
| snowflake/snowflake-llama-3.1-405b | -- | -- | 8000 | 8192 |
| snowflake/llama3.2-1b | -- | -- | 128000 | 8192 |
| snowflake/llama3.2-3b | -- | -- | 128000 | 8192 |
| snowflake/mistral-7b | -- | -- | 32000 | 8192 |
| snowflake/gemma-7b | -- | -- | 8000 | 8192 |
| azure/global/gpt-4o-2024-11-20 | $2.5 | $10 | 128000 | 16384 |
| azure/global/gpt-4o-2024-08-06 | $2.5 | $10 | 128000 | 16384 |
| o1-pro | $150 | $600 | 200000 | 100000 |
| o1-pro-2025-03-19 | $150 | $600 | 200000 | 100000 |
| gpt-4o-search-preview-2025-03-11 | $2.5 | $10 | 128000 | 16384 |
| gpt-4o-search-preview | $2.5 | $10 | 128000 | 16384 |
| gpt-4o-mini-search-preview-2025-03-11 | $0.15 | $0.6 | 128000 | 16384 |
| gpt-4o-mini-search-preview | $0.15 | $0.6 | 128000 | 16384 |
| azure/gpt-4.5-preview | $75 | $150 | 128000 | 16384 |
| azure_ai/mistral-small-2503 | $0.1 | $0.3 | 128000 | 128000 |
| text-embedding-large-exp-03-07 | $0.1 | $0 | 8192 | nan |
| gpt-4.1 | $2 | $8 | 1.04758e+06 | 32768 |
| gpt-4.1-2025-04-14 | $2 | $8 | 1.04758e+06 | 32768 |
| gpt-4.1-mini | $0.4 | $1.6 | 1.04758e+06 | 32768 |
| gpt-4.1-mini-2025-04-14 | $0.4 | $1.6 | 1.04758e+06 | 32768 |
| gpt-4.1-nano | $0.1 | $0.4 | 1.04758e+06 | 32768 |
| gpt-4.1-nano-2025-04-14 | $0.1 | $0.4 | 1.04758e+06 | 32768 |
| watsonx/ibm/granite-3-8b-instruct | $0.2 | $0.2 | 8192 | 1024 |
| computer-use-preview | $3 | $12 | 8192 | 1024 |
| o3 | $2 | $8 | 200000 | 100000 |
| o3-2025-04-16 | $2 | $8 | 200000 | 100000 |
| o4-mini | $1.1 | $4.4 | 200000 | 100000 |
| o4-mini-2025-04-16 | $1.1 | $4.4 | 200000 | 100000 |
| gpt-image-1 | $5 | -- | nan | nan |
| low/1024-x-1024/gpt-image-1 | -- | -- | nan | nan |
| medium/1024-x-1024/gpt-image-1 | -- | -- | nan | nan |
| high/1024-x-1024/gpt-image-1 | -- | -- | nan | nan |
| low/1024-x-1536/gpt-image-1 | -- | -- | nan | nan |
| medium/1024-x-1536/gpt-image-1 | -- | -- | nan | nan |
| high/1024-x-1536/gpt-image-1 | -- | -- | nan | nan |
| low/1536-x-1024/gpt-image-1 | -- | -- | nan | nan |
| medium/1536-x-1024/gpt-image-1 | -- | -- | nan | nan |
| high/1536-x-1024/gpt-image-1 | -- | -- | nan | nan |
| gpt-4o-transcribe | $2.5 | $10 | 16000 | 2000 |
| gpt-4o-mini-transcribe | $1.25 | $5 | 16000 | 2000 |
| gpt-4o-mini-tts | $2.5 | $10 | nan | nan |
| azure/computer-use-preview | $3 | $12 | 8192 | 1024 |
| azure/gpt-4o-audio-preview-2024-12-17 | $2.5 | $10 | 128000 | 16384 |
| azure/gpt-4o-mini-audio-preview-2024-12-17 | $2.5 | $10 | 128000 | 16384 |
| azure/gpt-4.1 | $2 | $8 | 1.04758e+06 | 32768 |
| azure/gpt-4.1-2025-04-14 | $2 | $8 | 1.04758e+06 | 32768 |
| azure/gpt-4.1-mini | $0.4 | $1.6 | 1.04758e+06 | 32768 |
| azure/gpt-4.1-mini-2025-04-14 | $0.4 | $1.6 | 1.04758e+06 | 32768 |
| azure/gpt-4.1-nano | $0.1 | $0.4 | 1.04758e+06 | 32768 |
| azure/gpt-4.1-nano-2025-04-14 | $0.1 | $0.4 | 1.04758e+06 | 32768 |
| azure/o3 | $2 | $8 | 200000 | 100000 |
| azure/o3-2025-04-16 | $2 | $8 | 200000 | 100000 |
| azure/o4-mini | $1.1 | $4.4 | 200000 | 100000 |
| azure/gpt-4o-realtime-preview-2024-12-17 | $5 | $20 | 128000 | 4096 |
| azure/us/gpt-4o-realtime-preview-2024-12-17 | $5.5 | $22 | 128000 | 4096 |
| azure/eu/gpt-4o-realtime-preview-2024-12-17 | $5.5 | $22 | 128000 | 4096 |
| azure/o4-mini-2025-04-16 | $1.1 | $4.4 | 200000 | 100000 |
| azure/gpt-image-1 | $5 | -- | nan | nan |
| azure/low/1024-x-1024/gpt-image-1 | -- | -- | nan | nan |
| azure/medium/1024-x-1024/gpt-image-1 | -- | -- | nan | nan |
| azure/high/1024-x-1024/gpt-image-1 | -- | -- | nan | nan |
| azure/low/1024-x-1536/gpt-image-1 | -- | -- | nan | nan |
| azure/medium/1024-x-1536/gpt-image-1 | -- | -- | nan | nan |
| azure/high/1024-x-1536/gpt-image-1 | -- | -- | nan | nan |
| azure/low/1536-x-1024/gpt-image-1 | -- | -- | nan | nan |
| azure/medium/1536-x-1024/gpt-image-1 | -- | -- | nan | nan |
| azure/high/1536-x-1024/gpt-image-1 | -- | -- | nan | nan |
| azure_ai/mistral-large-latest | $2 | $6 | 128000 | 4096 |
| xai/grok-3-beta | $3 | $15 | 131072 | 131072 |
| xai/grok-3-fast-beta | $5 | $25 | 131072 | 131072 |
| xai/grok-3-fast-latest | $5 | $25 | 131072 | 131072 |
| xai/grok-3-mini-beta | $0.3 | $0.5 | 131072 | 131072 |
| xai/grok-3-mini-fast-beta | $0.6 | $4 | 131072 | 131072 |
| xai/grok-3-mini-fast-latest | $0.6 | $4 | 131072 | 131072 |
| groq/whisper-large-v3 | -- | -- | nan | nan |
| groq/whisper-large-v3-turbo | -- | -- | nan | nan |
| groq/distil-whisper-large-v3-en | -- | -- | nan | nan |
| meta_llama/Llama-4-Scout-17B-16E-Instruct-FP8 | -- | -- | 1e+07 | 4028 |
| meta_llama/Llama-4-Maverick-17B-128E-Instruct-FP8 | -- | -- | 1e+06 | 4028 |
| meta_llama/Llama-3.3-70B-Instruct | -- | -- | 128000 | 4028 |
| meta_llama/Llama-3.3-8B-Instruct | -- | -- | 128000 | 4028 |
| gemini-2.5-pro-exp-03-25 | $1.25 | $10 | 1.04858e+06 | 65535 |
| gemini/gemini-2.5-pro-exp-03-25 | $0 | $0 | 1.04858e+06 | 65535 |
| gemini/gemini-2.5-flash-preview-04-17 | $0.15 | $0.6 | 1.04858e+06 | 65535 |
| gemini-2.5-flash-preview-04-17 | $0.15 | $0.6 | 1.04858e+06 | 65535 |
| gemini-2.0-flash | $0.1 | $0.4 | 1.04858e+06 | 8192 |
| gemini-2.0-flash-lite | $0.08 | $0.3 | 1.04858e+06 | 8192 |
| gemini-2.0-flash-lite-001 | $0.08 | $0.3 | 1.04858e+06 | 8192 |
| gemini-2.5-pro-preview-05-06 | $1.25 | $10 | 1.04858e+06 | 65535 |
| gemini-2.5-pro-preview-03-25 | $1.25 | $10 | 1.04858e+06 | 65535 |
| gemini/gemini-2.0-flash-lite | $0.08 | $0.3 | 1.04858e+06 | 8192 |
| gemini/gemini-2.5-pro-preview-05-06 | $1.25 | $10 | 1.04858e+06 | 65535 |
| gemini/gemini-2.5-pro-preview-03-25 | $1.25 | $10 | 1.04858e+06 | 65535 |
| vertex_ai/meta/llama-4-scout-17b-16e-instruct-maas | $0.25 | $0.7 | 1e+07 | 1e+07 |
| vertex_ai/meta/llama-4-scout-17b-128e-instruct-maas | $0.25 | $0.7 | 1e+07 | 1e+07 |
| vertex_ai/meta/llama-4-maverick-17b-128e-instruct-maas | $0.35 | $1.15 | 1e+06 | 1e+06 |
| vertex_ai/meta/llama-4-maverick-17b-16e-instruct-maas | $0.35 | $1.15 | 1e+06 | 1e+06 |
| vertex_ai/mistral-small-2503@001 | $1 | $3 | 32000 | 8191 |
| vertex_ai/mistral-small-2503 | $1 | $3 | 128000 | 128000 |
| multimodalembedding | $0.8 | $0 | 2048 | nan |
| multimodalembedding@001 | $0.8 | $0 | 2048 | nan |
| command-a-03-2025 | $2.5 | $10 | 256000 | 8000 |
| mistralai/mistral-small-3.1-24b-instruct | $0.1 | $0.3 | nan | nan |
| openrouter/openai/o3-mini | $1.1 | $4.4 | 128000 | 65536 |
| openrouter/openai/o3-mini-high | $1.1 | $4.4 | 128000 | 65536 |
| us.amazon.nova-premier-v1:0 | $2.5 | $12.5 | 1e+06 | 10000 |
| meta.llama4-maverick-17b-instruct-v1:0 | $0.24 | $0.97 | 128000 | 4096 |
| us.meta.llama4-maverick-17b-instruct-v1:0 | $0.24 | $0.97 | 128000 | 4096 |
| meta.llama4-scout-17b-instruct-v1:0 | $0.17 | $0.66 | 128000 | 4096 |
| us.meta.llama4-scout-17b-instruct-v1:0 | $0.17 | $0.66 | 128000 | 4096 |
| together_ai/meta-llama/Llama-4-Maverick-17B-128E-Instruct-FP8 | $0.27 | $0.85 | nan | nan |
| together_ai/meta-llama/Llama-4-Scout-17B-16E-Instruct | $0.18 | $0.59 | nan | nan |
| together_ai/meta-llama/Llama-3.2-3B-Instruct-Turbo | -- | -- | nan | nan |
| together_ai/Qwen/Qwen2.5-7B-Instruct-Turbo | -- | -- | nan | nan |
| together_ai/Qwen/Qwen2.5-72B-Instruct-Turbo | -- | -- | nan | nan |
| together_ai/deepseek-ai/DeepSeek-V3 | $1.25 | $1.25 | 65536 | 8192 |
| together_ai/mistralai/Mistral-Small-24B-Instruct-2501 | -- | -- | nan | nan |
| perplexity/sonar-deep-research | $2 | $8 | 128000 | nan |
| fireworks_ai/accounts/fireworks/models/deepseek-r1 | $3 | $8 | 128000 | 20480 |
| fireworks_ai/accounts/fireworks/models/deepseek-r1-basic | $0.55 | $2.19 | 128000 | 20480 |
| fireworks_ai/accounts/fireworks/models/llama-v3p1-405b-instruct | $3 | $3 | 128000 | 16384 |
| fireworks_ai/accounts/fireworks/models/llama4-maverick-instruct-basic | $0.22 | $0.88 | 131072 | 131072 |
| fireworks_ai/accounts/fireworks/models/llama4-scout-instruct-basic | $0.15 | $0.6 | 131072 | 131072 |
| fireworks-ai-up-to-4b | $0.2 | $0.2 | nan | nan |
| fireworks-ai-4.1b-to-16b | $0.2 | $0.2 | nan | nan |
| fireworks-ai-above-16b | $0.9 | $0.9 | nan | nan |
| databricks/databricks-claude-3-7-sonnet | $3 | $15 | 200000 | 128000 |
| databricks/databricks-meta-llama-3-3-70b-instruct | $0.5 | $1.5 | 128000 | 128000 |
| azure_ai/deepseek-v3-0324 | $1.14 | $4.56 | 128000 | 8192 |
| azure_ai/Llama-4-Scout-17B-16E-Instruct | $0.2 | $0.78 | 1e+07 | 16384 |
| azure_ai/Llama-4-Maverick-17B-128E-Instruct-FP8 | $1.41 | $0.35 | 1e+06 | 16384 |
| cerebras/llama-3.3-70b | $0.85 | $1.2 | 128000 | 128000 |
| perplexity/sonar-reasoning | $1 | $5 | 128000 | nan |
| perplexity/sonar-reasoning-pro | $2 | $8 | 128000 | nan |
| nscale/meta-llama/Llama-4-Scout-17B-16E-Instruct | $0.09 | $0.29 | nan | nan |
| nscale/Qwen/Qwen2.5-Coder-3B-Instruct | $0.01 | $0.03 | nan | nan |
| nscale/Qwen/Qwen2.5-Coder-7B-Instruct | $0.01 | $0.03 | nan | nan |
| nscale/Qwen/Qwen2.5-Coder-32B-Instruct | $0.06 | $0.2 | nan | nan |
| nscale/Qwen/QwQ-32B | $0.18 | $0.2 | nan | nan |
| nscale/deepseek-ai/DeepSeek-R1-Distill-Llama-70B | $0.38 | $0.38 | nan | nan |
| nscale/deepseek-ai/DeepSeek-R1-Distill-Llama-8B | $0.02 | $0.02 | nan | nan |
| nscale/deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B | $0.09 | $0.09 | nan | nan |
| nscale/deepseek-ai/DeepSeek-R1-Distill-Qwen-7B | $0.2 | $0.2 | nan | nan |
| nscale/deepseek-ai/DeepSeek-R1-Distill-Qwen-14B | $0.07 | $0.07 | nan | nan |
| nscale/deepseek-ai/DeepSeek-R1-Distill-Qwen-32B | $0.15 | $0.15 | nan | nan |
| nscale/mistralai/mixtral-8x22b-instruct-v0.1 | $0.6 | $0.6 | nan | nan |
| nscale/meta-llama/Llama-3.1-8B-Instruct | $0.03 | $0.03 | nan | nan |
| nscale/meta-llama/Llama-3.3-70B-Instruct | $0.2 | $0.2 | nan | nan |
| nscale/black-forest-labs/FLUX.1-schnell | -- | -- | nan | nan |
| nscale/stabilityai/stable-diffusion-xl-base-1.0 | -- | -- | nan | nan |
| azure/gpt-4o-mini-tts | $2.5 | $10 | nan | nan |
| azure_ai/embed-v-4-0 | $0.12 | $0 | 128000 | nan |
| eu.anthropic.claude-3-7-sonnet-20250219-v1:0 | $3 | $15 | 200000 | 8192 |
| groq/llama-guard-3-8b | $0.2 | $0.2 | 8192 | 8192 |
| groq/meta-llama/llama-4-scout-17b-16e-instruct | $0.11 | $0.34 | 131072 | 8192 |
| groq/meta-llama/llama-4-maverick-17b-128e-instruct | $0.2 | $0.6 | 131072 | 8192 |
| groq/mistral-saba-24b | $0.79 | $0.79 | 32000 | 32000 |
| groq/qwen-qwq-32b | $0.29 | $0.39 | 128000 | 128000 |
| groq/playai-tts | -- | -- | 10000 | 10000 |
| featherless_ai/featherless-ai/Qwerky-72B | -- | -- | 32768 | 4096 |
| featherless_ai/featherless-ai/Qwerky-QwQ-32B | -- | -- | 32768 | 4096 |
| sambanova/Llama-4-Maverick-17B-128E-Instruct | $0.63 | $1.8 | 131072 | 131072 |
| sambanova/Llama-4-Scout-17B-16E-Instruct | $0.4 | $0.7 | 8192 | 8192 |
| sambanova/Meta-Llama-Guard-3-8B | $0.3 | $0.3 | 16384 | 16384 |
| sambanova/Qwen3-32B | $0.4 | $0.8 | 8192 | 8192 |
| sambanova/QwQ-32B | $0.5 | $1 | 16384 | 16384 |
| sambanova/Qwen2-Audio-7B-Instruct | $0.5 | $100 | 4096 | 4096 |
| sambanova/DeepSeek-R1-Distill-Llama-70B | $0.7 | $1.4 | 131072 | 131072 |
| sambanova/DeepSeek-R1 | $5 | $7 | 32768 | 32768 |
| sambanova/DeepSeek-V3-0324 | $3 | $4.5 | 32768 | 32768 |
| xai/grok-3 | $3 | $15 | 131072 | 131072 |
| gemini/gemini-2.5-flash-preview-tts | $0.3 | $2.5 | nan | nan |
| gemini/gemini-2.5-flash-preview-05-20 | $0.3 | $2.5 | 1.04858e+06 | 65535 |
| gemini-2.5-flash-preview-05-20 | $0.3 | $2.5 | 1.04858e+06 | 65535 |
| gemini-2.0-flash-preview-image-generation | $0.1 | $0.4 | 1.04858e+06 | 8192 |
| gemini-2.5-pro-preview-tts | $1.25 | $10 | 1.04858e+06 | 65535 |
| gemini/gemini-2.0-flash-preview-image-generation | $0.1 | $0.4 | 1.04858e+06 | 8192 |
| gemini/gemini-2.5-pro-preview-tts | $1.25 | $10 | 1.04858e+06 | 65535 |
| claude-opus-4-20250514 | $15 | $75 | 200000 | 32000 |
| claude-sonnet-4-20250514 | $3 | $15 | 1e+06 | 64000 |
| vertex_ai/claude-opus-4@20250514 | $15 | $75 | 200000 | 32000 |
| vertex_ai/claude-sonnet-4@20250514 | $3 | $15 | 1e+06 | 64000 |
| anthropic.claude-opus-4-20250514-v1:0 | $15 | $75 | 200000 | 32000 |
| anthropic.claude-sonnet-4-20250514-v1:0 | $3 | $15 | 1e+06 | 64000 |
| us.anthropic.claude-opus-4-20250514-v1:0 | $15 | $75 | 200000 | 32000 |
| us.anthropic.claude-sonnet-4-20250514-v1:0 | $3 | $15 | 1e+06 | 64000 |
| eu.anthropic.claude-opus-4-20250514-v1:0 | $15 | $75 | 200000 | 32000 |
| eu.anthropic.claude-sonnet-4-20250514-v1:0 | $3 | $15 | 1e+06 | 64000 |
| databricks/databricks-llama-4-maverick | $0.5 | $1.5 | 128000 | 128000 |
| azure_ai/mistral-medium-2505 | $0.4 | $2 | 131072 | 8191 |
| mistral/devstral-small-2505 | $0.1 | $0.3 | 128000 | 128000 |
| gpt-4o-mini-audio-preview | $0.15 | $0.6 | 128000 | 16384 |
| mistral/mistral-medium-2505 | $0.4 | $2 | 131072 | 8191 |
| embed-v4.0 | $0.12 | $0 | 1024 | nan |
| cerebras/qwen-3-32b | $0.4 | $0.8 | 128000 | 128000 |
| gemini-embedding-001 | $0.15 | $0 | 2048 | nan |
| claude-4-opus-20250514 | $15 | $75 | 200000 | 32000 |
| claude-4-sonnet-20250514 | $3 | $15 | 1e+06 | 64000 |
| fireworks_ai/accounts/fireworks/models/llama-v3p2-90b-vision-instruct | $0.9 | $0.9 | 16384 | 16384 |
| fireworks_ai/accounts/fireworks/models/deepseek-r1-0528 | $3 | $8 | 160000 | 160000 |
| codex-mini-latest | $1.5 | $6 | 200000 | 100000 |
| azure/codex-mini-latest | $1.5 | $6 | 200000 | 100000 |
| gemini-2.5-pro-preview-06-05 | $1.25 | $10 | 1.04858e+06 | 65535 |
| gemini/gemini-2.5-pro-preview-06-05 | $1.25 | $10 | 1.04858e+06 | 65535 |
| gpt-4o-audio-preview-2025-06-03 | $2.5 | $10 | 128000 | 16384 |
| o3-pro | $20 | $80 | 200000 | 100000 |
| o3-pro-2025-06-10 | $20 | $80 | 200000 | 100000 |
| mistral/magistral-medium-2506 | $2 | $5 | 40000 | 40000 |
| mistral/magistral-small-2506 | $0.5 | $1.5 | 40000 | 40000 |
| vertex_ai/claude-opus-4 | $15 | $75 | 200000 | 32000 |
| vertex_ai/claude-sonnet-4 | $3 | $15 | 1e+06 | 64000 |
| deepgram/nova-3 | -- | -- | nan | nan |
| deepgram/nova-3-general | -- | -- | nan | nan |
| deepgram/nova-3-medical | -- | -- | nan | nan |
| deepgram/nova-2 | -- | -- | nan | nan |
| deepgram/nova-2-general | -- | -- | nan | nan |
| deepgram/nova-2-meeting | -- | -- | nan | nan |
| deepgram/nova-2-phonecall | -- | -- | nan | nan |
| deepgram/nova-2-voicemail | -- | -- | nan | nan |
| deepgram/nova-2-finance | -- | -- | nan | nan |
| deepgram/nova-2-conversationalai | -- | -- | nan | nan |
| deepgram/nova-2-video | -- | -- | nan | nan |
| deepgram/nova-2-drivethru | -- | -- | nan | nan |
| deepgram/nova-2-automotive | -- | -- | nan | nan |
| deepgram/nova-2-atc | -- | -- | nan | nan |
| deepgram/nova | -- | -- | nan | nan |
| deepgram/nova-general | -- | -- | nan | nan |
| deepgram/nova-phonecall | -- | -- | nan | nan |
| deepgram/enhanced | -- | -- | nan | nan |
| deepgram/enhanced-general | -- | -- | nan | nan |
| deepgram/enhanced-meeting | -- | -- | nan | nan |
| deepgram/enhanced-phonecall | -- | -- | nan | nan |
| deepgram/enhanced-finance | -- | -- | nan | nan |
| deepgram/base | -- | -- | nan | nan |
| deepgram/base-general | -- | -- | nan | nan |
| deepgram/base-meeting | -- | -- | nan | nan |
| deepgram/base-phonecall | -- | -- | nan | nan |
| deepgram/base-voicemail | -- | -- | nan | nan |
| deepgram/base-finance | -- | -- | nan | nan |
| deepgram/base-conversationalai | -- | -- | nan | nan |
| deepgram/base-video | -- | -- | nan | nan |
| deepgram/whisper | -- | -- | nan | nan |
| deepgram/whisper-tiny | -- | -- | nan | nan |
| deepgram/whisper-base | -- | -- | nan | nan |
| deepgram/whisper-small | -- | -- | nan | nan |
| deepgram/whisper-medium | -- | -- | nan | nan |
| deepgram/whisper-large | -- | -- | nan | nan |
| azure/gpt-4o-transcribe | $2.5 | $10 | 16000 | 2000 |
| azure/gpt-4o-mini-transcribe | $1.25 | $5 | 16000 | 2000 |
| mistral/magistral-medium-latest | $2 | $5 | 40000 | 40000 |
| mistral/magistral-small-latest | $0.5 | $1.5 | 40000 | 40000 |
| xai/grok-3-latest | $3 | $15 | 131072 | 131072 |
| xai/grok-3-mini | $0.3 | $0.5 | 131072 | 131072 |
| xai/grok-3-mini-latest | $0.3 | $0.5 | 131072 | 131072 |
| xai/grok-3-mini-fast | $0.6 | $4 | 131072 | 131072 |
| vertex_ai/imagen-4.0-generate-preview-06-06 | -- | -- | nan | nan |
| vertex_ai/imagen-4.0-ultra-generate-preview-06-06 | -- | -- | nan | nan |
| vertex_ai/imagen-4.0-fast-generate-preview-06-06 | -- | -- | nan | nan |
| gemini/gemini-2.5-pro | $1.25 | $10 | 1.04858e+06 | 65535 |
| gemini-2.5-flash | $0.3 | $2.5 | 1.04858e+06 | 65535 |
| gemini-2.5-flash-lite-preview-06-17 | $0.1 | $0.4 | 1.04858e+06 | 65535 |
| openrouter/deepseek/deepseek-r1-0528 | $0.5 | $2.15 | 65336 | 8192 |
| openrouter/google/gemini-2.5-pro | $1.25 | $10 | 1.04858e+06 | 8192 |
| openrouter/google/gemini-2.5-flash | $0.3 | $2.5 | 1.04858e+06 | 8192 |
| openrouter/anthropic/claude-sonnet-4 | $3 | $15 | 1e+06 | 64000 |
| gemini/gemini-2.5-flash | $0.3 | $2.5 | 1.04858e+06 | 65535 |
| gemini/gemini-2.5-flash-lite-preview-06-17 | $0.1 | $0.4 | 1.04858e+06 | 65535 |
| gemini-2.5-pro | $1.25 | $10 | 1.04858e+06 | 65535 |
| apac.amazon.nova-micro-v1:0 | $0.04 | $0.15 | 128000 | 10000 |
| apac.amazon.nova-lite-v1:0 | $0.06 | $0.25 | 300000 | 10000 |
| apac.amazon.nova-pro-v1:0 | $0.84 | $3.36 | 300000 | 10000 |
| apac.anthropic.claude-3-haiku-20240307-v1:0 | $0.25 | $1.25 | 200000 | 4096 |
| apac.anthropic.claude-3-sonnet-20240229-v1:0 | $3 | $15 | 200000 | 4096 |
| apac.anthropic.claude-3-5-sonnet-20240620-v1:0 | $3 | $15 | 200000 | 4096 |
| apac.anthropic.claude-3-5-sonnet-20241022-v2:0 | $3 | $15 | 200000 | 8192 |
| apac.anthropic.claude-sonnet-4-20250514-v1:0 | $3 | $15 | 1e+06 | 64000 |
| azure/codex-mini | $1.5 | $6 | 200000 | 100000 |
| openrouter/mistralai/mistral-small-3.1-24b-instruct | $0.1 | $0.3 | 131072 | 131072 |
| openrouter/mistralai/mistral-small-3.2-24b-instruct | $0.1 | $0.3 | 128000 | 128000 |
| azure/o3-pro | $20 | $80 | 200000 | 100000 |
| azure/o3-pro-2025-06-10 | $20 | $80 | 200000 | 100000 |
| o3-deep-research | $10 | $40 | 200000 | 100000 |
| o3-deep-research-2025-06-26 | $10 | $40 | 200000 | 100000 |
| o4-mini-deep-research | $2 | $8 | 200000 | 100000 |
| o4-mini-deep-research-2025-06-26 | $2 | $8 | 200000 | 100000 |
| deepseek/deepseek-r1 | $0.55 | $2.19 | 65536 | 8192 |
| deepseek/deepseek-v3 | $0.27 | $1.1 | 65536 | 8192 |
| elevenlabs/scribe_v1 | -- | -- | nan | nan |
| elevenlabs/scribe_v1_experimental | -- | -- | nan | nan |
| azure_ai/cohere-rerank-v3.5 | $0 | $0 | 4096 | 4096 |
| watsonx/mistralai/mistral-large | $3 | $10 | 131072 | 16384 |
| azure/o3-deep-research | $10 | $40 | 200000 | 100000 |
| mistral/devstral-small-2507 | $0.1 | $0.3 | 128000 | 128000 |
| mistral/devstral-medium-2507 | $0.4 | $2 | 128000 | 128000 |
| xai/grok-4 | $3 | $15 | 256000 | 256000 |
| xai/grok-4-0709 | $3 | $15 | 256000 | 256000 |
| xai/grok-4-latest | $3 | $15 | 256000 | 256000 |
| eu.mistral.pixtral-large-2502-v1:0 | $2 | $6 | 128000 | 4096 |
| us.mistral.pixtral-large-2502-v1:0 | $2 | $6 | 128000 | 4096 |
| dashscope/qwen-max | $1.6 | $6.4 | 30720 | 8192 |
| dashscope/qwen-plus-latest | -- | -- | 997952 | 32768 |
| dashscope/qwen-turbo-latest | $0.05 | $0.2 | 1e+06 | 16384 |
| dashscope/qwen3-30b-a3b | -- | -- | 129024 | 16384 |
| jamba-large-1.7 | $2 | $8 | 256000 | 256000 |
| jamba-mini-1.7 | $0.2 | $0.4 | 256000 | 256000 |
| moonshot/moonshot-v1-8k | $0.2 | $2 | 8192 | 8192 |
| moonshot/moonshot-v1-32k | $1 | $3 | 32768 | 32768 |
| moonshot/moonshot-v1-128k | $2 | $5 | 131072 | 131072 |
| moonshot/moonshot-v1-auto | $2 | $5 | 131072 | 131072 |
| moonshot/kimi-k2-0711-preview | $0.6 | $2.5 | 131072 | 131072 |
| moonshot/moonshot-v1-32k-0430 | $1 | $3 | 32768 | 32768 |
| moonshot/moonshot-v1-128k-0430 | $2 | $5 | 131072 | 131072 |
| moonshot/moonshot-v1-8k-0430 | $0.2 | $2 | 8192 | 8192 |
| groq/moonshotai-kimi-k2-instruct | $1 | $3 | 131072 | 16384 |
| together_ai/deepseek-ai/DeepSeek-R1 | $3 | $7 | 128000 | 20480 |
| together_ai/moonshotai/Kimi-K2-Instruct | $1 | $3 | nan | nan |
| azure_ai/grok-3 | $3 | $15 | 131072 | 131072 |
| azure_ai/global/grok-3 | $3 | $15 | 131072 | 131072 |
| azure_ai/global/grok-3-mini | $0.25 | $1.27 | 131072 | 131072 |
| azure_ai/grok-3-mini | $0.25 | $1.27 | 131072 | 131072 |
| azure_ai/jais-30b-chat | $3200 | $9710 | 8192 | 8192 |
| groq/moonshotai/kimi-k2-instruct | $1 | $3 | 131072 | 16384 |
| openrouter/switchpoint/router | $0.85 | $3.4 | 131072 | 131072 |
| v0/v0-1.0-md | $3 | $15 | 128000 | 128000 |
| v0/v0-1.5-md | $3 | $15 | 128000 | 128000 |
| v0/v0-1.5-lg | $15 | $75 | 512000 | 512000 |
| bedrock/us-gov-east-1/amazon.titan-embed-text-v1 | $0.1 | $0 | 8192 | nan |
| bedrock/us-gov-east-1/amazon.titan-embed-text-v2:0 | $0.2 | $0 | 8192 | nan |
| bedrock/us-gov-east-1/amazon.titan-text-express-v1 | $1.3 | $1.7 | 42000 | 8000 |
| bedrock/us-gov-east-1/amazon.titan-text-lite-v1 | $0.3 | $0.4 | 42000 | 4000 |
| bedrock/us-gov-east-1/amazon.titan-text-premier-v1:0 | $0.5 | $1.5 | 42000 | 32000 |
| bedrock/us-gov-east-1/anthropic.claude-3-5-sonnet-20240620-v1:0 | $3.6 | $18 | 200000 | 8192 |
| bedrock/us-gov-east-1/anthropic.claude-3-haiku-20240307-v1:0 | $0.3 | $1.5 | 200000 | 4096 |
| bedrock/us-gov-east-1/meta.llama3-70b-instruct-v1:0 | $2.65 | $3.5 | 8000 | 2048 |
| bedrock/us-gov-east-1/meta.llama3-8b-instruct-v1:0 | $0.3 | $2.65 | 8000 | 2048 |
| bedrock/us-gov-west-1/amazon.titan-embed-text-v1 | $0.1 | $0 | 8192 | nan |
| bedrock/us-gov-west-1/amazon.titan-embed-text-v2:0 | $0.2 | $0 | 8192 | nan |
| bedrock/us-gov-west-1/amazon.titan-text-express-v1 | $1.3 | $1.7 | 42000 | 8000 |
| bedrock/us-gov-west-1/amazon.titan-text-lite-v1 | $0.3 | $0.4 | 42000 | 4000 |
| bedrock/us-gov-west-1/amazon.titan-text-premier-v1:0 | $0.5 | $1.5 | 42000 | 32000 |
| bedrock/us-gov-west-1/anthropic.claude-3-5-sonnet-20240620-v1:0 | $3.6 | $18 | 200000 | 8192 |
| bedrock/us-gov-west-1/anthropic.claude-3-haiku-20240307-v1:0 | $0.3 | $1.5 | 200000 | 4096 |
| bedrock/us-gov-west-1/meta.llama3-70b-instruct-v1:0 | $2.65 | $3.5 | 8000 | 2048 |
| bedrock/us-gov-west-1/meta.llama3-8b-instruct-v1:0 | $0.3 | $2.65 | 8000 | 2048 |
| bedrock/us-gov-east-1/amazon.nova-pro-v1:0 | $0.96 | $3.84 | 300000 | 10000 |
| bedrock/us-gov-west-1/amazon.nova-pro-v1:0 | $0.96 | $3.84 | 300000 | 10000 |
| moonshot/kimi-latest | $2 | $5 | 131072 | 131072 |
| moonshot/kimi-latest-8k | $0.2 | $2 | 8192 | 8192 |
| moonshot/kimi-latest-32k | $1 | $3 | 32768 | 32768 |
| moonshot/kimi-latest-128k | $2 | $5 | 131072 | 131072 |
| moonshot/kimi-thinking-preview | $0.6 | $2.5 | 131072 | 131072 |
| moonshot/moonshot-v1-8k-vision-preview | $0.2 | $2 | 8192 | 8192 |
| moonshot/moonshot-v1-32k-vision-preview | $1 | $3 | 32768 | 32768 |
| moonshot/moonshot-v1-128k-vision-preview | $2 | $5 | 131072 | 131072 |
| groq/qwen/qwen3-32b | $0.29 | $0.59 | 131000 | 131000 |
| openrouter/qwen/qwen-vl-plus | $0.21 | $0.63 | 8192 | 2048 |
| fireworks_ai/accounts/fireworks/models/kimi-k2-instruct | $0.6 | $2.5 | 131072 | 16384 |
| lambda_ai/deepseek-llama3.3-70b | $0.2 | $0.6 | 131072 | 131072 |
| lambda_ai/deepseek-r1-0528 | $0.2 | $0.6 | 131072 | 131072 |
| lambda_ai/deepseek-r1-671b | $0.8 | $0.8 | 131072 | 131072 |
| lambda_ai/deepseek-v3-0324 | $0.2 | $0.6 | 131072 | 131072 |
| lambda_ai/hermes3-405b | $0.8 | $0.8 | 131072 | 131072 |
| lambda_ai/hermes3-70b | $0.12 | $0.3 | 131072 | 131072 |
| lambda_ai/hermes3-8b | $0.02 | $0.04 | 131072 | 131072 |
| lambda_ai/lfm-40b | $0.1 | $0.2 | 131072 | 131072 |
| lambda_ai/lfm-7b | $0.02 | $0.04 | 131072 | 131072 |
| lambda_ai/llama-4-maverick-17b-128e-instruct-fp8 | $0.05 | $0.1 | 131072 | 8192 |
| lambda_ai/llama-4-scout-17b-16e-instruct | $0.05 | $0.1 | 16384 | 8192 |
| lambda_ai/llama3.1-405b-instruct-fp8 | $0.8 | $0.8 | 131072 | 131072 |
| lambda_ai/llama3.1-70b-instruct-fp8 | $0.12 | $0.3 | 131072 | 131072 |
| lambda_ai/llama3.1-8b-instruct | $0.02 | $0.04 | 131072 | 131072 |
| lambda_ai/llama3.1-nemotron-70b-instruct-fp8 | $0.12 | $0.3 | 131072 | 131072 |
| lambda_ai/llama3.2-11b-vision-instruct | $0.02 | $0.02 | 131072 | 131072 |
| lambda_ai/llama3.2-3b-instruct | $0.02 | $0.02 | 131072 | 131072 |
| lambda_ai/llama3.3-70b-instruct-fp8 | $0.12 | $0.3 | 131072 | 131072 |
| lambda_ai/qwen25-coder-32b-instruct | $0.05 | $0.1 | 131072 | 131072 |
| lambda_ai/qwen3-32b-fp8 | $0.05 | $0.1 | 131072 | 131072 |
| recraft/recraftv3 | -- | -- | nan | nan |
| recraft/recraftv2 | -- | -- | nan | nan |
| morph/morph-v3-fast | $0.8 | $1.2 | 16000 | 16000 |
| morph/morph-v3-large | $0.9 | $1.9 | 16000 | 16000 |
| gemini/gemini-2.0-flash-live-001 | $0.35 | $1.5 | 1.04858e+06 | 65535 |
| gemini-2.0-flash-live-preview-04-09 | $0.5 | $2 | 1.04858e+06 | 65535 |
| vertex_ai/meta/llama-3.1-8b-instruct-maas | $0 | $0 | 128000 | 2048 |
| vertex_ai/meta/llama-3.1-70b-instruct-maas | $0 | $0 | 128000 | 2048 |
| vertex_ai/meta/llama-3.1-405b-instruct-maas | $5 | $16 | 128000 | 2048 |
| hyperbolic/moonshotai/Kimi-K2-Instruct | $2 | $2 | 131072 | 131072 |
| hyperbolic/deepseek-ai/DeepSeek-R1-0528 | $0.25 | $0.25 | 131072 | 131072 |
| hyperbolic/Qwen/Qwen3-235B-A22B | $2 | $2 | 131072 | 131072 |
| hyperbolic/deepseek-ai/DeepSeek-V3-0324 | $0.4 | $0.4 | 32768 | 32768 |
| hyperbolic/Qwen/QwQ-32B | $0.2 | $0.2 | 131072 | 131072 |
| hyperbolic/deepseek-ai/DeepSeek-R1 | $0.4 | $0.4 | 32768 | 32768 |
| hyperbolic/deepseek-ai/DeepSeek-V3 | $0.2 | $0.2 | 32768 | 32768 |
| hyperbolic/meta-llama/Llama-3.3-70B-Instruct | $0.12 | $0.3 | 131072 | 131072 |
| hyperbolic/Qwen/Qwen2.5-Coder-32B-Instruct | $0.12 | $0.3 | 32768 | 32768 |
| hyperbolic/meta-llama/Llama-3.2-3B-Instruct | $0.12 | $0.3 | 32768 | 32768 |
| hyperbolic/Qwen/Qwen2.5-72B-Instruct | $0.12 | $0.3 | 131072 | 131072 |
| hyperbolic/meta-llama/Meta-Llama-3-70B-Instruct | $0.12 | $0.3 | 131072 | 131072 |
| hyperbolic/NousResearch/Hermes-3-Llama-3.1-70B | $0.12 | $0.3 | 32768 | 32768 |
| hyperbolic/meta-llama/Meta-Llama-3.1-405B-Instruct | $0.12 | $0.3 | 32768 | 32768 |
| hyperbolic/meta-llama/Meta-Llama-3.1-8B-Instruct | $0.12 | $0.3 | 32768 | 32768 |
| hyperbolic/meta-llama/Meta-Llama-3.1-70B-Instruct | $0.12 | $0.3 | 32768 | 32768 |
| gemini/gemini-2.5-flash-lite | $0.1 | $0.4 | 1.04858e+06 | 65535 |
| gemini-2.5-flash-lite | $0.1 | $0.4 | 1.04858e+06 | 65535 |
| gpt-4o-realtime-preview-2025-06-03 | $5 | $20 | 128000 | 4096 |
| openrouter/bytedance/ui-tars-1.5-7b | $0.1 | $0.2 | 131072 | 2048 |
| openrouter/qwen/qwen3-coder | $0.22 | $0.95 | 262100 | 262100 |
| gemini/imagen-4.0-generate-preview-06-06 | -- | -- | nan | nan |
| gemini/imagen-4.0-ultra-generate-preview-06-06 | -- | -- | nan | nan |
| gemini/imagen-4.0-fast-generate-preview-06-06 | -- | -- | nan | nan |
| gemini/imagen-3.0-generate-002 | -- | -- | nan | nan |
| gemini/imagen-3.0-generate-001 | -- | -- | nan | nan |
| gemini/imagen-3.0-fast-generate-001 | -- | -- | nan | nan |
| openrouter/x-ai/grok-4 | $3 | $15 | 256000 | 256000 |
| claude-opus-4-1-20250805 | $15 | $75 | 200000 | 32000 |
| anthropic.claude-opus-4-1-20250805-v1:0 | $15 | $75 | 200000 | 32000 |
| us.anthropic.claude-opus-4-1-20250805-v1:0 | $15 | $75 | 200000 | 32000 |
| eu.anthropic.claude-opus-4-1-20250805-v1:0 | $15 | $75 | 200000 | 32000 |
| fireworks_ai/accounts/fireworks/models/glm-4p5 | $0.55 | $2.19 | 128000 | 96000 |
| fireworks_ai/accounts/fireworks/models/glm-4p5-air | $0.22 | $0.88 | 128000 | 96000 |
| fireworks_ai/accounts/fireworks/models/gpt-oss-120b | $0.15 | $0.6 | 131072 | 131072 |
| fireworks_ai/accounts/fireworks/models/gpt-oss-20b | $0.05 | $0.2 | 131072 | 131072 |
| vertex_ai/claude-opus-4-1 | $15 | $75 | 200000 | 32000 |
| vertex_ai/claude-opus-4-1@20250805 | $15 | $75 | 200000 | 32000 |
| openai.gpt-oss-20b-1:0 | $0.07 | $0.3 | 128000 | 128000 |
| openai.gpt-oss-120b-1:0 | $0.15 | $0.6 | 128000 | 128000 |
| gpt-5 | $1.25 | $10 | 272000 | 128000 |
| gpt-5-mini | $0.25 | $2 | 272000 | 128000 |
| gpt-5-nano | $0.05 | $0.4 | 272000 | 128000 |
| gpt-5-chat | $1.25 | $10 | 128000 | 16384 |
| gpt-5-chat-latest | $1.25 | $10 | 128000 | 16384 |
| gpt-5-2025-08-07 | $1.25 | $10 | 272000 | 128000 |
| gpt-5-mini-2025-08-07 | $0.25 | $2 | 272000 | 128000 |
| gpt-5-nano-2025-08-07 | $0.05 | $0.4 | 272000 | 128000 |
| azure/gpt-5 | $1.25 | $10 | 272000 | 128000 |
| azure/gpt-5-2025-08-07 | $1.25 | $10 | 272000 | 128000 |
| azure/gpt-5-mini | $0.25 | $2 | 272000 | 128000 |
| azure/gpt-5-mini-2025-08-07 | $0.25 | $2 | 272000 | 128000 |
| azure/gpt-5-nano-2025-08-07 | $0.05 | $0.4 | 272000 | 128000 |
| azure/gpt-5-nano | $0.05 | $0.4 | 272000 | 128000 |
| azure/gpt-5-chat | $1.25 | $10 | 128000 | 16384 |
| azure/gpt-5-chat-latest | $1.25 | $10 | 128000 | 16384 |
| groq/openai/gpt-oss-20b | $0.08 | $0.3 | 131072 | 32768 |
| groq/openai/gpt-oss-120b | $0.15 | $0.6 | 131072 | 32766 |
| claude-opus-4-1 | $15 | $75 | 200000 | 32000 |
| oci/meta.llama-4-maverick-17b-128e-instruct-fp8 | $0.72 | $0.72 | 512000 | 4000 |
| oci/meta.llama-4-scout-17b-16e-instruct | $0.72 | $0.72 | 192000 | 4000 |
| oci/meta.llama-3.3-70b-instruct | $0.72 | $0.72 | 128000 | 4000 |
| oci/meta.llama-3.2-90b-vision-instruct | $2 | $2 | 128000 | 4000 |
| oci/meta.llama-3.1-405b-instruct | $10.68 | $10.68 | 128000 | 4000 |
| oci/xai.grok-4 | $3 | $15 | 128000 | 128000 |
| oci/xai.grok-3 | $3 | $15 | 131072 | 131072 |
| oci/xai.grok-3-mini | $0.3 | $0.5 | 131072 | 131072 |
| oci/xai.grok-3-fast | $5 | $25 | 131072 | 131072 |
| oci/xai.grok-3-mini-fast | $0.6 | $4 | 131072 | 131072 |
| cerebras/openai/gpt-oss-20b | $0.07 | $0.3 | 131072 | 32768 |
| cerebras/openai/gpt-oss-120b | $0.25 | $0.69 | 131072 | 32768 |
| openrouter/openai/gpt-oss-20b | $0.02 | $0.1 | 131072 | 32768 |
| openrouter/openai/gpt-oss-120b | $0.18 | $0.8 | 131072 | 32768 |
| gradient_ai/anthropic-claude-3.7-sonnet | $3 | $15 | 200000 | 1024 |
| gradient_ai/anthropic-claude-3.5-sonnet | $3 | $15 | 200000 | 1024 |
| gradient_ai/anthropic-claude-3.5-haiku | $0.8 | $4 | 200000 | 1024 |
| gradient_ai/anthropic-claude-3-opus | $15 | $75 | 200000 | 1024 |
| gradient_ai/deepseek-r1-distill-llama-70b | $0.99 | $0.99 | 32768 | 8000 |
| gradient_ai/llama3.3-70b-instruct | $0.65 | $0.65 | 128000 | 2048 |
| gradient_ai/llama3-8b-instruct | $0.2 | $0.2 | 8192 | 512 |
| gradient_ai/mistral-nemo-instruct-2407 | $0.3 | $0.3 | 128000 | 512 |
| gradient_ai/openai-o3 | $2 | $8 | 200000 | 100000 |
| gradient_ai/openai-o3-mini | $1.1 | $4.4 | 200000 | 100000 |
| gradient_ai/openai-gpt-4o | -- | -- | 128000 | 16384 |
| gradient_ai/openai-gpt-4o-mini | -- | -- | 128000 | 16384 |
| gradient_ai/alibaba-qwen3-32b | -- | -- | 131072 | 40960 |
| azure_ai/FLUX-1.1-pro | -- | -- | nan | nan |
| azure_ai/FLUX.1-Kontext-pro | -- | -- | nan | nan |
| vertex_ai/deepseek-ai/deepseek-r1-0528-maas | $1.35 | $5.4 | 65336 | 8192 |
| openrouter/deepseek/deepseek-chat-v3-0324 | $0.14 | $0.28 | 65536 | 8192 |
| vertex_ai/qwen/qwen3-coder-480b-a35b-instruct-maas | $1 | $4 | 262144 | 32768 |
| vertex_ai/qwen/qwen3-235b-a22b-instruct-2507-maas | $0.25 | $1 | 262144 | 16384 |
| together_ai/Qwen/Qwen3-235B-A22B-Instruct-2507-tput | $0.2 | $6 | 262000 | nan |
| together_ai/Qwen/Qwen3-Coder-480B-A35B-Instruct-FP8 | $2 | $2 | 256000 | nan |
| together_ai/Qwen/Qwen3-235B-A22B-Thinking-2507 | $0.65 | $3 | 256000 | nan |
| together_ai/Qwen/Qwen3-235B-A22B-fp8-tput | $0.2 | $0.6 | 40000 | nan |
| together_ai/deepseek-ai/DeepSeek-R1-0528-tput | $0.55 | $2.19 | 128000 | nan |
| together_ai/openai/gpt-oss-120b | $0.15 | $0.6 | 131072 | 131072 |
| together_ai/OpenAI/gpt-oss-20B | $0.05 | $0.2 | 128000 | nan |
| together_ai/zai-org/GLM-4.5-Air-FP8 | $0.2 | $1.1 | 128000 | nan |
| fireworks_ai/accounts/fireworks/models/deepseek-v3-0324 | $0.9 | $0.9 | 163840 | 163840 |
| vertex_ai/imagen-4.0-generate-001 | -- | -- | nan | nan |
| vertex_ai/imagen-4.0-ultra-generate-001 | -- | -- | nan | nan |
| vertex_ai/imagen-4.0-fast-generate-001 | -- | -- | nan | nan |
| gemini/imagen-4.0-generate-001 | -- | -- | nan | nan |
| gemini/imagen-4.0-ultra-generate-001 | -- | -- | nan | nan |
| gemini/imagen-4.0-fast-generate-001 | -- | -- | nan | nan |
| deepinfra/deepseek-ai/DeepSeek-V3 | $0.38 | $0.89 | 163840 | 163840 |
| deepinfra/meta-llama/Meta-Llama-3.1-8B-Instruct-Turbo | $0.02 | $0.03 | 131072 | 131072 |
| deepinfra/google/gemma-2-9b-it | $0.03 | $0.06 | 8192 | 8192 |
| deepinfra/deepseek-ai/DeepSeek-R1-0528-Turbo | $1 | $3 | 32768 | 32768 |
| deepinfra/Qwen/Qwen2-7B-Instruct | $0.06 | $0.06 | 32768 | 32768 |
| deepinfra/Qwen/QVQ-72B-Preview | $0.25 | $0.5 | 32000 | 32000 |
| deepinfra/meta-llama/Llama-3.3-70B-Instruct | $0.23 | $0.4 | 131072 | 131072 |
| deepinfra/microsoft/Phi-4-multimodal-instruct | $0.05 | $0.1 | 131072 | 131072 |
| deepinfra/mistralai/Devstral-Small-2507 | $0.07 | $0.28 | 128000 | 128000 |
| deepinfra/microsoft/WizardLM-2-7B | $0.06 | $0.06 | 32768 | 32768 |
| deepinfra/meta-llama/Llama-3.2-90B-Vision-Instruct | $0.35 | $0.4 | 32768 | 32768 |
| deepinfra/mistralai/Mistral-Small-3.2-24B-Instruct-2506 | $0.08 | $0.2 | 128000 | 128000 |
| deepinfra/deepseek-ai/DeepSeek-V3-0324 | $0.25 | $0.88 | 163840 | 163840 |
| deepinfra/anthropic/claude-3-7-sonnet-latest | $3.3 | $16.5 | 200000 | 200000 |
| deepinfra/cognitivecomputations/dolphin-2.9.1-llama-3-70b | $0.35 | $0.4 | 8192 | 8192 |
| deepinfra/Qwen/Qwen2.5-Coder-32B-Instruct | $0.06 | $0.15 | 32768 | 32768 |
| deepinfra/Qwen/Qwen3-235B-A22B | $0.18 | $0.54 | 40960 | 40960 |
| deepinfra/deepseek-ai/DeepSeek-V3-0324-Turbo | $1 | $3 | 32768 | 32768 |
| deepinfra/microsoft/WizardLM-2-8x22B | $0.48 | $0.48 | 65536 | 65536 |
| deepinfra/Sao10K/L3-8B-Lunaris-v1-Turbo | $0.04 | $0.05 | 8192 | 8192 |
| deepinfra/meta-llama/Llama-Guard-4-12B | $0.18 | $0.18 | 163840 | 163840 |
| deepinfra/meta-llama/Llama-3.2-1B-Instruct | $0 | $0.01 | 131072 | 131072 |
| deepinfra/google/gemma-2-27b-it | $0.27 | $0.27 | 8192 | 8192 |
| deepinfra/Qwen/Qwen2.5-VL-32B-Instruct | $0.2 | $0.6 | 128000 | 128000 |
| deepinfra/nvidia/Llama-3.1-Nemotron-70B-Instruct | $0.6 | $0.6 | 131072 | 131072 |
| deepinfra/mistralai/Mixtral-8x22B-Instruct-v0.1 | $0.65 | $0.65 | 65536 | 65536 |
| deepinfra/Qwen/Qwen2.5-7B-Instruct | $0.04 | $0.1 | 32768 | 32768 |
| deepinfra/google/gemini-1.5-flash-8b | $0.04 | $0.15 | 1e+06 | 1e+06 |
| deepinfra/NousResearch/Hermes-3-Llama-3.1-70B | $0.3 | $0.3 | 131072 | 131072 |
| deepinfra/deepseek-ai/DeepSeek-R1-Distill-Llama-70B | $0.2 | $0.6 | 131072 | 131072 |
| deepinfra/meta-llama/Llama-Guard-3-8B | $0.06 | $0.06 | 131072 | 131072 |
| deepinfra/mistralai/Mistral-Small-24B-Instruct-2501 | $0.05 | $0.08 | 32768 | 32768 |
| deepinfra/anthropic/claude-4-opus | $16.5 | $82.5 | 200000 | 200000 |
| deepinfra/openchat/openchat-3.6-8b | $0.06 | $0.06 | 8192 | 8192 |
| deepinfra/google/gemma-3-27b-it | $0.09 | $0.16 | 131072 | 131072 |
| deepinfra/Austism/chronos-hermes-13b-v2 | $0.13 | $0.13 | 4096 | 4096 |
| deepinfra/Sao10K/L3.1-70B-Euryale-v2.2 | $0.65 | $0.75 | 131072 | 131072 |
| deepinfra/Qwen/QwQ-32B-Preview | $0.12 | $0.18 | 32768 | 32768 |
| deepinfra/anthropic/claude-4-sonnet | $3.3 | $16.5 | 200000 | 200000 |
| deepinfra/microsoft/Phi-3-medium-4k-instruct | $0.14 | $0.14 | 4096 | 4096 |
| deepinfra/mattshumer/Reflection-Llama-3.1-70B | $0.35 | $0.4 | 8192 | 8192 |
| deepinfra/Sao10K/L3.3-70B-Euryale-v2.3 | $0.65 | $0.75 | 131072 | 131072 |
| deepinfra/meta-llama/Meta-Llama-3.1-70B-Instruct | $0.4 | $0.4 | 131072 | 131072 |
| deepinfra/deepseek-ai/DeepSeek-V3.1 | $0.27 | $1 | 163840 | 163840 |
| deepinfra/Qwen/Qwen2.5-Coder-7B | $0.02 | $0.05 | 32768 | 32768 |
| deepinfra/deepseek-ai/DeepSeek-Prover-V2-671B | $0.5 | $2.18 | 163840 | 163840 |
| deepinfra/zai-org/GLM-4.5 | $0.4 | $1.6 | 131072 | 131072 |
| deepinfra/meta-llama/Llama-3.2-3B-Instruct | $0.02 | $0.02 | 131072 | 131072 |
| deepinfra/google/gemini-1.5-flash | $0.08 | $0.3 | 1e+06 | 1e+06 |
| deepinfra/KoboldAI/LLaMA2-13B-Tiefighter | $0.1 | $0.1 | 4096 | 4096 |
| deepinfra/google/gemini-2.5-pro | $1.25 | $10 | 1e+06 | 1e+06 |
| deepinfra/Qwen/Qwen3-30B-A3B | $0.08 | $0.29 | 40960 | 40960 |
| deepinfra/Qwen/QwQ-32B | $0.15 | $0.4 | 131072 | 131072 |
| deepinfra/moonshotai/Kimi-K2-Instruct | $0.5 | $2 | 131072 | 131072 |
| deepinfra/Sao10K/L3-70B-Euryale-v2.1 | $0.7 | $0.8 | 8192 | 8192 |
| deepinfra/microsoft/phi-4-reasoning-plus | $0.07 | $0.35 | 32768 | 32768 |
| deepinfra/google/gemma-3-12b-it | $0.05 | $0.1 | 131072 | 131072 |
| deepinfra/google/gemini-2.5-flash | $0.3 | $2.5 | 1e+06 | 1e+06 |
| deepinfra/deepseek-ai/DeepSeek-R1 | $0.7 | $2.4 | 163840 | 163840 |
| deepinfra/mistralai/Mistral-7B-Instruct-v0.3 | $0.03 | $0.05 | 32768 | 32768 |
| deepinfra/Qwen/Qwen2.5-72B-Instruct | $0.12 | $0.39 | 32768 | 32768 |
| deepinfra/Qwen/Qwen3-14B | $0.06 | $0.24 | 40960 | 40960 |
| deepinfra/allenai/olmOCR-7B-0725-FP8 | $0.27 | $1.5 | 16384 | 16384 |
| deepinfra/Qwen/Qwen3-Coder-480B-A35B-Instruct | $0.4 | $1.6 | 262144 | 262144 |
| deepinfra/microsoft/phi-4 | $0.07 | $0.14 | 16384 | 16384 |
| deepinfra/NousResearch/Hermes-3-Llama-3.1-405B | $1 | $1 | 131072 | 131072 |
| deepinfra/zai-org/GLM-4.5-Air | $0.2 | $1.1 | 131072 | 131072 |
| deepinfra/deepseek-ai/DeepSeek-R1-Distill-Qwen-32B | $0.27 | $0.27 | 131072 | 131072 |
| deepinfra/openai/gpt-oss-120b | $0.05 | $0.45 | 131072 | 131072 |
| deepinfra/google/codegemma-7b-it | $0.07 | $0.07 | 8192 | 8192 |
| deepinfra/Qwen/Qwen3-Coder-480B-A35B-Instruct-Turbo | $0.29 | $1.2 | 262144 | 262144 |
| deepinfra/mistralai/Mistral-Nemo-Instruct-2407 | $0.02 | $0.04 | 131072 | 131072 |
| deepinfra/openbmb/MiniCPM-Llama3-V-2_5 | $0.34 | $0.34 | 8192 | 8192 |
| deepinfra/bigcode/starcoder2-15b-instruct-v0.1 | $0.15 | $0.15 | 4096 | 4096 |
| deepinfra/meta-llama/Llama-4-Maverick-17B-128E-Instruct-FP8 | $0.15 | $0.6 | 1.04858e+06 | 1.04858e+06 |
| deepinfra/meta-llama/Llama-4-Scout-17B-16E-Instruct | $0.08 | $0.3 | 327680 | 327680 |
| deepinfra/google/gemini-2.0-flash-001 | $0.1 | $0.4 | 1e+06 | 1e+06 |
| deepinfra/Gryphe/MythoMax-L2-13b-turbo | $0.13 | $0.13 | 4096 | 4096 |
| deepinfra/google/gemma-1.1-7b-it | $0.07 | $0.07 | 8192 | 8192 |
| deepinfra/meta-llama/Meta-Llama-3.1-70B-Instruct-Turbo | $0.1 | $0.28 | 131072 | 131072 |
| deepinfra/meta-llama/Meta-Llama-3.1-8B-Instruct | $0.03 | $0.05 | 131072 | 131072 |
| deepinfra/Qwen/Qwen3-32B | $0.1 | $0.28 | 40960 | 40960 |
| deepinfra/Qwen/Qwen3-235B-A22B-Thinking-2507 | $0.3 | $2.9 | 262144 | 262144 |
| deepinfra/nvidia/Nemotron-4-340B-Instruct | $4.2 | $4.2 | 4096 | 4096 |
| deepinfra/deepseek-ai/DeepSeek-R1-0528 | $0.5 | $2.15 | 163840 | 163840 |
| deepinfra/deepseek-ai/DeepSeek-R1-Turbo | $1 | $3 | 40960 | 40960 |
| deepinfra/NovaSky-AI/Sky-T1-32B-Preview | $0.12 | $0.18 | 32768 | 32768 |
| deepinfra/Qwen/Qwen3-235B-A22B-Instruct-2507 | $0.09 | $0.6 | 262144 | 262144 |
| deepinfra/mistralai/Mistral-Small-3.1-24B-Instruct-2503 | $0.05 | $0.1 | 128000 | 128000 |
| deepinfra/Qwen/Qwen2-72B-Instruct | $0.35 | $0.4 | 32768 | 32768 |
| deepinfra/meta-llama/Llama-4-Maverick-17B-128E-Instruct-Turbo | $0.5 | $0.5 | 8192 | 8192 |
| deepinfra/Sao10K/L3-8B-Lunaris-v1 | $0.03 | $0.06 | 8192 | 8192 |
| deepinfra/google/gemma-3-4b-it | $0.04 | $0.08 | 131072 | 131072 |
| deepinfra/mistralai/Mistral-7B-Instruct-v0.2 | $0.06 | $0.06 | 32768 | 32768 |
| deepinfra/meta-llama/Llama-3.3-70B-Instruct-Turbo | $0.13 | $0.39 | 131072 | 131072 |
| deepinfra/mistralai/Devstral-Small-2505 | $0.06 | $0.12 | 128000 | 128000 |
| deepinfra/meta-llama/Llama-3.2-11B-Vision-Instruct | $0.05 | $0.05 | 131072 | 131072 |
| deepinfra/openai/gpt-oss-20b | $0.04 | $0.15 | 131072 | 131072 |
| voyage/voyage-context-3 | $0.18 | $0 | 120000 | nan |
| openrouter/deepseek/deepseek-chat-v3.1 | $0.2 | $0.8 | 163840 | 163840 |
| aiml/flux/kontext-pro/text-to-image | -- | -- | nan | nan |
| aiml/flux/kontext-max/text-to-image | -- | -- | nan | nan |
| aiml/flux-pro/v1.1-ultra | -- | -- | nan | nan |
| aiml/flux-pro/v1.1 | -- | -- | nan | nan |
| aiml/flux-realism | -- | -- | nan | nan |
| aiml/flux/schnell | -- | -- | nan | nan |
| aiml/flux/dev | -- | -- | nan | nan |
| aiml/flux-pro | -- | -- | nan | nan |
| aiml/dall-e-3 | -- | -- | nan | nan |
| aiml/dall-e-2 | -- | -- | nan | nan |
| gemini/gemini-2.5-flash-image-preview | $0.3 | $30 | 1.04858e+06 | 65535 |
| gemini-2.5-flash-image-preview | $0.3 | $30 | 1.04858e+06 | 65535 |
| openrouter/anthropic/claude-opus-4 | $15 | $75 | 200000 | 32000 |
| openrouter/anthropic/claude-opus-4.1 | $15 | $75 | 200000 | 32000 |
| fireworks_ai/accounts/fireworks/models/deepseek-v3p1 | $0.56 | $1.68 | 128000 | 8192 |
| openrouter/openai/gpt-5-mini | $0.25 | $2 | 272000 | 128000 |
| openrouter/openai/gpt-5-nano | $0.05 | $0.4 | 272000 | 128000 |
| openrouter/openai/gpt-5-chat | $1.25 | $10 | 128000 | 16384 |
| xai/grok-code-fast-1 | $0.2 | $1.5 | 256000 | 256000 |
| xai/grok-code-fast | $0.2 | $1.5 | 256000 | 256000 |
| xai/grok-code-fast-1-0825 | $0.2 | $1.5 | 256000 | 256000 |
| gpt-realtime | $4 | $16 | 32000 | 4096 |
| gpt-realtime-2025-08-28 | $4 | $16 | 32000 | 4096 |
| vercel_ai_gateway/alibaba/qwen3-coder | $0.4 | $1.6 | 262144 | 66536 |
| vercel_ai_gateway/mistral/codestral-embed | $0.15 | $0 | 0 | 0 |
| vercel_ai_gateway/google/gemini-2.5-pro | $2.5 | $10 | 1.04858e+06 | 65536 |
| vercel_ai_gateway/deepseek/deepseek-v3 | $0.9 | $0.9 | 128000 | 8192 |
| vercel_ai_gateway/amazon/nova-lite | $0.06 | $0.24 | 300000 | 8192 |
| vercel_ai_gateway/meta/llama-4-scout | $0.1 | $0.3 | 131072 | 8192 |
| vercel_ai_gateway/meta/llama-3.2-1b | $0.1 | $0.1 | 128000 | 8192 |
| vercel_ai_gateway/mistral/mistral-small | $0.1 | $0.3 | 32000 | 4000 |
| vercel_ai_gateway/google/gemini-2.5-flash | $0.3 | $2.5 | 1e+06 | 65536 |
| vercel_ai_gateway/inception/mercury-coder-small | $0.25 | $1 | 32000 | 16384 |
| vercel_ai_gateway/openai/text-embedding-3-small | $0.02 | $0 | 0 | 0 |
| vercel_ai_gateway/xai/grok-2-vision | $2 | $10 | 32768 | 32768 |
| vercel_ai_gateway/xai/grok-2 | $2 | $10 | 131072 | 4000 |
| vercel_ai_gateway/deepseek/deepseek-r1-distill-llama-70b | $0.75 | $0.99 | 131072 | 131072 |
| vercel_ai_gateway/meta/llama-3.1-70b | $0.72 | $0.72 | 128000 | 8192 |
| vercel_ai_gateway/xai/grok-3 | $3 | $15 | 131072 | 131072 |
| vercel_ai_gateway/alibaba/qwen-3-235b | $0.2 | $0.6 | 40960 | 16384 |
| vercel_ai_gateway/xai/grok-3-fast | $5 | $25 | 131072 | 131072 |
| vercel_ai_gateway/vercel/v0-1.5-md | $3 | $15 | 128000 | 32768 |
| vercel_ai_gateway/openai/o4-mini | $1.1 | $4.4 | 200000 | 100000 |
| vercel_ai_gateway/mistral/magistral-medium | $2 | $5 | 128000 | 64000 |
| vercel_ai_gateway/amazon/titan-embed-text-v2 | $0.02 | $0 | 0 | 0 |
| vercel_ai_gateway/alibaba/qwen-3-30b | $0.1 | $0.3 | 40960 | 16384 |
| vercel_ai_gateway/zai/glm-4.5-air | $0.2 | $1.1 | 128000 | 96000 |
| vercel_ai_gateway/openai/gpt-4-turbo | $10 | $30 | 128000 | 4096 |
| vercel_ai_gateway/mistral/mistral-large | $2 | $6 | 32000 | 4000 |
| vercel_ai_gateway/perplexity/sonar-pro | $3 | $15 | 200000 | 8000 |
| vercel_ai_gateway/meta/llama-3.2-90b | $0.72 | $0.72 | 128000 | 8192 |
| vercel_ai_gateway/meta/llama-3-8b | $0.05 | $0.08 | 8192 | 8192 |
| vercel_ai_gateway/google/text-embedding-005 | $0.02 | $0 | 0 | 0 |
| vercel_ai_gateway/mistral/pixtral-large | $2 | $6 | 128000 | 4000 |
| vercel_ai_gateway/anthropic/claude-3.5-sonnet | $3 | $15 | 200000 | 8192 |
| vercel_ai_gateway/amazon/nova-micro | $0.04 | $0.14 | 128000 | 8192 |
| vercel_ai_gateway/cohere/command-r | $0.15 | $0.6 | 128000 | 4096 |
| vercel_ai_gateway/morph/morph-v3-large | $0.9 | $1.9 | 32768 | 16384 |
| vercel_ai_gateway/mistral/mixtral-8x22b-instruct | $1.2 | $1.2 | 65536 | 2048 |
| vercel_ai_gateway/xai/grok-4 | $3 | $15 | 256000 | 256000 |
| vercel_ai_gateway/meta/llama-3.1-8b | $0.05 | $0.08 | 131000 | 131072 |
| vercel_ai_gateway/anthropic/claude-3-opus | $15 | $75 | 200000 | 4096 |
| vercel_ai_gateway/zai/glm-4.5 | $0.6 | $2.2 | 131072 | 131072 |
| vercel_ai_gateway/openai/gpt-4o | $2.5 | $10 | 128000 | 16384 |
| vercel_ai_gateway/openai/o3-mini | $1.1 | $4.4 | 200000 | 100000 |
| vercel_ai_gateway/mistral/ministral-8b | $0.1 | $0.1 | 128000 | 4000 |
| vercel_ai_gateway/openai/o3 | $2 | $8 | 200000 | 100000 |
| vercel_ai_gateway/vercel/v0-1.0-md | $3 | $15 | 128000 | 32000 |
| vercel_ai_gateway/google/text-multilingual-embedding-002 | $0.02 | $0 | 0 | 0 |
| vercel_ai_gateway/amazon/nova-pro | $0.8 | $3.2 | 300000 | 8192 |
| vercel_ai_gateway/morph/morph-v3-fast | $0.8 | $1.2 | 32768 | 16384 |
| vercel_ai_gateway/openai/gpt-3.5-turbo | $0.5 | $1.5 | 16385 | 4096 |
| vercel_ai_gateway/mistral/codestral | $0.3 | $0.9 | 256000 | 4000 |
| vercel_ai_gateway/meta/llama-3.2-11b | $0.16 | $0.16 | 128000 | 8192 |
| vercel_ai_gateway/meta/llama-3-70b | $0.59 | $0.79 | 8192 | 8192 |
| vercel_ai_gateway/xai/grok-3-mini-fast | $0.6 | $4 | 131072 | 131072 |
| vercel_ai_gateway/openai/text-embedding-3-large | $0.13 | $0 | 0 | 0 |
| vercel_ai_gateway/google/gemini-2.0-flash-lite | $0.08 | $0.3 | 1.04858e+06 | 8192 |
| vercel_ai_gateway/mistral/ministral-3b | $0.04 | $0.04 | 128000 | 4000 |
| vercel_ai_gateway/perplexity/sonar-reasoning-pro | $2 | $8 | 127000 | 8000 |
| vercel_ai_gateway/google/gemini-embedding-001 | $0.15 | $0 | 0 | 0 |
| vercel_ai_gateway/anthropic/claude-3-haiku | $0.25 | $1.25 | 200000 | 4096 |
| vercel_ai_gateway/openai/o1 | $15 | $60 | 200000 | 100000 |
| vercel_ai_gateway/deepseek/deepseek-r1 | $0.55 | $2.19 | 128000 | 8192 |
| vercel_ai_gateway/mistral/mistral-embed | $0.1 | $0 | 0 | 0 |
| vercel_ai_gateway/openai/gpt-4.1-mini | $0.4 | $1.6 | 1.04758e+06 | 32768 |
| vercel_ai_gateway/openai/gpt-4o-mini | $0.15 | $0.6 | 128000 | 16384 |
| vercel_ai_gateway/alibaba/qwen-3-14b | $0.08 | $0.24 | 40960 | 16384 |
| vercel_ai_gateway/anthropic/claude-4-opus | $15 | $75 | 200000 | 32000 |
| vercel_ai_gateway/mistral/mistral-saba-24b | $0.79 | $0.79 | 32768 | 32768 |
| vercel_ai_gateway/perplexity/sonar-reasoning | $1 | $5 | 127000 | 8000 |
| vercel_ai_gateway/anthropic/claude-3.5-haiku | $0.8 | $4 | 200000 | 8192 |
| vercel_ai_gateway/cohere/command-a | $2.5 | $10 | 256000 | 8000 |
| vercel_ai_gateway/google/gemma-2-9b | $0.2 | $0.2 | 8192 | 8192 |
| vercel_ai_gateway/meta/llama-3.2-3b | $0.15 | $0.15 | 128000 | 8192 |
| vercel_ai_gateway/openai/gpt-4.1-nano | $0.1 | $0.4 | 1.04758e+06 | 32768 |
| vercel_ai_gateway/anthropic/claude-4-sonnet | $3 | $15 | 200000 | 64000 |
| vercel_ai_gateway/perplexity/sonar | $1 | $1 | 127000 | 8000 |
| vercel_ai_gateway/meta/llama-4-maverick | $0.2 | $0.6 | 131072 | 8192 |
| vercel_ai_gateway/openai/text-embedding-ada-002 | $0.1 | $0 | 0 | 0 |
| vercel_ai_gateway/xai/grok-3-mini | $0.3 | $0.5 | 131072 | 131072 |
| vercel_ai_gateway/cohere/embed-v4.0 | $0.12 | $0 | 0 | 0 |
| vercel_ai_gateway/meta/llama-3.3-70b | $0.72 | $0.72 | 128000 | 8192 |
| vercel_ai_gateway/cohere/command-r-plus | $2.5 | $10 | 128000 | 4096 |
| vercel_ai_gateway/openai/gpt-3.5-turbo-instruct | $1.5 | $2 | 8192 | 4096 |
| vercel_ai_gateway/mistral/devstral-small | $0.07 | $0.28 | 128000 | 128000 |
| vercel_ai_gateway/anthropic/claude-3.7-sonnet | $3 | $15 | 200000 | 64000 |
| vercel_ai_gateway/google/gemini-2.0-flash | $0.15 | $0.6 | 1.04858e+06 | 8192 |
| vercel_ai_gateway/mistral/pixtral-12b | $0.15 | $0.15 | 128000 | 4000 |
| vercel_ai_gateway/mistral/magistral-small | $0.5 | $1.5 | 128000 | 64000 |
| vercel_ai_gateway/moonshotai/kimi-k2 | $0.55 | $2.2 | 131072 | 16384 |
| vercel_ai_gateway/alibaba/qwen-3-32b | $0.1 | $0.3 | 40960 | 16384 |
| vercel_ai_gateway/openai/gpt-4.1 | $2 | $8 | 1.04758e+06 | 32768 |
| openrouter/openai/gpt-4.1 | $2 | $8 | 1.04758e+06 | 32768 |
| openrouter/openai/gpt-4.1-2025-04-14 | $2 | $8 | 1.04758e+06 | 32768 |
| openrouter/openai/gpt-4.1-mini | $0.4 | $1.6 | 1.04758e+06 | 32768 |
| openrouter/openai/gpt-4.1-mini-2025-04-14 | $0.4 | $1.6 | 1.04758e+06 | 32768 |
| openrouter/openai/gpt-4.1-nano | $0.1 | $0.4 | 1.04758e+06 | 32768 |
| openrouter/openai/gpt-4.1-nano-2025-04-14 | $0.1 | $0.4 | 1.04758e+06 | 32768 |
| together_ai/deepseek-ai/DeepSeek-V3.1 | $0.6 | $1.7 | 128000 | 16384 |
| vertex_ai/openai/gpt-oss-20b-maas | $0.08 | $0.3 | 131072 | 32768 |
| vertex_ai/openai/gpt-oss-120b-maas | $0.15 | $0.6 | 131072 | 32768 |
| gemini/veo-3.0-generate-preview | -- | -- | 1024 | nan |
| gemini/veo-3.0-fast-generate-preview | -- | -- | 1024 | nan |
| gemini/veo-2.0-generate-001 | -- | -- | 1024 | nan |
| vertex_ai/veo-3.0-generate-preview | -- | -- | 1024 | nan |
| vertex_ai/veo-3.0-fast-generate-preview | -- | -- | 1024 | nan |
| vertex_ai/veo-2.0-generate-001 | -- | -- | 1024 | nan |
| doubao-embedding-large | $0 | $0 | 4096 | nan |
| doubao-embedding-large-text-250515 | $0 | $0 | 4096 | nan |
| doubao-embedding-large-text-240915 | $0 | $0 | 4096 | nan |
| doubao-embedding | $0 | $0 | 4096 | nan |
| doubao-embedding-text-240715 | $0 | $0 | 4096 | nan |
| heroku/claude-4-sonnet | -- | -- | 200000 | 8192 |
| heroku/claude-3-7-sonnet | -- | -- | 200000 | 8192 |
| heroku/claude-3-5-sonnet-latest | -- | -- | 200000 | 8192 |
| heroku/claude-3-5-haiku | -- | -- | 200000 | 8192 |
| together_ai/openai/gpt-oss-20b | $0.05 | $0.2 | 128000 | nan |
| dashscope/qwen3-max-preview | -- | -- | 258048 | 65536 |
| dashscope/qwen-plus | $0.4 | $1.2 | 129024 | 16384 |
| dashscope/qwen-flash | -- | -- | 997952 | 32768 |
| dashscope/qwen-coder | $0.3 | $1.5 | 1e+06 | 16384 |
| dashscope/qwen3-coder-plus | -- | -- | 997952 | 65536 |
| dashscope/qwen3-coder-plus-2025-07-22 | -- | -- | 997952 | 65536 |
| dashscope/qwen3-coder-flash | -- | -- | 997952 | 65536 |
| dashscope/qwen3-coder-flash-2025-07-28 | -- | -- | 997952 | 65536 |
| dashscope/qwen-plus-2025-09-11 | -- | -- | 997952 | 32768 |
| dashscope/qwen-plus-2025-07-28 | -- | -- | 997952 | 32768 |
| dashscope/qwen-plus-2025-07-14 | $0.4 | $1.2 | 129024 | 16384 |
| dashscope/qwen-plus-2025-04-28 | $0.4 | $1.2 | 129024 | 16384 |
| dashscope/qwen-plus-2025-01-25 | $0.4 | $1.2 | 129024 | 8192 |
| dashscope/qwen-flash-2025-07-28 | -- | -- | 997952 | 32768 |
| dashscope/qwen-turbo | $0.05 | $0.2 | 129024 | 16384 |
| dashscope/qwen-turbo-2025-04-28 | $0.05 | $0.2 | 1e+06 | 16384 |
| dashscope/qwen-turbo-2024-11-01 | $0.05 | $0.2 | 1e+06 | 8192 |
| dashscope/qwq-plus | $0.8 | $2.4 | 98304 | 8192 |
| ovhcloud/Qwen2.5-VL-72B-Instruct | $0.91 | $0.91 | 32000 | 32000 |
| ovhcloud/llava-v1.6-mistral-7b-hf | $0.29 | $0.29 | 32000 | 32000 |
| ovhcloud/gpt-oss-120b | $0.08 | $0.4 | 131000 | 131000 |
| ovhcloud/Meta-Llama-3_3-70B-Instruct | $0.67 | $0.67 | 131000 | 131000 |
| ovhcloud/Qwen2.5-Coder-32B-Instruct | $0.87 | $0.87 | 32000 | 32000 |
| ovhcloud/Mixtral-8x7B-Instruct-v0.1 | $0.63 | $0.63 | 32000 | 32000 |
| ovhcloud/Meta-Llama-3_1-70B-Instruct | $0.67 | $0.67 | 131000 | 131000 |
| ovhcloud/Mistral-Small-3.2-24B-Instruct-2506 | $0.09 | $0.28 | 128000 | 128000 |
| ovhcloud/DeepSeek-R1-Distill-Llama-70B | $0.67 | $0.67 | 131000 | 131000 |
| ovhcloud/Llama-3.1-8B-Instruct | $0.1 | $0.1 | 131000 | 131000 |
| ovhcloud/Mistral-7B-Instruct-v0.3 | $0.1 | $0.1 | 127000 | 127000 |
| ovhcloud/gpt-oss-20b | $0.04 | $0.15 | 131000 | 131000 |
| ovhcloud/Mistral-Nemo-Instruct-2407 | $0.13 | $0.13 | 118000 | 118000 |
| ovhcloud/Qwen3-32B | $0.08 | $0.23 | 32000 | 32000 |
| ovhcloud/mamba-codestral-7B-v0.1 | $0.19 | $0.19 | 256000 | 256000 |
| bedrock/us.anthropic.claude-3-5-haiku-20241022-v1:0 | $0.8 | $4 | 200000 | 8192 |
| twelvelabs.marengo-embed-2-7-v1:0 | $70 | $0 | 77 | nan |
| us.twelvelabs.marengo-embed-2-7-v1:0 | $70 | $0 | 77 | nan |
| eu.twelvelabs.marengo-embed-2-7-v1:0 | $70 | $0 | 77 | nan |
| twelvelabs.pegasus-1-2-v1:0 | -- | $7.5 | nan | nan |
| us.twelvelabs.pegasus-1-2-v1:0 | -- | $7.5 | nan | nan |
| eu.twelvelabs.pegasus-1-2-v1:0 | -- | $7.5 | nan | nan |
| wandb/openai/gpt-oss-120b | $15000 | $60000 | 131072 | 131072 |
| wandb/openai/gpt-oss-20b | $5000 | $20000 | 131072 | 131072 |
| wandb/zai-org/GLM-4.5 | $55000 | $200000 | 131072 | 131072 |
| wandb/Qwen/Qwen3-235B-A22B-Instruct-2507 | $10000 | $10000 | 262144 | 262144 |
| wandb/Qwen/Qwen3-Coder-480B-A35B-Instruct | $100000 | $150000 | 262144 | 262144 |
| wandb/Qwen/Qwen3-235B-A22B-Thinking-2507 | $10000 | $10000 | 262144 | 262144 |
| wandb/moonshotai/Kimi-K2-Instruct | $0.6 | $2.5 | 128000 | 128000 |
| wandb/meta-llama/Llama-3.1-8B-Instruct | $22000 | $22000 | 128000 | 128000 |
| wandb/deepseek-ai/DeepSeek-V3.1 | $55000 | $165000 | 128000 | 128000 |
| wandb/deepseek-ai/DeepSeek-R1-0528 | $135000 | $540000 | 161000 | 161000 |
| wandb/deepseek-ai/DeepSeek-V3-0324 | $114000 | $275000 | 161000 | 161000 |
| wandb/meta-llama/Llama-3.3-70B-Instruct | $71000 | $71000 | 128000 | 128000 |
| wandb/meta-llama/Llama-4-Scout-17B-16E-Instruct | $17000 | $66000 | 64000 | 64000 |
| wandb/microsoft/Phi-4-mini-instruct | $8000 | $35000 | 128000 | 128000 |
| openrouter/x-ai/grok-4-fast:free | $0 | $0 | 2e+06 | 30000 |
| vertex_ai/qwen/qwen3-next-80b-a3b-instruct-maas | $0.15 | $1.2 | 262144 | 262144 |
| vertex_ai/qwen/qwen3-next-80b-a3b-thinking-maas | $0.15 | $1.2 | 262144 | 262144 |
| xai/grok-4-fast-reasoning | $0.2 | $0.5 | 2e+06 | 2e+06 |
| xai/grok-4-fast-non-reasoning | $0.2 | $0.5 | 2e+06 | 2e+06 |
| azure/gpt-5-codex | $1.25 | $10 | 272000 | 128000 |
| deepseek-reasoner | $0.28 | $0.42 | 131072 | 65536 |
| deepseek.v3-v1:0 | $0.58 | $1.68 | 163840 | 81920 |
| gpt-5-codex | $1.25 | $10 | 272000 | 128000 |
| qwen.qwen3-coder-480b-a35b-v1:0 | $0.22 | $1.8 | 262000 | 65536 |
| qwen.qwen3-235b-a22b-2507-v1:0 | $0.22 | $0.88 | 262144 | 131072 |
| qwen.qwen3-coder-30b-a3b-v1:0 | $0.15 | $0.6 | 262144 | 131072 |
| qwen.qwen3-32b-v1:0 | $0.15 | $0.6 | 131072 | 16384 |
| sambanova/DeepSeek-V3.1 | $3 | $4.5 | 32768 | 32768 |
| sambanova/gpt-oss-120b | $3 | $4.5 | 131072 | 131072 |
| vertex_ai/deepseek-ai/deepseek-v3.1-maas | $1.35 | $5.4 | 163840 | 32768 |
| gemini-2.5-flash-lite-preview-09-2025 | $0.1 | $0.4 | 1.04858e+06 | 65535 |
| gemini-2.5-flash-preview-09-2025 | $0.3 | $2.5 | 1.04858e+06 | 65535 |
| gemini-flash-latest | $0.3 | $2.5 | 1.04858e+06 | 65535 |
| gemini-flash-lite-latest | $0.1 | $0.4 | 1.04858e+06 | 65535 |
| gemini/gemini-2.5-flash-lite-preview-09-2025 | $0.1 | $0.4 | 1.04858e+06 | 65535 |
| gemini/gemini-2.5-flash-preview-09-2025 | $0.3 | $2.5 | 1.04858e+06 | 65535 |
| gemini/gemini-flash-latest | $0.3 | $2.5 | 1.04858e+06 | 65535 |
| gemini/gemini-flash-lite-latest | $0.1 | $0.4 | 1.04858e+06 | 65535 |
| openrouter/openai/gpt-5-codex | $1.25 | $10 | 272000 | 128000 |
| openrouter/openai/gpt-5 | $1.25 | $10 | 272000 | 128000 |
| anthropic/claude-sonnet-4-5 | $3 | $15 | 200000 | 64000 |
| claude-sonnet-4-5-20250929 | $3 | $15 | 200000 | 64000 |
| ollama/deepseek-v3.1:671b-cloud | $0 | $0 | 163840 | 163840 |
| ollama/gpt-oss:120b-cloud | $0 | $0 | 131072 | 131072 |
| ollama/gpt-oss:20b-cloud | $0 | $0 | 131072 | 131072 |
| ollama/qwen3-coder:480b-cloud | $0 | $0 | 262144 | 262144 |
| us.anthropic.claude-sonnet-4-5-20250929-v1:0 | $3.3 | $16.5 | 200000 | 64000 |
| vertex_ai/claude-sonnet-4-5 | $3 | $15 | 200000 | 64000 |
| vertex_ai/claude-sonnet-4-5@20250929 | $3 | $15 | 200000 | 64000 |
| claude-sonnet-4-5 | $3 | $15 | 200000 | 64000 |
| lemonade/Qwen3-Coder-30B-A3B-Instruct-GGUF | $0 | $0 | 262144 | 32768 |
| groq/moonshotai/kimi-k2-instruct-0905 | $1 | $3 | 262144 | 16384 |
| azure_ai/grok-4 | $3 | $15 | 131072 | 131072 |
| azure_ai/grok-4-fast-non-reasoning | $0.2 | $0.5 | 131072 | 131072 |
| azure_ai/grok-4-fast-reasoning | $0.2 | $0.5 | 131072 | 131072 |
| azure_ai/grok-code-fast-1 | $0.2 | $1.5 | 131072 | 131072 |
| eu.anthropic.claude-sonnet-4-5-20250929-v1:0 | $3.3 | $16.5 | 200000 | 64000 |
| nvidia_nim/nvidia/nv-rerankqa-mistral-4b-v3 | $0 | $0 | nan | nan |
| nvidia_nim/nvidia/llama-3_2-nv-rerankqa-1b-v2 | $0 | $0 | nan | nan |
| deepinfra/Qwen/Qwen3-Next-80B-A3B-Instruct | $0.14 | $1.4 | 262144 | 262144 |
| deepinfra/Qwen/Qwen3-Next-80B-A3B-Thinking | $0.14 | $1.4 | 262144 | 262144 |
| deepinfra/deepseek-ai/DeepSeek-V3.1-Terminus | $0.27 | $1 | 163840 | 163840 |
| deepinfra/moonshotai/Kimi-K2-Instruct-0905 | $0.5 | $2 | 262144 | 262144 |
| deepinfra/nvidia/Llama-3.3-Nemotron-Super-49B-v1.5 | $0.1 | $0.4 | 131072 | 131072 |
| deepinfra/nvidia/NVIDIA-Nemotron-Nano-9B-v2 | $0.04 | $0.16 | 131072 | 131072 |
| jp.anthropic.claude-sonnet-4-5-20250929-v1:0 | $3.3 | $16.5 | 200000 | 64000 |
| watsonx/bigscience/mt0-xxl-13b | $500 | $2000 | 8192 | 8192 |
| watsonx/core42/jais-13b-chat | $500 | $2000 | 8192 | 8192 |
| watsonx/google/flan-t5-xl-3b | $0.6 | $0.6 | 8192 | 8192 |
| watsonx/ibm/granite-13b-chat-v2 | $0.6 | $0.6 | 8192 | 8192 |
| watsonx/ibm/granite-13b-instruct-v2 | $0.6 | $0.6 | 8192 | 8192 |
| watsonx/ibm/granite-3-3-8b-instruct | $0.2 | $0.2 | 8192 | 8192 |
| watsonx/ibm/granite-4-h-small | $0.06 | $0.25 | 20480 | 20480 |
| watsonx/ibm/granite-guardian-3-2-2b | $0.1 | $0.1 | 8192 | 8192 |
| watsonx/ibm/granite-guardian-3-3-8b | $0.2 | $0.2 | 8192 | 8192 |
| watsonx/ibm/granite-ttm-1024-96-r2 | $0.38 | $0.38 | 512 | 512 |
| watsonx/ibm/granite-ttm-1536-96-r2 | $0.38 | $0.38 | 512 | 512 |
| watsonx/ibm/granite-ttm-512-96-r2 | $0.38 | $0.38 | 512 | 512 |
| watsonx/ibm/granite-vision-3-2-2b | $0.1 | $0.1 | 8192 | 8192 |
| watsonx/meta-llama/llama-3-2-11b-vision-instruct | $0.35 | $0.35 | 128000 | 128000 |
| watsonx/meta-llama/llama-3-2-1b-instruct | $0.1 | $0.1 | 128000 | 128000 |
| watsonx/meta-llama/llama-3-2-3b-instruct | $0.15 | $0.15 | 128000 | 128000 |
| watsonx/meta-llama/llama-3-2-90b-vision-instruct | $2 | $2 | 128000 | 128000 |
| watsonx/meta-llama/llama-3-3-70b-instruct | $0.71 | $0.71 | 128000 | 128000 |
| watsonx/meta-llama/llama-4-maverick-17b | $0.35 | $1.4 | 128000 | 128000 |
| watsonx/meta-llama/llama-guard-3-11b-vision | $0.35 | $0.35 | 128000 | 128000 |
| watsonx/mistralai/mistral-medium-2505 | $3 | $10 | 128000 | 128000 |
| watsonx/mistralai/mistral-small-2503 | $0.1 | $0.3 | 32000 | 32000 |
| watsonx/mistralai/pixtral-12b-2409 | $0.35 | $0.35 | 128000 | 128000 |
| watsonx/openai/gpt-oss-120b | $0.15 | $0.6 | 8192 | 8192 |
| watsonx/sdaia/allam-1-13b-instruct | $1.8 | $1.8 | 8192 | 8192 |
| global.anthropic.claude-sonnet-4-5-20250929-v1:0 | $3 | $15 | 200000 | 64000 |
| global.anthropic.claude-sonnet-4-20250514-v1:0 | $3 | $15 | 1e+06 | 64000 |
| gpt-5-pro | $15 | $120 | 128000 | 272000 |
| gpt-image-1-mini | $2 | -- | nan | nan |
| gpt-realtime-mini | $0.6 | $2.4 | 128000 | 4096 |
| low/1024-x-1024/gpt-image-1-mini | -- | -- | nan | nan |
| low/1024-x-1536/gpt-image-1-mini | -- | -- | nan | nan |
| low/1536-x-1024/gpt-image-1-mini | -- | -- | nan | nan |
| medium/1024-x-1024/gpt-image-1-mini | -- | -- | nan | nan |
| medium/1024-x-1536/gpt-image-1-mini | -- | -- | nan | nan |
| medium/1536-x-1024/gpt-image-1-mini | -- | -- | nan | nan |
| cohere.embed-v4:0 | $0.12 | $0 | 128000 | nan |
| gpt-5-pro-2025-10-06 | $15 | $120 | 128000 | 272000 |
| azure_ai/Phi-4-mini-reasoning | $0.08 | $0.32 | 131072 | 4096 |
| azure_ai/Phi-4-reasoning | $0.12 | $0.5 | 32768 | 4096 |
| azure_ai/MAI-DS-R1 | $1.35 | $5.4 | 128000 | 8192 |
| au.anthropic.claude-sonnet-4-5-20250929-v1:0 | $3.3 | $16.5 | 200000 | 64000 |
| oci/cohere.command-latest | $1.56 | $1.56 | 128000 | 4000 |
| oci/cohere.command-a-03-2025 | $1.56 | $1.56 | 256000 | 4000 |
| oci/cohere.command-plus-latest | $1.56 | $1.56 | 128000 | 4000 |
| together_ai/moonshotai/Kimi-K2-Instruct-0905 | $1 | $3 | 262144 | nan |
| together_ai/Qwen/Qwen3-Next-80B-A3B-Instruct | $0.15 | $1.5 | 262144 | nan |
| together_ai/Qwen/Qwen3-Next-80B-A3B-Thinking | $0.15 | $1.5 | 262144 | nan |
| together_ai/baai/bge-base-en-v1.5 | $0.01 | $0 | 512 | nan |
| together_ai/BAAI/bge-base-en-v1.5 | $0.01 | $0 | 512 | nan |
| openrouter/anthropic/claude-sonnet-4.5 | $3 | $15 | 1e+06 | 1e+06 |
| anthropic.claude-haiku-4-5-20251001-v1:0 | $1 | $5 | 200000 | 64000 |
| anthropic.claude-haiku-4-5@20251001 | $1 | $5 | 200000 | 64000 |
| apac.anthropic.claude-haiku-4-5-20251001-v1:0 | $1.1 | $5.5 | 200000 | 64000 |
| claude-haiku-4-5-20251001 | $1 | $5 | 200000 | 64000 |
| claude-haiku-4-5 | $1 | $5 | 200000 | 64000 |
| eu.anthropic.claude-haiku-4-5-20251001-v1:0 | $1.1 | $5.5 | 200000 | 64000 |
| gemini-2.5-flash-image | $0.3 | $2.5 | 32768 | 32768 |
| gemini/gemini-2.5-flash-image | $0.3 | $2.5 | 32768 | 32768 |
| lemonade/gpt-oss-20b-mxfp4-GGUF | $0 | $0 | 131072 | 32768 |
| lemonade/gpt-oss-120b-mxfp-GGUF | $0 | $0 | 131072 | 32768 |
| lemonade/Gemma-3-4b-it-GGUF | $0 | $0 | 128000 | 8192 |
| lemonade/Qwen3-4B-Instruct-2507-GGUF | $0 | $0 | 262144 | 32768 |
| jp.anthropic.claude-haiku-4-5-20251001-v1:0 | $1.1 | $5.5 | 200000 | 64000 |
| us.anthropic.claude-haiku-4-5-20251001-v1:0 | $1.1 | $5.5 | 200000 | 64000 |
| vertex_ai/claude-haiku-4-5@20251001 | $1 | $5 | 200000 | 8192 |
| azure_ai/mistral-document-ai-2505 | -- | -- | nan | nan |
| mistral/mistral-ocr-latest | -- | -- | nan | nan |
| mistral/mistral-ocr-2505-completion | -- | -- | nan | nan |
| watsonx/mistralai/mistral-small-3-1-24b-instruct-2503 | $0.1 | $0.3 | 32000 | 32000 |
| global.anthropic.claude-haiku-4-5-20251001-v1:0 | $1 | $5 | 200000 | 64000 |
| au.anthropic.claude-haiku-4-5-20251001-v1:0 | $1.1 | $5.5 | 200000 | 64000 |
| vercel_ai_gateway/glm-4.6 | $0.6 | $2.2 | 200000 | 200000 |
| azure/speech/azure-tts | -- | -- | nan | nan |
| azure/speech/azure-tts-hd | -- | -- | nan | nan |
| bedrock/us-gov-west-1/anthropic.claude-3-7-sonnet-20240620-v1:0 | $3.6 | $18 | 200000 | 8192 |
| anthropic.claude-3-7-sonnet-20240620-v1:0 | $3.6 | $18 | 200000 | 8192 |
| bedrock/us-gov-west-1/anthropic.claude-3-7-sonnet-20250219-v1:0 | $3.6 | $18 | 200000 | 8192 |
| dataforseo/search | -- | -- | nan | nan |
| exa_ai/search | -- | -- | nan | nan |
| perplexity/search | -- | -- | nan | nan |
| google_pse/search | -- | -- | nan | nan |
| parallel_ai/search | -- | -- | nan | nan |
| parallel_ai/search-pro | -- | -- | nan | nan |
| tavily/search | -- | -- | nan | nan |
| tavily/search-advanced | -- | -- | nan | nan |
| vertex_ai/mistralai/codestral-2@001 | $0.3 | $0.9 | 128000 | 128000 |
| vertex_ai/codestral-2 | $0.3 | $0.9 | 128000 | 128000 |
| vertex_ai/codestral-2@001 | $0.3 | $0.9 | 128000 | 128000 |
| vertex_ai/mistralai/codestral-2 | $0.3 | $0.9 | 128000 | 128000 |
| vertex_ai/mistral-medium-3 | $0.4 | $2 | 128000 | 8191 |
| vertex_ai/mistral-medium-3@001 | $0.4 | $2 | 128000 | 8191 |
| vertex_ai/mistralai/mistral-medium-3 | $0.4 | $2 | 128000 | 8191 |
| vertex_ai/mistralai/mistral-medium-3@001 | $0.4 | $2 | 128000 | 8191 |
| openai/sora-2 | -- | -- | nan | nan |
| azure/sora-2 | -- | -- | nan | nan |
| azure/sora-2-pro | -- | -- | nan | nan |
| azure/sora-2-pro-high-res | -- | -- | nan | nan |
| amazon.titan-image-generator-v1 | -- | -- | nan | nan |
| amazon.titan-image-generator-v2 | -- | -- | nan | nan |
| vertex_ai/search_api | -- | -- | nan | nan |
| openai/sora-2-pro | -- | -- | nan | nan |
| openrouter/anthropic/claude-haiku-4.5 | $1 | $5 | 200000 | 200000 |
| fal_ai/bria/text-to-image/3.2 | -- | -- | nan | nan |
| fal_ai/fal-ai/flux-pro/v1.1-ultra | -- | -- | nan | nan |
| fal_ai/fal-ai/imagen4/preview | -- | -- | nan | nan |
| fal_ai/fal-ai/recraft/v3/text-to-image | -- | -- | nan | nan |
| fal_ai/fal-ai/stable-diffusion-v35-medium | -- | -- | nan | nan |
| mistral/codestral-embed | $0.15 | -- | 8192 | nan |
| mistral/codestral-embed-2505 | $0.15 | -- | 8192 | nan |
| gemini/gemini-embedding-001 | $0.15 | $0 | 2048 | nan |
| vercel_ai_gateway/zai/glm-4.6 | $0.45 | $1.8 | 200000 | 200000 |
| openrouter/nvidia/nemotron-nano-9b-v2:free | $0 | $0 | nan | nan |
| openai/container | -- | -- | nan | nan |
| azure/container | -- | -- | nan | nan |
| vertex_ai/mistral-ocr-2505 | -- | -- | nan | nan |
| azure_ai/doc-intelligence/prebuilt-read | -- | -- | nan | nan |
| azure_ai/doc-intelligence/prebuilt-layout | -- | -- | nan | nan |
| azure_ai/doc-intelligence/prebuilt-document | -- | -- | nan | nan |
| gemini-live-2.5-flash-preview-native-audio-09-2025 | $0.3 | $2 | 1.04858e+06 | 65535 |
| gemini/gemini-live-2.5-flash-preview-native-audio-09-2025 | $0.3 | $2 | 1.04858e+06 | 65535 |
| azure/gpt-image-1-mini | $2 | -- | nan | nan |
| azure/low/1024-x-1024/gpt-image-1-mini | -- | -- | nan | nan |
| azure/low/1024-x-1536/gpt-image-1-mini | -- | -- | nan | nan |
| azure/low/1536-x-1024/gpt-image-1-mini | -- | -- | nan | nan |
| azure/medium/1024-x-1024/gpt-image-1-mini | -- | -- | nan | nan |
| azure/medium/1024-x-1536/gpt-image-1-mini | -- | -- | nan | nan |
| azure/medium/1536-x-1024/gpt-image-1-mini | -- | -- | nan | nan |
| azure/high/1024-x-1024/gpt-image-1-mini | -- | -- | nan | nan |
| azure/high/1024-x-1536/gpt-image-1-mini | -- | -- | nan | nan |
| azure/high/1536-x-1024/gpt-image-1-mini | -- | -- | nan | nan |
| firecrawl/search | -- | -- | nan | nan |
| searxng/search | -- | -- | nan | nan |
| azure/gpt-5-pro | $15 | $120 | 272000 | 128000 |
| vertex_ai/minimaxai/minimax-m2-maas | $0.3 | $1.2 | 196608 | 196608 |
| cohere/embed-v4.0 | $0.12 | $0 | 128000 | nan |
| gemini/veo-3.1-fast-generate-preview | -- | -- | 1024 | nan |
| gemini/veo-3.1-generate-preview | -- | -- | 1024 | nan |
| vertex_ai/veo-3.1-generate-preview | -- | -- | 1024 | nan |
| vertex_ai/veo-3.1-fast-generate-preview | -- | -- | 1024 | nan |
| moonshot/kimi-k2-thinking | $0.6 | $2.5 | 262144 | 262144 |
| mistral/magistral-medium-2509 | $2 | $5 | 40000 | 40000 |
| openrouter/deepseek/deepseek-v3.2-exp | $0.2 | $0.4 | 163840 | 163840 |
| openrouter/minimax/minimax-m2 | $0.26 | $1.02 | 204800 | 204800 |
| openrouter/z-ai/glm-4.6 | $0.4 | $1.75 | 202800 | 131000 |
| openrouter/z-ai/glm-4.6:exacto | $0.45 | $1.9 | 202800 | 131000 |
| runwayml/gen4_turbo | -- | -- | nan | nan |
| runwayml/gen4_aleph | -- | -- | nan | nan |
| runwayml/gen3a_turbo | -- | -- | nan | nan |
| runwayml/gen4_image | -- | -- | nan | nan |
| runwayml/gen4_image_turbo | -- | -- | nan | nan |
| gpt-5.1 | $1.25 | $10 | 272000 | 128000 |
| gpt-5.1-2025-11-13 | $1.25 | $10 | 272000 | 128000 |
| gpt-5.1-chat-latest | $1.25 | $10 | 128000 | 16384 |
| gpt-5.1-codex | $1.25 | $10 | 272000 | 128000 |
| gpt-5.1-codex-mini | $0.25 | $2 | 272000 | 128000 |
| fal_ai/fal-ai/flux/schnell | -- | -- | nan | nan |
| fal_ai/fal-ai/imagen4/preview/fast | -- | -- | nan | nan |
| fal_ai/fal-ai/imagen4/preview/ultra | -- | -- | nan | nan |
| voyage/voyage-3.5 | $0.06 | $0 | 32000 | nan |
| voyage/voyage-3.5-lite | $0.02 | $0 | 32000 | nan |
| runwayml/eleven_multilingual_v2 | -- | -- | nan | nan |
| fal_ai/fal-ai/flux-pro/v1.1 | -- | -- | nan | nan |
| fal_ai/fal-ai/bytedance/seedream/v3/text-to-image | -- | -- | nan | nan |
| fal_ai/fal-ai/bytedance/dreamina/v3.1/text-to-image | -- | -- | nan | nan |
| fal_ai/fal-ai/ideogram/v3 | -- | -- | nan | nan |
| vertex_ai/moonshotai/kimi-k2-thinking-maas | $0.6 | $2.5 | 256000 | 256000 |
| azure/eu/gpt-5-2025-08-07 | $1.38 | $11 | 272000 | 128000 |
| azure/eu/gpt-5-mini-2025-08-07 | $0.28 | $2.2 | 272000 | 128000 |
| azure/eu/gpt-5-nano-2025-08-07 | $0.06 | $0.44 | 272000 | 128000 |
| azure/us/gpt-4.1-2025-04-14 | $2.2 | $8.8 | 1.04758e+06 | 32768 |
| azure/us/gpt-4.1-mini-2025-04-14 | $0.44 | $1.76 | 1.04758e+06 | 32768 |
| azure/us/gpt-4.1-nano-2025-04-14 | $0.11 | $0.44 | 1.04758e+06 | 32768 |
| azure/us/gpt-5-2025-08-07 | $1.38 | $11 | 272000 | 128000 |
| azure/us/gpt-5-mini-2025-08-07 | $0.28 | $2.2 | 272000 | 128000 |
| azure/us/gpt-5-nano-2025-08-07 | $0.06 | $0.44 | 272000 | 128000 |
| azure/us/o3-2025-04-16 | $2.2 | $8.8 | 200000 | 100000 |
| azure/us/o4-mini-2025-04-16 | $1.21 | $4.84 | 200000 | 100000 |
| fireworks_ai/accounts/fireworks/models/deepseek-v3p1-terminus | $0.56 | $1.68 | 128000 | 8192 |
| fireworks_ai/accounts/fireworks/models/kimi-k2-thinking | $0.6 | $2.5 | 262144 | 262144 |
| gemini-3-pro-preview | $2 | $12 | 1.04858e+06 | 65535 |
| gemini/gemini-3-pro-preview | $2 | $12 | 1.04858e+06 | 65535 |
| vertex_ai/veo-3.0-fast-generate-001 | -- | -- | 1024 | nan |
| vertex_ai/veo-3.0-generate-001 | -- | -- | 1024 | nan |
| azure/gpt-audio-2025-08-28 | $2.5 | $10 | 128000 | 16384 |
| azure/gpt-audio-mini-2025-10-06 | $0.6 | $2.4 | 128000 | 16384 |
| azure/gpt-realtime-2025-08-28 | $4 | $16 | 32000 | 4096 |
| azure/gpt-realtime-mini-2025-10-06 | $0.6 | $2.4 | 32000 | 4096 |
| azure/gpt-4o-transcribe-diarize | $2.5 | $10 | 16000 | 2000 |
| azure/gpt-5.1-2025-11-13 | $1.25 | $10 | 272000 | 128000 |
| azure/gpt-5.1-chat-2025-11-13 | $1.25 | $10 | 128000 | 16384 |
| azure/gpt-5.1-codex-2025-11-13 | $1.25 | $10 | 272000 | 128000 |
| azure/gpt-5.1-codex-mini-2025-11-13 | $0.25 | $2 | 272000 | 128000 |
| azure/eu/gpt-5.1 | $1.38 | $11 | 272000 | 128000 |
| azure/eu/gpt-5.1-chat | $1.38 | $11 | 128000 | 128000 |
| azure/eu/gpt-5.1-codex | $1.38 | $11 | 272000 | 128000 |
| azure/eu/gpt-5.1-codex-mini | $0.28 | $2.2 | 272000 | 128000 |
| azure/global/gpt-5.1 | $1.25 | $10 | 272000 | 128000 |
| azure/global/gpt-5.1-chat | $1.25 | $10 | 128000 | 128000 |
| azure/global/gpt-5.1-codex | $1.25 | $10 | 272000 | 128000 |
| azure/global/gpt-5.1-codex-mini | $0.25 | $2 | 272000 | 128000 |
| azure/gpt-5.1 | $1.25 | $10 | 272000 | 128000 |
| azure/gpt-5.1-chat | $1.25 | $10 | 128000 | 128000 |
| azure/gpt-5.1-codex | $1.25 | $10 | 272000 | 128000 |
| azure/gpt-5.1-codex-mini | $0.25 | $2 | 272000 | 128000 |
| azure/us/gpt-5.1 | $1.38 | $11 | 272000 | 128000 |
| azure/us/gpt-5.1-chat | $1.38 | $11 | 128000 | 128000 |
| azure/us/gpt-5.1-codex | $1.38 | $11 | 272000 | 128000 |
| azure/us/gpt-5.1-codex-mini | $0.28 | $2.2 | 272000 | 128000 |
| vertex_ai/gemini-3-pro-preview | $2 | $12 | 1.04858e+06 | 65535 |
| vertex_ai/gemini-2.5-flash-image | $0.3 | $2.5 | 32768 | 32768 |
| vertex_ai/imagen-3.0-capability-001 | -- | -- | nan | nan |
| cerebras/gpt-oss-120b | $0.35 | $0.75 | 131072 | 32768 |
| gemini-3-pro-image-preview | $2 | $12 | 65536 | 32768 |
| gemini/gemini-3-pro-image-preview | $2 | $12 | 65536 | 32768 |
| openrouter/google/gemini-3-pro-preview | $2 | $12 | 1.04858e+06 | 65535 |
| together_ai/zai-org/GLM-4.6 | $0.6 | $2.2 | 200000 | 200000 |
| vertex_ai/gemini-3-pro-image-preview | $2 | $12 | 65536 | 32768 |
| xai/grok-4-1-fast | $0.2 | $0.5 | 2e+06 | 2e+06 |
| xai/grok-4-1-fast-reasoning | $0.2 | $0.5 | 2e+06 | 2e+06 |
| xai/grok-4-1-fast-reasoning-latest | $0.2 | $0.5 | 2e+06 | 2e+06 |
| xai/grok-4-1-fast-non-reasoning | $0.2 | $0.5 | 2e+06 | 2e+06 |
| xai/grok-4-1-fast-non-reasoning-latest | $0.2 | $0.5 | 2e+06 | 2e+06 |
| anthropic.claude-sonnet-4-5-20250929-v1:0 | $3 | $15 | 200000 | 64000 |
| bedrock/us-gov-east-1/claude-sonnet-4-5-20250929-v1:0 | $3.3 | $16.5 | 200000 | 8192 |
| bedrock/us-gov-west-1/claude-sonnet-4-5-20250929-v1:0 | $3.3 | $16.5 | 200000 | 8192 |
| claude-sonnet-4-5-20250929-v1:0 | $3 | $15 | 200000 | 64000 |
| anthropic.claude-opus-4-5-20251101-v1:0 | $5 | $25 | 200000 | 64000 |
| claude-opus-4-5-20251101 | $5 | $25 | 200000 | 64000 |
| us.anthropic.claude-opus-4-5-20251101-v1:0 | $5.5 | $27.5 | 200000 | 64000 |
| azure/claude-haiku-4-5 | $1 | $5 | 200000 | 64000 |
| azure/claude-opus-4-1 | $15 | $75 | 200000 | 32000 |
| azure/claude-sonnet-4-5 | $3 | $15 | 200000 | 64000 |
| vertex_ai/claude-opus-4-5 | $5 | $25 | 200000 | 64000 |
| vertex_ai/claude-opus-4-5@20251101 | $5 | $25 | 200000 | 64000 |
| embed-multilingual-light-v3.0 | $100 | $0 | 1024 | nan |
| fireworks_ai/accounts/fireworks/models/glm-4p6 | $0.55 | $2.19 | 202800 | 202800 |
| openrouter/anthropic/claude-opus-4.5 | $5 | $25 | 200000 | 32000 |
| watsonx/whisper-large-v3-turbo | -- | -- | nan | nan |
| amazon.nova-canvas-v1:0 | -- | -- | 2600 | nan |
| publicai/swiss-ai/apertus-8b-instruct | $0 | $0 | 8192 | 4096 |
| publicai/swiss-ai/apertus-70b-instruct | $0 | $0 | 8192 | 4096 |
| publicai/aisingapore/Gemma-SEA-LION-v4-27B-IT | $0 | $0 | 8192 | 4096 |
| publicai/BSC-LT/salamandra-7b-instruct-tools-16k | $0 | $0 | 16384 | 4096 |
| publicai/BSC-LT/ALIA-40b-instruct_Q8_0 | $0 | $0 | 8192 | 4096 |
| publicai/allenai/Olmo-3-7B-Instruct | $0 | $0 | 32768 | 4096 |
| publicai/aisingapore/Qwen-SEA-LION-v4-32B-IT | $0 | $0 | 32768 | 4096 |
| publicai/allenai/Olmo-3-7B-Think | $0 | $0 | 32768 | 4096 |
| publicai/allenai/Olmo-3-32B-Think | $0 | $0 | 32768 | 4096 |
| fireworks_ai/accounts/fireworks/models/kimi-k2-instruct-0905 | $0.6 | $2.5 | 262144 | 32768 |
| claude-opus-4-5 | $5 | $25 | 200000 | 64000 |
| databricks/databricks-claude-haiku-4-5 | $1 | $5 | 200000 | 64000 |
| databricks/databricks-claude-opus-4 | $15 | $75 | 200000 | 32000 |
| databricks/databricks-claude-opus-4-1 | $15 | $75 | 200000 | 32000 |
| databricks/databricks-claude-opus-4-5 | $5 | $25 | 200000 | 64000 |
| databricks/databricks-claude-sonnet-4 | $3 | $15 | 200000 | 64000 |
| databricks/databricks-claude-sonnet-4-1 | $3 | $15 | 200000 | 64000 |
| databricks/databricks-claude-sonnet-4-5 | $3 | $15 | 200000 | 64000 |
| databricks/databricks-gemini-2-5-flash | $0.3 | $2.5 | 1.04858e+06 | 65535 |
| databricks/databricks-gemini-2-5-pro | $1.25 | $10 | 1.04858e+06 | 65536 |
| databricks/databricks-gemma-3-12b | $0.15 | $0.5 | 128000 | 32000 |
| databricks/databricks-gpt-5 | $1.25 | $10 | 272000 | 128000 |
| databricks/databricks-gpt-5-1 | $1.25 | $10 | 272000 | 128000 |
| databricks/databricks-gpt-5-mini | $0.25 | $2 | 272000 | 128000 |
| databricks/databricks-gpt-5-nano | $0.05 | $0.4 | 272000 | 128000 |
| databricks/databricks-gpt-oss-120b | $0.15 | $0.6 | 131072 | 131072 |
| databricks/databricks-gpt-oss-20b | $0.07 | $0.3 | 131072 | 131072 |
| databricks/databricks-meta-llama-3-1-8b-instruct | $0.15 | $0.45 | 200000 | 128000 |
| fireworks_ai/accounts/fireworks/models/deepseek-v3p2 | $0.56 | $1.68 | 163840 | 163840 |
| vertex_ai/chirp | -- | -- | nan | nan |
| zai/glm-4.6 | $0.6 | $2.2 | 200000 | 128000 |
| zai/glm-4.5 | $0.6 | $2.2 | 128000 | 32000 |
| zai/glm-4.5v | $0.6 | $1.8 | 128000 | 32000 |
| zai/glm-4.5-x | $2.2 | $8.9 | 128000 | 32000 |
| zai/glm-4.5-air | $0.2 | $1.1 | 128000 | 32000 |
| zai/glm-4.5-airx | $1.1 | $4.5 | 128000 | 32000 |
| zai/glm-4-32b-0414-128k | $0.1 | $0.1 | 128000 | 32000 |
| zai/glm-4.5-flash | $0 | $0 | 128000 | 32000 |
| amazon.nova-2-lite-v1:0 | $0.3 | $2.5 | 1e+06 | 64000 |
| apac.amazon.nova-2-lite-v1:0 | $0.33 | $2.75 | 1e+06 | 64000 |
| eu.amazon.nova-2-lite-v1:0 | $0.33 | $2.75 | 1e+06 | 64000 |
| us.amazon.nova-2-lite-v1:0 | $0.33 | $2.75 | 1e+06 | 64000 |
| deepseek/deepseek-v3.2 | $0.28 | $0.4 | 163840 | 163840 |
| ft:gpt-4.1-2025-04-14 | $3 | $12 | 1.04758e+06 | 32768 |
| ft:gpt-4.1-mini-2025-04-14 | $0.8 | $3.2 | 1.04758e+06 | 32768 |
| ft:gpt-4.1-nano-2025-04-14 | $0.2 | $0.8 | 1.04758e+06 | 32768 |
| ft:o4-mini-2025-04-16 | $4 | $16 | 200000 | 100000 |
| openrouter/deepseek/deepseek-v3.2 | $0.28 | $0.4 | 163840 | 163840 |
| global.anthropic.claude-opus-4-5-20251101-v1:0 | $5 | $25 | 200000 | 64000 |
| amazon.titan-image-generator-v2:0 | -- | -- | nan | nan |
| moonshot/kimi-k2-0905-preview | $0.6 | $2.5 | 262144 | 262144 |
| moonshot/kimi-k2-turbo-preview | $1.15 | $8 | 262144 | 262144 |
| moonshot/kimi-k2-thinking-turbo | $1.15 | $8 | 262144 | 262144 |
| azure/gpt-5.1-codex-max | $1.25 | $10 | 272000 | 128000 |
| azure_ai/mistral-large-3 | $0.5 | $1.5 | 256000 | 8191 |
| gpt-5.1-codex-max | $1.25 | $10 | 272000 | 128000 |
| mistral/mistral-large-3 | $0.5 | $1.5 | 262144 | 262144 |
| amazon-nova/nova-micro-v1 | $0.04 | $0.14 | 128000 | 10000 |
| amazon-nova/nova-lite-v1 | $0.06 | $0.24 | 300000 | 10000 |
| amazon-nova/nova-premier-v1 | $2.5 | $12.5 | 1e+06 | 10000 |
| amazon-nova/nova-pro-v1 | $0.8 | $3.2 | 300000 | 10000 |
| google.gemma-3-12b-it | $0.09 | $0.29 | 128000 | 8192 |
| google.gemma-3-27b-it | $0.23 | $0.38 | 128000 | 8192 |
| google.gemma-3-4b-it | $0.04 | $0.08 | 128000 | 8192 |
| global.amazon.nova-2-lite-v1:0 | $0.3 | $2.5 | 1e+06 | 64000 |
| minimax.minimax-m2 | $0.3 | $1.2 | 128000 | 8192 |
| mistral.magistral-small-2509 | $0.5 | $1.5 | 128000 | 8192 |
| mistral.ministral-3-14b-instruct | $0.2 | $0.2 | 128000 | 8192 |
| mistral.ministral-3-3b-instruct | $0.1 | $0.1 | 128000 | 8192 |
| mistral.ministral-3-8b-instruct | $0.15 | $0.15 | 128000 | 8192 |
| mistral.mistral-large-3-675b-instruct | $0.5 | $1.5 | 128000 | 8192 |
| mistral.voxtral-mini-3b-2507 | $0.04 | $0.04 | 128000 | 8192 |
| mistral.voxtral-small-24b-2507 | $0.1 | $0.3 | 128000 | 8192 |
| moonshot.kimi-k2-thinking | $0.6 | $2.5 | 128000 | 8192 |
| nvidia.nemotron-nano-12b-v2 | $0.2 | $0.6 | 128000 | 8192 |
| nvidia.nemotron-nano-9b-v2 | $0.06 | $0.23 | 128000 | 8192 |
| openai.gpt-oss-safeguard-120b | $0.15 | $0.6 | 128000 | 8192 |
| openai.gpt-oss-safeguard-20b | $0.07 | $0.2 | 128000 | 8192 |
| qwen.qwen3-next-80b-a3b | $0.15 | $1.2 | 128000 | 8192 |
| qwen.qwen3-vl-235b-a22b | $0.53 | $2.66 | 128000 | 8192 |
| nvidia_nim/ranking/nvidia/llama-3.2-nv-rerankqa-1b-v2 | $0 | $0 | nan | nan |
| us.writer.palmyra-x4-v1:0 | $2.5 | $10 | 128000 | 8192 |
| us.writer.palmyra-x5-v1:0 | $0.6 | $6 | 1e+06 | 8192 |
| writer.palmyra-x4-v1:0 | $2.5 | $10 | 128000 | 8192 |
| writer.palmyra-x5-v1:0 | $0.6 | $6 | 1e+06 | 8192 |
| cerebras/zai-glm-4.6 | $2.25 | $2.75 | 128000 | 128000 |
| fireworks_ai/accounts/fireworks/models/qwen3-coder-480b-a35b-instruct | $0.45 | $1.8 | 262144 | 262144 |
| fireworks_ai/accounts/fireworks/models/flux-kontext-pro | $0.04 | $0.04 | 4096 | 4096 |
| fireworks_ai/accounts/fireworks/models/SSD-1B | $0 | $0 | 4096 | 4096 |
| fireworks_ai/accounts/fireworks/models/chronos-hermes-13b-v2 | $0.2 | $0.2 | 4096 | 4096 |
| fireworks_ai/accounts/fireworks/models/code-llama-13b | $0.2 | $0.2 | 16384 | 16384 |
| fireworks_ai/accounts/fireworks/models/code-llama-13b-instruct | $0.2 | $0.2 | 16384 | 16384 |
| fireworks_ai/accounts/fireworks/models/code-llama-13b-python | $0.2 | $0.2 | 16384 | 16384 |
| fireworks_ai/accounts/fireworks/models/code-llama-34b | $0.9 | $0.9 | 16384 | 16384 |
| fireworks_ai/accounts/fireworks/models/code-llama-34b-instruct | $0.9 | $0.9 | 16384 | 16384 |
| fireworks_ai/accounts/fireworks/models/code-llama-34b-python | $0.9 | $0.9 | 16384 | 16384 |
| fireworks_ai/accounts/fireworks/models/code-llama-70b | $0.9 | $0.9 | 4096 | 4096 |
| fireworks_ai/accounts/fireworks/models/code-llama-70b-instruct | $0.9 | $0.9 | 4096 | 4096 |
| fireworks_ai/accounts/fireworks/models/code-llama-70b-python | $0.9 | $0.9 | 4096 | 4096 |
| fireworks_ai/accounts/fireworks/models/code-llama-7b | $0.2 | $0.2 | 16384 | 16384 |
| fireworks_ai/accounts/fireworks/models/code-llama-7b-instruct | $0.2 | $0.2 | 16384 | 16384 |
| fireworks_ai/accounts/fireworks/models/code-llama-7b-python | $0.2 | $0.2 | 16384 | 16384 |
| fireworks_ai/accounts/fireworks/models/code-qwen-1p5-7b | $0.2 | $0.2 | 65536 | 65536 |
| fireworks_ai/accounts/fireworks/models/codegemma-2b | $0.1 | $0.1 | 8192 | 8192 |
| fireworks_ai/accounts/fireworks/models/codegemma-7b | $0.2 | $0.2 | 8192 | 8192 |
| fireworks_ai/accounts/fireworks/models/cogito-671b-v2-p1 | $1.2 | $1.2 | 163840 | 163840 |
| fireworks_ai/accounts/fireworks/models/cogito-v1-preview-llama-3b | $0.1 | $0.1 | 131072 | 131072 |
| fireworks_ai/accounts/fireworks/models/cogito-v1-preview-llama-70b | $0.9 | $0.9 | 131072 | 131072 |
| fireworks_ai/accounts/fireworks/models/cogito-v1-preview-llama-8b | $0.2 | $0.2 | 131072 | 131072 |
| fireworks_ai/accounts/fireworks/models/cogito-v1-preview-qwen-14b | $0.2 | $0.2 | 131072 | 131072 |
| fireworks_ai/accounts/fireworks/models/cogito-v1-preview-qwen-32b | $0.9 | $0.9 | 131072 | 131072 |
| fireworks_ai/accounts/fireworks/models/flux-kontext-max | $0.08 | $0.08 | 4096 | 4096 |
| fireworks_ai/accounts/fireworks/models/dbrx-instruct | $1.2 | $1.2 | 32768 | 32768 |
| fireworks_ai/accounts/fireworks/models/deepseek-coder-1b-base | $0.1 | $0.1 | 16384 | 16384 |
| fireworks_ai/accounts/fireworks/models/deepseek-coder-33b-instruct | $0.9 | $0.9 | 16384 | 16384 |
| fireworks_ai/accounts/fireworks/models/deepseek-coder-7b-base | $0.2 | $0.2 | 4096 | 4096 |
| fireworks_ai/accounts/fireworks/models/deepseek-coder-7b-base-v1p5 | $0.2 | $0.2 | 4096 | 4096 |
| fireworks_ai/accounts/fireworks/models/deepseek-coder-7b-instruct-v1p5 | $0.2 | $0.2 | 4096 | 4096 |
| fireworks_ai/accounts/fireworks/models/deepseek-coder-v2-lite-base | $0.5 | $0.5 | 163840 | 163840 |
| fireworks_ai/accounts/fireworks/models/deepseek-coder-v2-lite-instruct | $0.5 | $0.5 | 163840 | 163840 |
| fireworks_ai/accounts/fireworks/models/deepseek-prover-v2 | $1.2 | $1.2 | 163840 | 163840 |
| fireworks_ai/accounts/fireworks/models/deepseek-r1-0528-distill-qwen3-8b | $0.2 | $0.2 | 131072 | 131072 |
| fireworks_ai/accounts/fireworks/models/deepseek-r1-distill-llama-70b | $0.9 | $0.9 | 131072 | 131072 |
| fireworks_ai/accounts/fireworks/models/deepseek-r1-distill-llama-8b | $0.2 | $0.2 | 131072 | 131072 |
| fireworks_ai/accounts/fireworks/models/deepseek-r1-distill-qwen-14b | $0.2 | $0.2 | 131072 | 131072 |
| fireworks_ai/accounts/fireworks/models/deepseek-r1-distill-qwen-1p5b | $0.1 | $0.1 | 131072 | 131072 |
| fireworks_ai/accounts/fireworks/models/deepseek-r1-distill-qwen-32b | $0.9 | $0.9 | 131072 | 131072 |
| fireworks_ai/accounts/fireworks/models/deepseek-r1-distill-qwen-7b | $0.2 | $0.2 | 131072 | 131072 |
| fireworks_ai/accounts/fireworks/models/deepseek-v2-lite-chat | $0.5 | $0.5 | 163840 | 163840 |
| fireworks_ai/accounts/fireworks/models/deepseek-v2p5 | $1.2 | $1.2 | 32768 | 32768 |
| fireworks_ai/accounts/fireworks/models/devstral-small-2505 | $0.9 | $0.9 | 131072 | 131072 |
| fireworks_ai/accounts/fireworks/models/dobby-mini-unhinged-plus-llama-3-1-8b | $0.2 | $0.2 | 131072 | 131072 |
| fireworks_ai/accounts/fireworks/models/dobby-unhinged-llama-3-3-70b-new | $0.9 | $0.9 | 131072 | 131072 |
| fireworks_ai/accounts/fireworks/models/dolphin-2-9-2-qwen2-72b | $0.9 | $0.9 | 131072 | 131072 |
| fireworks_ai/accounts/fireworks/models/dolphin-2p6-mixtral-8x7b | $0.5 | $0.5 | 32768 | 32768 |
| fireworks_ai/accounts/fireworks/models/ernie-4p5-21b-a3b-pt | $0.1 | $0.1 | 4096 | 4096 |
| fireworks_ai/accounts/fireworks/models/ernie-4p5-300b-a47b-pt | $0.1 | $0.1 | 4096 | 4096 |
| fireworks_ai/accounts/fireworks/models/fare-20b | $0.9 | $0.9 | 131072 | 131072 |
| fireworks_ai/accounts/fireworks/models/firefunction-v1 | $0.5 | $0.5 | 32768 | 32768 |
| fireworks_ai/accounts/fireworks/models/firellava-13b | $0.2 | $0.2 | 4096 | 4096 |
| fireworks_ai/accounts/fireworks/models/firesearch-ocr-v6 | $0.2 | $0.2 | 8192 | 8192 |
| fireworks_ai/accounts/fireworks/models/fireworks-asr-large | $0 | $0 | 4096 | 4096 |
| fireworks_ai/accounts/fireworks/models/fireworks-asr-v2 | $0 | $0 | 4096 | 4096 |
| fireworks_ai/accounts/fireworks/models/flux-1-dev | $0.1 | $0.1 | 4096 | 4096 |
| fireworks_ai/accounts/fireworks/models/flux-1-dev-controlnet-union | $0 | $0 | 4096 | 4096 |
| fireworks_ai/accounts/fireworks/models/flux-1-dev-fp8 | $0 | $0 | 4096 | 4096 |
| fireworks_ai/accounts/fireworks/models/flux-1-schnell | $0.1 | $0.1 | 4096 | 4096 |
| fireworks_ai/accounts/fireworks/models/flux-1-schnell-fp8 | $0 | $0 | 4096 | 4096 |
| fireworks_ai/accounts/fireworks/models/gemma-2b-it | $0.1 | $0.1 | 8192 | 8192 |
| fireworks_ai/accounts/fireworks/models/gemma-3-27b-it | $0.9 | $0.9 | 131072 | 131072 |
| fireworks_ai/accounts/fireworks/models/gemma-7b | $0.2 | $0.2 | 8192 | 8192 |
| fireworks_ai/accounts/fireworks/models/gemma-7b-it | $0.2 | $0.2 | 8192 | 8192 |
| fireworks_ai/accounts/fireworks/models/gemma2-9b-it | $0.2 | $0.2 | 8192 | 8192 |
| fireworks_ai/accounts/fireworks/models/glm-4p5v | $1.2 | $1.2 | 131072 | 131072 |
| fireworks_ai/accounts/fireworks/models/gpt-oss-safeguard-120b | $1.2 | $1.2 | 131072 | 131072 |
| fireworks_ai/accounts/fireworks/models/gpt-oss-safeguard-20b | $0.5 | $0.5 | 131072 | 131072 |
| fireworks_ai/accounts/fireworks/models/hermes-2-pro-mistral-7b | $0.2 | $0.2 | 32768 | 32768 |
| fireworks_ai/accounts/fireworks/models/internvl3-38b | $0.9 | $0.9 | 16384 | 16384 |
| fireworks_ai/accounts/fireworks/models/internvl3-78b | $0.9 | $0.9 | 16384 | 16384 |
| fireworks_ai/accounts/fireworks/models/internvl3-8b | $0.2 | $0.2 | 16384 | 16384 |
| fireworks_ai/accounts/fireworks/models/japanese-stable-diffusion-xl | $0 | $0 | 4096 | 4096 |
| fireworks_ai/accounts/fireworks/models/kat-coder | $0.9 | $0.9 | 262144 | 262144 |
| fireworks_ai/accounts/fireworks/models/kat-dev-32b | $0.9 | $0.9 | 131072 | 131072 |
| fireworks_ai/accounts/fireworks/models/kat-dev-72b-exp | $0.9 | $0.9 | 131072 | 131072 |
| fireworks_ai/accounts/fireworks/models/llama-guard-2-8b | $0.2 | $0.2 | 8192 | 8192 |
| fireworks_ai/accounts/fireworks/models/llama-guard-3-1b | $0.1 | $0.1 | 131072 | 131072 |
| fireworks_ai/accounts/fireworks/models/llama-guard-3-8b | $0.2 | $0.2 | 131072 | 131072 |
| fireworks_ai/accounts/fireworks/models/llama-v2-13b | $0.2 | $0.2 | 4096 | 4096 |
| fireworks_ai/accounts/fireworks/models/llama-v2-13b-chat | $0.2 | $0.2 | 4096 | 4096 |
| fireworks_ai/accounts/fireworks/models/llama-v2-70b | $0.1 | $0.1 | 4096 | 4096 |
| fireworks_ai/accounts/fireworks/models/llama-v2-70b-chat | $0.9 | $0.9 | 2048 | 2048 |
| fireworks_ai/accounts/fireworks/models/llama-v2-7b | $0.2 | $0.2 | 4096 | 4096 |
| fireworks_ai/accounts/fireworks/models/llama-v2-7b-chat | $0.2 | $0.2 | 4096 | 4096 |
| fireworks_ai/accounts/fireworks/models/llama-v3-70b-instruct | $0.9 | $0.9 | 8192 | 8192 |
| fireworks_ai/accounts/fireworks/models/llama-v3-70b-instruct-hf | $0.9 | $0.9 | 8192 | 8192 |
| fireworks_ai/accounts/fireworks/models/llama-v3-8b | $0.2 | $0.2 | 8192 | 8192 |
| fireworks_ai/accounts/fireworks/models/llama-v3-8b-instruct-hf | $0.2 | $0.2 | 8192 | 8192 |
| fireworks_ai/accounts/fireworks/models/llama-v3p1-405b-instruct-long | $0.1 | $0.1 | 4096 | 4096 |
| fireworks_ai/accounts/fireworks/models/llama-v3p1-70b-instruct | $0.9 | $0.9 | 131072 | 131072 |
| fireworks_ai/accounts/fireworks/models/llama-v3p1-70b-instruct-1b | $0.1 | $0.1 | 4096 | 4096 |
| fireworks_ai/accounts/fireworks/models/llama-v3p1-nemotron-70b-instruct | $0.9 | $0.9 | 131072 | 131072 |
| fireworks_ai/accounts/fireworks/models/llama-v3p2-1b | $0.1 | $0.1 | 131072 | 131072 |
| fireworks_ai/accounts/fireworks/models/llama-v3p2-3b | $0.1 | $0.1 | 131072 | 131072 |
| fireworks_ai/accounts/fireworks/models/llama-v3p3-70b-instruct | $0.9 | $0.9 | 131072 | 131072 |
| fireworks_ai/accounts/fireworks/models/llamaguard-7b | $0.2 | $0.2 | 4096 | 4096 |
| fireworks_ai/accounts/fireworks/models/llava-yi-34b | $0.9 | $0.9 | 4096 | 4096 |
| fireworks_ai/accounts/fireworks/models/minimax-m1-80k | $0.1 | $0.1 | 4096 | 4096 |
| fireworks_ai/accounts/fireworks/models/minimax-m2 | $0.3 | $1.2 | 4096 | 4096 |
| fireworks_ai/accounts/fireworks/models/ministral-3-14b-instruct-2512 | $0.2 | $0.2 | 256000 | 256000 |
| fireworks_ai/accounts/fireworks/models/ministral-3-3b-instruct-2512 | $0.1 | $0.1 | 256000 | 256000 |
| fireworks_ai/accounts/fireworks/models/ministral-3-8b-instruct-2512 | $0.2 | $0.2 | 256000 | 256000 |
| fireworks_ai/accounts/fireworks/models/mistral-7b | $0.2 | $0.2 | 32768 | 32768 |
| fireworks_ai/accounts/fireworks/models/mistral-7b-instruct-4k | $0.2 | $0.2 | 32768 | 32768 |
| fireworks_ai/accounts/fireworks/models/mistral-7b-instruct-v0p2 | $0.2 | $0.2 | 32768 | 32768 |
| fireworks_ai/accounts/fireworks/models/mistral-7b-instruct-v3 | $0.2 | $0.2 | 32768 | 32768 |
| fireworks_ai/accounts/fireworks/models/mistral-7b-v0p2 | $0.2 | $0.2 | 32768 | 32768 |
| fireworks_ai/accounts/fireworks/models/mistral-large-3-fp8 | $1.2 | $1.2 | 256000 | 256000 |
| fireworks_ai/accounts/fireworks/models/mistral-nemo-base-2407 | $0.2 | $0.2 | 128000 | 128000 |
| fireworks_ai/accounts/fireworks/models/mistral-nemo-instruct-2407 | $0.2 | $0.2 | 128000 | 128000 |
| fireworks_ai/accounts/fireworks/models/mistral-small-24b-instruct-2501 | $0.9 | $0.9 | 32768 | 32768 |
| fireworks_ai/accounts/fireworks/models/mixtral-8x22b | $1.2 | $1.2 | 65536 | 65536 |
| fireworks_ai/accounts/fireworks/models/mixtral-8x22b-instruct | $1.2 | $1.2 | 65536 | 65536 |
| fireworks_ai/accounts/fireworks/models/mixtral-8x7b | $0.5 | $0.5 | 32768 | 32768 |
| fireworks_ai/accounts/fireworks/models/mixtral-8x7b-instruct | $0.5 | $0.5 | 32768 | 32768 |
| fireworks_ai/accounts/fireworks/models/mixtral-8x7b-instruct-hf | $0.5 | $0.5 | 32768 | 32768 |
| fireworks_ai/accounts/fireworks/models/mythomax-l2-13b | $0.2 | $0.2 | 4096 | 4096 |
| fireworks_ai/accounts/fireworks/models/nemotron-nano-v2-12b-vl | $0.1 | $0.1 | 4096 | 4096 |
| fireworks_ai/accounts/fireworks/models/nous-capybara-7b-v1p9 | $0.2 | $0.2 | 32768 | 32768 |
| fireworks_ai/accounts/fireworks/models/nous-hermes-2-mixtral-8x7b-dpo | $0.5 | $0.5 | 32768 | 32768 |
| fireworks_ai/accounts/fireworks/models/nous-hermes-2-yi-34b | $0.9 | $0.9 | 4096 | 4096 |
| fireworks_ai/accounts/fireworks/models/nous-hermes-llama2-13b | $0.2 | $0.2 | 4096 | 4096 |
| fireworks_ai/accounts/fireworks/models/nous-hermes-llama2-70b | $0.9 | $0.9 | 4096 | 4096 |
| fireworks_ai/accounts/fireworks/models/nous-hermes-llama2-7b | $0.2 | $0.2 | 4096 | 4096 |
| fireworks_ai/accounts/fireworks/models/nvidia-nemotron-nano-12b-v2 | $0.2 | $0.2 | 131072 | 131072 |
| fireworks_ai/accounts/fireworks/models/nvidia-nemotron-nano-9b-v2 | $0.2 | $0.2 | 131072 | 131072 |
| fireworks_ai/accounts/fireworks/models/openchat-3p5-0106-7b | $0.2 | $0.2 | 8192 | 8192 |
| fireworks_ai/accounts/fireworks/models/openhermes-2-mistral-7b | $0.2 | $0.2 | 32768 | 32768 |
| fireworks_ai/accounts/fireworks/models/openhermes-2p5-mistral-7b | $0.2 | $0.2 | 32768 | 32768 |
| fireworks_ai/accounts/fireworks/models/openorca-7b | $0.2 | $0.2 | 32768 | 32768 |
| fireworks_ai/accounts/fireworks/models/phi-2-3b | $0.1 | $0.1 | 2048 | 2048 |
| fireworks_ai/accounts/fireworks/models/phi-3-mini-128k-instruct | $0.1 | $0.1 | 131072 | 131072 |
| fireworks_ai/accounts/fireworks/models/phi-3-vision-128k-instruct | $0.2 | $0.2 | 32064 | 32064 |
| fireworks_ai/accounts/fireworks/models/phind-code-llama-34b-python-v1 | $0.9 | $0.9 | 16384 | 16384 |
| fireworks_ai/accounts/fireworks/models/phind-code-llama-34b-v1 | $0.9 | $0.9 | 16384 | 16384 |
| fireworks_ai/accounts/fireworks/models/phind-code-llama-34b-v2 | $0.9 | $0.9 | 16384 | 16384 |
| fireworks_ai/accounts/fireworks/models/playground-v2-1024px-aesthetic | $0 | $0 | 4096 | 4096 |
| fireworks_ai/accounts/fireworks/models/playground-v2-5-1024px-aesthetic | $0 | $0 | 4096 | 4096 |
| fireworks_ai/accounts/fireworks/models/pythia-12b | $0.2 | $0.2 | 2048 | 2048 |
| fireworks_ai/accounts/fireworks/models/qwen-qwq-32b-preview | $0.9 | $0.9 | 32768 | 32768 |
| fireworks_ai/accounts/fireworks/models/qwen-v2p5-14b-instruct | $0.2 | $0.2 | 32768 | 32768 |
| fireworks_ai/accounts/fireworks/models/qwen-v2p5-7b | $0.2 | $0.2 | 131072 | 131072 |
| fireworks_ai/accounts/fireworks/models/qwen1p5-72b-chat | $0.9 | $0.9 | 32768 | 32768 |
| fireworks_ai/accounts/fireworks/models/qwen2-7b-instruct | $0.2 | $0.2 | 32768 | 32768 |
| fireworks_ai/accounts/fireworks/models/qwen2-vl-2b-instruct | $0.1 | $0.1 | 32768 | 32768 |
| fireworks_ai/accounts/fireworks/models/qwen2-vl-72b-instruct | $0.9 | $0.9 | 32768 | 32768 |
| fireworks_ai/accounts/fireworks/models/qwen2-vl-7b-instruct | $0.2 | $0.2 | 32768 | 32768 |
| fireworks_ai/accounts/fireworks/models/qwen2p5-0p5b-instruct | $0.1 | $0.1 | 32768 | 32768 |
| fireworks_ai/accounts/fireworks/models/qwen2p5-14b | $0.2 | $0.2 | 131072 | 131072 |
| fireworks_ai/accounts/fireworks/models/qwen2p5-1p5b-instruct | $0.1 | $0.1 | 32768 | 32768 |
| fireworks_ai/accounts/fireworks/models/qwen2p5-32b | $0.9 | $0.9 | 131072 | 131072 |
| fireworks_ai/accounts/fireworks/models/qwen2p5-32b-instruct | $0.9 | $0.9 | 32768 | 32768 |
| fireworks_ai/accounts/fireworks/models/qwen2p5-72b | $0.9 | $0.9 | 131072 | 131072 |
| fireworks_ai/accounts/fireworks/models/qwen2p5-72b-instruct | $0.9 | $0.9 | 32768 | 32768 |
| fireworks_ai/accounts/fireworks/models/qwen2p5-7b-instruct | $0.2 | $0.2 | 32768 | 32768 |
| fireworks_ai/accounts/fireworks/models/qwen2p5-coder-0p5b | $0.1 | $0.1 | 32768 | 32768 |
| fireworks_ai/accounts/fireworks/models/qwen2p5-coder-0p5b-instruct | $0.1 | $0.1 | 32768 | 32768 |
| fireworks_ai/accounts/fireworks/models/qwen2p5-coder-14b | $0.2 | $0.2 | 32768 | 32768 |
| fireworks_ai/accounts/fireworks/models/qwen2p5-coder-14b-instruct | $0.2 | $0.2 | 32768 | 32768 |
| fireworks_ai/accounts/fireworks/models/qwen2p5-coder-1p5b | $0.1 | $0.1 | 32768 | 32768 |
| fireworks_ai/accounts/fireworks/models/qwen2p5-coder-1p5b-instruct | $0.1 | $0.1 | 32768 | 32768 |
| fireworks_ai/accounts/fireworks/models/qwen2p5-coder-32b | $0.9 | $0.9 | 32768 | 32768 |
| fireworks_ai/accounts/fireworks/models/qwen2p5-coder-32b-instruct-128k | $0.9 | $0.9 | 131072 | 131072 |
| fireworks_ai/accounts/fireworks/models/qwen2p5-coder-32b-instruct-32k-rope | $0.9 | $0.9 | 32768 | 32768 |
| fireworks_ai/accounts/fireworks/models/qwen2p5-coder-32b-instruct-64k | $0.9 | $0.9 | 65536 | 65536 |
| fireworks_ai/accounts/fireworks/models/qwen2p5-coder-3b | $0.1 | $0.1 | 32768 | 32768 |
| fireworks_ai/accounts/fireworks/models/qwen2p5-coder-3b-instruct | $0.1 | $0.1 | 32768 | 32768 |
| fireworks_ai/accounts/fireworks/models/qwen2p5-coder-7b | $0.2 | $0.2 | 32768 | 32768 |
| fireworks_ai/accounts/fireworks/models/qwen2p5-coder-7b-instruct | $0.2 | $0.2 | 32768 | 32768 |
| fireworks_ai/accounts/fireworks/models/qwen2p5-math-72b-instruct | $0.9 | $0.9 | 4096 | 4096 |
| fireworks_ai/accounts/fireworks/models/qwen2p5-vl-32b-instruct | $0.9 | $0.9 | 128000 | 128000 |
| fireworks_ai/accounts/fireworks/models/qwen2p5-vl-3b-instruct | $0.2 | $0.2 | 128000 | 128000 |
| fireworks_ai/accounts/fireworks/models/qwen2p5-vl-72b-instruct | $0.9 | $0.9 | 128000 | 128000 |
| fireworks_ai/accounts/fireworks/models/qwen2p5-vl-7b-instruct | $0.2 | $0.2 | 128000 | 128000 |
| fireworks_ai/accounts/fireworks/models/qwen3-0p6b | $0.1 | $0.1 | 40960 | 40960 |
| fireworks_ai/accounts/fireworks/models/qwen3-14b | $0.2 | $0.2 | 40960 | 40960 |
| fireworks_ai/accounts/fireworks/models/qwen3-1p7b | $0.1 | $0.1 | 131072 | 131072 |
| fireworks_ai/accounts/fireworks/models/qwen3-1p7b-fp8-draft | $0.1 | $0.1 | 262144 | 262144 |
| fireworks_ai/accounts/fireworks/models/qwen3-1p7b-fp8-draft-131072 | $0.1 | $0.1 | 131072 | 131072 |
| fireworks_ai/accounts/fireworks/models/qwen3-1p7b-fp8-draft-40960 | $0.1 | $0.1 | 40960 | 40960 |
| fireworks_ai/accounts/fireworks/models/qwen3-235b-a22b | $0.22 | $0.88 | 131072 | 131072 |
| fireworks_ai/accounts/fireworks/models/qwen3-235b-a22b-instruct-2507 | $0.22 | $0.88 | 262144 | 262144 |
| fireworks_ai/accounts/fireworks/models/qwen3-235b-a22b-thinking-2507 | $0.22 | $0.88 | 262144 | 262144 |
| fireworks_ai/accounts/fireworks/models/qwen3-30b-a3b | $0.15 | $0.6 | 131072 | 131072 |
| fireworks_ai/accounts/fireworks/models/qwen3-30b-a3b-instruct-2507 | $0.5 | $0.5 | 262144 | 262144 |
| fireworks_ai/accounts/fireworks/models/qwen3-30b-a3b-thinking-2507 | $0.9 | $0.9 | 262144 | 262144 |
| fireworks_ai/accounts/fireworks/models/qwen3-32b | $0.9 | $0.9 | 131072 | 131072 |
| fireworks_ai/accounts/fireworks/models/qwen3-4b | $0.2 | $0.2 | 40960 | 40960 |
| fireworks_ai/accounts/fireworks/models/qwen3-4b-instruct-2507 | $0.2 | $0.2 | 262144 | 262144 |
| fireworks_ai/accounts/fireworks/models/qwen3-8b | $0.2 | $0.2 | 40960 | 40960 |
| fireworks_ai/accounts/fireworks/models/qwen3-coder-30b-a3b-instruct | $0.15 | $0.6 | 262144 | 262144 |
| fireworks_ai/accounts/fireworks/models/qwen3-coder-480b-instruct-bf16 | $0.9 | $0.9 | 4096 | 4096 |
| fireworks_ai/accounts/fireworks/models/qwen3-embedding-0p6b | $0 | $0 | 32768 | 32768 |
| fireworks_ai/accounts/fireworks/models/qwen3-embedding-4b | $0 | $0 | 40960 | 40960 |
| fireworks_ai/accounts/fireworks/models/qwen3-embedding-8b | $0 | $0 | 40960 | 40960 |
| fireworks_ai/accounts/fireworks/models/qwen3-next-80b-a3b-instruct | $0.9 | $0.9 | 4096 | 4096 |
| fireworks_ai/accounts/fireworks/models/qwen3-next-80b-a3b-thinking | $0.9 | $0.9 | 4096 | 4096 |
| fireworks_ai/accounts/fireworks/models/qwen3-reranker-0p6b | $0 | $0 | 40960 | 40960 |
| fireworks_ai/accounts/fireworks/models/qwen3-reranker-4b | $0 | $0 | 40960 | 40960 |
| fireworks_ai/accounts/fireworks/models/qwen3-reranker-8b | $0 | $0 | 40960 | 40960 |
| fireworks_ai/accounts/fireworks/models/qwen3-vl-235b-a22b-instruct | $0.22 | $0.88 | 262144 | 262144 |
| fireworks_ai/accounts/fireworks/models/qwen3-vl-235b-a22b-thinking | $0.22 | $0.88 | 262144 | 262144 |
| fireworks_ai/accounts/fireworks/models/qwen3-vl-30b-a3b-instruct | $0.15 | $0.6 | 262144 | 262144 |
| fireworks_ai/accounts/fireworks/models/qwen3-vl-30b-a3b-thinking | $0.15 | $0.6 | 262144 | 262144 |
| fireworks_ai/accounts/fireworks/models/qwen3-vl-32b-instruct | $0.9 | $0.9 | 4096 | 4096 |
| fireworks_ai/accounts/fireworks/models/qwen3-vl-8b-instruct | $0.2 | $0.2 | 4096 | 4096 |
| fireworks_ai/accounts/fireworks/models/qwq-32b | $0.9 | $0.9 | 131072 | 131072 |
| fireworks_ai/accounts/fireworks/models/rolm-ocr | $0.2 | $0.2 | 128000 | 128000 |
| fireworks_ai/accounts/fireworks/models/snorkel-mistral-7b-pairrm-dpo | $0.2 | $0.2 | 32768 | 32768 |
| fireworks_ai/accounts/fireworks/models/stable-diffusion-xl-1024-v1-0 | $0 | $0 | 4096 | 4096 |
| fireworks_ai/accounts/fireworks/models/stablecode-3b | $0.1 | $0.1 | 4096 | 4096 |
| fireworks_ai/accounts/fireworks/models/starcoder-16b | $0.2 | $0.2 | 8192 | 8192 |
| fireworks_ai/accounts/fireworks/models/starcoder-7b | $0.2 | $0.2 | 8192 | 8192 |
| fireworks_ai/accounts/fireworks/models/starcoder2-15b | $0.2 | $0.2 | 16384 | 16384 |
| fireworks_ai/accounts/fireworks/models/starcoder2-3b | $0.1 | $0.1 | 16384 | 16384 |
| fireworks_ai/accounts/fireworks/models/starcoder2-7b | $0.2 | $0.2 | 16384 | 16384 |
| fireworks_ai/accounts/fireworks/models/toppy-m-7b | $0.2 | $0.2 | 32768 | 32768 |
| fireworks_ai/accounts/fireworks/models/whisper-v3 | $0 | $0 | 4096 | 4096 |
| fireworks_ai/accounts/fireworks/models/whisper-v3-turbo | $0 | $0 | 4096 | 4096 |
| fireworks_ai/accounts/fireworks/models/yi-34b | $0.9 | $0.9 | 4096 | 4096 |
| fireworks_ai/accounts/fireworks/models/yi-34b-200k-capybara | $0.9 | $0.9 | 200000 | 200000 |
| fireworks_ai/accounts/fireworks/models/yi-34b-chat | $0.9 | $0.9 | 4096 | 4096 |
| fireworks_ai/accounts/fireworks/models/yi-6b | $0.2 | $0.2 | 4096 | 4096 |
| fireworks_ai/accounts/fireworks/models/zephyr-7b-beta | $0.2 | $0.2 | 32768 | 32768 |
| gemini/gemini-2.5-computer-use-preview-10-2025 | $1.25 | $10 | 128000 | 64000 |
| vertex_ai/deepseek-ai/deepseek-v3.2-maas | $0.56 | $1.68 | 163840 | 32768 |
| voyage/rerank-2.5 | $0.05 | $0 | 32000 | 32000 |
| voyage/rerank-2.5-lite | $0.02 | $0 | 32000 | 32000 |
| azure_ai/claude-haiku-4-5 | $1 | $5 | 200000 | 64000 |
| azure_ai/claude-opus-4-1 | $15 | $75 | 200000 | 32000 |
| azure_ai/claude-sonnet-4-5 | $3 | $15 | 200000 | 64000 |
| gpt-5.2 | $1.75 | $14 | 272000 | 128000 |
| gpt-5.2-2025-12-11 | $1.75 | $14 | 272000 | 128000 |
| gpt-5.2-chat-latest | $1.75 | $14 | 128000 | 16384 |
| gpt-5.2-pro | $21 | $168 | 272000 | 128000 |
| gpt-5.2-pro-2025-12-11 | $21 | $168 | 272000 | 128000 |
| mistral/codestral-2508 | $0.3 | $0.9 | 256000 | 256000 |
| mistral/labs-devstral-small-2512 | $0.1 | $0.3 | 256000 | 256000 |
| mistral/devstral-2512 | $0.4 | $2 | 256000 | 256000 |
| azure/gpt-5.2 | $1.75 | $14 | 272000 | 128000 |
| azure/gpt-5.2-2025-12-11 | $1.75 | $14 | 272000 | 128000 |
| azure/gpt-5.2-chat-2025-12-11 | $1.75 | $14 | 128000 | 16384 |
| azure/gpt-5.2-pro | $21 | $168 | 272000 | 128000 |
| azure/gpt-5.2-pro-2025-12-11 | $21 | $168 | 272000 | 128000 |
| eu.anthropic.claude-opus-4-5-20251101-v1:0 | $5 | $25 | 200000 | 64000 |
| azure_ai/cohere-rerank-v4.0-pro | $0 | $0 | 32768 | 32768 |
| azure_ai/cohere-rerank-v4.0-fast | $0 | $0 | 32768 | 32768 |
| stability/sd3 | -- | -- | nan | nan |
| stability/sd3-large | -- | -- | nan | nan |
| stability/sd3-large-turbo | -- | -- | nan | nan |
| stability/sd3-medium | -- | -- | nan | nan |
| stability/sd3.5-large | -- | -- | nan | nan |
| stability/sd3.5-large-turbo | -- | -- | nan | nan |
| stability/sd3.5-medium | -- | -- | nan | nan |
| stability/stable-image-ultra | -- | -- | nan | nan |
| stability/stable-image-core | -- | -- | nan | nan |
| openrouter/mistralai/devstral-2512:free | $0 | $0 | 262144 | 262144 |
| openrouter/mistralai/devstral-2512 | $0.15 | $0.6 | 262144 | 65536 |
| openrouter/mistralai/ministral-3b-2512 | $0.1 | $0.1 | 131072 | 131072 |
| openrouter/mistralai/ministral-8b-2512 | $0.15 | $0.15 | 262144 | 262144 |
| openrouter/mistralai/ministral-14b-2512 | $0.2 | $0.2 | 262144 | 262144 |
| openrouter/mistralai/mistral-large-2512 | $0.5 | $1.5 | 262144 | 262144 |
| openrouter/openai/gpt-5.2 | $1.75 | $14 | 272000 | 128000 |
| openrouter/openai/gpt-5.2-chat | $1.75 | $14 | 128000 | 16384 |
| openrouter/openai/gpt-5.2-pro | $21 | $168 | 272000 | 128000 |
| azure_ai/deepseek-v3.2 | $0.58 | $1.68 | 163840 | 163840 |
| azure_ai/deepseek-v3.2-speciale | $0.58 | $1.68 | 163840 | 163840 |
| github_copilot/claude-haiku-4.5 | -- | -- | 128000 | 16000 |
| github_copilot/claude-opus-4.5 | -- | -- | 128000 | 16000 |
| github_copilot/claude-opus-41 | -- | -- | 80000 | 16000 |
| github_copilot/claude-sonnet-4 | -- | -- | 128000 | 16000 |
| github_copilot/claude-sonnet-4.5 | -- | -- | 128000 | 16000 |
| github_copilot/gemini-2.5-pro | -- | -- | 128000 | 64000 |
| github_copilot/gemini-3-pro-preview | -- | -- | 128000 | 64000 |
| github_copilot/gpt-3.5-turbo | -- | -- | 16384 | 4096 |
| github_copilot/gpt-3.5-turbo-0613 | -- | -- | 16384 | 4096 |
| github_copilot/gpt-4 | -- | -- | 32768 | 4096 |
| github_copilot/gpt-4-0613 | -- | -- | 32768 | 4096 |
| github_copilot/gpt-4-o-preview | -- | -- | 64000 | 4096 |
| github_copilot/gpt-4.1 | -- | -- | 128000 | 16384 |
| github_copilot/gpt-4.1-2025-04-14 | -- | -- | 128000 | 16384 |
| github_copilot/gpt-41-copilot | -- | -- | nan | nan |
| github_copilot/gpt-4o | -- | -- | 64000 | 4096 |
| github_copilot/gpt-4o-2024-05-13 | -- | -- | 64000 | 4096 |
| github_copilot/gpt-4o-2024-08-06 | -- | -- | 64000 | 16384 |
| github_copilot/gpt-4o-2024-11-20 | -- | -- | 64000 | 16384 |
| github_copilot/gpt-4o-mini | -- | -- | 64000 | 4096 |
| github_copilot/gpt-4o-mini-2024-07-18 | -- | -- | 64000 | 4096 |
| github_copilot/gpt-5 | -- | -- | 128000 | 128000 |
| github_copilot/gpt-5-mini | -- | -- | 128000 | 64000 |
| github_copilot/gpt-5.1 | -- | -- | 128000 | 64000 |
| github_copilot/gpt-5.1-codex-max | -- | -- | 128000 | 128000 |
| github_copilot/gpt-5.2 | -- | -- | 128000 | 64000 |
| github_copilot/text-embedding-3-small | -- | -- | 8191 | nan |
| github_copilot/text-embedding-3-small-inference | -- | -- | 8191 | nan |
| github_copilot/text-embedding-ada-002 | -- | -- | 8191 | nan |
| fireworks_ai/accounts/fireworks/models/ | $0.1 | $0 | 40960 | 40960 |
| gpt-4o-transcribe-diarize | $2.5 | $10 | 16000 | 2000 |
| gemini/gemini-3-flash-preview | $0.5 | $3 | 1.04858e+06 | 65535 |
| gemini-3-flash-preview | $0.5 | $3 | 1.04858e+06 | 65535 |
| gpt-image-1.5 | $5 | $10 | nan | nan |
| gpt-image-1.5-2025-12-16 | $5 | $10 | nan | nan |
| vertex_ai/gemini-3-flash-preview | $0.5 | $3 | 1.04858e+06 | 65535 |
| linkup/search | -- | -- | nan | nan |
| linkup/search-deep | -- | -- | nan | nan |
| stability/inpaint | -- | -- | nan | nan |
| stability/outpaint | -- | -- | nan | nan |
| stability/erase | -- | -- | nan | nan |
| stability/search-and-replace | -- | -- | nan | nan |
| stability/search-and-recolor | -- | -- | nan | nan |
| stability/remove-background | -- | -- | nan | nan |
| stability/replace-background-and-relight | -- | -- | nan | nan |
| stability/sketch | -- | -- | nan | nan |
| stability/structure | -- | -- | nan | nan |
| stability/style | -- | -- | nan | nan |
| stability/style-transfer | -- | -- | nan | nan |
| stability/fast | -- | -- | nan | nan |
| stability/conservative | -- | -- | nan | nan |
| stability/creative | -- | -- | nan | nan |
| stability.stable-conservative-upscale-v1:0 | -- | -- | 77 | nan |
| stability.stable-creative-upscale-v1:0 | -- | -- | 77 | nan |
| stability.stable-fast-upscale-v1:0 | -- | -- | 77 | nan |
| stability.stable-outpaint-v1:0 | -- | -- | 77 | nan |
| stability.stable-image-control-sketch-v1:0 | -- | -- | 77 | nan |
| stability.stable-image-control-structure-v1:0 | -- | -- | 77 | nan |
| stability.stable-image-erase-object-v1:0 | -- | -- | 77 | nan |
| stability.stable-image-inpaint-v1:0 | -- | -- | 77 | nan |
| stability.stable-image-remove-background-v1:0 | -- | -- | 77 | nan |
| stability.stable-image-search-recolor-v1:0 | -- | -- | 77 | nan |
| stability.stable-image-search-replace-v1:0 | -- | -- | 77 | nan |
| stability.stable-image-style-guide-v1:0 | -- | -- | 77 | nan |
| stability.stable-style-transfer-v1:0 | -- | -- | 77 | nan |
| vertex_ai/deepseek-ai/deepseek-ocr-maas | $0.3 | $1.2 | nan | nan |
| gemini/veo-3.1-fast-generate-001 | -- | -- | 1024 | nan |
| gemini/veo-3.1-generate-001 | -- | -- | 1024 | nan |
| aws_polly/standard | -- | -- | nan | nan |
| aws_polly/neural | -- | -- | nan | nan |
| aws_polly/long-form | -- | -- | nan | nan |
| aws_polly/generative | -- | -- | nan | nan |
| vertex_ai/veo-3.1-generate-001 | -- | -- | 1024 | nan |
| vertex_ai/veo-3.1-fast-generate-001 | -- | -- | 1024 | nan |
| azure_ai/gpt-oss-120b | $0.15 | $0.6 | 131072 | 131072 |
| azure/gpt-image-1.5 | $5 | -- | nan | nan |
| azure/gpt-image-1.5-2025-12-16 | $5 | -- | nan | nan |
| groq/meta-llama/llama-guard-4-12b | $0.2 | $0.2 | 8192 | 8192 |
| minimax/speech-02-hd | -- | -- | nan | nan |
| minimax/speech-02-turbo | -- | -- | nan | nan |
| minimax/speech-2.6-hd | -- | -- | nan | nan |
| minimax/speech-2.6-turbo | -- | -- | nan | nan |
| minimax/MiniMax-M2.1 | $0.3 | $1.2 | 1e+06 | 8192 |
| minimax/MiniMax-M2.1-lightning | $0.3 | $2.4 | 1e+06 | 8192 |
| minimax/MiniMax-M2 | $0.3 | $1.2 | 200000 | 8192 |
| azure/gpt-5.2-chat | $1.75 | $14 | 128000 | 16384 |
| aiml/google/imagen-4.0-ultra-generate-001 | -- | -- | nan | nan |
| aiml/google/nano-banana-pro | -- | -- | nan | nan |
| low/1024-x-1024/gpt-image-1.5 | -- | -- | nan | nan |
| low/1024-x-1536/gpt-image-1.5 | -- | -- | nan | nan |
| low/1536-x-1024/gpt-image-1.5 | -- | -- | nan | nan |
| medium/1024-x-1024/gpt-image-1.5 | -- | -- | nan | nan |
| medium/1024-x-1536/gpt-image-1.5 | -- | -- | nan | nan |
| medium/1536-x-1024/gpt-image-1.5 | -- | -- | nan | nan |
| high/1024-x-1024/gpt-image-1.5 | -- | -- | nan | nan |
| high/1024-x-1536/gpt-image-1.5 | -- | -- | nan | nan |
| high/1536-x-1024/gpt-image-1.5 | -- | -- | nan | nan |
| standard/1024-x-1024/gpt-image-1.5 | -- | -- | nan | nan |
| standard/1024-x-1536/gpt-image-1.5 | -- | -- | nan | nan |
| standard/1536-x-1024/gpt-image-1.5 | -- | -- | nan | nan |
| 1024-x-1024/gpt-image-1.5 | -- | -- | nan | nan |
| 1024-x-1536/gpt-image-1.5 | -- | -- | nan | nan |
| 1536-x-1024/gpt-image-1.5 | -- | -- | nan | nan |
| low/1024-x-1024/gpt-image-1.5-2025-12-16 | -- | -- | nan | nan |
| low/1024-x-1536/gpt-image-1.5-2025-12-16 | -- | -- | nan | nan |
| low/1536-x-1024/gpt-image-1.5-2025-12-16 | -- | -- | nan | nan |
| medium/1024-x-1024/gpt-image-1.5-2025-12-16 | -- | -- | nan | nan |
| medium/1024-x-1536/gpt-image-1.5-2025-12-16 | -- | -- | nan | nan |
| medium/1536-x-1024/gpt-image-1.5-2025-12-16 | -- | -- | nan | nan |
| high/1024-x-1024/gpt-image-1.5-2025-12-16 | -- | -- | nan | nan |
| high/1024-x-1536/gpt-image-1.5-2025-12-16 | -- | -- | nan | nan |
| high/1536-x-1024/gpt-image-1.5-2025-12-16 | -- | -- | nan | nan |
| standard/1024-x-1024/gpt-image-1.5-2025-12-16 | -- | -- | nan | nan |
| standard/1024-x-1536/gpt-image-1.5-2025-12-16 | -- | -- | nan | nan |
| standard/1536-x-1024/gpt-image-1.5-2025-12-16 | -- | -- | nan | nan |
| 1024-x-1024/gpt-image-1.5-2025-12-16 | -- | -- | nan | nan |
| 1024-x-1536/gpt-image-1.5-2025-12-16 | -- | -- | nan | nan |
| 1536-x-1024/gpt-image-1.5-2025-12-16 | -- | -- | nan | nan |
| zai/glm-4.7 | $0.6 | $2.2 | 200000 | 128000 |
| openrouter/google/gemini-3-flash-preview | $0.5 | $3 | 1.04858e+06 | 65535 |
| amazon.nova-2-multimodal-embeddings-v1:0 | $0.14 | $0 | 8172 | nan |
| gigachat/GigaChat-2-Lite | $0 | $0 | 128000 | 8192 |
| gigachat/GigaChat-2-Max | $0 | $0 | 128000 | 8192 |
| gigachat/GigaChat-2-Pro | $0 | $0 | 128000 | 8192 |
| gigachat/Embeddings | $0 | $0 | 512 | nan |
| gigachat/Embeddings-2 | $0 | $0 | 512 | nan |
| gigachat/EmbeddingsGigaR | $0 | $0 | 4096 | nan |
| llamagate/llama-3.1-8b | $0.03 | $0.05 | 131072 | 8192 |
| llamagate/llama-3.2-3b | $0.04 | $0.08 | 131072 | 8192 |
| llamagate/mistral-7b-v0.3 | $0.1 | $0.15 | 32768 | 8192 |
| llamagate/qwen3-8b | $0.04 | $0.14 | 32768 | 8192 |
| llamagate/dolphin3-8b | $0.08 | $0.15 | 128000 | 8192 |
| llamagate/deepseek-r1-8b | $0.1 | $0.2 | 65536 | 16384 |
| llamagate/deepseek-r1-7b-qwen | $0.08 | $0.15 | 131072 | 16384 |
| llamagate/openthinker-7b | $0.08 | $0.15 | 32768 | 8192 |
| llamagate/qwen2.5-coder-7b | $0.06 | $0.12 | 32768 | 8192 |
| llamagate/deepseek-coder-6.7b | $0.06 | $0.12 | 16384 | 4096 |
| llamagate/codellama-7b | $0.06 | $0.12 | 16384 | 4096 |
| llamagate/qwen3-vl-8b | $0.15 | $0.55 | 32768 | 8192 |
| llamagate/llava-7b | $0.1 | $0.2 | 4096 | 2048 |
| llamagate/gemma3-4b | $0.03 | $0.08 | 128000 | 8192 |
| llamagate/nomic-embed-text | $0.02 | $0 | 8192 | nan |
| llamagate/qwen3-embedding-8b | $0.02 | $0 | 40960 | nan |
| azure_ai/flux.2-pro | -- | -- | nan | nan |
| vertex_ai/zai-org/glm-4.7-maas | $0.6 | $2.2 | 200000 | 128000 |
| novita/deepseek/deepseek-v3.2 | $0.27 | $0.4 | 163840 | 65536 |
| novita/minimax/minimax-m2.1 | $0.3 | $1.2 | 204800 | 131072 |
| novita/zai-org/glm-4.7 | $0.6 | $2.2 | 204800 | 131072 |
| novita/xiaomimimo/mimo-v2-flash | $0.1 | $0.3 | 262144 | 32000 |
| novita/zai-org/autoglm-phone-9b-multilingual | $0.04 | $0.14 | 65536 | 65536 |
| novita/moonshotai/kimi-k2-thinking | $0.6 | $2.5 | 262144 | 262144 |
| novita/minimax/minimax-m2 | $0.3 | $1.2 | 204800 | 131072 |
| novita/paddlepaddle/paddleocr-vl | $0.02 | $0.02 | 16384 | 16384 |
| novita/deepseek/deepseek-v3.2-exp | $0.27 | $0.41 | 163840 | 65536 |
| novita/qwen/qwen3-vl-235b-a22b-thinking | $0.98 | $3.95 | 131072 | 32768 |
| novita/zai-org/glm-4.6v | $0.3 | $0.9 | 131072 | 32768 |
| novita/zai-org/glm-4.6 | $0.55 | $2.2 | 204800 | 131072 |
| novita/qwen/qwen3-next-80b-a3b-instruct | $0.15 | $1.5 | 131072 | 32768 |
| novita/qwen/qwen3-next-80b-a3b-thinking | $0.15 | $1.5 | 131072 | 32768 |
| novita/deepseek/deepseek-ocr | $0.03 | $0.03 | 8192 | 8192 |
| novita/deepseek/deepseek-v3.1-terminus | $0.27 | $1 | 131072 | 32768 |
| novita/qwen/qwen3-vl-235b-a22b-instruct | $0.3 | $1.5 | 131072 | 32768 |
| novita/qwen/qwen3-max | $2.11 | $8.45 | 262144 | 65536 |
| novita/skywork/r1v4-lite | $0.2 | $0.6 | 262144 | 65536 |
| novita/deepseek/deepseek-v3.1 | $0.27 | $1 | 131072 | 32768 |
| novita/moonshotai/kimi-k2-0905 | $0.6 | $2.5 | 262144 | 262144 |
| novita/qwen/qwen3-coder-480b-a35b-instruct | $0.3 | $1.3 | 262144 | 65536 |
| novita/qwen/qwen3-coder-30b-a3b-instruct | $0.07 | $0.27 | 160000 | 32768 |
| novita/openai/gpt-oss-120b | $0.05 | $0.25 | 131072 | 32768 |
| novita/moonshotai/kimi-k2-instruct | $0.57 | $2.3 | 131072 | 131072 |
| novita/deepseek/deepseek-v3-0324 | $0.27 | $1.12 | 163840 | 163840 |
| novita/zai-org/glm-4.5 | $0.6 | $2.2 | 131072 | 98304 |
| novita/qwen/qwen3-235b-a22b-thinking-2507 | $0.3 | $3 | 131072 | 32768 |
| novita/meta-llama/llama-3.1-8b-instruct | $0.02 | $0.05 | 16384 | 16384 |
| novita/google/gemma-3-12b-it | $0.05 | $0.1 | 131072 | 8192 |
| novita/zai-org/glm-4.5v | $0.6 | $1.8 | 65536 | 16384 |
| novita/openai/gpt-oss-20b | $0.04 | $0.15 | 131072 | 32768 |
| novita/qwen/qwen3-235b-a22b-instruct-2507 | $0.09 | $0.58 | 131072 | 16384 |
| novita/deepseek/deepseek-r1-distill-qwen-14b | $0.15 | $0.15 | 32768 | 16384 |
| novita/meta-llama/llama-3.3-70b-instruct | $0.14 | $0.4 | 131072 | 120000 |
| novita/qwen/qwen-2.5-72b-instruct | $0.38 | $0.4 | 32000 | 8192 |
| novita/mistralai/mistral-nemo | $0.04 | $0.17 | 60288 | 16000 |
| novita/minimaxai/minimax-m1-80k | $0.55 | $2.2 | 1e+06 | 40000 |
| novita/deepseek/deepseek-r1-0528 | $0.7 | $2.5 | 163840 | 32768 |
| novita/deepseek/deepseek-r1-distill-qwen-32b | $0.3 | $0.3 | 64000 | 32000 |
| novita/meta-llama/llama-3-8b-instruct | $0.04 | $0.04 | 8192 | 8192 |
| novita/microsoft/wizardlm-2-8x22b | $0.62 | $0.62 | 65535 | 8000 |
| novita/deepseek/deepseek-r1-0528-qwen3-8b | $0.06 | $0.09 | 128000 | 32000 |
| novita/deepseek/deepseek-r1-distill-llama-70b | $0.8 | $0.8 | 8192 | 8192 |
| novita/meta-llama/llama-3-70b-instruct | $0.51 | $0.74 | 8192 | 8000 |
| novita/qwen/qwen3-235b-a22b-fp8 | $0.2 | $0.8 | 40960 | 20000 |
| novita/meta-llama/llama-4-maverick-17b-128e-instruct-fp8 | $0.27 | $0.85 | 1.04858e+06 | 8192 |
| novita/meta-llama/llama-4-scout-17b-16e-instruct | $0.18 | $0.59 | 131072 | 131072 |
| novita/nousresearch/hermes-2-pro-llama-3-8b | $0.14 | $0.14 | 8192 | 8192 |
| novita/qwen/qwen2.5-vl-72b-instruct | $0.8 | $0.8 | 32768 | 32768 |
| novita/sao10k/l3-70b-euryale-v2.1 | $1.48 | $1.48 | 8192 | 8192 |
| novita/baidu/ernie-4.5-21B-a3b-thinking | $0.07 | $0.28 | 131072 | 65536 |
| novita/sao10k/l3-8b-lunaris | $0.05 | $0.05 | 8192 | 8192 |
| novita/baichuan/baichuan-m2-32b | $0.07 | $0.07 | 131072 | 131072 |
| novita/thudm/glm-4.1v-9b-thinking | $280 | $1104 | 65536 | 8000 |
| novita/baidu/ernie-4.5-vl-424b-a47b | $0.42 | $1.25 | 123000 | 16000 |
| novita/baidu/ernie-4.5-300b-a47b-paddle | $0.28 | $1.1 | 123000 | 12000 |
| novita/deepseek/deepseek-prover-v2-671b | $0.7 | $2.5 | 160000 | 160000 |
| novita/qwen/qwen3-32b-fp8 | $0.1 | $0.45 | 40960 | 20000 |
| novita/qwen/qwen3-30b-a3b-fp8 | $0.09 | $0.45 | 40960 | 20000 |
| novita/google/gemma-3-27b-it | $0.12 | $0.2 | 98304 | 16384 |
| novita/deepseek/deepseek-v3-turbo | $0.4 | $1.3 | 64000 | 16000 |
| novita/deepseek/deepseek-r1-turbo | $0.7 | $2.5 | 64000 | 16000 |
| novita/Sao10K/L3-8B-Stheno-v3.2 | $0.05 | $0.05 | 8192 | 32000 |
| novita/gryphe/mythomax-l2-13b | $0.09 | $0.09 | 4096 | 3200 |
| novita/baidu/ernie-4.5-vl-28b-a3b-thinking | $0.39 | $0.39 | 131072 | 65536 |
| novita/qwen/qwen3-vl-8b-instruct | $0.08 | $0.5 | 131072 | 32768 |
| novita/zai-org/glm-4.5-air | $0.13 | $0.85 | 131072 | 98304 |
| novita/qwen/qwen3-vl-30b-a3b-instruct | $0.2 | $0.7 | 131072 | 32768 |
| novita/qwen/qwen3-vl-30b-a3b-thinking | $0.2 | $1 | 131072 | 32768 |
| novita/qwen/qwen-mt-plus | $0.25 | $0.75 | 16384 | 8192 |
| novita/baidu/ernie-4.5-vl-28b-a3b | $0.14 | $0.56 | 30000 | 8000 |
| novita/baidu/ernie-4.5-21B-a3b | $0.07 | $0.28 | 120000 | 8000 |
| novita/qwen/qwen3-8b-fp8 | $0.04 | $0.14 | 128000 | 20000 |
| novita/qwen/qwen3-4b-fp8 | $0.03 | $0.03 | 128000 | 20000 |
| novita/qwen/qwen2.5-7b-instruct | $0.07 | $0.07 | 32000 | 32000 |
| novita/meta-llama/llama-3.2-3b-instruct | $0.03 | $0.05 | 32768 | 32000 |
| novita/sao10k/l31-70b-euryale-v2.2 | $1.48 | $1.48 | 8192 | 8192 |
| novita/qwen/qwen3-embedding-0.6b | $0.07 | $0 | 32768 | 32768 |
| novita/qwen/qwen3-embedding-8b | $0.07 | $0 | 32768 | 4096 |
| novita/baai/bge-m3 | $0.01 | $0.01 | 8192 | 96000 |
| novita/qwen/qwen3-reranker-8b | $0.05 | $0.05 | 32768 | 4096 |
| novita/baai/bge-reranker-v2-m3 | $0.01 | $0.01 | 8000 | 8000 |
| replicate/openai/gpt-5 | $1.25 | $10 | nan | nan |
| replicateopenai/gpt-oss-20b | $0.09 | $0.36 | nan | nan |
| replicate/anthropic/claude-4.5-haiku | $1 | $5 | nan | nan |
| replicate/ibm-granite/granite-3.3-8b-instruct | $0.03 | $0.25 | nan | nan |
| replicate/openai/gpt-4o | $2.5 | $10 | nan | nan |
| replicate/openai/o4-mini | $1 | $4 | nan | nan |
| replicate/openai/o1-mini | $1.1 | $4.4 | nan | nan |
| replicate/openai/o1 | $15 | $60 | nan | nan |
| replicate/openai/gpt-4o-mini | $0.15 | $0.6 | nan | nan |
| replicate/qwen/qwen3-235b-a22b-instruct-2507 | $0.26 | $1.06 | nan | nan |
| replicate/anthropic/claude-4-sonnet | $3 | $15 | nan | nan |
| replicate/deepseek-ai/deepseek-v3 | $1.45 | $1.45 | 65536 | 8192 |
| replicate/anthropic/claude-3.7-sonnet | $3 | $15 | nan | nan |
| replicate/anthropic/claude-3.5-haiku | $1 | $5 | nan | nan |
| replicate/anthropic/claude-3.5-sonnet | $3.75 | $18.75 | nan | nan |
| replicate/google/gemini-3-pro | $2 | $12 | nan | nan |
| replicate/anthropic/claude-4.5-sonnet | $3 | $15 | nan | nan |
| replicate/openai/gpt-4.1 | $2 | $8 | nan | nan |
| replicate/openai/gpt-4.1-nano | $0.1 | $0.4 | nan | nan |
| replicate/openai/gpt-4.1-mini | $0.4 | $1.6 | nan | nan |
| replicate/openai/gpt-5-nano | $0.05 | $0.4 | nan | nan |
| replicate/openai/gpt-5-mini | $0.25 | $2 | nan | nan |
| replicate/google/gemini-2.5-flash | $2.5 | $2.5 | nan | nan |
| replicate/openai/gpt-oss-120b | $0.18 | $0.72 | nan | nan |
| replicate/deepseek-ai/deepseek-v3.1 | $0.67 | $2.02 | 163840 | 163840 |
| replicate/xai/grok-4 | $7.2 | $36 | nan | nan |
| replicate/deepseek-ai/deepseek-r1 | $3.75 | $10 | 65536 | 8192 |
| azure_ai/claude-opus-4-5 | $5 | $25 | 200000 | 64000 |
| novita/kwaipilot/kat-coder-pro | $0.3 | $1.2 | 256000 | 128000 |
| novita/qwen/qwen3-omni-30b-a3b-thinking | $0.25 | $0.97 | 65536 | 16384 |
| novita/qwen/qwen3-omni-30b-a3b-instruct | $0.25 | $0.97 | 65536 | 16384 |
| cerebras/zai-glm-4.7 | $2.25 | $2.75 | 128000 | 128000 |
| gpt-5.2-codex | $1.75 | $14 | 272000 | 128000 |
| openrouter/openai/gpt-5.2-codex | $1.75 | $14 | 272000 | 128000 |
| azure/gpt-5.2-codex | $1.75 | $14 | 272000 | 128000 |
| dall-e-2 | -- | -- | nan | nan |
| dall-e-3 | -- | -- | nan | nan |
| gemini-2.5-computer-use-preview-10-2025 | $1.25 | $10 | 128000 | 64000 |
| chatgpt/gpt-5.2-codex | -- | -- | 128000 | 128000 |
| chatgpt/gpt-5.2 | -- | -- | 128000 | 64000 |
| chatgpt/gpt-5.1-codex-max | -- | -- | 128000 | 128000 |
| chatgpt/gpt-5.1-codex-mini | -- | -- | 128000 | 64000 |
| sarvam/sarvam-m | $0 | $0 | 8192 | 32000 |
| gmi/anthropic/claude-opus-4.5 | $5 | $25 | 409600 | 32000 |
| gmi/anthropic/claude-sonnet-4.5 | $3 | $15 | 409600 | 32000 |
| gmi/anthropic/claude-sonnet-4 | $3 | $15 | 409600 | 32000 |
| gmi/anthropic/claude-opus-4 | $15 | $75 | 409600 | 32000 |
| gmi/openai/gpt-5.2 | $1.75 | $14 | 409600 | 32000 |
| gmi/openai/gpt-5.1 | $1.25 | $10 | 409600 | 32000 |
| gmi/openai/gpt-5 | $1.25 | $10 | 409600 | 32000 |
| gmi/openai/gpt-4o | $2.5 | $10 | 131072 | 16384 |
| gmi/openai/gpt-4o-mini | $0.15 | $0.6 | 131072 | 16384 |
| gmi/deepseek-ai/DeepSeek-V3.2 | $0.28 | $0.4 | 163840 | 16384 |
| gmi/deepseek-ai/DeepSeek-V3-0324 | $0.28 | $0.88 | 163840 | 16384 |
| gmi/google/gemini-3-pro-preview | $2 | $12 | 1.04858e+06 | 65536 |
| gmi/google/gemini-3-flash-preview | $0.5 | $3 | 1.04858e+06 | 65536 |
| gmi/moonshotai/Kimi-K2-Thinking | $0.8 | $1.2 | 262144 | 16384 |
| gmi/MiniMaxAI/MiniMax-M2.1 | $0.3 | $1.2 | 196608 | 16384 |
| gmi/Qwen/Qwen3-VL-235B-A22B-Instruct-FP8 | $0.3 | $1.4 | 262144 | 16384 |
| gmi/zai-org/GLM-4.7-FP8 | $0.4 | $2 | 202752 | 16384 |
| gpt-audio | $2.5 | $10 | 128000 | 16384 |
| gpt-audio-2025-08-28 | $2.5 | $10 | 128000 | 16384 |
| gpt-audio-mini | $0.6 | $2.4 | 128000 | 16384 |
| gpt-audio-mini-2025-10-06 | $0.6 | $2.4 | 128000 | 16384 |
| gpt-audio-mini-2025-12-15 | $0.6 | $2.4 | 128000 | 16384 |
| deepseek-v3-2-251201 | $0 | $0 | 98304 | 32768 |
| glm-4-7-251222 | $0 | $0 | 204800 | 131072 |
| kimi-k2-thinking-251104 | $0 | $0 | 229376 | 32768 |
| gemini-robotics-er-1.5-preview | $0.3 | $2.5 | 1.04858e+06 | 65535 |
| gemini/gemini-robotics-er-1.5-preview | $0.3 | $2.5 | 1.04858e+06 | 65535 |
| openrouter/xiaomi/mimo-v2-flash | $0.1 | $0.3 | 262144 | 16384 |
| openrouter/z-ai/glm-4.7 | $0.4 | $1.5 | 202752 | 64000 |
| openrouter/z-ai/glm-4.7-flash | $0.07 | $0.4 | 200000 | 32000 |
| openrouter/minimax/minimax-m2.1 | $0.27 | $1.2 | 204000 | 64000 |
| amazon.nova-2-pro-preview-20251202-v1:0 | $2.19 | $17.5 | 1e+06 | 64000 |
| apac.amazon.nova-2-pro-preview-20251202-v1:0 | $2.19 | $17.5 | 1e+06 | 64000 |
| eu.amazon.nova-2-pro-preview-20251202-v1:0 | $2.19 | $17.5 | 1e+06 | 64000 |
| us.amazon.nova-2-pro-preview-20251202-v1:0 | $2.19 | $17.5 | 1e+06 | 64000 |
| openrouter/moonshotai/kimi-k2.5 | $0.6 | $3 | 262144 | 262144 |
| azure_ai/model_router | $0.14 | $0 | nan | nan |
| moonshot/kimi-k2.5 | $0.6 | $3 | 262144 | 262144 |
| together_ai/zai-org/GLM-4.7 | $0.45 | $2 | 200000 | 200000 |
| together_ai/moonshotai/Kimi-K2.5 | $0.5 | $2.8 | 256000 | 256000 |
| deep-research-pro-preview-12-2025 | $2 | $12 | 65536 | 32768 |
| gemini/deep-research-pro-preview-12-2025 | $2 | $12 | 65536 | 32768 |
| vertex_ai/deep-research-pro-preview-12-2025 | $2 | $12 | 65536 | 32768 |
| anthropic.claude-opus-4-6-v1:0 | $5 | $25 | 200000 | 128000 |
| global.anthropic.claude-opus-4-6-v1:0 | $5 | $25 | 200000 | 128000 |
| us.anthropic.claude-opus-4-6-v1:0 | $5.5 | $27.5 | 1e+06 | 128000 |
| eu.anthropic.claude-opus-4-6-v1:0 | $5.5 | $27.5 | 200000 | 128000 |
| apac.anthropic.claude-opus-4-6-v1:0 | $5.5 | $27.5 | 200000 | 128000 |
| azure_ai/claude-opus-4-6 | $5 | $25 | 200000 | 128000 |
| claude-opus-4-6 | $5 | $25 | 1e+06 | 128000 |
| claude-opus-4-6-20260205 | $5 | $25 | 1e+06 | 128000 |
| openrouter/qwen/qwen3-235b-a22b-2507 | $0.07 | $0.1 | 262144 | 262144 |
| openrouter/qwen/qwen3-235b-a22b-thinking-2507 | $0.11 | $0.6 | 262144 | 262144 |
| vertex_ai/claude-opus-4-6 | $5 | $25 | 1e+06 | 128000 |
| anthropic.claude-opus-4-6-v1 | $5 | $25 | 1e+06 | 128000 |
| global.anthropic.claude-opus-4-6-v1 | $5 | $25 | 1e+06 | 128000 |
| us.anthropic.claude-opus-4-6-v1 | $5.5 | $27.5 | 1e+06 | 128000 |
| eu.anthropic.claude-opus-4-6-v1 | $5.5 | $27.5 | 1e+06 | 128000 |
| apac.anthropic.claude-opus-4-6-v1 | $5.5 | $27.5 | 1e+06 | 128000 |
| au.anthropic.claude-opus-4-6-v1:0 | $5.5 | $27.5 | 200000 | 128000 |
| us/claude-opus-4-6 | $5.5 | $27.5 | 200000 | 128000 |
| us/claude-opus-4-6-20260205 | $5.5 | $27.5 | 200000 | 128000 |
| elevenlabs/eleven_v3 | -- | -- | nan | nan |
| elevenlabs/eleven_multilingual_v2 | -- | -- | nan | nan |
| fast/claude-opus-4-6 | $30 | $150 | 1e+06 | 128000 |
| fast/us/claude-opus-4-6 | $30 | $150 | 200000 | 128000 |
| fast/claude-opus-4-6-20260205 | $30 | $150 | 1e+06 | 128000 |
| vercel_ai_gateway/anthropic/claude-3-5-sonnet | $3 | $15 | 200000 | 8192 |
| vercel_ai_gateway/anthropic/claude-3-5-sonnet-20241022 | $3 | $15 | 200000 | 8192 |
| vercel_ai_gateway/anthropic/claude-3-7-sonnet | $3 | $15 | 200000 | 64000 |
| vercel_ai_gateway/anthropic/claude-haiku-4.5 | $1 | $5 | 200000 | 64000 |
| vercel_ai_gateway/anthropic/claude-opus-4 | $15 | $75 | 200000 | 32000 |
| vercel_ai_gateway/anthropic/claude-opus-4.1 | $15 | $75 | 200000 | 32000 |
| vercel_ai_gateway/anthropic/claude-opus-4.5 | $5 | $25 | 200000 | 64000 |
| vercel_ai_gateway/anthropic/claude-opus-4.6 | $5 | $25 | 200000 | 64000 |
| vercel_ai_gateway/anthropic/claude-sonnet-4 | $3 | $15 | 200000 | 64000 |
| vercel_ai_gateway/anthropic/claude-sonnet-4.5 | $3 | $15 | 1e+06 | 64000 |
| azure_ai/kimi-k2.5 | $0.6 | $3 | 262144 | 262144 |
| bedrock/ap-northeast-1/moonshotai.kimi-k2-thinking | $0.73 | $3.03 | 262144 | 262144 |
| bedrock/moonshotai.kimi-k2-thinking | $0.73 | $3.03 | 262144 | 262144 |
| bedrock/moonshotai.kimi-k2.5 | $0.6 | $3.03 | 262144 | 262144 |
| bedrock/ap-south-1/moonshotai.kimi-k2-thinking | $0.71 | $2.94 | 262144 | 262144 |
| bedrock/sa-east-1/moonshotai.kimi-k2-thinking | $0.73 | $3.03 | 262144 | 262144 |
| bedrock/us-east-1/moonshotai.kimi-k2-thinking | $0.6 | $2.5 | 262144 | 262144 |
| bedrock/us-east-2/moonshotai.kimi-k2-thinking | $0.6 | $2.5 | 262144 | 262144 |
| bedrock/us-west-2/moonshotai.kimi-k2-thinking | $0.6 | $2.5 | 262144 | 262144 |
| perplexity/preset/pro-search | -- | -- | nan | nan |
| perplexity/openai/gpt-4o | -- | -- | nan | nan |
| perplexity/openai/gpt-4o-mini | -- | -- | nan | nan |
| perplexity/openai/gpt-5.2 | -- | -- | nan | nan |
| perplexity/anthropic/claude-3-5-sonnet-20241022 | -- | -- | nan | nan |
| perplexity/anthropic/claude-3-5-haiku-20241022 | -- | -- | nan | nan |
| perplexity/google/gemini-2.0-flash-exp | -- | -- | nan | nan |
| perplexity/google/gemini-2.0-flash-thinking-exp | -- | -- | nan | nan |
| perplexity/xai/grok-2-1212 | -- | -- | nan | nan |
| perplexity/xai/grok-2-vision-1212 | -- | -- | nan | nan |
| dashscope/qwen3-max | -- | -- | 258048 | 65536 |
| minimax/MiniMax-M2.5 | $0.3 | $1.2 | 1e+06 | 8192 |
| minimax/MiniMax-M2.5-lightning | $0.3 | $2.4 | 1e+06 | 8192 |
| vertex_ai/zai-org/glm-5-maas | $1 | $3.2 | 200000 | 128000 |
| bedrock/ap-northeast-1/deepseek.v3.2 | $0.74 | $2.22 | 163840 | 163840 |
| bedrock/ap-northeast-1/minimax.minimax-m2.1 | $0.36 | $1.44 | 196000 | 8192 |
| bedrock/ap-northeast-1/moonshotai.kimi-k2.5 | $0.72 | $3.6 | 262144 | 262144 |
| bedrock/ap-northeast-1/qwen.qwen3-coder-next | $0.6 | $1.44 | 262144 | 8192 |
| bedrock/ap-south-1/deepseek.v3.2 | $0.74 | $2.22 | 163840 | 163840 |
| bedrock/ap-south-1/minimax.minimax-m2.1 | $0.36 | $1.44 | 196000 | 8192 |
| bedrock/ap-south-1/moonshotai.kimi-k2.5 | $0.72 | $3.6 | 262144 | 262144 |
| bedrock/ap-south-1/qwen.qwen3-coder-next | $0.6 | $1.44 | 262144 | 8192 |
| bedrock/ap-southeast-3/deepseek.v3.2 | $0.74 | $2.22 | 163840 | 163840 |
| bedrock/ap-southeast-3/minimax.minimax-m2.1 | $0.36 | $1.44 | 196000 | 8192 |
| bedrock/ap-southeast-3/moonshotai.kimi-k2.5 | $0.72 | $3.6 | 262144 | 262144 |
| bedrock/ap-southeast-3/qwen.qwen3-coder-next | $0.6 | $1.44 | 262144 | 8192 |
| bedrock/eu-north-1/deepseek.v3.2 | $0.74 | $2.22 | 163840 | 163840 |
| bedrock/eu-north-1/minimax.minimax-m2.1 | $0.36 | $1.44 | 196000 | 8192 |
| bedrock/eu-north-1/moonshotai.kimi-k2.5 | $0.72 | $3.6 | 262144 | 262144 |
| bedrock/eu-central-1/minimax.minimax-m2.1 | $0.36 | $1.44 | 196000 | 8192 |
| bedrock/eu-central-1/qwen.qwen3-coder-next | $0.6 | $1.44 | 262144 | 8192 |
| bedrock/eu-west-1/minimax.minimax-m2.1 | $0.36 | $1.44 | 196000 | 8192 |
| bedrock/eu-west-1/qwen.qwen3-coder-next | $0.6 | $1.44 | 262144 | 8192 |
| bedrock/eu-west-2/minimax.minimax-m2.1 | $0.47 | $1.86 | 196000 | 8192 |
| bedrock/eu-west-2/qwen.qwen3-coder-next | $0.78 | $1.86 | 262144 | 8192 |
| bedrock/eu-south-1/minimax.minimax-m2.1 | $0.36 | $1.44 | 196000 | 8192 |
| bedrock/eu-south-1/qwen.qwen3-coder-next | $0.6 | $1.44 | 262144 | 8192 |
| bedrock/sa-east-1/deepseek.v3.2 | $0.74 | $2.22 | 163840 | 163840 |
| bedrock/sa-east-1/minimax.minimax-m2.1 | $0.36 | $1.44 | 196000 | 8192 |
| bedrock/sa-east-1/moonshotai.kimi-k2.5 | $0.72 | $3.6 | 262144 | 262144 |
| bedrock/sa-east-1/qwen.qwen3-coder-next | $0.6 | $1.44 | 262144 | 8192 |
| bedrock/us-east-1/deepseek.v3.2 | $0.62 | $1.85 | 163840 | 163840 |
| bedrock/us-east-1/minimax.minimax-m2.1 | $0.3 | $1.2 | 196000 | 8192 |
| bedrock/us-east-1/moonshotai.kimi-k2.5 | $0.6 | $3 | 262144 | 262144 |
| bedrock/us-east-1/qwen.qwen3-coder-next | $0.5 | $1.2 | 262144 | 8192 |
| bedrock/us-east-2/deepseek.v3.2 | $0.62 | $1.85 | 163840 | 163840 |
| bedrock/us-east-2/minimax.minimax-m2.1 | $0.3 | $1.2 | 196000 | 8192 |
| bedrock/us-east-2/moonshotai.kimi-k2.5 | $0.6 | $3 | 262144 | 262144 |
| bedrock/us-east-2/qwen.qwen3-coder-next | $0.5 | $1.2 | 262144 | 8192 |
| bedrock/us-west-2/deepseek.v3.2 | $0.62 | $1.85 | 163840 | 163840 |
| bedrock/us-west-2/minimax.minimax-m2.1 | $0.3 | $1.2 | 196000 | 8192 |
| bedrock/us-west-2/moonshotai.kimi-k2.5 | $0.6 | $3 | 262144 | 262144 |
| bedrock/us-west-2/qwen.qwen3-coder-next | $0.5 | $1.2 | 262144 | 8192 |
| deepseek.v3.2 | $0.62 | $1.85 | 163840 | 163840 |
| minimax.minimax-m2.1 | $0.3 | $1.2 | 196000 | 8192 |
| moonshotai.kimi-k2.5 | $0.6 | $3 | 262144 | 262144 |
| nvidia.nemotron-nano-3-30b | $0.06 | $0.24 | 262144 | 8192 |
| qwen.qwen3-coder-next | $0.5 | $1.2 | 262144 | 8192 |
| us.deepseek.v3.2 | $0.62 | $1.85 | 163840 | 163840 |
| eu.deepseek.v3.2 | $0.74 | $2.22 | 163840 | 163840 |
| zai.glm-4.7 | $0.6 | $2.2 | 200000 | 128000 |
| openai/sora-2-pro-high-res | -- | -- | nan | nan |
| tts-1-1106 | -- | -- | nan | nan |
| tts-1-hd-1106 | -- | -- | nan | nan |
| gpt-4o-mini-tts-2025-03-20 | $2.5 | $10 | nan | nan |
| gpt-4o-mini-tts-2025-12-15 | $2.5 | $10 | nan | nan |
| gpt-4o-mini-transcribe-2025-03-20 | $1.25 | $5 | 16000 | 2000 |
| gpt-4o-mini-transcribe-2025-12-15 | $1.25 | $5 | 16000 | 2000 |
| gpt-5-search-api | $1.25 | $10 | 272000 | 128000 |
| gpt-5-search-api-2025-10-14 | $1.25 | $10 | 272000 | 128000 |
| gpt-realtime-mini-2025-10-06 | $0.6 | $2.4 | 128000 | 4096 |
| gpt-realtime-mini-2025-12-15 | $0.6 | $2.4 | 128000 | 4096 |
| sora-2 | -- | -- | nan | nan |
| sora-2-pro | -- | -- | nan | nan |
| sora-2-pro-high-res | -- | -- | nan | nan |
| chatgpt-image-latest | $5 | -- | nan | nan |
| gemini-2.0-flash-exp-image-generation | $0 | $0 | 32768 | 32768 |
| gemini/gemini-2.0-flash-exp-image-generation | $0 | $0 | 32768 | 32768 |
| gemini/gemini-2.0-flash-lite-001 | $0.08 | $0.3 | 1.04858e+06 | 8192 |
| gemini-2.5-flash-native-audio-latest | $0.3 | $2.5 | 1.04858e+06 | 8192 |
| gemini-2.5-flash-native-audio-preview-09-2025 | $0.3 | $2.5 | 1.04858e+06 | 8192 |
| gemini-2.5-flash-native-audio-preview-12-2025 | $0.3 | $2.5 | 1.04858e+06 | 8192 |
| gemini/gemini-2.5-flash-native-audio-latest | $0.3 | $2.5 | 1.04858e+06 | 8192 |
| gemini/gemini-2.5-flash-native-audio-preview-09-2025 | $0.3 | $2.5 | 1.04858e+06 | 8192 |
| gemini/gemini-2.5-flash-native-audio-preview-12-2025 | $0.3 | $2.5 | 1.04858e+06 | 8192 |
| gemini-2.5-flash-preview-tts | $0.3 | $2.5 | nan | nan |
| gemini-pro-latest | $1.25 | $10 | 1.04858e+06 | 65535 |
| gemini/gemini-pro-latest | $1.25 | $10 | 1.04858e+06 | 65535 |
| gemini-exp-1206 | $0.3 | $2.5 | 1.04858e+06 | 65535 |
| au.anthropic.claude-opus-4-6-v1 | $5.5 | $27.5 | 1e+06 | 128000 |
| fireworks_ai/accounts/fireworks/models/kimi-k2p5 | $0.6 | $3 | 262144 | 262144 |
| anthropic.claude-sonnet-4-6 | $3 | $15 | 1e+06 | 64000 |
| global.anthropic.claude-sonnet-4-6 | $3 | $15 | 1e+06 | 64000 |
| us.anthropic.claude-sonnet-4-6 | $3.3 | $16.5 | 1e+06 | 64000 |
| eu.anthropic.claude-sonnet-4-6 | $3.3 | $16.5 | 1e+06 | 64000 |
| apac.anthropic.claude-sonnet-4-6 | $3.3 | $16.5 | 200000 | 64000 |
| azure_ai/claude-sonnet-4-6 | $3 | $15 | 1e+06 | 64000 |
| claude-sonnet-4-6 | $3 | $15 | 1e+06 | 64000 |
| github_copilot/claude-opus-4.6-fast | -- | -- | 128000 | 16000 |
| github_copilot/gpt-5.3-codex | -- | -- | 128000 | 128000 |
| vertex_ai/claude-opus-4-6@default | $5 | $25 | 1e+06 | 128000 |
| vertex_ai/claude-sonnet-4-6 | $3 | $15 | 1e+06 | 64000 |
| vertex_ai/claude-sonnet-4-6@default | $3 | $15 | 1e+06 | 64000 |
| us/claude-sonnet-4-6 | $3.3 | $16.5 | 200000 | 64000 |
| mistral/devstral-small-latest | $0.1 | $0.3 | 256000 | 256000 |
| mistral/devstral-latest | $0.4 | $2 | 256000 | 256000 |
| mistral/devstral-medium-latest | $0.4 | $2 | 256000 | 256000 |
| duckduckgo/search | -- | -- | nan | nan |
| gemini-3.1-pro-preview | $2 | $12 | 1.04858e+06 | 65536 |
| gemini-3.1-pro-preview-customtools | $2 | $12 | 1.04858e+06 | 65536 |
| vertex_ai/gemini-3.1-pro-preview | $2 | $12 | 1.04858e+06 | 65536 |
| vertex_ai/gemini-3.1-pro-preview-customtools | $2 | $12 | 1.04858e+06 | 65536 |
| gemini/gemini-3.1-pro-preview | $2 | $12 | 1.04858e+06 | 65536 |
| gemini/gemini-3.1-pro-preview-customtools | $2 | $12 | 1.04858e+06 | 65536 |
| fireworks_ai/accounts/fireworks/models/glm-4p7 | $0.6 | $2.2 | 202800 | 202800 |
| fireworks_ai/accounts/fireworks/models/minimax-m2p1 | $0.3 | $1.2 | 204800 | 204800 |
| fireworks_ai/glm-4p7 | $0.6 | $2.2 | 202800 | 202800 |
| fireworks_ai/kimi-k2p5 | $0.6 | $3 | 262144 | 262144 |
| fireworks_ai/minimax-m2p1 | $0.3 | $1.2 | 204800 | 204800 |
| openrouter/minimax/minimax-m2.5 | $0.3 | $1.1 | 196608 | 65536 |
| gpt-5.3-codex | $1.75 | $14 | 272000 | 128000 |
| groq/openai/gpt-oss-safeguard-20b | $0.08 | $0.3 | 131072 | 65536 |
| perplexity/preset/fast-search | -- | -- | nan | nan |
| perplexity/preset/deep-research | -- | -- | nan | nan |
| perplexity/preset/advanced-deep-research | -- | -- | nan | nan |
| perplexity/openai/gpt-5.1 | -- | -- | nan | nan |
| perplexity/openai/gpt-5-mini | -- | -- | nan | nan |
| perplexity/anthropic/claude-opus-4-6 | -- | -- | nan | nan |
| perplexity/anthropic/claude-opus-4-5 | -- | -- | nan | nan |
| perplexity/anthropic/claude-sonnet-4-5 | -- | -- | nan | nan |
| perplexity/anthropic/claude-haiku-4-5 | -- | -- | nan | nan |
| perplexity/google/gemini-3-pro-preview | -- | -- | nan | nan |
| perplexity/google/gemini-3-flash-preview | -- | -- | nan | nan |
| perplexity/google/gemini-2.5-pro | -- | -- | nan | nan |
| perplexity/google/gemini-2.5-flash | -- | -- | nan | nan |
| perplexity/xai/grok-4-1-fast-non-reasoning | -- | -- | nan | nan |
| perplexity/perplexity/sonar | -- | -- | nan | nan |
| azure/gpt-audio-1.5-2026-02-23 | $2.5 | $10 | 128000 | 16384 |
| azure/gpt-realtime-1.5-2026-02-23 | $4 | $16 | 32000 | 4096 |
| azure/gpt-5.3-codex | $1.75 | $14 | 272000 | 128000 |
| gemini-3.1-flash-image-preview | $0.5 | $3 | 65536 | 32768 |
| gpt-audio-1.5 | $2.5 | $10 | 128000 | 16384 |
| gpt-realtime-1.5 | $4 | $16 | 32000 | 4096 |
| openrouter/anthropic/claude-opus-4.6 | $5 | $25 | 1e+06 | 128000 |
| openrouter/openrouter/auto | $0 | $0 | 2e+06 | nan |
| openrouter/openrouter/free | $0 | $0 | 200000 | nan |
| openrouter/openrouter/bodybuilder | $0 | $0 | 128000 | nan |
| vertex_ai/gemini-3.1-flash-image-preview | $0.5 | $3 | 65536 | 32768 |
| dashscope/qwen3-vl-plus | -- | -- | 260096 | 32768 |
| dashscope/qwen3.5-plus | -- | -- | 991808 | 65536 |
| mistral/magistral-medium-1-2-2509 | $2 | $5 | 40000 | 40000 |
| mistral/magistral-small-1-2-2509 | $0.5 | $1.5 | 40000 | 40000 |
| mistral/mistral-large-2512 | $0.5 | $1.5 | 262144 | 262144 |
| mistral/mistral-medium-3-1-2508 | $0.4 | $2 | 131072 | 131072 |
| mistral/mistral-small-3-2-2506 | $0.06 | $0.18 | 131072 | 131072 |
| mistral/ministral-3-3b-2512 | $0.1 | $0.1 | 131072 | 131072 |
| mistral/ministral-3-8b-2512 | $0.15 | $0.15 | 262144 | 262144 |
| mistral/ministral-3-14b-2512 | $0.2 | $0.2 | 262144 | 262144 |
| dashscope/qwen3-max-2026-01-23 | -- | -- | 258048 | 65536 |
| dashscope/qwen3-next-80b-a3b-instruct | $0.15 | $1.2 | 262144 | 65536 |
| dashscope/qwen3-next-80b-a3b-thinking | $0.15 | $1.2 | 262144 | 65536 |
| dashscope/qwen3-vl-235b-a22b-instruct | $0.4 | $1.6 | 131072 | 32768 |
| dashscope/qwen3-vl-235b-a22b-thinking | $0.4 | $4 | 131072 | 32768 |
| dashscope/qwen3-vl-32b-instruct | $0.16 | $0.64 | 131072 | 32768 |
| dashscope/qwen3-vl-32b-thinking | $0.16 | $2.87 | 131072 | 32768 |
| gemini-3.1-flash-lite-preview | $0.25 | $1.5 | 1.04858e+06 | 65536 |
| gemini/gemini-3.1-flash-lite-preview | $0.25 | $1.5 | 1.04858e+06 | 65536 |
| nebius/deepseek-ai/DeepSeek-R1 | $0.8 | $2.4 | 128000 | 128000 |
| nebius/deepseek-ai/DeepSeek-R1-0528 | $0.8 | $2.4 | 164000 | 164000 |
| nebius/deepseek-ai/DeepSeek-R1-Distill-Llama-70B | $0.25 | $0.75 | 128000 | 128000 |
| nebius/deepseek-ai/DeepSeek-V3 | $0.5 | $1.5 | 128000 | 128000 |
| nebius/deepseek-ai/DeepSeek-V3-0324 | $0.5 | $1.5 | 128000 | 128000 |
| nebius/google/gemma-3-27b-it | $0.06 | $0.2 | 128000 | 128000 |
| nebius/meta-llama/Llama-3.3-70B-Instruct | $0.13 | $0.4 | 128000 | 128000 |
| nebius/meta-llama/Llama-Guard-3-8B | $0.02 | $0.06 | 128000 | 128000 |
| nebius/meta-llama/Meta-Llama-3.1-8B-Instruct | $0.02 | $0.06 | 128000 | 128000 |
| nebius/meta-llama/Meta-Llama-3.1-70B-Instruct | $0.13 | $0.4 | 128000 | 128000 |
| nebius/meta-llama/Meta-Llama-3.1-405B-Instruct | $1 | $3 | 128000 | 128000 |
| nebius/mistralai/Mistral-Nemo-Instruct-2407 | $0.04 | $0.12 | 128000 | 128000 |
| nebius/NousResearch/Hermes-3-Llama-3.1-405B | $1 | $3 | 128000 | 128000 |
| nebius/nvidia/Llama-3.1-Nemotron-Ultra-253B-v1 | $0.6 | $1.8 | 128000 | 128000 |
| nebius/nvidia/Llama-3.3-Nemotron-Super-49B-v1 | $0.1 | $0.4 | 131072 | 131072 |
| nebius/Qwen/Qwen3-235B-A22B | $0.2 | $0.6 | 262144 | 262144 |
| nebius/Qwen/Qwen3-32B | $0.1 | $0.3 | 32768 | 32768 |
| nebius/Qwen/Qwen3-30B-A3B | $0.1 | $0.3 | 32768 | 32768 |
| nebius/Qwen/Qwen3-14B | $0.08 | $0.24 | 32768 | 32768 |
| nebius/Qwen/Qwen3-4B | $0.08 | $0.24 | 32768 | 32768 |
| nebius/Qwen/QwQ-32B | $0.15 | $0.45 | 32768 | 32768 |
| nebius/Qwen/Qwen2.5-72B-Instruct | $0.13 | $0.4 | 128000 | 128000 |
| nebius/Qwen/Qwen2.5-32B-Instruct | $0.06 | $0.2 | 128000 | 128000 |
| nebius/Qwen/Qwen2.5-Coder-7B | $0.01 | $0.03 | 32768 | 32768 |
| nebius/Qwen/Qwen2.5-VL-72B-Instruct | $0.13 | $0.4 | 131072 | 131072 |
| nebius/Qwen/Qwen2-VL-72B-Instruct | $0.13 | $0.4 | 131072 | 131072 |
| nebius/Qwen/Qwen2-VL-7B-Instruct | $0.02 | $0.06 | 131072 | 131072 |
| nebius/BAAI/bge-en-icl | $0.01 | $0 | 32768 | nan |
| nebius/BAAI/bge-multilingual-gemma2 | $0.01 | $0 | 8192 | nan |
| nebius/intfloat/e5-mistral-7b-instruct | $0.01 | $0 | 32768 | nan |
| openrouter/anthropic/claude-sonnet-4.6 | $3 | $15 | 1e+06 | 128000 |
| openrouter/google/gemini-3.1-pro-preview | $2 | $12 | 1.04858e+06 | 65536 |
| openrouter/openai/gpt-5.1-codex-max | $1.25 | $10 | 400000 | 128000 |
| openrouter/qwen/qwen3-coder-plus | $1 | $5 | 997952 | 65536 |
| openrouter/z-ai/glm-5 | $0.8 | $2.56 | 202752 | 128000 |
| perplexity/pplx-embed-v1-0.6b | $0 | $0 | 32768 | nan |
| perplexity/pplx-embed-v1-4b | $0.03 | $0 | 32768 | nan |
| vertex_ai/gemini-3.1-flash-lite-preview | $0.25 | $1.5 | 1.04858e+06 | 65536 |
| zai/glm-5 | $1 | $3.2 | 200000 | 128000 |
| zai/glm-5-code | $1.2 | $5 | 200000 | 128000 |
| gpt-5.3-chat-latest | $1.75 | $14 | 128000 | 16384 |
| mistral.devstral-2-123b | $0.4 | $2 | 256000 | 8192 |
| together_ai/Qwen/Qwen3.5-397B-A17B | $0.6 | $3.6 | 262144 | nan |
| zai.glm-4.7-flash | $0.07 | $0.4 | 200000 | 128000 |
| au.anthropic.claude-sonnet-4-6 | $3.3 | $16.5 | 1e+06 | 64000 |
| azure_ai/mistral-document-ai-2512 | -- | -- | nan | nan |
| gpt-5.4 | $2.5 | $15 | 1.05e+06 | 128000 |
| gpt-5.4-2026-03-05 | $2.5 | $15 | 1.05e+06 | 128000 |
| bedrock_mantle/openai.gpt-oss-120b | $0.15 | $0.6 | 131072 | 32768 |
| bedrock_mantle/openai.gpt-oss-20b | $0.08 | $0.3 | 131072 | 32768 |
| bedrock_mantle/openai.gpt-oss-safeguard-120b | $0.15 | $0.6 | 131072 | 65536 |
| bedrock_mantle/openai.gpt-oss-safeguard-20b | $0.08 | $0.3 | 131072 | 65536 |
| chatgpt/gpt-5.4 | -- | -- | 1.05e+06 | 128000 |
| chatgpt/gpt-5.4-pro | -- | -- | 1.05e+06 | 128000 |
| chatgpt/gpt-5.3-codex | -- | -- | 128000 | 128000 |
| chatgpt/gpt-5.3-codex-spark | -- | -- | 128000 | 128000 |
| chatgpt/gpt-5.3-instant | -- | -- | 128000 | 64000 |
| chatgpt/gpt-5.3-chat-latest | -- | -- | 128000 | 64000 |
| gpt-5.4-pro | $30 | $180 | 1.05e+06 | 128000 |
| gpt-5.4-pro-2026-03-05 | $30 | $180 | 1.05e+06 | 128000 |
| azure_ai/grok-4-1-fast-non-reasoning | $0.2 | $0.5 | 131072 | 131072 |
| azure_ai/grok-4-1-fast-reasoning | $0.2 | $0.5 | 131072 | 131072 |
| serper/search | -- | -- | nan | nan |
| gemini/gemini-3.1-flash-image-preview | $0.25 | $1.5 | 65536 | 32768 |
| azure/gpt-5.3-chat | $1.75 | $14 | 128000 | 16384 |
| azure/gpt-5.4 | $2.5 | $15 | 1.05e+06 | 128000 |
| azure/gpt-5.4-2026-03-05 | $2.5 | $15 | 1.05e+06 | 128000 |
| azure/gpt-5.4-pro | $30 | $180 | 1.05e+06 | 128000 |
| azure/gpt-5.4-pro-2026-03-05 | $30 | $180 | 1.05e+06 | 128000 |
| black_forest_labs/flux-kontext-pro | -- | -- | nan | nan |
| black_forest_labs/flux-kontext-max | -- | -- | nan | nan |
| black_forest_labs/flux-pro-1.0-fill | -- | -- | nan | nan |
| black_forest_labs/flux-pro-1.0-expand | -- | -- | nan | nan |
| black_forest_labs/flux-pro-1.1 | -- | -- | nan | nan |
| black_forest_labs/flux-pro-1.1-ultra | -- | -- | nan | nan |
| black_forest_labs/flux-dev | -- | -- | nan | nan |
| black_forest_labs/flux-pro | -- | -- | nan | nan |
| gemini-embedding-2-preview | $0.2 | $0 | 8192 | nan |
| vertex_ai/gemini-embedding-2-preview | $0.2 | $0 | 8192 | nan |
| gemini/gemini-embedding-2-preview | $0.2 | $0 | 8192 | nan |
| openrouter/qwen/qwen3.5-35b-a3b | $0.25 | $2 | 262144 | 65536 |
| openrouter/qwen/qwen3.5-27b | $0.3 | $2.4 | 262144 | 65536 |
| openrouter/qwen/qwen3.5-122b-a10b | $0.4 | $2 | 262144 | 65536 |
| openrouter/qwen/qwen3.5-flash-02-23 | $0.1 | $0.4 | 1e+06 | 65536 |
| openrouter/qwen/qwen3.5-plus-02-15 | $0.4 | $2.4 | 1e+06 | 65536 |
| openrouter/qwen/qwen3.5-397b-a17b | $0.6 | $3.6 | 262144 | 65536 |
| xai/grok-4.20-multi-agent-beta-0309 | $2 | $6 | 2e+06 | 2e+06 |
| xai/grok-4.20-beta-0309-reasoning | $2 | $6 | 2e+06 | 2e+06 |
| xai/grok-4.20-beta-0309-non-reasoning | $2 | $6 | 2e+06 | 2e+06 |
| azure/gpt-5.4-mini | $0.75 | $4.5 | 1.05e+06 | 128000 |
| azure/gpt-5.4-nano | $0.2 | $1.25 | 1.05e+06 | 128000 |
| volcengine/doubao-seed-2-0-pro-260215 | -- | -- | 256000 | 128000 |
| volcengine/doubao-seed-2-0-lite-260215 | -- | -- | 256000 | 128000 |
| volcengine/doubao-seed-2-0-mini-260215 | -- | -- | 256000 | 128000 |
| volcengine/doubao-seed-2-0-code-preview-260215 | -- | -- | 256000 | 128000 |
| gpt-5.4-mini | $0.75 | $4.5 | 272000 | 128000 |
| gpt-5.4-nano | $0.2 | $1.25 | 272000 | 128000 |
| gemini/lyria-3-clip-preview | $0 | $0 | 131072 | 8192 |
| gemini/lyria-3-pro-preview | $0 | $0 | 131072 | 8192 |
| gemini-3.1-flash-live-preview | $0.75 | $4.5 | 131072 | 65536 |
| gemini/gemini-3.1-flash-live-preview | $0.75 | $4.5 | 131072 | 65536 |
| bedrock/us-gov-east-1/anthropic.claude-haiku-4-5-20251001-v1:0 | $1.2 | $6 | 200000 | 64000 |
| bedrock/us-gov-west-1/anthropic.claude-haiku-4-5-20251001-v1:0 | $1.2 | $6 | 200000 | 64000 |
| us.amazon.nova-canvas-v1:0 | -- | -- | 2600 | nan |
| bedrock/ap-northeast-1/minimax.minimax-m2.5 | $0.36 | $1.44 | 1e+06 | 8192 |
| bedrock/ap-south-1/minimax.minimax-m2.5 | $0.36 | $1.44 | 1e+06 | 8192 |
| bedrock/ap-southeast-2/minimax.minimax-m2.5 | $0.31 | $1.24 | 1e+06 | 8192 |
| bedrock/ap-southeast-3/minimax.minimax-m2.5 | $0.36 | $1.44 | 1e+06 | 8192 |
| bedrock/eu-north-1/minimax.minimax-m2.5 | $0.36 | $1.44 | 1e+06 | 8192 |
| bedrock/eu-central-1/minimax.minimax-m2.5 | $0.36 | $1.44 | 1e+06 | 8192 |
| bedrock/eu-west-1/minimax.minimax-m2.5 | $0.36 | $1.44 | 1e+06 | 8192 |
| bedrock/eu-west-2/minimax.minimax-m2.5 | $0.47 | $1.86 | 1e+06 | 8192 |
| bedrock/eu-south-1/minimax.minimax-m2.5 | $0.36 | $1.44 | 1e+06 | 8192 |
| bedrock/sa-east-1/minimax.minimax-m2.5 | $0.36 | $1.44 | 1e+06 | 8192 |
| bedrock/us-east-1/minimax.minimax-m2.5 | $0.3 | $1.2 | 1e+06 | 8192 |
| bedrock/us-east-2/minimax.minimax-m2.5 | $0.3 | $1.2 | 1e+06 | 8192 |
| bedrock/us-west-2/minimax.minimax-m2.5 | $0.3 | $1.2 | 1e+06 | 8192 |
| minimax.minimax-m2.5 | $0.3 | $1.2 | 1e+06 | 8192 |
| nvidia.nemotron-super-3-120b | $0.15 | $0.65 | 256000 | 32768 |
| zai.glm-5 | $1 | $3.2 | 200000 | 128000 |
| oci/cohere.command-a-reasoning-08-2025 | $1.56 | $1.56 | 256000 | 4000 |
| oci/cohere.command-a-vision-07-2025 | $1.56 | $1.56 | 128000 | 4000 |
| oci/cohere.command-a-translate-08-2025 | $0.09 | $0.09 | 256000 | 4000 |
| oci/cohere.command-r-08-2024 | $0.15 | $0.15 | 128000 | 4000 |
| oci/cohere.command-r-plus-08-2024 | $1.56 | $1.56 | 128000 | 4000 |
| oci/meta.llama-3.2-11b-vision-instruct | $2 | $2 | 128000 | 4000 |
| oci/meta.llama-3.1-70b-instruct | $0.72 | $0.72 | 128000 | 4000 |
| oci/meta.llama-3.3-70b-instruct-fp8-dynamic | $0.72 | $0.72 | 128000 | 4000 |
| oci/xai.grok-4-fast | $5 | $25 | 131072 | 131072 |
| oci/xai.grok-4.1-fast | $5 | $25 | 131072 | 131072 |
| oci/xai.grok-4.20 | $3 | $15 | 131072 | 131072 |
| oci/xai.grok-4.20-multi-agent | $3 | $15 | 131072 | 131072 |
| oci/xai.grok-code-fast-1 | $5 | $25 | 131072 | 131072 |
| oci/google.gemini-2.5-pro | $1.25 | $10 | 1.04858e+06 | 65536 |
| oci/google.gemini-2.5-flash | $0.15 | $0.6 | 1.04858e+06 | 65536 |
| oci/google.gemini-2.5-flash-lite | $0.08 | $0.3 | 1.04858e+06 | 65536 |
| oci/cohere.embed-english-v3.0 | $0.1 | $0 | 512 | nan |
| oci/cohere.embed-english-light-v3.0 | $0.1 | $0 | 512 | nan |
| oci/cohere.embed-multilingual-v3.0 | $0.1 | $0 | 512 | nan |
| oci/cohere.embed-multilingual-light-v3.0 | $0.1 | $0 | 512 | nan |
| oci/cohere.embed-english-image-v3.0 | $0.1 | $0 | 512 | nan |
| oci/cohere.embed-english-light-image-v3.0 | $0.1 | $0 | 512 | nan |
| oci/cohere.embed-multilingual-light-image-v3.0 | $0.1 | $0 | 512 | nan |
| oci/cohere.embed-v4.0 | $0.12 | $0 | 128000 | nan |
| vertex_ai/claude-haiku-4-5 | $1 | $5 | 200000 | 8192 |
| bedrock/us-gov-east-1/anthropic.claude-sonnet-4-5-20250929-v1:0 | $3.3 | $16.5 | 200000 | 8192 |
| bedrock/us-gov-west-1/anthropic.claude-sonnet-4-5-20250929-v1:0 | $3.3 | $16.5 | 200000 | 8192 |
| us-gov.anthropic.claude-sonnet-4-5-20250929-v1:0 | $3.3 | $16.5 | 200000 | 64000 |
| baseten/MiniMaxAI/MiniMax-M2.5 | $0.3 | $1.2 | nan | nan |
| baseten/nvidia/Nemotron-120B-A12B | $0.3 | $0.75 | nan | nan |
| baseten/zai-org/GLM-5 | $0.95 | $3.15 | nan | nan |
| baseten/zai-org/GLM-4.7 | $0.6 | $2.2 | nan | nan |
| baseten/zai-org/GLM-4.6 | $0.6 | $2.2 | nan | nan |
| baseten/moonshotai/Kimi-K2.5 | $0.6 | $3 | nan | nan |
| baseten/moonshotai/Kimi-K2-Thinking | $0.6 | $2.5 | nan | nan |
| baseten/moonshotai/Kimi-K2-Instruct-0905 | $0.6 | $2.5 | nan | nan |
| baseten/openai/gpt-oss-120b | $0.1 | $0.5 | nan | nan |
| baseten/deepseek-ai/DeepSeek-V3.1 | $0.5 | $1.5 | nan | nan |
| baseten/deepseek-ai/DeepSeek-V3-0324 | $0.77 | $0.77 | nan | nan |
| gemini/veo-3.1-lite-generate-preview | -- | -- | 1024 | nan |
| wandb/moonshotai/Kimi-K2.5 | $0.6 | $3 | 262144 | 262144 |
| wandb/MiniMaxAI/MiniMax-M2.5 | $0.3 | $1.2 | 197000 | 197000 |
| anthropic.claude-opus-4-7 | $5 | $25 | 1e+06 | 128000 |
| global.anthropic.claude-opus-4-7 | $5 | $25 | 1e+06 | 128000 |
| us.anthropic.claude-opus-4-7 | $5.5 | $27.5 | 1e+06 | 128000 |
| eu.anthropic.claude-opus-4-7 | $5.5 | $27.5 | 1e+06 | 128000 |
| au.anthropic.claude-opus-4-7 | $5.5 | $27.5 | 1e+06 | 128000 |
| azure_ai/claude-opus-4-7 | $5 | $25 | 200000 | 128000 |
| claude-opus-4-7 | $5 | $25 | 1e+06 | 128000 |
| claude-opus-4-7-20260416 | $5 | $25 | 1e+06 | 128000 |
| perplexity/anthropic/claude-opus-4-7 | -- | -- | nan | nan |
| vertex_ai/claude-opus-4-7 | $5 | $25 | 1e+06 | 128000 |
| vertex_ai/claude-opus-4-7@default | $5 | $25 | 1e+06 | 128000 |
| openrouter/google/gemini-3.1-flash-lite-preview | $0.25 | $1.5 | 1.04858e+06 | 65536 |
| xai/grok-4.20-0309-reasoning | $2 | $6 | 2e+06 | 2e+06 |
| anthropic.claude-mythos-preview | $0 | $0 | 1e+06 | 128000 |
| gpt-5.5 | $5 | $30 | 1.05e+06 | 128000 |
| moonshot/kimi-k2.6 | $0.95 | $4 | 262144 | 262144 |
| openrouter/anthropic/claude-opus-4.7 | $5 | $25 | 1e+06 | 128000 |
| dashscope/qwen-image-2.0 | -- | -- | nan | nan |
| dashscope/qwen-image-2.0-pro | -- | -- | nan | nan |
| gpt-5.5-2026-04-23 | $5 | $30 | 1.05e+06 | 128000 |
| gpt-5.5-pro | $30 | $180 | 1.05e+06 | 128000 |
| gpt-5.5-pro-2026-04-23 | $30 | $180 | 1.05e+06 | 128000 |
| azure/gpt-5.5 | $5 | $30 | 1.05e+06 | 128000 |
| azure/gpt-5.5-2026-04-23 | $5 | $30 | 1.05e+06 | 128000 |
| azure/gpt-5.5-pro | $30 | $180 | 1.05e+06 | 128000 |
| azure/gpt-5.5-pro-2026-04-23 | $30 | $180 | 1.05e+06 | 128000 |
| azure/gpt-5.4-mini-2026-03-17 | $0.75 | $4.5 | 1.05e+06 | 128000 |
| azure/gpt-5.4-nano-2026-03-17 | $0.2 | $1.25 | 1.05e+06 | 128000 |
| gemini-embedding-2 | $0.2 | $0 | 8192 | nan |
| vertex_ai/gemini-embedding-2 | $0.2 | $0 | 8192 | nan |
| gemini/gemini-embedding-2 | $0.2 | $0 | 8192 | nan |
| gpt-5.4-mini-2026-03-17 | $0.75 | $4.5 | 272000 | 128000 |
| gpt-5.4-nano-2026-03-17 | $0.2 | $1.25 | 272000 | 128000 |
| bedrock/us-east-1/zai.glm-5 | $1 | $3.2 | 200000 | 128000 |
| bedrock/us-west-2/zai.glm-5 | $1 | $3.2 | 200000 | 128000 |
| azure/gpt-image-2 | $5 | $10 | nan | nan |
| azure/gpt-image-2-2026-04-21 | $5 | $10 | nan | nan |
| gpt-image-2 | $5 | $10 | nan | nan |
| gpt-image-2-2026-04-21 | $5 | $10 | nan | nan |
| crusoe/deepseek-ai/DeepSeek-R1-0528 | $3 | $7 | 163840 | 163840 |
| crusoe/deepseek-ai/DeepSeek-V3-0324 | $1.5 | $1.5 | 163840 | 163840 |
| crusoe/google/gemma-3-12b-it | $0.1 | $0.1 | 131072 | 131072 |
| crusoe/meta-llama/Llama-3.3-70B-Instruct | $0.2 | $0.2 | 131072 | 131072 |
| crusoe/moonshotai/Kimi-K2-Thinking | $2.5 | $2.5 | 262144 | 262144 |
| crusoe/openai/gpt-oss-120b | $0.8 | $0.8 | 131072 | 131072 |
| crusoe/Qwen/Qwen3-235B-A22B-Instruct-2507 | $3 | $3 | 262144 | 262144 |
| vertex_ai/xai/grok-4.1-fast-non-reasoning | $0.2 | $0.5 | 2e+06 | 2e+06 |
| vertex_ai/xai/grok-4.1-fast-reasoning | $0.2 | $0.5 | 2e+06 | 2e+06 |
| vertex_ai/xai/grok-4.20-non-reasoning | $2 | $6 | 2e+06 | 2e+06 |
| vertex_ai/xai/grok-4.20-reasoning | $2 | $6 | 2e+06 | 2e+06 |
| openai/gpt-4 | $30 | $60 | 8192 | 4096 |
| openai/gpt-4o | $2.5 | $10 | 128000 | 16384 |
| openai/gpt-4o-audio-preview | $2.5 | $10 | 128000 | 16384 |
| openai/gpt-4o-audio-preview-2024-10-01 | $2.5 | $10 | 128000 | 16384 |
| openai/gpt-4o-mini | $0.15 | $0.6 | 128000 | 16384 |
| openai/gpt-4o-mini-2024-07-18 | $0.15 | $0.6 | 128000 | 16384 |
| openai/o1-mini | $1.1 | $4.4 | 128000 | 65536 |
| openai/o1-mini-2024-09-12 | $3 | $12 | 128000 | 65536 |
| openai/o1-preview | $15 | $60 | 128000 | 32768 |
| openai/o1-preview-2024-09-12 | $15 | $60 | 128000 | 32768 |
| openai/chatgpt-4o-latest | $5 | $15 | 128000 | 4096 |
| openai/gpt-4o-2024-05-13 | $5 | $15 | 128000 | 4096 |
| openai/gpt-4o-2024-08-06 | $2.5 | $10 | 128000 | 16384 |
| openai/gpt-4-turbo-preview | $10 | $30 | 128000 | 4096 |
| openai/gpt-4-0314 | $30 | $60 | 8192 | 4096 |
| openai/gpt-4-0613 | $30 | $60 | 8192 | 4096 |
| openai/gpt-4-32k | $60 | $120 | 32768 | 4096 |
| openai/gpt-4-32k-0314 | $60 | $120 | 32768 | 4096 |
| openai/gpt-4-32k-0613 | $60 | $120 | 32768 | 4096 |
| openai/gpt-4-turbo | $10 | $30 | 128000 | 4096 |
| openai/gpt-4-turbo-2024-04-09 | $10 | $30 | 128000 | 4096 |
| openai/gpt-4-1106-preview | $10 | $30 | 128000 | 4096 |
| openai/gpt-4-0125-preview | $10 | $30 | 128000 | 4096 |
| openai/gpt-4-vision-preview | $10 | $30 | 128000 | 4096 |
| openai/gpt-4-1106-vision-preview | $10 | $30 | 128000 | 4096 |
| openai/gpt-3.5-turbo | $0.5 | $1.5 | 16385 | 4096 |
| openai/gpt-3.5-turbo-0301 | $1.5 | $2 | 4097 | 4096 |
| openai/gpt-3.5-turbo-0613 | $1.5 | $2 | 4097 | 4096 |
| openai/gpt-3.5-turbo-1106 | $1 | $2 | 16385 | 4096 |
| openai/gpt-3.5-turbo-0125 | $0.5 | $1.5 | 16385 | 4096 |
| openai/gpt-3.5-turbo-16k | $3 | $4 | 16385 | 4096 |
| openai/gpt-3.5-turbo-16k-0613 | $3 | $4 | 16385 | 4096 |
| openai/ft:gpt-3.5-turbo | $3 | $6 | 16385 | 4096 |
| openai/ft:gpt-3.5-turbo-0125 | $3 | $6 | 16385 | 4096 |
| openai/ft:gpt-3.5-turbo-1106 | $3 | $6 | 16385 | 4096 |
| openai/ft:gpt-3.5-turbo-0613 | $3 | $6 | 4096 | 4096 |
| openai/ft:gpt-4-0613 | $30 | $60 | 8192 | 4096 |
| openai/ft:gpt-4o-2024-08-06 | $3.75 | $15 | 128000 | 16384 |
| openai/ft:gpt-4o-mini-2024-07-18 | $0.3 | $1.2 | 128000 | 16384 |
| openai/text-embedding-3-large | $0.13 | $0 | 8191 | nan |
| openai/text-embedding-3-small | $0.02 | $0 | 8191 | nan |
| openai/text-embedding-ada-002 | $0.1 | $0 | 8191 | nan |
| openai/text-embedding-ada-002-v2 | $0.1 | $0 | 8191 | nan |
| openai/text-moderation-stable | $0 | $0 | 32768 | 0 |
| openai/text-moderation-007 | $0 | $0 | 32768 | 0 |
| openai/text-moderation-latest | $0 | $0 | 32768 | 0 |
| openai/256-x-256/dall-e-2 | -- | -- | nan | nan |
| openai/512-x-512/dall-e-2 | -- | -- | nan | nan |
| openai/1024-x-1024/dall-e-2 | -- | -- | nan | nan |
| openai/hd/1024-x-1792/dall-e-3 | -- | -- | nan | nan |
| openai/hd/1792-x-1024/dall-e-3 | -- | -- | nan | nan |
| openai/hd/1024-x-1024/dall-e-3 | -- | -- | nan | nan |
| openai/standard/1024-x-1792/dall-e-3 | -- | -- | nan | nan |
| openai/standard/1792-x-1024/dall-e-3 | -- | -- | nan | nan |
| openai/standard/1024-x-1024/dall-e-3 | -- | -- | nan | nan |
| openai/whisper-1 | -- | -- | nan | nan |
| openai/tts-1 | -- | -- | nan | nan |
| openai/tts-1-hd | -- | -- | nan | nan |
| openai/gpt-4o-2024-11-20 | $2.5 | $10 | 128000 | 16384 |
| openai/ft:gpt-4o-2024-11-20 | $3.75 | $15 | 128000 | 16384 |
| openai/omni-moderation-latest | $0 | $0 | 32768 | 0 |
| openai/omni-moderation-latest-intents | $0 | $0 | 32768 | 0 |
| openai/omni-moderation-2024-09-26 | $0 | $0 | 32768 | 0 |
| openai/gpt-4o-audio-preview-2024-12-17 | $2.5 | $10 | 128000 | 16384 |
| openai/gpt-4o-mini-audio-preview-2024-12-17 | $0.15 | $0.6 | 128000 | 16384 |
| openai/o1 | $15 | $60 | 200000 | 100000 |
| openai/o1-2024-12-17 | $15 | $60 | 200000 | 100000 |
| openai/gpt-4o-realtime-preview-2024-10-01 | $5 | $20 | 128000 | 4096 |
| openai/gpt-4o-realtime-preview | $5 | $20 | 128000 | 4096 |
| openai/gpt-4o-realtime-preview-2024-12-17 | $5 | $20 | 128000 | 4096 |
| openai/gpt-4o-mini-realtime-preview | $0.6 | $2.4 | 128000 | 4096 |
| openai/gpt-4o-mini-realtime-preview-2024-12-17 | $0.6 | $2.4 | 128000 | 4096 |
| openai/o3-mini | $1.1 | $4.4 | 200000 | 100000 |
| openai/o3-mini-2025-01-31 | $1.1 | $4.4 | 200000 | 100000 |
| openai/gpt-4.5-preview | $75 | $150 | 128000 | 16384 |
| openai/gpt-4.5-preview-2025-02-27 | $75 | $150 | 128000 | 16384 |
| openai/o1-pro | $150 | $600 | 200000 | 100000 |
| openai/o1-pro-2025-03-19 | $150 | $600 | 200000 | 100000 |
| openai/gpt-4o-search-preview-2025-03-11 | $2.5 | $10 | 128000 | 16384 |
| openai/gpt-4o-search-preview | $2.5 | $10 | 128000 | 16384 |
| openai/gpt-4o-mini-search-preview-2025-03-11 | $0.15 | $0.6 | 128000 | 16384 |
| openai/gpt-4o-mini-search-preview | $0.15 | $0.6 | 128000 | 16384 |
| openai/gpt-4.1 | $2 | $8 | 1.04758e+06 | 32768 |
| openai/gpt-4.1-2025-04-14 | $2 | $8 | 1.04758e+06 | 32768 |
| openai/gpt-4.1-mini | $0.4 | $1.6 | 1.04758e+06 | 32768 |
| openai/gpt-4.1-mini-2025-04-14 | $0.4 | $1.6 | 1.04758e+06 | 32768 |
| openai/gpt-4.1-nano | $0.1 | $0.4 | 1.04758e+06 | 32768 |
| openai/gpt-4.1-nano-2025-04-14 | $0.1 | $0.4 | 1.04758e+06 | 32768 |
| openai/o3 | $2 | $8 | 200000 | 100000 |
| openai/o3-2025-04-16 | $2 | $8 | 200000 | 100000 |
| openai/o4-mini | $1.1 | $4.4 | 200000 | 100000 |
| openai/o4-mini-2025-04-16 | $1.1 | $4.4 | 200000 | 100000 |
| openai/gpt-image-1 | $5 | -- | nan | nan |
| openai/low/1024-x-1024/gpt-image-1 | -- | -- | nan | nan |
| openai/medium/1024-x-1024/gpt-image-1 | -- | -- | nan | nan |
| openai/high/1024-x-1024/gpt-image-1 | -- | -- | nan | nan |
| openai/low/1024-x-1536/gpt-image-1 | -- | -- | nan | nan |
| openai/medium/1024-x-1536/gpt-image-1 | -- | -- | nan | nan |
| openai/high/1024-x-1536/gpt-image-1 | -- | -- | nan | nan |
| openai/low/1536-x-1024/gpt-image-1 | -- | -- | nan | nan |
| openai/medium/1536-x-1024/gpt-image-1 | -- | -- | nan | nan |
| openai/high/1536-x-1024/gpt-image-1 | -- | -- | nan | nan |
| openai/gpt-4o-transcribe | $2.5 | $10 | 16000 | 2000 |
| openai/gpt-4o-mini-transcribe | $1.25 | $5 | 16000 | 2000 |
| openai/gpt-4o-mini-tts | $2.5 | $10 | nan | nan |
| openai/gpt-4o-mini-audio-preview | $0.15 | $0.6 | 128000 | 16384 |
| openai/codex-mini-latest | $1.5 | $6 | 200000 | 100000 |
| openai/gpt-4o-audio-preview-2025-06-03 | $2.5 | $10 | 128000 | 16384 |
| openai/o3-pro | $20 | $80 | 200000 | 100000 |
| openai/o3-pro-2025-06-10 | $20 | $80 | 200000 | 100000 |
| openai/o3-deep-research | $10 | $40 | 200000 | 100000 |
| openai/o3-deep-research-2025-06-26 | $10 | $40 | 200000 | 100000 |
| openai/o4-mini-deep-research | $2 | $8 | 200000 | 100000 |
| openai/o4-mini-deep-research-2025-06-26 | $2 | $8 | 200000 | 100000 |
| openai/gpt-4o-realtime-preview-2025-06-03 | $5 | $20 | 128000 | 4096 |
| openai/gpt-5 | $1.25 | $10 | 272000 | 128000 |
| openai/gpt-5-mini | $0.25 | $2 | 272000 | 128000 |
| openai/gpt-5-nano | $0.05 | $0.4 | 272000 | 128000 |
| openai/gpt-5-chat | $1.25 | $10 | 128000 | 16384 |
| openai/gpt-5-chat-latest | $1.25 | $10 | 128000 | 16384 |
| openai/gpt-5-2025-08-07 | $1.25 | $10 | 272000 | 128000 |
| openai/gpt-5-mini-2025-08-07 | $0.25 | $2 | 272000 | 128000 |
| openai/gpt-5-nano-2025-08-07 | $0.05 | $0.4 | 272000 | 128000 |
| openai/gpt-realtime | $4 | $16 | 32000 | 4096 |
| openai/gpt-realtime-2025-08-28 | $4 | $16 | 32000 | 4096 |
| openai/gpt-5-codex | $1.25 | $10 | 272000 | 128000 |
| openai/gpt-5-pro | $15 | $120 | 128000 | 272000 |
| openai/gpt-image-1-mini | $2 | -- | nan | nan |
| openai/gpt-realtime-mini | $0.6 | $2.4 | 128000 | 4096 |
| openai/low/1024-x-1024/gpt-image-1-mini | -- | -- | nan | nan |
| openai/low/1024-x-1536/gpt-image-1-mini | -- | -- | nan | nan |
| openai/low/1536-x-1024/gpt-image-1-mini | -- | -- | nan | nan |
| openai/medium/1024-x-1024/gpt-image-1-mini | -- | -- | nan | nan |
| openai/medium/1024-x-1536/gpt-image-1-mini | -- | -- | nan | nan |
| openai/medium/1536-x-1024/gpt-image-1-mini | -- | -- | nan | nan |
| openai/gpt-5-pro-2025-10-06 | $15 | $120 | 128000 | 272000 |
| openai/gpt-5.1 | $1.25 | $10 | 272000 | 128000 |
| openai/gpt-5.1-2025-11-13 | $1.25 | $10 | 272000 | 128000 |
| openai/gpt-5.1-chat-latest | $1.25 | $10 | 128000 | 16384 |
| openai/gpt-5.1-codex | $1.25 | $10 | 272000 | 128000 |
| openai/gpt-5.1-codex-mini | $0.25 | $2 | 272000 | 128000 |
| openai/ft:gpt-4.1-2025-04-14 | $3 | $12 | 1.04758e+06 | 32768 |
| openai/ft:gpt-4.1-mini-2025-04-14 | $0.8 | $3.2 | 1.04758e+06 | 32768 |
| openai/ft:gpt-4.1-nano-2025-04-14 | $0.2 | $0.8 | 1.04758e+06 | 32768 |
| openai/ft:o4-mini-2025-04-16 | $4 | $16 | 200000 | 100000 |
| openai/gpt-5.1-codex-max | $1.25 | $10 | 272000 | 128000 |
| openai/gpt-5.2 | $1.75 | $14 | 272000 | 128000 |
| openai/gpt-5.2-2025-12-11 | $1.75 | $14 | 272000 | 128000 |
| openai/gpt-5.2-chat-latest | $1.75 | $14 | 128000 | 16384 |
| openai/gpt-5.2-pro | $21 | $168 | 272000 | 128000 |
| openai/gpt-5.2-pro-2025-12-11 | $21 | $168 | 272000 | 128000 |
| openai/gpt-4o-transcribe-diarize | $2.5 | $10 | 16000 | 2000 |
| openai/gpt-image-1.5 | $5 | $10 | nan | nan |
| openai/gpt-image-1.5-2025-12-16 | $5 | $10 | nan | nan |
| openai/low/1024-x-1024/gpt-image-1.5 | -- | -- | nan | nan |
| openai/low/1024-x-1536/gpt-image-1.5 | -- | -- | nan | nan |
| openai/low/1536-x-1024/gpt-image-1.5 | -- | -- | nan | nan |
| openai/medium/1024-x-1024/gpt-image-1.5 | -- | -- | nan | nan |
| openai/medium/1024-x-1536/gpt-image-1.5 | -- | -- | nan | nan |
| openai/medium/1536-x-1024/gpt-image-1.5 | -- | -- | nan | nan |
| openai/high/1024-x-1024/gpt-image-1.5 | -- | -- | nan | nan |
| openai/high/1024-x-1536/gpt-image-1.5 | -- | -- | nan | nan |
| openai/high/1536-x-1024/gpt-image-1.5 | -- | -- | nan | nan |
| openai/standard/1024-x-1024/gpt-image-1.5 | -- | -- | nan | nan |
| openai/standard/1024-x-1536/gpt-image-1.5 | -- | -- | nan | nan |
| openai/standard/1536-x-1024/gpt-image-1.5 | -- | -- | nan | nan |
| openai/1024-x-1024/gpt-image-1.5 | -- | -- | nan | nan |
| openai/1024-x-1536/gpt-image-1.5 | -- | -- | nan | nan |
| openai/1536-x-1024/gpt-image-1.5 | -- | -- | nan | nan |
| openai/low/1024-x-1024/gpt-image-1.5-2025-12-16 | -- | -- | nan | nan |
| openai/low/1024-x-1536/gpt-image-1.5-2025-12-16 | -- | -- | nan | nan |
| openai/low/1536-x-1024/gpt-image-1.5-2025-12-16 | -- | -- | nan | nan |
| openai/medium/1024-x-1024/gpt-image-1.5-2025-12-16 | -- | -- | nan | nan |
| openai/medium/1024-x-1536/gpt-image-1.5-2025-12-16 | -- | -- | nan | nan |
| openai/medium/1536-x-1024/gpt-image-1.5-2025-12-16 | -- | -- | nan | nan |
| openai/high/1024-x-1024/gpt-image-1.5-2025-12-16 | -- | -- | nan | nan |
| openai/high/1024-x-1536/gpt-image-1.5-2025-12-16 | -- | -- | nan | nan |
| openai/high/1536-x-1024/gpt-image-1.5-2025-12-16 | -- | -- | nan | nan |
| openai/standard/1024-x-1024/gpt-image-1.5-2025-12-16 | -- | -- | nan | nan |
| openai/standard/1024-x-1536/gpt-image-1.5-2025-12-16 | -- | -- | nan | nan |
| openai/standard/1536-x-1024/gpt-image-1.5-2025-12-16 | -- | -- | nan | nan |
| openai/1024-x-1024/gpt-image-1.5-2025-12-16 | -- | -- | nan | nan |
| openai/1024-x-1536/gpt-image-1.5-2025-12-16 | -- | -- | nan | nan |
| openai/1536-x-1024/gpt-image-1.5-2025-12-16 | -- | -- | nan | nan |
| openai/gpt-5.2-codex | $1.75 | $14 | 272000 | 128000 |
| openai/dall-e-2 | -- | -- | nan | nan |
| openai/dall-e-3 | -- | -- | nan | nan |
| openai/gpt-audio | $2.5 | $10 | 128000 | 16384 |
| openai/gpt-audio-2025-08-28 | $2.5 | $10 | 128000 | 16384 |
| openai/gpt-audio-mini | $0.6 | $2.4 | 128000 | 16384 |
| openai/gpt-audio-mini-2025-10-06 | $0.6 | $2.4 | 128000 | 16384 |
| openai/gpt-audio-mini-2025-12-15 | $0.6 | $2.4 | 128000 | 16384 |
| openai/tts-1-1106 | -- | -- | nan | nan |
| openai/tts-1-hd-1106 | -- | -- | nan | nan |
| openai/gpt-4o-mini-tts-2025-03-20 | $2.5 | $10 | nan | nan |
| openai/gpt-4o-mini-tts-2025-12-15 | $2.5 | $10 | nan | nan |
| openai/gpt-4o-mini-transcribe-2025-03-20 | $1.25 | $5 | 16000 | 2000 |
| openai/gpt-4o-mini-transcribe-2025-12-15 | $1.25 | $5 | 16000 | 2000 |
| openai/gpt-5-search-api | $1.25 | $10 | 272000 | 128000 |
| openai/gpt-5-search-api-2025-10-14 | $1.25 | $10 | 272000 | 128000 |
| openai/gpt-realtime-mini-2025-10-06 | $0.6 | $2.4 | 128000 | 4096 |
| openai/gpt-realtime-mini-2025-12-15 | $0.6 | $2.4 | 128000 | 4096 |
| openai/chatgpt-image-latest | $5 | -- | nan | nan |
| openai/gpt-5.3-codex | $1.75 | $14 | 272000 | 128000 |
| openai/gpt-audio-1.5 | $2.5 | $10 | 128000 | 16384 |
| openai/gpt-realtime-1.5 | $4 | $16 | 32000 | 4096 |
| openai/gpt-5.3-chat-latest | $1.75 | $14 | 128000 | 16384 |
| openai/gpt-5.4 | $2.5 | $15 | 1.05e+06 | 128000 |
| openai/gpt-5.4-2026-03-05 | $2.5 | $15 | 1.05e+06 | 128000 |
| openai/gpt-5.4-pro | $30 | $180 | 1.05e+06 | 128000 |
| openai/gpt-5.4-pro-2026-03-05 | $30 | $180 | 1.05e+06 | 128000 |
| openai/gpt-5.4-mini | $0.75 | $4.5 | 272000 | 128000 |
| openai/gpt-5.4-nano | $0.2 | $1.25 | 272000 | 128000 |
| openai/gpt-5.5 | $5 | $30 | 1.05e+06 | 128000 |
| openai/gpt-5.5-2026-04-23 | $5 | $30 | 1.05e+06 | 128000 |
| openai/gpt-5.5-pro | $30 | $180 | 1.05e+06 | 128000 |
| openai/gpt-5.5-pro-2026-04-23 | $30 | $180 | 1.05e+06 | 128000 |
| openai/gpt-5.4-mini-2026-03-17 | $0.75 | $4.5 | 272000 | 128000 |
| openai/gpt-5.4-nano-2026-03-17 | $0.2 | $1.25 | 272000 | 128000 |
| openai/gpt-image-2 | $5 | $10 | nan | nan |
| openai/gpt-image-2-2026-04-21 | $5 | $10 | nan | nan |
| xai/grok-4.3 | $1.25 | $2.5 | 1e+06 | 1e+06 |
| xai/grok-4.3-latest | $1.25 | $2.5 | 1e+06 | 1e+06 |
| sambanova/MiniMax-M2.7 | $0.3 | $1.2 | 204800 | 131072 |
| openrouter/qwen/qwen3.6-plus | $0.32 | $1.95 | 1e+06 | 65536 |
| gpt-realtime-2 | $4 | $16 | 32000 | 4096 |
| openai/gpt-realtime-2 | $4 | $16 | 32000 | 4096 |
| jp.anthropic.claude-sonnet-4-6 | $3.3 | $16.5 | 1e+06 | 64000 |
| azure_ai/gpt-5.4 | $2.5 | $15 | 1.05e+06 | 128000 |
| azure_ai/gpt-5.4-2026-03-05 | $2.5 | $15 | 1.05e+06 | 128000 |
| azure_ai/gpt-5.4-pro | $30 | $180 | 1.05e+06 | 128000 |
| azure_ai/gpt-5.4-pro-2026-03-05 | $30 | $180 | 1.05e+06 | 128000 |
| azure_ai/gpt-5.4-mini | $0.75 | $4.5 | 400000 | 128000 |
| azure_ai/gpt-5.4-mini-2026-03-17 | $0.75 | $4.5 | 400000 | 128000 |
| azure_ai/gpt-5.4-nano | $0.2 | $1.25 | 400000 | 128000 |
| azure_ai/gpt-5.4-nano-2026-03-17 | $0.2 | $1.25 | 400000 | 128000 |
| vertex_ai/gemini-3.5-flash | $1.5 | $9 | 1.04858e+06 | 65535 |
| gemini/gemini-3.5-flash | $1.5 | $9 | 1.04858e+06 | 65535 |
| gemini-3.5-flash | $1.5 | $9 | 1.04858e+06 | 65535 |
| fireworks_ai/accounts/fireworks/models/glm-5p1 | $1.4 | $4.4 | 202800 | 202800 |
| fireworks_ai/glm-5p1 | $1.4 | $4.4 | 202800 | 202800 |
| gemini-3.1-flash-lite | $0.25 | $1.5 | 1.04858e+06 | 65536 |
| gemini/gemini-3.1-flash-lite | $0.25 | $1.5 | 1.04858e+06 | 65536 |
| mistral/ministral-8b-2512 | $0.15 | $0.15 | 262144 | 262144 |
| openrouter/google/gemini-3.1-flash-lite | $0.25 | $1.5 | 1.04858e+06 | 65536 |
| openrouter/xiaomi/mimo-v2.5-pro | $1 | $3 | 1.04858e+06 | 16384 |
| openrouter/xiaomi/mimo-v2.5 | $0.4 | $2 | 1.04858e+06 | 131072 |
| reducto/parse-legacy | -- | -- | nan | nan |
| reducto/parse-v3 | -- | -- | nan | nan |
| vertex_ai/gemini-3.1-flash-lite | $0.25 | $1.5 | 1.04858e+06 | 65536 |
License
TokenCost is released under the MIT License.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file tokencostauto-0.1.634.tar.gz.
File metadata
- Download URL: tokencostauto-0.1.634.tar.gz
- Upload date:
- Size: 234.4 kB
- Tags: Source
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
6555961c7da0a97825e77a2eeb24b1524e0b3d8bbe15e467b7bc2ac6bbe2026b
|
|
| MD5 |
02a33678c25f2642fa2ae97e47ac2924
|
|
| BLAKE2b-256 |
9bfad7ca7fed8365866a9073431543cdc1f17ab9eab759eca688c5e5f466f5cf
|
Provenance
The following attestation bundles were made for tokencostauto-0.1.634.tar.gz:
Publisher:
daily-price-update-and-publish.yml on madpin/tokencostauto
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
tokencostauto-0.1.634.tar.gz -
Subject digest:
6555961c7da0a97825e77a2eeb24b1524e0b3d8bbe15e467b7bc2ac6bbe2026b - Sigstore transparency entry: 1609957477
- Sigstore integration time:
-
Permalink:
madpin/tokencostauto@24d4f440d704092cdf73c0f3cca7f9bd5e15fd4d -
Branch / Tag:
refs/heads/main - Owner: https://github.com/madpin
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
daily-price-update-and-publish.yml@24d4f440d704092cdf73c0f3cca7f9bd5e15fd4d -
Trigger Event:
schedule
-
Statement type:
File details
Details for the file tokencostauto-0.1.634-py3-none-any.whl.
File metadata
- Download URL: tokencostauto-0.1.634-py3-none-any.whl
- Upload date:
- Size: 154.0 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
826c0161562c0732e1ffdbee03b6441e143e4ea06f00fd06e2fedc22d5330145
|
|
| MD5 |
b5124d77ebaacdf914f362d762502257
|
|
| BLAKE2b-256 |
94608a03bd454ebad8327551960d1cd3e5af6450899fac075590592e7e4e83e0
|
Provenance
The following attestation bundles were made for tokencostauto-0.1.634-py3-none-any.whl:
Publisher:
daily-price-update-and-publish.yml on madpin/tokencostauto
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
tokencostauto-0.1.634-py3-none-any.whl -
Subject digest:
826c0161562c0732e1ffdbee03b6441e143e4ea06f00fd06e2fedc22d5330145 - Sigstore transparency entry: 1609957601
- Sigstore integration time:
-
Permalink:
madpin/tokencostauto@24d4f440d704092cdf73c0f3cca7f9bd5e15fd4d -
Branch / Tag:
refs/heads/main - Owner: https://github.com/madpin
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
daily-price-update-and-publish.yml@24d4f440d704092cdf73c0f3cca7f9bd5e15fd4d -
Trigger Event:
schedule
-
Statement type: