Skip to main content

A CLI-based dashboard to see your AWS Bedrock model quotas and usage.

Project description

bedrock-quota

A CLI-based dashboard to see your AWS Bedrock model quotas and usage.

PyPI Python License: MIT Release Downloads

Main dashboard Detail — 7 days

The AWS console shows quota limits on one page and CloudWatch usage on another — nothing joins them per model. bedrock-quota pulls both and shows headroom per model, per region, so you know where you stand before you scale. Read-only, no dashboard to maintain.

Features

  • Know your headroom — RPM, TPM, TPD quotas next to last-day usage, per model.
  • Spot throttle risk early — ⚠ flagged when 7-day P90 TPM crosses 80% of quota.
  • Every variant in one table — on-demand, cross-region, and global cross-region inference profiles.
  • Drill into a model — per-period stats, stacked token charts, hourly heatmap, per-inference-profile breakdown.
  • Multi-region view — each region loads independently, shown as a separate section.
  • Keyboard-first — works over SSH, 5-minute metric cache.

Install

uvx bedrock-quota                # try without installing (recommended)
uv tool install bedrock-quota    # or install as a uv tool
pipx install bedrock-quota       # or via pipx
pip install bedrock-quota        # or plain pip

Quickstart

AWS_PROFILE=my-sso-profile AWS_DEFAULT_REGION=us-east-1 bedrock-quota

Credentials are picked up like any other AWS CLI tool — AWS_PROFILE, SSO, environment variables, or an attached instance role.

IAM permissions

Most developers already have enough read-only access through their SSO or dev IAM role and can skip this section. If you need to scope a dedicated policy, this is the minimum:

{
  "Version": "2012-10-17",
  "Statement": [
    {
      "Effect": "Allow",
      "Action": [
        "bedrock:ListFoundationModels",
        "bedrock:ListInferenceProfiles",
        "bedrock:ListTagsForResource",
        "cloudwatch:GetMetricData",
        "cloudwatch:ListMetrics",
        "pricing:GetProducts",
        "servicequotas:ListServiceQuotas"
      ],
      "Resource": "*"
    }
  ]
}

Keyboard shortcuts

Key Action
g Switch / add region
p Switch provider
r Refresh data
Enter Open model detail screen
Escape Back / close
Ctrl+Q Quit

Model detail

Press Enter on any row to open a per-model detail screen with stats (P50/P90/Avg/Max for TPD, RPM, TPM, latency), token charts, an hourly heatmap, and a per-inference-profile breakdown — for time periods Last 24h, Today, 7d, 14d, 30d, This Month, Last Month.

Detail — last 24h

Troubleshooting

  • No usage data — CloudWatch only records metrics for models you've actually invoked. Check that AWS_DEFAULT_REGION matches the region you're calling Bedrock in.
  • Slow first load — the first run queries CloudWatch for every model across every time period; subsequent loads hit the 5-minute cache. Press r to force-refresh.
  • Credentials error — run aws configure, aws sso login, or set AWS_ACCESS_KEY_ID / AWS_SECRET_ACCESS_KEY / AWS_DEFAULT_REGION.

Implementation notes

Usage data is queried from CloudWatch (AWS/Bedrock namespace, ModelId dimension): InputTokenCount + OutputTokenCount → TPM/TPD, Invocations → RPM, InvocationLatency → latency, InvocationThrottles / InvocationClientErrors / InvocationServerErrors → error counts. Quotas come from AWS Service Quotas. Model IDs are discovered from ListFoundationModels, ListInferenceProfiles, and observed CloudWatch dimensions, so cross-region variants like us.anthropic.claude-sonnet-4-6 match their quota automatically. Per-token costs shown in the detail screen are list prices from the AWS Pricing API (pricing:GetProducts); they exclude Savings Plans and batch-inference discounts. If pricing:GetProducts is not granted, cost rows render and the status bar shows a note.

Development

git clone https://github.com/Kelet-ai/bedrock-quota
cd bedrock-quota
uv sync
uv run bedrock-quota

License

MIT

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

bedrock_quota-1.1.1.tar.gz (34.8 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

bedrock_quota-1.1.1-py3-none-any.whl (34.0 kB view details)

Uploaded Python 3

File details

Details for the file bedrock_quota-1.1.1.tar.gz.

File metadata

  • Download URL: bedrock_quota-1.1.1.tar.gz
  • Upload date:
  • Size: 34.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: uv/0.11.14 {"installer":{"name":"uv","version":"0.11.14","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for bedrock_quota-1.1.1.tar.gz
Algorithm Hash digest
SHA256 ddef66ccd8096f76b7f6144c609b80442bdfd16112587e76c14e3264c14e55a7
MD5 c6d48e758f6847937b248f61570f8c82
BLAKE2b-256 1d3f34342d538010f25e9041fe0df5ff643a598234840fe6a689e0bf8161fc9c

See more details on using hashes here.

File details

Details for the file bedrock_quota-1.1.1-py3-none-any.whl.

File metadata

  • Download URL: bedrock_quota-1.1.1-py3-none-any.whl
  • Upload date:
  • Size: 34.0 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: uv/0.11.14 {"installer":{"name":"uv","version":"0.11.14","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for bedrock_quota-1.1.1-py3-none-any.whl
Algorithm Hash digest
SHA256 d48537dbf8a96d9c67949bac9af44b59448e15d1292968e016779f2cc1c96347
MD5 4fa892fc4e1a82c4d1333af6042b7b9c
BLAKE2b-256 20ec4075543dba9a8a8a9f673e76cacfe6257cd24058c4eb6a07540f7fb95e8a

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page