Rate limiting library backed by DynamoDB with token bucket algorithm

These details have been verified by PyPI

Project links

Owner

ZeroAE

GitHub Statistics

These details have not been verified by PyPI

Project description

zae-limiter

A rate limiting library backed by DynamoDB using the token bucket algorithm.

Features

Token Bucket Algorithm: Precise rate limiting with configurable burst capacity
Multiple Limits: Track requests per minute, tokens per minute, etc. in a single call
Hierarchical Entities: Two-level hierarchy (project → API keys) with cascade mode
Atomic Transactions: Multi-key updates via DynamoDB TransactWriteItems
Rollback on Exception: Automatic rollback if your code throws
Stored Limits: Configure per-entity limits in DynamoDB
Usage Analytics: Lambda aggregator for hourly/daily usage snapshots
Async + Sync APIs: First-class async support with sync wrapper

Installation

pip install zae-limiter

Or using uv:

uv pip install zae-limiter

Quick Start

1. Deploy Infrastructure

Using CLI (recommended):

zae-limiter deploy --table-name rate_limits --region us-east-1

Or get template for manual deployment:

zae-limiter cfn-template > template.yaml
aws cloudformation deploy --template-file template.yaml --stack-name zae-limiter

Or auto-create in code (development):

from zae_limiter import RateLimiter

limiter = RateLimiter(
    table_name="rate_limits",
    region="us-east-1",
    create_stack=True,  # auto-create CloudFormation stack
)

2. Use in Code

from zae_limiter import RateLimiter, Limit

# Initialize the limiter (stack must already exist)
limiter = RateLimiter(
    table_name="rate_limits",
    region="us-east-1",
)

# Acquire rate limit capacity
async with limiter.acquire(
    entity_id="api-key-123",
    resource="gpt-4",
    limits=[
        Limit.per_minute("rpm", 100),       # 100 requests/minute
        Limit.per_minute("tpm", 10_000),    # 10k tokens/minute
    ],
    consume={"rpm": 1, "tpm": 500},  # estimate 500 tokens
) as lease:
    response = await call_llm()

    # Reconcile actual token usage (can go negative)
    actual_tokens = response.usage.total_tokens
    await lease.adjust(tpm=actual_tokens - 500)

# On success: consumption is committed
# On exception: consumption is rolled back

Local Development

For DynamoDB Local, auto-creation uses direct table creation (not CloudFormation):

limiter = RateLimiter(
    table_name="rate_limits",
    endpoint_url="http://localhost:8000",
    create_table=True,  # Creates table directly (CloudFormation skipped)
)

Usage

Basic Rate Limiting

from zae_limiter import RateLimiter, Limit, RateLimitExceeded

limiter = RateLimiter(table_name="rate_limits")

try:
    async with limiter.acquire(
        entity_id="user-123",
        resource="api",
        limits=[Limit.per_minute("requests", 100)],
        consume={"requests": 1},
    ) as lease:
        await do_work()
except RateLimitExceeded as e:
    # Exception includes ALL limit statuses (passed and failed)
    print(f"Retry after {e.retry_after_seconds:.1f}s")

    # For API responses
    return JSONResponse(
        status_code=429,
        content=e.as_dict(),
        headers={"Retry-After": e.retry_after_header},
    )

Hierarchical Rate Limits (Cascade)

# Create parent (project) and child (API key)
await limiter.create_entity(entity_id="proj-1", name="Production")
await limiter.create_entity(entity_id="key-abc", parent_id="proj-1")

# Cascade mode: consume from both key AND project
async with limiter.acquire(
    entity_id="key-abc",
    resource="gpt-4",
    limits=[
        Limit.per_minute("tpm", 10_000),  # per-key limit
    ],
    consume={"tpm": 500},
    cascade=True,  # also applies to parent
) as lease:
    await call_api()

Burst Capacity

# Allow burst of 15k tokens, but sustain only 10k/minute
limits = [
    Limit.per_minute("tpm", 10_000, burst=15_000),
]

Stored Limits

# Store custom limits for premium users
await limiter.set_limits(
    entity_id="user-premium",
    limits=[
        Limit.per_minute("rpm", 500),
        Limit.per_minute("tpm", 50_000, burst=75_000),
    ],
)

# Use stored limits (falls back to defaults if not stored)
async with limiter.acquire(
    entity_id="user-premium",
    resource="gpt-4",
    limits=[Limit.per_minute("rpm", 100)],  # default
    consume={"rpm": 1},
    use_stored_limits=True,
) as lease:
    ...

LLM Token Estimation + Reconciliation

async with limiter.acquire(
    entity_id="key-abc",
    resource="gpt-4",
    limits=[
        Limit.per_minute("rpm", 100),
        Limit.per_minute("tpm", 10_000),
    ],
    consume={"rpm": 1, "tpm": 500},  # estimate
) as lease:
    response = await llm.complete(prompt)
    actual = response.usage.total_tokens

    # Adjust without throwing (can go negative)
    await lease.adjust(tpm=actual - 500)

Check Capacity Before Expensive Operations

# Check available capacity
available = await limiter.available(
    entity_id="key-abc",
    resource="gpt-4",
    limits=[Limit.per_minute("tpm", 10_000)],
)
print(f"Available tokens: {available['tpm']}")

# Check when capacity will be available
if available["tpm"] < needed_tokens:
    wait = await limiter.time_until_available(
        entity_id="key-abc",
        resource="gpt-4",
        limits=[Limit.per_minute("tpm", 10_000)],
        needed={"tpm": needed_tokens},
    )
    raise RetryAfter(seconds=wait)

Synchronous API

from zae_limiter import SyncRateLimiter, Limit

limiter = SyncRateLimiter(table_name="rate_limits")

with limiter.acquire(
    entity_id="key-abc",
    resource="api",
    limits=[Limit.per_minute("rpm", 100)],
    consume={"rpm": 1},
) as lease:
    response = call_api()
    lease.adjust(tokens=response.token_count)

Failure Modes

from zae_limiter import RateLimiter, FailureMode

# Fail closed (default): reject requests if DynamoDB unavailable
limiter = RateLimiter(
    table_name="rate_limits",
    failure_mode=FailureMode.FAIL_CLOSED,
)

# Fail open: allow requests if DynamoDB unavailable
limiter = RateLimiter(
    table_name="rate_limits",
    failure_mode=FailureMode.FAIL_OPEN,
)

# Override per-call
async with limiter.acquire(
    ...,
    failure_mode=FailureMode.FAIL_OPEN,
):
    ...

Exception Details

When a rate limit is exceeded, RateLimitExceeded includes full details:

try:
    async with limiter.acquire(...):
        ...
except RateLimitExceeded as e:
    # All limits that were checked
    for status in e.statuses:
        print(f"{status.limit_name}: {status.available}/{status.limit.capacity}")
        print(f"  exceeded: {status.exceeded}")
        print(f"  retry_after: {status.retry_after_seconds}s")

    # Just the violations
    for v in e.violations:
        print(f"Exceeded: {v.limit_name}")

    # Just the passed limits
    for p in e.passed:
        print(f"Passed: {p.limit_name}")

    # Primary bottleneck
    print(f"Bottleneck: {e.primary_violation.limit_name}")
    print(f"Retry after: {e.retry_after_seconds}s")

    # For HTTP responses
    response_body = e.as_dict()
    retry_header = e.retry_after_header

Infrastructure

Deploy with CloudFormation

# Export the template from the installed package
zae-limiter cfn-template > template.yaml

# Deploy the DynamoDB table and Lambda aggregator
aws cloudformation deploy \
    --template-file template.yaml \
    --stack-name zae-limiter \
    --parameter-overrides \
        TableName=rate_limits \
        SnapshotRetentionDays=90 \
    --capabilities CAPABILITY_NAMED_IAM

Automatic Lambda Deployment

The zae-limiter deploy CLI command automatically handles Lambda deployment:

# Deploy stack with Lambda aggregator (automatic)
zae-limiter deploy --table-name rate_limits --region us-east-1

# The CLI automatically:
# 1. Creates CloudFormation stack with DynamoDB table and Lambda function
# 2. Builds Lambda deployment package from installed library
# 3. Deploys Lambda code via AWS Lambda API (~30KB, no S3 required)

To deploy without the Lambda aggregator:

zae-limiter deploy --table-name rate_limits --no-aggregator

Local Development with DynamoDB Local

# Start DynamoDB Local
docker run -p 8000:8000 amazon/dynamodb-local

# Use endpoint_url
limiter = RateLimiter(
    table_name="rate_limits",
    endpoint_url="http://localhost:8000",
    create_table=True,
)

Development

Setup

# Clone repository
git clone https://github.com/zeroae/zae-limiter.git
cd zae-limiter

# Using uv
uv venv
source .venv/bin/activate
uv pip install -e ".[dev]"

# Using conda
conda create -n zae-limiter python=3.12
conda activate zae-limiter
pip install -e ".[dev]"

Run Tests

# Run all tests
pytest

# Run with coverage
pytest --cov=zae_limiter --cov-report=html

# Run specific test file
pytest tests/test_limiter.py -v

Code Quality

# Format and lint
ruff check --fix .
ruff format .

# Type checking
mypy src/zae_limiter

Architecture

DynamoDB Schema (Single Table)

Record Type	PK	SK
Entity metadata	`ENTITY#{id}`	`#META`
Bucket	`ENTITY#{id}`	`#BUCKET#{resource}#{limit_name}`
Limit config	`ENTITY#{id}`	`#LIMIT#{resource}#{limit_name}`
Usage snapshot	`ENTITY#{id}`	`#USAGE#{resource}#{window_key}`

Indexes:

GSI1: Parent → Children lookup (PARENT#{id} → CHILD#{id})
GSI2: Resource aggregation (RESOURCE#{name} → buckets/usage)

Token Bucket Implementation

All values stored as millitokens (×1000) for precision
Refill rate stored as fraction (amount/period) to avoid floating point
Supports negative buckets for post-hoc reconciliation
Uses DynamoDB transactions for multi-key atomicity

License

MIT

Project details

These details have been verified by PyPI

Project links

Owner

ZeroAE

GitHub Statistics

These details have not been verified by PyPI

Release history Release notifications | RSS feed

0.10.1

Feb 23, 2026

0.10.0

Feb 23, 2026

0.9.1

Feb 11, 2026

0.9.0

Feb 11, 2026

0.8.1

Feb 2, 2026

0.8.0

Feb 2, 2026

0.7.0

Jan 29, 2026

0.6.1

Jan 27, 2026

0.6.0

Jan 27, 2026

0.5.0

Jan 25, 2026

0.4.1

Jan 21, 2026

0.4.0

Jan 19, 2026

0.3.0

Jan 15, 2026

0.2.0

Jan 14, 2026

This version

0.1.0

Jan 10, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

zae_limiter-0.1.0.tar.gz (198.6 kB view details)

Uploaded Jan 10, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

zae_limiter-0.1.0-py3-none-any.whl (48.2 kB view details)

Uploaded Jan 10, 2026 Python 3

File details

Details for the file zae_limiter-0.1.0.tar.gz.

File metadata

Download URL: zae_limiter-0.1.0.tar.gz
Upload date: Jan 10, 2026
Size: 198.6 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for zae_limiter-0.1.0.tar.gz
Algorithm	Hash digest
SHA256	`f050ae155998fbb592159710ffee6087766802f540d9f7a871b0e51ae1efb250`
MD5	`df0bf0860908f443a684057e92fb847f`
BLAKE2b-256	`9440378328b98d76b5641921d79096d0a4b04af4fda62a3b030d6947b5866ac2`

See more details on using hashes here.

Provenance

The following attestation bundles were made for zae_limiter-0.1.0.tar.gz:

Publisher: release.yml on zeroae/zae-limiter

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: zae_limiter-0.1.0.tar.gz
- Subject digest: f050ae155998fbb592159710ffee6087766802f540d9f7a871b0e51ae1efb250
- Sigstore transparency entry: 813372698
- Sigstore integration time: Jan 10, 2026
Source repository:
- Permalink: zeroae/zae-limiter@28f29fff94a196c97acf4d6366ded202f00efe08
- Branch / Tag: refs/tags/v0.1.0
- Owner: https://github.com/zeroae
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yml@28f29fff94a196c97acf4d6366ded202f00efe08
- Trigger Event: push

File details

Details for the file zae_limiter-0.1.0-py3-none-any.whl.

File metadata

Download URL: zae_limiter-0.1.0-py3-none-any.whl
Upload date: Jan 10, 2026
Size: 48.2 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for zae_limiter-0.1.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`d7ae1b0a19d8ddf607d7eb02a458a95bdeba5650bd0b2f13085a2b8df6aa256b`
MD5	`a95b6752a5bdcdc45ff1c1a7f22a3765`
BLAKE2b-256	`ab610a1ef4bb3a9312317fa3c9a5574a6b4ed6ecb1488cf824cb15af6b2a9cc1`

See more details on using hashes here.

Provenance

The following attestation bundles were made for zae_limiter-0.1.0-py3-none-any.whl:

Publisher: release.yml on zeroae/zae-limiter

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: zae_limiter-0.1.0-py3-none-any.whl
- Subject digest: d7ae1b0a19d8ddf607d7eb02a458a95bdeba5650bd0b2f13085a2b8df6aa256b
- Sigstore transparency entry: 813372699
- Sigstore integration time: Jan 10, 2026
Source repository:
- Permalink: zeroae/zae-limiter@28f29fff94a196c97acf4d6366ded202f00efe08
- Branch / Tag: refs/tags/v0.1.0
- Owner: https://github.com/zeroae
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yml@28f29fff94a196c97acf4d6366ded202f00efe08
- Trigger Event: push

zae-limiter 0.1.0

Navigation

Verified details

Project links

Owner

GitHub Statistics

Unverified details

Meta

Classifiers

Project description

zae-limiter

Features

Installation

Quick Start

1. Deploy Infrastructure

2. Use in Code

Local Development

Usage

Basic Rate Limiting

Hierarchical Rate Limits (Cascade)

Burst Capacity

Stored Limits

LLM Token Estimation + Reconciliation

Check Capacity Before Expensive Operations

Synchronous API

Failure Modes

Exception Details

Infrastructure

Deploy with CloudFormation

Automatic Lambda Deployment

Local Development with DynamoDB Local

Development

Setup

Run Tests

Code Quality

Architecture

DynamoDB Schema (Single Table)

Token Bucket Implementation

License

Project details

Verified details

Project links

Owner

GitHub Statistics

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance