Probe any HTTP API for its rate limits, burst ceiling, and full-scrape time.

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

Jartan

These details have not been verified by PyPI

Project description

Sonde

Probe any HTTP API for its rate limits, burst ceiling, and full-scrape time. Provider-pluggable, safe by default.

Install

Requires Python 3.12+.

pip install sonde

From source:

git clone https://github.com/Jartan-LLC/sonde.git
cd sonde
pip install -e .

For Docker, see Docker below.

Quick Start

Probe the Roblox asset-owners endpoint:

export ROBLOX_COOKIE="your_roblosecurity_cookie"
sonde asset-owners --asset-id 20573078 --total-items 1470000

Probe GitHub stargazers:

export GITHUB_TOKEN="ghp_..."
sonde github-stargazers --owner anthropics --repo anthropic-sdk-python --total-items 5000

Anonymous probing (no auth) works too -- you'll just hit lower rate limits:

sonde github-stargazers --owner torvalds --repo linux --total-items 190000

Results are written to sonde_report.json by default:

sonde asset-owners --asset-id 20573078 --output my_report.json

How It Works

Sonde runs five phases against the target endpoint, then combines the measurements into a safe rate estimate.

Phase	What it does
Sanity	One request. Validates auth, reads rate-limit response headers (e.g. `x-ratelimit-limit`, `x-ratelimit-remaining`), and records items-per-page for the scrape-time estimate.
Sequential	Fires back-to-back requests (up to `--seq-cap`, default 150) until the first 429 or the cap. Measures baseline throughput and how many requests the API allows before throttling.
Burst	Fires N truly-concurrent requests (default sizes: 10, 20, 40, 80) via httpx on a single asyncio event loop. After the first throttled burst, measures the recovery window -- how long until requests succeed again -- via adaptive geometric backoff.
Sweep	Drains the rate-limit bucket, then paces requests at progressively faster intervals (default: 8s down to 0.15s) to find the fastest sustainable interval from empty. Skipped by default when authoritative rate-limit headers are present (override with `--force-sweep`).
Estimate	Combines all measurements into a recommended request interval and, if a total item count is known, a wall-clock full-scrape estimate.

How the estimate is produced

The estimate phase uses a priority ladder to determine the safe rate:

Authoritative headers -- If the API returned x-ratelimit-limit and a window, use those directly (e.g. 100 requests per 60s).
Swept floor -- If the sweep found a fastest sustainable interval, use that.
Token-bucket inference -- If burst results show a clean burst size and a measured recovery window, infer the bucket rate.
Sequential fallback -- Use the observed sequential throughput before the first 429.
No-throttle fallback -- If nothing ever throttled, no ceiling was found, so fall back to a conservative fraction of the measured sequential throughput.

Every rung applies the safety margin (default 80%, configurable with --margin) -- the recommended pace is ~25% slower than the measured ceiling. Rung 5 has no measured ceiling, so it applies an extra 0.5 factor on top (~40% of observed throughput at the default margin).

Endpoints

asset-owners

Roblox inventory.roblox.com/v2/assets/{id}/owners -- paginated list of owners of a collectible asset.

Option	Required	Default	Description
`--asset-id`	Yes	--	Asset ID to probe (e.g. `20573078`)
`--sort-order`	No	Asc	`Asc` or `Desc`
`--page-size`	No	100	Items per page (capped at 100)
`--total-items`	No	None	Known total owners, for wall-clock estimate

Auth: Set ROBLOX_COOKIE (legacy web-session) and/or ROBLOX_BEARER (Open Cloud) environment variables.

github-stargazers

GitHub api.github.com/repos/{owner}/{repo}/stargazers -- users who starred a repository.

Option	Required	Default	Description
`--owner`	Yes	--	Repository owner/org (e.g. `anthropics`)
`--repo`	Yes	--	Repository name (e.g. `anthropic-sdk-python`)
`--page-size`	No	100	Items per page (capped at 100)
`--total-items`	No	None	Known stargazer count, for wall-clock estimate

Auth: Set GITHUB_TOKEN environment variable. Without it, you get the anonymous rate limit (60 requests/hour).

Adding an Endpoint

Create a new module in src/sonde/endpoints/.
Subclass Endpoint and implement build_request(cursor) and parse_page(response).
Decorate with @register and set a unique name (becomes the CLI subcommand).
Override _make_provider() to return the appropriate Provider (or use the generic one for standard 200/429 + IETF headers).
Optionally implement total_items() for scrape-time estimates, add_arguments() / from_args() for CLI options, and extra_headers() for endpoint-specific headers.
If the endpoint is paginated, call add_pagination_args(parser, page_max=cls.MAX_PAGE) in add_arguments() and pagination_from_args(args, page_max=cls.MAX_PAGE) in from_args() so it gets the shared --page-size / --total-items flags (clamped to your endpoint's cap).
Import the new module in src/sonde/endpoints/__init__.py so it registers on package load.

Minimal example:

from sonde import Endpoint, RequestSpec, PageResult, register

@register
class MyEndpoint(Endpoint):
    name = "my-endpoint"
    help = "one-line description for --help"

    def build_request(self, cursor):
        return RequestSpec(url="https://api.example.com/items", params={"page": cursor or 1})

    def parse_page(self, response):
        data = response.json()
        return PageResult(count=len(data["items"]), next_cursor=data.get("next_page"))

CLI Reference

Common options shared by all endpoints:

Option	Default	Description
`--max-requests`	1200	Hard global cap across all phases (safety budget)
`--seq-cap`	150	Max sequential requests before stopping
`--skip-burst`	off	Skip the concurrent burst phase
`--burst-sizes`	`10,20,40,80`	Comma-separated list of concurrent burst sizes
`--burst-cooldown`	60.0	Fallback seconds between bursts if the recovery window can't be measured
`--recovery-step`	0.25	Initial poll delay when measuring the throttle window (grows geometrically)
`--recovery-max`	90.0	Give up measuring the window after this many seconds
`--recovery-polls`	15	Max polls during recovery measurement
`--skip-sweep`	off	Skip the sustained-interval sweep phase
`--force-sweep`	off	Run the sweep even when authoritative rate-limit headers are present
`--sweep-intervals`	`8,5,3,2,1.2,0.6,0.3,0.15`	Inter-request intervals (seconds) to test, slow to fast
`--sweep-count`	20	Paced requests per interval after draining
`--sweep-drain`	500	Cap on rapid requests used to empty the bucket before each interval
`--sweep-tolerance`	0.1	Max fraction of 429s for an interval to count as sustainable
`--margin`	0.8	Safety margin: pace at 80% of the measured max rate (0.8 = 25% slower than ceiling)
`--output`	`sonde_report.json`	Path for the JSON report (use `-` for stdout)
`-v` / `--verbose`	off	Show per-request detail (sets log level to DEBUG)
`-q` / `--quiet`	off	Only show warnings and errors (sets log level to WARNING)
`--log-format`	`plain`	Log line format: `plain` (message-only) or `json` (structured)

-v and -q are mutually exclusive. Logs always go to stderr; the report goes to --output.

Exit codes: 0 success, 2 precondition failure (bad arguments, unwritable --output, or the endpoint returned no usable response), 1 unexpected crash, 130 interrupted.

Piping and machine-readable output

Use --output - to write the JSON report to stdout instead of a file. Combine with -q to suppress INFO-level log noise on stderr:

sonde asset-owners --asset-id 20573078 --output - -q | jq .estimate

Use --log-format json for structured log lines on stderr (keys: timestamp, level, logger, message, plus exc on error lines), useful for log aggregators or CI pipelines:

sonde asset-owners --asset-id 20573078 --log-format json 2>sonde.log

Docker

Build:

docker build -t sonde .

Run (mount current directory so the report lands on the host):

docker run --rm -v "$(pwd):/data" -e ROBLOX_COOKIE sonde \
    asset-owners --asset-id 20573078 --total-items 1470000

docker run --rm -v "$(pwd):/data" -e GITHUB_TOKEN sonde \
    github-stargazers --owner anthropics --repo anthropic-sdk-python --total-items 5000

The container writes sonde_report.json to /data (the mounted volume).

Development

pip install -e '.[dev]'

Run tests and linting:

pytest
ruff check .
ruff format --check .

License

MIT

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

Jartan

These details have not been verified by PyPI

Release history Release notifications | RSS feed

This version

0.1.0

Jul 2, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

sonde-0.1.0.tar.gz (45.1 kB view details)

Uploaded Jul 2, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

sonde-0.1.0-py3-none-any.whl (30.4 kB view details)

Uploaded Jul 2, 2026 Python 3

File details

Details for the file sonde-0.1.0.tar.gz.

File metadata

Download URL: sonde-0.1.0.tar.gz
Upload date: Jul 2, 2026
Size: 45.1 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.13

File hashes

Hashes for sonde-0.1.0.tar.gz
Algorithm	Hash digest
SHA256	`1441eab577e4f662afa481eb62d9a720789e5a66b8fb3de0984d4f29aeb7a815`
MD5	`7707c40e683b324404e02d4a4e89b19a`
BLAKE2b-256	`c27703250a2621acec9658f4049d5cd9439e4afe350dff5c9d5dff26f0eac4f2`

See more details on using hashes here.

Provenance

The following attestation bundles were made for sonde-0.1.0.tar.gz:

Publisher: publish-pypi.yml on Jartan-LLC/sonde

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: sonde-0.1.0.tar.gz
- Subject digest: 1441eab577e4f662afa481eb62d9a720789e5a66b8fb3de0984d4f29aeb7a815
- Sigstore transparency entry: 2048590995
- Sigstore integration time: Jul 2, 2026
Source repository:
- Permalink: Jartan-LLC/sonde@4f616ac2736b71adbc4553239679ea4109ebb216
- Branch / Tag: refs/tags/v0.1.0
- Owner: https://github.com/Jartan-LLC
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish-pypi.yml@4f616ac2736b71adbc4553239679ea4109ebb216
- Trigger Event: push

File details

Details for the file sonde-0.1.0-py3-none-any.whl.

File metadata

Download URL: sonde-0.1.0-py3-none-any.whl
Upload date: Jul 2, 2026
Size: 30.4 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.13

File hashes

Hashes for sonde-0.1.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`6a86eb535e3f33134164b3fa2d3235fb497442c817f01545da637e0a38c11d19`
MD5	`cfe32b84b9a7cabf686582e79a9a5a2c`
BLAKE2b-256	`39e5f654c564a1aa219a6404cff3c0cb0ecd1b54a478b2b8a77f0824eb2272ac`

See more details on using hashes here.

Provenance

The following attestation bundles were made for sonde-0.1.0-py3-none-any.whl:

Publisher: publish-pypi.yml on Jartan-LLC/sonde

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: sonde-0.1.0-py3-none-any.whl
- Subject digest: 6a86eb535e3f33134164b3fa2d3235fb497442c817f01545da637e0a38c11d19
- Sigstore transparency entry: 2048591011
- Sigstore integration time: Jul 2, 2026
Source repository:
- Permalink: Jartan-LLC/sonde@4f616ac2736b71adbc4553239679ea4109ebb216
- Branch / Tag: refs/tags/v0.1.0
- Owner: https://github.com/Jartan-LLC
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish-pypi.yml@4f616ac2736b71adbc4553239679ea4109ebb216
- Trigger Event: push

sonde 0.1.0

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Project description

Sonde

Install

Quick Start

How It Works

How the estimate is produced

Endpoints

asset-owners

github-stargazers

Adding an Endpoint

CLI Reference

Piping and machine-readable output

Docker

Development

License

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance