strot

Discover API endpoints with natural language

These details have not been verified by PyPI

Project links

Repository

Project description

Reverse-engineer any website's internal API with natural language

The Problem

Modern websites load data through hidden AJAX calls:

Data isn't in the HTML - it's fetched via internal APIs
You have to manually inspect network requests
Complex pagination logic buried in JavaScript
APIs change endpoints and parameters frequently

The Solution

Strot analyzes websites and discovers their internal API calls for you.

What Strot actually does:

Analyzes the webpage to understand what data you want
Captures the AJAX request that fetches that data
Detects all parameters - pagination (limit/offset, cursors, page numbers) AND dynamic parameters (sorting, filtering, search)
Generates extraction code to parse the JSON response
Returns a Source that replicates the website's own API calls with full parameter control

🐳 Try It Now - No Setup Required!

Get the full Strot experience (Web UI + API) instantly:

Using Patchright browser

docker run -d --name browser-server -p 5678:5678 synacktra/patchright-headless-server
STROT_ANTHROPIC_API_KEY=sk-ant-apiXXXXXX \
  docker compose -f https://raw.githubusercontent.com/vertexcover-io/strot/refs/heads/main/docker-compose.yml up

Or, using Steel browser

STROT_BROWSER_WS_URL=wss://connect.steel.dev?apiKey=ste-XXXXXX STROT_ANTHROPIC_API_KEY=sk-ant-apiXXXXXX \
  docker compose -f https://raw.githubusercontent.com/vertexcover-io/strot/refs/heads/main/docker-compose.yml up

Then visit:

🌐 Web Dashboard: http://localhost:3000 - Visual interface for analyzing websites
🔌 REST API: http://localhost:1337 - Programmatic access

🐍 Python Library

Installation

pip install strot

Or using uv:

uv pip install strot

Example 1: Reverse-Engineer Review API

import strot
import asyncio
from pydantic import BaseModel

class Review(BaseModel):
    title: str | None = None
    username: str | None = None
    rating: float | None = None
    comment: str | None = None
    date: str | None = None

async def get_reviews():
    # Strot discovers the internal API that loads reviews
    source = await strot.analyze(
        url="https://www.getcleanpeople.com/product/fresh-clean-laundry-detergent/",
        query="Customer reviews with ratings and comments",
        output_schema=Review,
        code_executor="e2b" # Defaults to "unsafe"
    )

    # Use the same API call the website uses, with full parameter control
    async for reviews in source.generate_data(limit=500, offset=100, sortBy="date desc"):
        for review in reviews:
            print(f"{review['rating']}⭐ by {review['username']}: {review['comment']}")

if __name__ == "__main__":
    asyncio.run(get_reviews())

Example 2: Capture Product Listing API

import strot
import asyncio
from pydantic import BaseModel

class Product(BaseModel):
    name: str | None = None
    price: float | None = None
    rating: float | None = None
    availability: str | None = None
    description: str | None = None

async def get_products():
    # Strot finds the AJAX endpoint that loads product listings
    source = await strot.analyze(
        url="https://blinkit.com/cn/fresh-vegetables/cid/1487/1489",
        query="Listed products with names and prices.",
        output_schema=Product
    )

    # Set code executor of your choice into source
    source.set_code_executor("e2b")

    # Access all products with custom filtering and sorting
    async for products in source.generate_data(limit=100, offset=0, category="vegetables", sortBy="price"):
        for product in products:
            print(f"{product['name']}: ${product['price']} ({product['rating']}⭐)")

if __name__ == "__main__":
    asyncio.run(get_products())

🔌 MCP Server

Strot provides an MCP (Model Context Protocol) server that enables MCP clients like Claude to analyze websites and discover their internal APIs directly.

Setup

git clone https://github.com/vertexcover-io/strot.git
cd strot && uv sync --group mcp

Set environment variables:

export STROT_ANTHROPIC_API_KEY=sk-ant-apiXXXXXX
export STROT_BROWSER_MODE_OR_WS_URL=headless  # or 'headed' or ws://browser-url

If you want to use a hosted browser, you could do the following:

Run a headless server inside docker

docker run -d --name browser-server -p 5678:5678 synacktra/patchright-headless-server
export STROT_BROWSER_MODE_OR_WS_URL=ws://localhost:5678/patchright

Or, connect to a remote browser server:

export STROT_BROWSER_MODE_OR_WS_URL=wss://...

Usage

$ uv run strotmcp
Usage: strotmcp COMMAND

╭─ Commands ─────────────────────────────────────────────╮
│ http             Start an http server.                 │
│ sse              Start an sse server.                  │
│ stdio            Start a stdio server.                 │
│ streamable-http  Start a streamable http server.       │
╰────────────────────────────────────────────────────────╯

With Claude Code

stdio transport

claude mcp add strot-mcp "uv run strotmcp stdio" --transport stdio \
  --env STROT_ANTHROPIC_API_KEY=$STROT_ANTHROPIC_API_KEY STROT_BROWSER_MODE_OR_WS_URL=$STROT_BROWSER_MODE_OR_WS_URL

sse, http, streamable-http transports

Run the MCP server with the desired transport in separate terminal:

uv run strotmcp sse --port 8888
# uv run strotmcp http --host 0.0.0.0
# uv run strotmcp streamable-http --host 127.0.0.1 --port 8777

Add the MCP server to Claude:

claude mcp add strot-mcp http://127.0.0.1:8888/sse --transport sse

🧪 Evaluation

Test and validate Strot's analysis accuracy across different websites and individual components. The evaluation system supports five input types:

existing jobs (evaluate completed jobs by ID),
new jobs (create and evaluate full analysis tasks),
request detection (test URL and query combinations),
parameter detection (test individual request parameter analysis - both pagination and dynamic parameters),
structured extraction (test response parsing with specific schemas).

It compares actual results against expected outcomes, tracking metrics like source URL matching, pagination key detection, dynamic parameter detection, and entity count accuracy. All evaluations and their detailed analysis steps are stored in Airtable.

Setup

git clone https://github.com/vertexcover-io/strot.git
cd strot && uv sync --group eval

Setup Airtable

Create a new Airtable base:
- Go to Workspaces
- Click on Create button on your desired workspace
- Select Build an app on your own
- Copy the Base ID from the URL (e.g. appXXXXXXXXXXXXXX)
Create a Personal Access Token:
- Go to /create/tokens
- Add the following scopes:
  - data.records:read
  - data.records:write
  - schema.bases:read
  - schema.bases:write
- Give access to the base you created in step 1
- Press Create token and copy the token (e.g. patXXXXXXXXXXXXXX)

Set environment variables:

export STROT_AIRTABLE_BASE_ID=appXXXXXXXXXXXXXX
export STROT_AIRTABLE_TOKEN=patXXXXXXXXXXXXXX

# set the following if you wish to evaluate separate tasks
export STROT_ANTHROPIC_API_KEY=sk-ant-apiXXXXXX

Note: Required tables are automatically created with proper schema when you run evaluations.

Usage

$ uv run stroteval
Usage: stroteval [OPTIONS]

Evaluate multiple job-based or task-based inputs from a file or stdin.

╭─ Parameters ──────────────────────────────────────────────────────────────────╮
│ --file  -f  Path to the JSON/JSONL file. If not provided, reads from stdin.   │
╰───────────────────────────────────────────────────────────────────────────────╯

Make sure the API server is running If you're evaluating job-based inputs.

echo '[
  {
    "job_id": "existing-job-uuid",
    "expected_source": "https://api.example.com/reviews",
    "expected_pagination_keys": ["cursor", "limit"],
    "expected_dynamic_keys": ["sortBy", "category"],
    "expected_entity_count": 243
  },
  {
    "site_url": "https://example.com/category/abc",
    "query": "Listed products with name and prices",
    "expected_source": "https://api.example.com/products",
    "expected_pagination_keys": ["limit", "offset"],
    "expected_dynamic_keys": ["category", "sortBy"],
    "expected_entity_count": 100
  },
  {
    "request": {
      "method": "GET",
      "url": "https://example.com/api/products",
      "type": "ajax",
      "queries": {"page": "2", "limit": "50", "sortBy": "price", "category": "electronics"}
    },
    "expected_pagination_keys": ["page", "limit"],
    "expected_dynamic_keys": ["sortBy", "category"]
  },
  {
    "response": {
      "value": "",
      "request": {
        "method": "GET",
        "url": "https://example.com/api/reviews",
        "type": "ajax"
      },
      "preprocessor": {"element_selector": ".reviews-container"}
    },
    "output_schema_file": "review_schema.json",
    "expected_entity_count": 10
  }
]' | uv run stroteval

🆘 Need Help?

💬 GitHub Discussions - Ask questions
🐛 Report Issues - Found a bug?

📄 License

MIT License - Use it however you want!

Made with ❤️ by Vertexcover Labs

Project details

These details have not been verified by PyPI

Project links

Repository

Release history Release notifications | RSS feed

This version

1.4.0

Sep 17, 2025

1.3.0

Sep 11, 2025

1.2.0

Aug 28, 2025

1.1.0

Aug 21, 2025

1.0.0

Aug 13, 2025

0.1.4

Aug 5, 2025

0.1.3

Aug 5, 2025

0.1.2

Aug 5, 2025

0.1.1

Aug 5, 2025

0.1.0

Aug 5, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

strot-1.4.0.tar.gz (41.4 kB view details)

Uploaded Sep 17, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

strot-1.4.0-py3-none-any.whl (55.5 kB view details)

Uploaded Sep 17, 2025 Python 3

File details

Details for the file strot-1.4.0.tar.gz.

File metadata

Download URL: strot-1.4.0.tar.gz
Upload date: Sep 17, 2025
Size: 41.4 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: uv/0.8.17

File hashes

Hashes for strot-1.4.0.tar.gz
Algorithm	Hash digest
SHA256	`5fb8f1cab7c7cba8593088054529d4e3145ec8a82b9c21259b5e6d442c565ce6`
MD5	`d1146485cccb223fe113fb92ea32b909`
BLAKE2b-256	`6cd14ef585a4e6df0c257085fc6b011c5d8173caf378efc3856efab570d25dc5`

See more details on using hashes here.

File details

Details for the file strot-1.4.0-py3-none-any.whl.

File metadata

Download URL: strot-1.4.0-py3-none-any.whl
Upload date: Sep 17, 2025
Size: 55.5 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: uv/0.8.17

File hashes

Hashes for strot-1.4.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`759a97e3f905ce77251da666bd1cc510e6b9ee1b9db000e79439800a366a47be`
MD5	`2887fa9b5d10e9174af3e3e48f64e366`
BLAKE2b-256	`d2f6a9155ebaad1c0a02b2d8dfbe261967175d2aba3ff22a6858ad7f6c4b8254`

See more details on using hashes here.

strot 1.4.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

The Problem

The Solution

🐳 Try It Now - No Setup Required!

🐍 Python Library

Installation

Example 1: Reverse-Engineer Review API

Example 2: Capture Product Listing API

🔌 MCP Server

Setup

Usage

With Claude Code

🧪 Evaluation

Setup

Setup Airtable

Usage

🆘 Need Help?

📄 License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes