Static analysis tool that maps Python codebases into navigable flows

These details have not been verified by PyPI

Project description

Cartograph

Static analysis for Python codebases. Discovers entry points, builds cross-file call graphs, outputs structured context for LLMs.

pip install cartograph-code
carto scan ./sentry/src

Scan complete.
  4,415 modules · 30,926 functions · 788 entry points · 37,659 resolved calls

Discovered Flows:

   1. issues (142 entry points: 89 discovered, 53 signal_handler)
      Deepest: process_organization_mapping → 22 functions

   2. integrations (98 entry points: 98 discovered)
      Deepest: setup_intent_succeeded → 18 functions

   3. monitors (67 entry points: 67 discovered)
      Deepest: _process_checkin → 57 functions
   ...

Sentry uses @instrumented_task and @cell_silo_endpoint — custom decorators no static analyzer has seen before. Cartograph found 788 entry points without a single line of Sentry-specific code.

How It Works

Entry Point Discovery

Most code analysis tools hardcode decorator names: "if you see @app.get, it's a route." This breaks the moment a codebase uses custom wrappers.

Cartograph takes a different approach. After building the call graph, it looks at the graph topology:

A function is an entry point if:

Zero incoming edges — no project code calls it
Has outgoing edges — it does something
Has a decorator — a framework registered it

This works on any framework, any custom decorator, without configuration. Framework detectors (FastAPI, Flask, Django, Celery) still exist — they add rich labels like "GET /api/users" — but they're optional. The topology does the discovery.

Codebase	Detector-only	+ Topology	Notes
Sentry	52	788	`@instrumented_task`, `@cell_silo_endpoint`
Dagster	0	255	`@public`, `@job_cli.command` — zero detectors exist
Polar	328	600	FastAPI routes + `@actor`, `@cli.command`
Prefect	183	396	FastAPI routes + `@flow`, `@task`

Type Resolution

Resolving receiver.method() requires knowing the type of receiver. The pipeline tries these sources in order:

self + MRO — self.method() walks the class hierarchy
Imports — from .service import user_service; user_service.get()
Parameter annotations — def f(session: AsyncSession): session.execute()
Local assignments — x = Foo() or x = Foo.create_delegation() (factory classmethods)
Return types — x = get_user() where get_user() -> User
ORM patterns — Model.objects.filter()

The factory classmethod resolution is why Sentry works. Sentry's service layer: action_service = ActionService.create_delegation() — 36 service singletons, all using this pattern. Without recognizing that Foo.create_delegation() returns type Foo, every service call resolves to nothing.

Conditional Branches

Call edges carry their condition. When get_claim_info calls ResourceNotFound three times under different conditions, the graph preserves each one:

get_claim_info → ResourceNotFound  [condition: not seat]
get_claim_info → ResourceNotFound  [condition: else]
get_claim_info → ResourceNotFound  [condition: not organization]

Cache

carto scan parses everything and writes to .cartograph/ (JSON). Every subsequent command reads from cache. On Sentry (30K functions): first scan ~30s, every command after <0.5s.

Commands

carto scan ./project              # parse + cache + show flows
carto entries                     # list entry points (no path needed after scan)
carto entries --type api_route    # filter by type
carto search "checkout"           # find functions by name
carto trace "deploy" --depth 3    # call tree with branches
carto callers "execute_run"       # reverse lookup
carto summary                     # stats

Pipe to LLMs

carto context outputs structured markdown to stdout. Pipe it to whatever LLM you already use:

carto context | claude "what does this codebase do"
carto context "deploy" | claude "explain the deploy flow"
carto context "checkout" | gh copilot explain

Prefect's raw codebase: ~9M tokens. carto context output: ~8K tokens. The LLM gets every entry point, domain grouping, top callers, and package structure in one page.

Built-in LLM (optional)

export CARTOGRAPH_LLM_PROVIDER=claude   # or openai, ollama
export ANTHROPIC_API_KEY=sk-ant-...     # or OPENAI_API_KEY
carto explain                           # whole codebase
carto explain "checkout"                # specific flow

Web Viewer

carto serve ./project --port 3333

ELK.js layout engine, draggable nodes, branch visualization on edges, condition labels. Click any entry point to render its DAG.

Tested Against

Project	Framework	Functions	Entry Points	Resolved Edges
Sentry	Django + Celery (custom)	30,926	788	37,659
Polar	FastAPI	6,350	600	6,327
Prefect	FastAPI + custom	6,280	396	2,821
Dagster	Custom framework	11,533	255	6,919
paperless-ngx	Django + Celery	1,559	26	1,099

122 tests passing.

Design Decisions

Why stdlib ast instead of tree-sitter? Python-only for now. ast gives us the full parse tree with zero dependencies. Tree-sitter is the migration path for multi-language support.

Why topology for entry points? Because hardcoding @app.get means maintaining a list that's always incomplete. The graph already knows which functions are roots. Use the structure, not the annotations.

Why not use LSP? LSP requires a running language server and gives you one symbol at a time. We need the whole graph at once for topology analysis. Different tool for a different job.

Why pipe instead of built-in LLM? Developers already have their LLM. We're a context generator, not an LLM wrapper. The built-in explain command exists for convenience, but carto context | claude is the primary workflow.

Resolution ceiling: ~65% of project-internal calls resolve on complex codebases. The remaining are calls to external packages (Django ORM, stdlib, sentry_sdk) that no project-level static analyzer can resolve. The entry point discovery doesn't depend on resolution — it's pure graph topology.

Roadmap

Blast radius — carto impact --diff HEAD~1 → which entry points does this PR affect
MCP server — Cartograph as a tool for Claude Code / Cursor (no piping needed)
Tree-sitter → Java (Spring Boot), Go, TypeScript
CI integration — GitHub Action that comments blast radius on PRs

License

MIT

LLMs guess. Cartograph proves.

Project details

These details have not been verified by PyPI

Release history Release notifications | RSS feed

This version

0.1.3

Apr 18, 2026

0.1.2

Apr 18, 2026

0.1.1

Apr 18, 2026

0.1.0

Apr 18, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

cartograph_code-0.1.3.tar.gz (49.6 kB view details)

Uploaded Apr 18, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

cartograph_code-0.1.3-py3-none-any.whl (59.3 kB view details)

Uploaded Apr 18, 2026 Python 3

File details

Details for the file cartograph_code-0.1.3.tar.gz.

File metadata

Download URL: cartograph_code-0.1.3.tar.gz
Upload date: Apr 18, 2026
Size: 49.6 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.12.9

File hashes

Hashes for cartograph_code-0.1.3.tar.gz
Algorithm	Hash digest
SHA256	`4e4c6f74eb5c4285b60eb6ba5eb6a1e329fb6bd04388fb3692ffc2954ef3af72`
MD5	`ab6130c5f46081e7f61bbb9864f38525`
BLAKE2b-256	`a1deb754264ebb179ab0fdf6353bf6962a213ae787e02f1816a83ac61976da1d`

See more details on using hashes here.

File details

Details for the file cartograph_code-0.1.3-py3-none-any.whl.

File metadata

Download URL: cartograph_code-0.1.3-py3-none-any.whl
Upload date: Apr 18, 2026
Size: 59.3 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.12.9

File hashes

Hashes for cartograph_code-0.1.3-py3-none-any.whl
Algorithm	Hash digest
SHA256	`530e771460426fad6f1537f2b6087a44fb969858970a83b93587ccb650753568`
MD5	`c980ee6e2d6b39e9d318db7d7200e97a`
BLAKE2b-256	`5341d3e54f13569b50c002bad079539b88d5c7c41eb816a59860bbfbf24a5045`

See more details on using hashes here.

cartograph-code 0.1.3

Navigation

Verified details

Maintainers

Unverified details

Meta

Classifiers

Project description

Cartograph

How It Works

Entry Point Discovery

Type Resolution

Conditional Branches

Cache

Commands

Pipe to LLMs

Built-in LLM (optional)

Web Viewer

Tested Against

Design Decisions

Roadmap

License

Project details

Verified details

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes