Skip to main content

Static code analysis + knowledge-graph for AI coding agents — cut token costs by 89–93 %

Project description

Aethvion Project Mapper

Static code analysis + knowledge-graph for AI coding agents.
Give your AI the map it needs — before it starts writing code.


Why it exists

AI coding agents (Claude Code, Cursor, Copilot, etc.) read your entire codebase on every task. That's expensive, slow, and often inaccurate because context windows fill up before the agent sees the relevant files.

Project Mapper scans your codebase once, builds a structured knowledge graph of every module, class, function, and their relationships, and lets agents query only what they need — in milliseconds, at a fraction of the token cost.


Benchmark numbers

Measured on the Django 5.1.x source (521 k lines · 2,917 Python files · 11,988 entities · Windows 11 · no LLM enrichment).

Token reduction (measured)

Query type Without PM With PM Reduction
Context query ("authentication middleware security") ~13,141 tokens (5 files) 1,448 tokens 89 %
Impact query (Model class, 2-hop) ~54,412 tokens (2 core files) 4,020 tokens 93 %

The without-PM baseline assumes the agent already knows which files to read. On a first encounter with a codebase it would read 10–20 files, pushing token cost to 25 000–60 000 per query.

Query latency (measured)

Query Latency
Context query 79–101 ms
Impact query 11–57 ms

Session startup — entity map load (measured)

After a scan, agents load the full entity map at session start.

Method Load time
File-by-file, cold OS cache (restart / idle) 52,048 ms
File-by-file, warm OS cache (right after scan) ~700 ms
Snapshot (single pre-built file) 145 ms

The snapshot is built automatically at the end of each scan (82 ms to build, 9.8 MB for 11,988 entities). It is validated before every load and rebuilt automatically when entity files change. 358× faster than a cold start.

Financial impact at scale (modelled)

Modelled from the measured 89–93 % per-query token reduction. Assumes 10 tasks/dev/day, 8 turns/task, Claude Sonnet pricing. See the cost calculator for your own numbers.

Team size Monthly AI coding cost (est.) Savings with PM
Solo developer $80 $74
10-person team $2,400 $2,230
100-person team $48,000 $44,600
Enterprise (1,000 devs) $480,000 $446,400

What it does

  1. Static scan — walks your project, extracts every module / class / function via AST analysis. No AI needed for this step.
  2. Knowledge graph — stores entities + relationships (imports, calls, extends, depends_on, …) in a local JSON database.
  3. Agent queries — 7 MCP tools that agents call instead of reading raw files:
Tool What it answers
pm_context "What should I know before touching the auth system?"
pm_impact "What breaks if I change UserService?"
pm_path "How does RateLimiter connect to the payment flow?"
pm_contribute "Record that I added rate limiting to endpoint X"
pm_stats "What's already indexed in this database?"
pm_delta "What changed since the last scan?"
pm_scan "Scan this project directory right now"

Quick start

HTTP API

# Install
pip install aethvion-project-mapper

# Start server
uvicorn server:app --port 7474

# Scan your project
curl -X POST http://localhost:7474/api/project-mapper/scan \
  -H "Content-Type: application/json" \
  -d '{"project_root": "/path/to/your/project", "enrich": false}'

# Query context for a task
curl -X POST http://localhost:7474/api/project-mapper/query/context \
  -H "Content-Type: application/json" \
  -d '{"q": "add rate limiting to auth endpoints", "detail_level": "medium"}'

Docs at http://localhost:7474/docs

Docker

docker compose up
# Server running at http://localhost:7474

Mount your projects:

# docker-compose.yml — set PROJECTS_DIR to your code root
PROJECTS_DIR=/home/you/code docker compose up

MCP stdio (Claude Code / Cursor / Antigravity)

Claude Code — add to ~/.claude/settings.json:

{
  "mcpServers": {
    "project-mapper": {
      "type": "stdio",
      "command": "python",
      "args": [
        "-m", "project_mapper.mcp_server",
        "--db", "my_project",
        "--project-root", "/absolute/path/to/project"
      ],
      "cwd": "/absolute/path/to/Aethvion-ProjectMapper"
    }
  }
}

Cursor — add to .cursor/mcp.json:

{
  "mcpServers": {
    "project-mapper": {
      "command": "python",
      "args": ["-m", "project_mapper.mcp_server", "--db", "my_project"],
      "cwd": "/absolute/path/to/Aethvion-ProjectMapper",
      "env": { "PM_PROJECT_ROOT": "/absolute/path/to/project" }
    }
  }
}

Antigravity (Google) — add to ~/.gemini/antigravity/mcp_config.json:

{
  "mcpServers": {
    "project-mapper": {
      "type": "stdio",
      "command": "python",
      "args": [
        "-m", "project_mapper.mcp_server",
        "--db", "my_project",
        "--project-root", "C:/absolute/path/to/project"
      ],
      "cwd": "C:/absolute/path/to/Aethvion-ProjectMapper"
    }
  }
}

All three agents use the same mcpServers format — only the config file location differs. Restart the agent after editing the config.


Configuration

Variable Default Description
PM_DATA_DIR ~/.aethvion_pm/data Root directory for all databases
PM_LOG_LEVEL INFO Log level: DEBUG / INFO / WARNING / ERROR
PM_DB_NAME default MCP server: database name
PM_DB_PATH (unset) MCP server: explicit database path
PM_PROJECT_ROOT (unset) MCP server: default project root for pm_scan

Project structure

project_mapper/
├── config.py          — DATA_DIR config
├── routes.py          — FastAPI router (/api/project-mapper/*)
├── scanner.py         — Async background scan engine
├── ingestor.py        — CodeAnalysis → AethvionDB entities
├── code_analyzer.py   — Python AST extractor
├── query.py           — Impact / context / shortest-path algorithms
├── cleanup.py         — Incremental scan maintenance
├── delta.py           — Filesystem diff (no DB writes)
├── mcp_tools.py       — 7 MCP tool schemas + handlers
├── mcp_server.py      — JSON-RPC 2.0 stdio MCP server
└── db/
    ├── entity_schema.py   — Entity data model + validation
    ├── entity_writer.py   — Create / update / delete entities
    ├── name_index.py      — Thread-safe name → ID index
    ├── file_manifest.py   — File ↔ entity provenance tracking
    ├── snapshot.py        — Fast-load snapshot cache (358× speedup)
    └── db_registry.py     — Named database registry
server.py              — FastAPI app entry point

Incremental scanning

Subsequent scans only process files whose SHA-256 hash has changed since the last run. On a 10,000-file repo that's been scanned before, incremental mode typically processes < 1 % of files — scan time drops from ~60 s to < 2 s.

# Full scan (first time or force refresh)
curl -X POST .../scan -d '{"project_root": "...", "incremental": false}'

# Incremental scan (default — only changed files)
curl -X POST .../scan -d '{"project_root": "..."}'

License

Open-source core: GNU AGPL v3
Free to use, modify, and self-host. Network use requires open-sourcing your modifications.

Commercial license: COMMERCIAL_LICENSE.md
Available for teams that need a proprietary license, SLA, or integration support.


Contributing

Pull requests are welcome. By submitting a PR you agree to the Contributor License Agreement (§2).

git clone https://github.com/Aethvion/Aethvion-ProjectMapper
cd Aethvion-ProjectMapper
pip install -e ".[dev]"
pytest

Built with care by the Aethvion team.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

aethvion_project_mapper-1.0.0.tar.gz (84.1 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

aethvion_project_mapper-1.0.0-py3-none-any.whl (94.3 kB view details)

Uploaded Python 3

File details

Details for the file aethvion_project_mapper-1.0.0.tar.gz.

File metadata

  • Download URL: aethvion_project_mapper-1.0.0.tar.gz
  • Upload date:
  • Size: 84.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.10.11

File hashes

Hashes for aethvion_project_mapper-1.0.0.tar.gz
Algorithm Hash digest
SHA256 fa750b973f501ddba464c3835cee0b20609867f18fb86fa0653f6c7d4695a14a
MD5 2de24dd4f607576f5acce119df42fe11
BLAKE2b-256 98c2825b04226cb99664d936a61800a1161e895b23b6ee6b8462995ef972f6e0

See more details on using hashes here.

File details

Details for the file aethvion_project_mapper-1.0.0-py3-none-any.whl.

File metadata

File hashes

Hashes for aethvion_project_mapper-1.0.0-py3-none-any.whl
Algorithm Hash digest
SHA256 c9be23d377ccdd68d8cb8868f7125d650cfe021112b7e9a27225b2fd9f5465e5
MD5 aa4ad7ed20e387ad96b77e05ba892c6e
BLAKE2b-256 58ad6ba3adde9a24f10eee4ad181aaca784b606edaede8337e7e556b94a165c4

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page