CLI for generating multi-level code architecture graphs
Project description
CodeMarp
Multi-level Code Architecture and Relationship Mapping
Understand a codebase like a map — zoom in, zoom out, follow the flow.
The problem
Large codebases are hard to navigate. You open a file and you're already lost. You don't know what calls what, where things live, or what actually happens inside a function.
Documentation is outdated. Diagrams don't exist. The only way to understand the code is to read all of it.
CodeMarp takes a different approach: it builds the map for you.
What CodeMarp does
Given a Python or TypeScript repository, CodeMarp maps the codebase at multiple zoom levels:
| Level | What you see |
|---|---|
| High | Module and package architecture — how things are organised |
| Mid | Function relationships — who calls what |
| Low | Control flow inside a function — what actually happens |
Think of it as Google Maps for your codebase. Built entirely from static analysis — no runtime, no instrumentation.
Zoom out to see the city. Zoom in to see the streets. Zoom in further to see the building layout.
Graph Levels & Guarantees
CodeMarp produces three views of your codebase. Each view has a different goal and level of precision.
High-level (architecture)
- Shows module and package relationships
- Built from import statements and grouping
- Best-effort approximation of architecture
- May be incomplete or sparse depending on project layout
Mid-level (function relationships)
- Conservative call graph
- Resolves calls using:
- same-module functions
- imported symbols (
from x import y) - imported modules (
import x; x.y()) - unique module-level functions
- Avoids ambiguous resolution paths such as dynamic dispatch and unresolved method calls
- Prioritises precision over recall
- fewer false positives
- may miss valid edges
Each mid-level call edge includes a resolution reason:
same_moduleimported_symbolimported_moduleunique_global
Use --debug-resolution to inspect how edges were resolved.
Low-level (control flow)
- Function-level control flow graph (CFG)
- Shows branching, merging, and execution structure
- Structural only — does not model runtime behaviour
- Currently Python-only
Quickstart
Run CodeMarp on a Python or TypeScript repository:
codemarp analyze src --out out
This generates:
out/
high_level.mmd
mid_level.mmd
graph.json
To zoom into one Python function:
codemarp analyze src --view low --focus codemarp.cli.main:analyze_command --out out
Output
Full, trace, reverse, and module views produce:
out/
high_level.mmd # architecture graph
mid_level.mmd # function call graph
mid_level.json # function graph data
graph.json # full graph data for tooling
Low view produces:
out/
high_level.mmd # architecture graph
low_level.mmd # control flow
low_level.json # control-flow graph data
graph.json # full graph data for tooling
- Mermaid (
.mmd) — renders in GitHub, VS Code, Mermaid Live Editor - JSON — for tooling, integrations, and future UI
Install
For local development in this repo:
uv sync --extra dev
To install in a project environment:
uv pip install codemarp
Or with pip:
pip install codemarp
Or with Homebrew:
brew tap haddyadnan/forge
brew install codemarp
To install the CLI directly from source with uv:
uv tool install git+https://github.com/haddyadnan/codemarp.git
For a specific release tag:
uv tool install git+https://github.com/haddyadnan/codemarp.git@v0.3.0
Usage
Supported languages
| Language | Support |
|---|---|
| Python | Tree-sitter parser by default, AST parser available with --parser-engine ast |
| TypeScript | Tree-sitter parser for .ts and .tsx files |
JavaScript support is planned.
Analyse a repo
codemarp analyze path/to/repo --out out
Point at the folder that contains your top-level package:
# flat layout: mypackage/ is at root
codemarp analyze .
# src layout: mypackage/ is inside src/
codemarp analyze src
Views
Full (default)
See the entire graph — architecture + all function relationships.
codemarp analyze path/to/repo --view full --out out
Trace — what does this function call?
Follow a function forward through the call graph.
codemarp analyze path/to/repo \
--view trace \
--focus package.module:function_name \
--max-depth 3 \
--out out
Reverse — what calls this function?
Find every path that leads to a function.
codemarp analyze path/to/repo \
--view reverse \
--focus package.module:function_name \
--max-depth 3 \
--out out
Module — zoom into one module
Show the functions and internal relationships for a single module.
codemarp analyze path/to/repo \
--view module \
--module package.module_name \
--out out
Low — what happens inside this function?
Build a control-flow graph for one function.
codemarp analyze path/to/repo \
--view low \
--focus package.module:function_name \
--out out
Debug call resolution
Use --debug-resolution to print why mid-level call edges were resolved. This is useful when checking whether a call edge came from an imported symbol, a local function, or another resolver path.
codemarp analyze path/to/repo --out out --debug-resolution
The debug output is printed to stdout, so you can pipe it into a file:
codemarp analyze path/to/repo --out out --debug-resolution > debug.txt
The generated graph files are still written to the directory passed with --out.
Typical workflow
1. codemarp analyze src --view full
→ understand the overall structure
2. codemarp analyze src --view module --module mypackage.core
→ inspect one area
3. codemarp analyze src --view trace --focus mypackage.core:run --max-depth 3
→ follow a specific entrypoint
4. codemarp analyze src --view low --focus mypackage.core:run
→ zoom into the logic
Viewing the output
Mermaid files render automatically in:
- GitHub — renders Mermaid blocks directly in Markdown files
- Mermaid Live Editor — paste and share
- VS Code — with the Mermaid Preview extension
Yes, GitHub renders Mermaid blocks directly in Markdown, so the sample diagrams below should display in the repo README.
Sample output
These examples were generated from the CodeMarp codebase itself.
High-level architecture
Command:
codemarp analyze src --view full --out samples/codemarp_full_out
Excerpt from samples/codemarp_full_out/high_level.mmd:
flowchart LR
codemarp_analyzers["codemarp.analyzers"]
codemarp_cli["codemarp.cli"]
codemarp_contracts(["codemarp.contracts"])
codemarp_errors(["codemarp.errors"])
codemarp_exporters["codemarp.exporters"]
codemarp_graph["codemarp.graph"]
codemarp_parser["codemarp.parser"]
codemarp_pipeline["codemarp.pipeline"]
codemarp_views["codemarp.views"]
codemarp_analyzers -->|imports| codemarp_graph
codemarp_analyzers -->|imports| codemarp_parser
codemarp_analyzers -->|imports| codemarp_contracts
codemarp_cli -->|imports| codemarp_analyzers
codemarp_cli -->|imports| codemarp_errors
codemarp_cli -->|imports| codemarp_pipeline
codemarp_exporters -->|imports| codemarp_graph
codemarp_exporters -->|imports| codemarp_analyzers
codemarp_graph -->|imports| codemarp_contracts
codemarp_parser -->|imports| codemarp_errors
codemarp_parser -->|imports| codemarp_pipeline
codemarp_pipeline -->|imports| codemarp_graph
codemarp_pipeline -->|imports| codemarp_views
codemarp_pipeline -->|imports| codemarp_analyzers
codemarp_pipeline -->|imports| codemarp_parser
codemarp_pipeline -->|imports| codemarp_exporters
codemarp_views -->|imports| codemarp_errors
codemarp_views -->|imports| codemarp_graph
This is the zoomed-out package view: which parts of the project depend on which others.
Focused function trace
Command:
codemarp analyze src --view trace --focus codemarp.cli.main:analyze_command --out samples/codemarp_trace_out
Excerpt from samples/codemarp_trace_out/mid_level.mmd:
flowchart LR
codemarp_graph_models_GraphBundle_function_by_id["codemarp.graph.models:GraphBundle.function_by_id"]
codemarp_views_subgraph_build_function_subgraph["codemarp.views.subgraph:build_function_subgraph"]
codemarp_views_subgraph__filter_edges_for_nodes["codemarp.views.subgraph:_filter_edges_for_nodes"]
codemarp_views_subgraph__modules_for_functions["codemarp.views.subgraph:_modules_for_functions"]
codemarp_views_trace__build_call_adjacency["codemarp.views.trace:_build_call_adjacency"]
codemarp_views_trace_trace_functions_forward["codemarp.views.trace:trace_functions_forward"]
codemarp_views_trace_trace_function_view["codemarp.views.trace:trace_function_view"]
codemarp_views_trace__validate_entrypoint["codemarp.views.trace:_validate_entrypoint"]
codemarp_views_subgraph_build_function_subgraph -->|calls| codemarp_views_subgraph__modules_for_functions
codemarp_views_subgraph_build_function_subgraph -->|calls| codemarp_views_subgraph__filter_edges_for_nodes
codemarp_views_trace_trace_functions_forward -->|calls| codemarp_views_trace__validate_entrypoint
codemarp_views_trace_trace_functions_forward -->|calls| codemarp_views_trace__build_call_adjacency
codemarp_views_trace_trace_function_view -->|calls| codemarp_views_trace__validate_entrypoint
codemarp_views_trace_trace_function_view -->|calls| codemarp_views_trace_trace_functions_forward
codemarp_views_trace_trace_function_view -->|calls| codemarp_views_subgraph_build_function_subgraph
codemarp_views_trace__validate_entrypoint -->|calls| codemarp_graph_models_GraphBundle_function_by_id
This is the focused mid-level view: starting from one function, you can follow the call chain it reaches.
Low-level control flow
Command:
codemarp analyze src --view low --focus codemarp.parser.python.low_level:find_function_node --out samples/codemarp_low_out
Excerpt from samples/codemarp_low_out/low_level.mmd:
flowchart TD
n1(["Start"]):::terminal
n2["'Resolve a low-level f...'"]:::statement
n3["(module_id, target_name) = parse_..."]:::statement
n4["For"]:::statement
n5["Loop Body"]:::statement
n6["current_module_id = module_id_fro..."]:::statement
n7{"current_module_id != module_id"}:::decision
n8["continue"]:::statement
n9["Merge"]:::merge
n10["code = file_path.read_text(...)"]:::statement
n11["tree = ast.parse(...)"]:::statement
n12["node = _find_function_in_tree(...)"]:::statement
n13{"node is None"}:::decision
n14(["Raise"]):::terminal
n15["Merge"]:::merge
n16["normalized_function_id = f'{modul..."]:::statement
n17(["Return"]):::terminal
n18["After Loop"]:::merge
n19(["Raise"]):::terminal
n20(["End"]):::terminal
n1 --> n2
n2 --> n3
n3 --> n4
n4 -->|Iterate| n5
n5 --> n6
n6 --> n7
n7 -->|True| n8
n8 --> n9
n7 -->|False| n9
n9 --> n10
n10 --> n11
n11 --> n12
n12 --> n13
n13 -->|True| n14
n13 -->|False| n15
n15 --> n16
n16 --> n17
n17 -->|Next| n4
n4 -->|Exit| n18
n18 --> n19
n19 --> n20
classDef decision fill:#fff3cd,stroke:#d4a017,color:#000;
classDef statement fill:#f0f0f0,stroke:#aaa,color:#333;
classDef terminal fill:#fde8e8,stroke:#c0392b,color:#000;
classDef merge fill:#eef2ff,stroke:#7c8fdb,color:#000;
classDef start fill:#e8f5e9,stroke:#2e7d32,color:#000;
This is the zoomed-in function view: branches, merges, and return flow inside a single function.
Known limitations
CodeMarp is static analysis — it reads your code without running it.
| Limitation | Workaround |
|---|---|
| Relative imports may produce sparse high-level graphs | Use --view module or --view trace instead |
Method calls (self.method()) are conservatively handled |
Some valid edges may be missing, but false positives are reduced |
| Dynamic dispatch is not tracked | Results reflect static structure only |
| TypeScript support is first-pass | Function/import/call facts are extracted, but some language forms are still omitted |
| Low-level CFG is Python-only | Use high, mid, trace, reverse, or module views for TypeScript |
| Large full graphs can be hard to read | Use focused views — trace, module, reverse |
These are honest limitations, not bugs. Focused views exist precisely because full graphs on real codebases get noisy.
Roadmap
- Better call resolution (method dispatch, aliases)
- Graph filtering and noise reduction
- JavaScript support through tree-sitter
- Broader TypeScript coverage
- Language-neutral low-level CFG
- Interactive web UI
Philosophy
Useful before perfect. CodeMarp is not exhaustive. It is correct for common cases and honest about where it isn't.
Readable before complete. A graph you can understand is more valuable than a graph that shows everything.
Static first. No runtime instrumentation. No code execution. Analysis runs anywhere.
Status
- Python and TypeScript support
- Tree-sitter default parser
- Python AST fallback available with
--parser-engine ast - CLI-first
- v0.3.x — early but usable on real codebases
Name
CodeMarp — sounds like "code map", because that's what it produces.
Built to answer the question every developer asks when they open an unfamiliar codebase: where do I even start?
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file codemarp-0.3.0.tar.gz.
File metadata
- Download URL: codemarp-0.3.0.tar.gz
- Upload date:
- Size: 33.7 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: uv/0.7.19
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
8770c0fb08710c296ce2bdf7b479a7e05d50e477f3a22cb947ca3f07c99d0df3
|
|
| MD5 |
4771d94c10ec1f6106e9a8dd728bdda5
|
|
| BLAKE2b-256 |
0747680c8ac01180001684bb9445207e071f08e932e9a19a75a1d6f1aad3c942
|
File details
Details for the file codemarp-0.3.0-py3-none-any.whl.
File metadata
- Download URL: codemarp-0.3.0-py3-none-any.whl
- Upload date:
- Size: 33.7 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: uv/0.7.19
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
9522fe761d686f354a9028cac16baeccaac743305bd24b4a5b409b38ab313025
|
|
| MD5 |
e1a7b3298f612dbb6a90a8a99b46cff0
|
|
| BLAKE2b-256 |
01d261607ae4432bfcfffe0fc9bc594ab28bff26cfed0ba7174d535f8eefd66f
|