Multi-level code architecture and relationship mapping

Project description

CodeMarp

Multi-level Code Architecture and Relationship Mapping

Understand a codebase like a map — zoom in, zoom out, follow the flow.

The problem

Large codebases are hard to navigate. You open a file and you're already lost. You don't know what calls what, where things live, or what actually happens inside a function.

Documentation is outdated. Diagrams don't exist. The only way to understand the code is to read all of it.

CodeMarp takes a different approach: it builds the map for you.

What CodeMarp does

Given a Python repository, CodeMarp gives you three zoom levels:

Level	What you see
High	Module and package architecture — how things are organised
Mid	Function relationships — who calls what
Low	Control flow inside a function — what actually happens

Think of it as Google Maps for your codebase. Built entirely from static analysis — no runtime, no instrumentation.

Zoom out to see the city. Zoom in to see the streets. Zoom in further to see the building layout.

Graph Levels & Guarantees

CodeMarp produces three views of your codebase. Each view has a different goal and level of precision.

High-level (architecture)

Shows module and package relationships
Built from import statements and grouping
Best-effort approximation of architecture
May be incomplete or sparse depending on project layout

Mid-level (function relationships)

Conservative call graph
Resolves calls using:
- same-module functions
- imported symbols (from x import y)
- imported modules (import x; x.y())
- unique module-level functions
Avoids ambiguous resolution paths such as dynamic dispatch and unresolved method calls
Prioritises precision over recall
- fewer false positives
- may miss valid edges

Each mid-level call edge includes a resolution reason::

same_module
imported_symbol
imported_module
unique_global

Use --debug-resolution to inspect how edges were resolved.

Low-level (control flow)

Function-level control flow graph (CFG)
Shows branching, merging, and execution structure
Structural only — does not model runtime behaviour

Quickstart

Run CodeMarp on a Python repository:

codemarp analyze src --out out

This generates:

out/
  high_level.mmd
  mid_level.mmd
  graph.json

To zoom into one function:

codemarp analyze src --view low --focus codemarp.cli.main:analyze_command --out out

Output

Every analysis produces:

out/
  high_level.mmd    # architecture graph
  mid_level.mmd     # function call graph
  low_level.mmd     # control flow (when using --view low)
  graph.json        # full graph data for tooling

Mermaid (.mmd) — renders in GitHub, VS Code, Mermaid Live Editor
JSON — for tooling, integrations, and future UI

Install

For local development in this repo:

uv sync --extra dev

To install in a project environment:

uv pip install codemarp

Or with pip:

pip install codemarp

Usage

Analyse a repo

codemarp analyze path/to/repo --out out

Point at the folder that contains your top-level package:

# flat layout: mypackage/ is at root
codemarp analyze .

# src layout: mypackage/ is inside src/
codemarp analyze src

Views

Full (default)

See the entire graph — architecture + all function relationships.

codemarp analyze path/to/repo --view full --out out

Trace — what does this function call?

Follow a function forward through the call graph.

codemarp analyze path/to/repo \
  --view trace \
  --focus package.module:function_name \
  --max-depth 3 \
  --out out

Reverse — what calls this function?

Find every path that leads to a function.

codemarp analyze path/to/repo \
  --view reverse \
  --focus package.module:function_name \
  --max-depth 3 \
  --out out

Module — zoom into one module

Show the functions and internal relationships for a single module.

codemarp analyze path/to/repo \
  --view module \
  --module package.module_name \
  --out out

Low — what happens inside this function?

Build a control-flow graph for one function.

codemarp analyze path/to/repo \
  --view low \
  --focus package.module:function_name \
  --out out

Debug call resolution

Use --debug-resolution to print why mid-level call edges were resolved. This is useful when checking whether a call edge came from an imported symbol, a local function, or another resolver path.

codemarp analyze path/to/repo --out out --debug-resolution

The debug output is printed to stdout, so you can pipe it into a file:

codemarp analyze path/to/repo --out out --debug-resolution > debug.txt

The generated graph files are still written to the directory passed with --out.

Typical workflow

1. codemarp analyze src --view full
   → understand the overall structure

2. codemarp analyze src --view module --module mypackage.core
   → inspect one area

3. codemarp analyze src --view trace --focus mypackage.core:run --max-depth 3
   → follow a specific entrypoint

4. codemarp analyze src --view low --focus mypackage.core:run
   → zoom into the logic

Viewing the output

Mermaid files render automatically in:

GitHub — renders Mermaid blocks directly in Markdown files
Mermaid Live Editor — paste and share
VS Code — with the Mermaid Preview extension

Yes, GitHub renders Mermaid blocks directly in Markdown, so the sample diagrams below should display in the repo README.

Sample output

These examples were generated from the CodeMarp codebase itself.

High-level architecture

Command:

codemarp analyze src --view full --out samples/codemarp_full_out

Excerpt from samples/codemarp_full_out/high_level.mmd:

flowchart LR
    codemarp_analyzers["codemarp.analyzers"]
    codemarp_cli["codemarp.cli"]
    codemarp_errors(["codemarp.errors"])
    codemarp_exporters["codemarp.exporters"]
    codemarp_graph["codemarp.graph"]
    codemarp_parser["codemarp.parser"]
    codemarp_pipeline["codemarp.pipeline"]
    codemarp_views["codemarp.views"]
    codemarp_analyzers -->|imports| codemarp_graph
    codemarp_analyzers -->|imports| codemarp_parser
    codemarp_cli -->|imports| codemarp_analyzers
    codemarp_cli -->|imports| codemarp_errors
    codemarp_cli -->|imports| codemarp_pipeline
    codemarp_exporters -->|imports| codemarp_graph
    codemarp_exporters -->|imports| codemarp_analyzers
    codemarp_parser -->|imports| codemarp_errors
    codemarp_parser -->|imports| codemarp_graph
    codemarp_pipeline -->|imports| codemarp_graph
    codemarp_pipeline -->|imports| codemarp_views
    codemarp_pipeline -->|imports| codemarp_analyzers
    codemarp_pipeline -->|imports| codemarp_parser
    codemarp_pipeline -->|imports| codemarp_exporters
    codemarp_views -->|imports| codemarp_errors
    codemarp_views -->|imports| codemarp_graph

This is the zoomed-out package view: which parts of the project depend on which others.

Focused function trace

Command:

codemarp analyze src --focus codemarp.cli.main:analyze_command --out out

Excerpt from samples/codemarp_trace_out/mid_level.mmd:

flowchart LR
    codemarp_graph_models_GraphBundle_function_by_id["codemarp.graph.models:GraphBundle.function_by_id"]
    codemarp_views_subgraph_build_function_subgraph["codemarp.views.subgraph:build_function_subgraph"]
    codemarp_views_subgraph__filter_edges_for_nodes["codemarp.views.subgraph:_filter_edges_for_nodes"]
    codemarp_views_subgraph__modules_for_functions["codemarp.views.subgraph:_modules_for_functions"]
    codemarp_views_trace__build_call_adjacency["codemarp.views.trace:_build_call_adjacency"]
    codemarp_views_trace_trace_functions_forward["codemarp.views.trace:trace_functions_forward"]
    codemarp_views_trace_trace_function_view["codemarp.views.trace:trace_function_view"]
    codemarp_views_trace__validate_entrypoint["codemarp.views.trace:_validate_entrypoint"]
    codemarp_views_subgraph_build_function_subgraph -->|calls| codemarp_views_subgraph__modules_for_functions
    codemarp_views_subgraph_build_function_subgraph -->|calls| codemarp_views_subgraph__filter_edges_for_nodes
    codemarp_views_trace_trace_functions_forward -->|calls| codemarp_views_trace__validate_entrypoint
    codemarp_views_trace_trace_functions_forward -->|calls| codemarp_views_trace__build_call_adjacency
    codemarp_views_trace_trace_function_view -->|calls| codemarp_views_trace__validate_entrypoint
    codemarp_views_trace_trace_function_view -->|calls| codemarp_views_trace_trace_functions_forward
    codemarp_views_trace_trace_function_view -->|calls| codemarp_views_subgraph_build_function_subgraph
    codemarp_views_trace__validate_entrypoint -->|calls| codemarp_graph_models_GraphBundle_function_by_id

This is the focused mid-level view: starting from one function, you can follow the call chain it reaches.

Low-level control flow

Command:

codemarp analyze src --view low --focus codemarp.parser.python_parser:get_source --out out

Excerpt from samples/codemarp_low_out/low_level.mmd:

flowchart TD
    n1(["Start"]):::terminal
    n2{"encoding is None"}:::decision
    n3["get_best_encoding(...)"]:::statement
    n4["Merge"]:::merge
    n5{"errors is None"}:::decision
    n6["Assign"]:::statement
    n7["Merge"]:::merge
    n8(["Return"]):::terminal
    n9(["End"]):::terminal
    n1 --> n2
    n2 -->|True| n3
    n3 --> n4
    n2 -->|False| n4
    n4 --> n5
    n5 -->|True| n6
    n6 --> n7
    n5 -->|False| n7
    n7 --> n8
    n8 --> n9

    classDef decision fill:#fff3cd,stroke:#d4a017,color:#000;
    classDef statement fill:#f0f0f0,stroke:#aaa,color:#333;
    classDef terminal fill:#fde8e8,stroke:#c0392b,color:#000;
    classDef merge fill:#eef2ff,stroke:#7c8fdb,color:#000;
    classDef start fill:#e8f5e9,stroke:#2e7d32,color:#000;

This is the zoomed-in function view: branches, merges, and return flow inside a single function.

Known limitations

CodeMarp is static analysis — it reads your code without running it.

Limitation	Workaround
Relative imports may produce sparse high-level graphs	Use `--view module` or `--view trace` instead
Method calls (`self.method()`) are conservatively handled	Some valid edges may be missing, but false positives are reduced
Dynamic dispatch is not tracked	Results reflect static structure only
Large full graphs can be hard to read	Use focused views — `trace`, `module`, `reverse`

These are honest limitations, not bugs. Focused views exist precisely because full graphs on real codebases get noisy.

Roadmap

Better call resolution (method dispatch, aliases)
Graph filtering and noise reduction
Tree-sitter migration → multi-language support
JavaScript / TypeScript support
Interactive web UI

Philosophy

Useful before perfect. CodeMarp v0.1 is not exhaustive. It is correct for common cases and honest about where it isn't.

Readable before complete. A graph you can understand is more valuable than a graph that shows everything.

Static first. No runtime instrumentation. No code execution. Analysis runs anywhere.

Status

Python only (AST-based, tree-sitter planned)
CLI-first
v0.1.x — early but usable on real codebases

Name

CodeMarp — sounds like "code map", because that's what it produces.

Built to answer the question every developer asks when they open an unfamiliar codebase: where do I even start?

Project details

Release history Release notifications | RSS feed

0.3.2

Apr 20, 2026

0.3.1

Apr 20, 2026

0.3.0

Apr 19, 2026

0.2.0

Apr 19, 2026

This version

0.1.3

Apr 18, 2026

0.1.2

Apr 16, 2026

0.1.1

Apr 16, 2026

0.1.0

Apr 12, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

codemarp-0.1.3.tar.gz (29.5 kB view details)

Uploaded Apr 18, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

codemarp-0.1.3-py3-none-any.whl (24.4 kB view details)

Uploaded Apr 18, 2026 Python 3

File details

Details for the file codemarp-0.1.3.tar.gz.

File metadata

Download URL: codemarp-0.1.3.tar.gz
Upload date: Apr 18, 2026
Size: 29.5 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: uv/0.7.19

File hashes

Hashes for codemarp-0.1.3.tar.gz
Algorithm	Hash digest
SHA256	`fd51118d7a74d8eac2d327973318e5a85d9b513109dc78abfea531669b9e149b`
MD5	`8d1f979fe5b7d2a959220c9acd6d1d2d`
BLAKE2b-256	`cb6bdef99a3d9a188cbcd5d37734d98f81f903d3f8eec3bd34b720abbb45f204`

See more details on using hashes here.

File details

Details for the file codemarp-0.1.3-py3-none-any.whl.

File metadata

Download URL: codemarp-0.1.3-py3-none-any.whl
Upload date: Apr 18, 2026
Size: 24.4 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: uv/0.7.19

File hashes

Hashes for codemarp-0.1.3-py3-none-any.whl
Algorithm	Hash digest
SHA256	`f0ed583fbf276f044e5f2a1b7c09e33889e224494ab4f7dfe544cb89904cedda`
MD5	`173146dd6b57f32cb50db1b3e98b5c05`
BLAKE2b-256	`7a6a6e9fb4ecd3e8d29d1d91aca6d9ab87327ee1563b85aa472d478b1c95e645`

See more details on using hashes here.

codemarp 0.1.3

Navigation

Verified details

Maintainers

Unverified details

Meta

Project description

CodeMarp

The problem

What CodeMarp does

Graph Levels & Guarantees

High-level (architecture)

Mid-level (function relationships)

Low-level (control flow)

Quickstart

Output

Install

Usage

Analyse a repo

Views

Full (default)

Trace — what does this function call?

Reverse — what calls this function?

Module — zoom into one module

Low — what happens inside this function?

Debug call resolution

Typical workflow

Viewing the output

Sample output

High-level architecture

Focused function trace

Low-level control flow

Known limitations

Roadmap

Philosophy

Status

Name

Project details

Verified details

Maintainers

Unverified details

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes