A tool for visualizing and analyzing code repositories

These details have not been verified by PyPI

Project links

Project description

Code Cartographer

Deep Multilayer Static Analyzer for Python Projects

As a masochist, I am never satisfied with a project until I reach perfection.
To me, perfection is so far beyond running correctly or achieving 0 problems across all files in a directory.
Although I've never actually achieved the elusive "perfect" finish line in any project ever, so I can't be sure about the definition. It's also just what happens when you're experimenting. When you're deep in research, you're iterating in a vacuum; by the time something works, you've rewritten it five times. Therefore, I unsurprisingly constantly have at least a few iterations of the same project on my local machine that I'm actively making more refinements to at all times.
This can cause confusion (shocker!), especially because I am reluctant to push or publish incomplete or "inadequate" code. I also have a fear of being perceived—and what's more vulnerable than my code?

"For when your code is too chaotic for flake8 and too personal for git push."

Just like that, a vicious cycle is born.
An unproductive, deeply confusing, memory-consuming vicious cycle.

Unfortunately the cycle is much harder to follow when there are dozens of moving parts in dozens of subfolders, each with (dozens) of lengthy scripts. You could feed your directory setup as context in an attempt to gain some clarity, but that's a gamble that backfires 9 times out of 10. Has any LLM ever in all of human history actually internalized any tree structure to assist you in reorganizing a repo? Yeah, I didn't think so. Not for me either. So if a vicious cycle is now my daily routine, at a certain point I decided to give myself the illusion of respite, however brief. This has given me solace at least once. Enjoy!

Code Cartographer

"If Git is for branches, this is for forks of forks."

Features

Full file and definition level metadata
Class/function blocks, line counts, docstrings, decorators, async flags, calls, type hints
Function/class SHA-256 hashes
Detects variants, clones, and partial rewrites across versions
Cyclomatic complexity & maintainability index analysis (via radon)
Flags "at-risk" code with CC > 10 or MI < 65
Auto-generated LLM refactor prompts
Variant grouping, inline diffs, rewrite guidance
Internal dependency graph
Outputs a Graphviz .dot of all intra-project imports
Markdown summary
Skimmable digest with risk flags and structure
Interactive Dashboard
Visual analysis of code complexity, variants, and dependencies
CLI flexibility
Exclusion patterns, Git SHA tagging, output formatting, Markdown/Graphviz toggles

Setup & Installation

Prerequisites

Python 3.8+
(Optional) Graphviz for dependency visualization
(Optional) jq for JSON manipulation

Installation

Create Virtual Environment

python -m venv .venv
source .venv/bin/activate  # On Windows: .venv\Scripts\activate

Install Dependencies

pip install -r requirements.txt

Required Files Structure

your-project/
├── code_analyzer_engine.py     # Core analysis engine
├── code_variant_analyzer.py    # Variant detection and merging
├── analyze_codebase.sh        # Main automation script
├── templates/
│   └── dashboard.html.j2      # Dashboard template
└── requirements.txt           # Dependencies

Usage Guide

Quick Start

The simplest way to analyze any project:

./analyze_codebase.sh --project-dir /path/to/your/project

This will:

Run deep code analysis
Detect code variants
Generate comparison reports
Create an interactive dashboard

Advanced Usage

1. Deep Code Analysis

python code_analyzer_engine.py \
  -d /path/to/project \
  --markdown summary.md \
  --graphviz deps.dot \
  --exclude "tests/.*" "build/.*"

2. Code Variant Analysis

# Compare variants
python code_variant_analyzer.py compare \
    --summary-dir ./analysis \
    --diffs-dir ./diffs \
    --summary-csv ./summary.csv \
    --profile balanced

# Merge similar code
python code_variant_analyzer.py merge \
    --summary-dir ./analysis \
    --output-dir ./merged \
    --format both

# Generate dashboard
python code_variant_analyzer.py dashboard \
    --summary-csv ./summary.csv \
    --diffs-dir ./diffs \
    --template-dir ./templates \
    --output ./dashboard.html

3. Comparing Multiple Versions

python deep_code_analyzer.py -d ./version-A -o version-A.json
python deep_code_analyzer.py -d ./version-B -o version-B.json

# Merge summaries (requires jq)
jq -n \
  --argfile A version-A.json \
  --argfile B version-B.json \
  '{ "version-A": $A, "version-B": $B }' > combined_summary.json

Customization Options

Matching Profiles

--profile strict      # 90% similarity required
--profile balanced   # 70% similarity required
--profile lenient    # 50% similarity required

Output Formats

--format python     # Python files only
--format markdown   # Markdown documentation
--format both      # Both formats

Output Structure

After analysis, you'll find:

analyzed-project/
├── code_analysis/           
│   ├── analysis.json       # Complete analysis data
│   ├── analysis.md         # Human-readable summary
│   ├── dependencies.dot    # Dependency graph
│   ├── diffs/             # Code variant differences
│   └── dashboard.html      # Interactive dashboard
└── merged_code/           # Merged variant implementations

Dashboard Features

Overview metrics and trends
Code variant analysis
Complexity distribution
Dependency visualization
Documentation coverage

Key Metrics

Total files and trends
Code variant count
Average complexity
Documentation coverage
High-complexity files
Most referenced modules
External dependencies

Best Practices

1. Regular Analysis

# Add to your CI/CD pipeline
./analyze_codebase.sh --project-dir . --output-dir ./analysis-$(date +%Y%m%d)

2. Large Projects

# Analyze specific directories
./analyze_codebase.sh \
    --project-dir ./src \
    --exclude "tests/.*,docs/.*,*.pyc,__pycache__/.*"

3. Memory Management

For very large projects:

# Analyze in chunks
for dir in src/*/; do
    ./analyze_codebase.sh --project-dir "$dir" --output-dir "analysis-$(basename $dir)"
done

Integration Examples

GitHub Actions

name: Code Analysis
on: [push, pull_request]

jobs:
  analyze:
    runs-on: ubuntu-latest
    steps:
      - uses: actions/checkout@v2
      - uses: actions/setup-python@v2
      - name: Run Analysis
        run: |
          python -m venv .venv
          source .venv/bin/activate
          pip install -r requirements.txt
          ./analyze_codebase.sh --project-dir .
      - uses: actions/upload-artifact@v2
        with:
          name: code-analysis
          path: code_analysis/

Pre-commit Hook

#!/bin/bash
# .git/hooks/pre-commit

./analyze_codebase.sh --project-dir . --output-dir ./latest-analysis
if [ $? -ne 0 ]; then
    echo "Code analysis failed - please review"
    exit 1
fi

Troubleshooting

Memory Issues

# Reduce analysis scope
./analyze_codebase.sh \
    --project-dir . \
    --exclude "tests/.*,docs/.*,*.pyc,__pycache__/.*,migrations/.*"

Slow Analysis

# Focus on specific directories
./analyze_codebase.sh --project-dir ./src/core

Dashboard Issues

Ensure all JavaScript dependencies are accessible
Check browser console for errors
Verify JSON data structure matches template expectations

Author Notes

This tool exists to reconcile broken, duplicated, or ghost-forked Python projects. It helps you detect what's salvageable, refactor what's duplicated, and visualize the mess you made.

Whether you're dealing with:

Fragmented directories
Local edits lost to time
Abandoned branches and reanimated scripts

This is for you. Or at least, for the version of you that still wants to fix it.

"Structured remorse for unstructured code."

License

MIT License. See LICENSE file for details.

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

0.3.0

Mar 5, 2026

0.2.1

Apr 25, 2025

0.1.7

Apr 24, 2025

0.1.6

Apr 24, 2025

0.1.5

Apr 24, 2025

0.1.4

Apr 24, 2025

0.1.3

Apr 24, 2025

This version

0.1.1

Apr 24, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

code_cartographer-0.1.1.tar.gz (20.6 kB view details)

Uploaded Apr 24, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

code_cartographer-0.1.1-py3-none-any.whl (5.9 kB view details)

Uploaded Apr 24, 2025 Python 3

File details

Details for the file code_cartographer-0.1.1.tar.gz.

File metadata

Download URL: code_cartographer-0.1.1.tar.gz
Upload date: Apr 24, 2025
Size: 20.6 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.8.18

File hashes

Hashes for code_cartographer-0.1.1.tar.gz
Algorithm	Hash digest
SHA256	`044c5b5eb781a3870972d55ffb2272e8883e961edbfaf043636282169db60134`
MD5	`eebbe6d06865895b34b4f5385f5c7050`
BLAKE2b-256	`71c68ce9fa684daa2ccb3272d819b6940a021984dfbee077d515cfc850d1d631`

See more details on using hashes here.

File details

Details for the file code_cartographer-0.1.1-py3-none-any.whl.

File metadata

Download URL: code_cartographer-0.1.1-py3-none-any.whl
Upload date: Apr 24, 2025
Size: 5.9 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.8.18

File hashes

Hashes for code_cartographer-0.1.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`eba64576d48ac4e74a20d90276365924b60875cb7c29bfc494c582b179ef1bbc`
MD5	`aaf712d9d55f6bd27b7743ef83886525`
BLAKE2b-256	`95fc6989ce9062f810d5e81a99353484fd8a10cb9bb84695fdf2c3fe2b249d7f`

See more details on using hashes here.

code-cartographer 0.1.1

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Code Cartographer

Deep Multilayer Static Analyzer for Python Projects

Code Cartographer

Features

Setup & Installation

Prerequisites

Installation

Usage Guide

Quick Start

Advanced Usage

1. Deep Code Analysis

2. Code Variant Analysis

3. Comparing Multiple Versions

Customization Options

Matching Profiles

Output Formats

Output Structure

Dashboard Features

Key Metrics

Best Practices

1. Regular Analysis

2. Large Projects

3. Memory Management

Integration Examples

GitHub Actions

Pre-commit Hook

Troubleshooting

Memory Issues

Slow Analysis

Dashboard Issues

Author Notes

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes