Skip to main content

Autonomous codebase understanding agent — clone a repo, get an architecture report.

Project description

CodeNarrator

Autonomous codebase understanding agent.

Give it a Git URL and it produces an interactive HTML dependency graph plus a natural-language architecture summary of the codebase. Everything runs locally via Ollama — no API keys, no cloud calls.

Prerequisites

  • Python 3.10+
  • Ollama installed and running
  • A model pulled: ollama pull qwen2.5-coder:7b

Installation

pip install codenarrator-ai

Usage

from codenarrator import analyze

result = analyze("https://github.com/user/repo")

result.show()                  # opens the HTML report in your browser
result.to_html("report.html")  # save the HTML to a file
result.to_json("graph.json")   # export the dependency graph as JSON

Deeper exploration (30 steps instead of the default 20):

result = analyze("https://github.com/user/repo", depth="deep")

Private repositories:

result = analyze(
    "https://github.com/user/private-repo",
    api_key="your-github-token",
)

Result object

Property / Method Description
result.show() Open the HTML report in your browser
result.to_html(path) Save the HTML report to a file
result.to_json(path) Export dependency graph as JSON
result.dependency_graph List of internal edges
result.explored_files Files the agent explored
result.architecture_summary AI-generated architecture summary

Configuration

Environment variables:

Variable Purpose Default
CODENARRATOR_DATA_DIR Where clones, cache, and reports go ~/codenarrator/
OLLAMA_HOST Ollama server URL http://localhost:11434
OLLAMA_MODEL Model to use qwen2.5-coder:7b

Output

Reports and clones live under ~/codenarrator/ by default:

~/codenarrator/
├── repos/      ← cloned repositories
├── cache/      ← analysis cache (keyed by repo + git HEAD)
└── reports/    ← generated HTML reports

Supported languages

Language Full dependency graph Import extraction
Python
TypeScript
JavaScript
Java
Go
Rust
C / C++

How it works

Two layers:

Deterministic layer — file scanning, import extraction, internal-edge resolution, dependency graph computation. Pure Python, no LLM involved. Always produces a complete graph from static analysis alone.

Agentic layer — a local LLM (Qwen2.5-Coder 7B by default) explores the codebase via five tool calls:

  • read_file — inspect a source file
  • follow_import — jump to a file imported by something already read
  • search_for_pattern — regex search across the repo
  • mark_architecture_insight — record a finding
  • stop_analysis — finish the run

The model produces a natural-language architecture summary alongside the deterministic graph. If the LLM times out or returns junk, the report still renders from the deterministic layer.

Local development

For contributors only. End users should use pip install codenarrator-ai above.

git clone https://github.com/sharwariakre/CodeNarrator
cd CodeNarrator
pip install -e .  # for local development

# Start Ollama (separate terminal)
ollama serve
ollama pull qwen2.5-coder:7b

Optional web UI (FastAPI backend + React frontend):

# Terminal 1
cd backend && uvicorn app.main:app --reload

# Terminal 2
cd frontend && npm install && npm run dev

Run the test suite:

cd backend && python -m pytest tests/

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

codenarrator_ai-0.1.2.tar.gz (42.6 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

codenarrator_ai-0.1.2-py3-none-any.whl (46.8 kB view details)

Uploaded Python 3

File details

Details for the file codenarrator_ai-0.1.2.tar.gz.

File metadata

  • Download URL: codenarrator_ai-0.1.2.tar.gz
  • Upload date:
  • Size: 42.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.12.3

File hashes

Hashes for codenarrator_ai-0.1.2.tar.gz
Algorithm Hash digest
SHA256 39a0194c3bda4b3ea16412f600e0f11eba342296a215bd80c4f7be68464c5037
MD5 6ed71d2289f714c19f2e3ec33d95e1e2
BLAKE2b-256 7419ef3b06c11dafa31ef7f31e703d6ec4f9cb4caa372aaf141ea3c2359b35f0

See more details on using hashes here.

File details

Details for the file codenarrator_ai-0.1.2-py3-none-any.whl.

File metadata

File hashes

Hashes for codenarrator_ai-0.1.2-py3-none-any.whl
Algorithm Hash digest
SHA256 0911244564b5f7941e34d9143b1786ecf2eefa1a6bf4e21b9aac5af51a830865
MD5 35f128f1617f4b0c559c723c20849c1c
BLAKE2b-256 f99ca69fd03f48d6574dc81976a1d93be249cc038f0735479497fe6b500b8cb6

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page