Your repository mentor - AI-guided codebase analysis with psychology-driven design

These details have not been verified by PyPI

Project description

Lantern

Lighting your way through the code forest.

Lantern Hero Image

Lantern is your CLI mentor that turns complex repositories into a step-by-step narrative.

Understand codebases faster with AI-guided architecture scans, planned learning paths, and human-readable guides.

Speaks Your Language: Complex logic is hard enough. Lantern explains code in your native language (Chinese, Japanese, Spanish, etc.) while keeping technical terms precise.

✨ Highlights


🧠 Cognitive Load Reduction	Psychology-based chunking (Miller's Law) breaks analysis into digestible batches
🌐 Native Language Output	Technical docs in your mother tongue—Chinese, Japanese, Spanish, and more
📊 Auto-Generated Diagrams	Mermaid flowcharts + sequence diagrams automatically created for every module
💡 Concept Extraction	Key mental models: authentication flow, caching strategy, retry mechanisms
Local & Private	Supports Ollama for 100% local analysis—safe for enterprise codebases

Why Lantern exists

Understanding a new codebase is hard. You face:

Not knowing which file to start with
Outdated or missing documentation
Hidden architectural dependencies
Needing to read dozens of files to understand one concept

Modern codebases often contain AI-generated code that works but lacks documentation—making comprehension even harder.

Most AI tools help you write or refactor code. Lantern's goal is different:

Lantern helps you understand code—whether written by humans or AI.

Use Cases

Scenario	How Lantern Helps
👤 New Hire Onboarding	Rapidly understand complex legacy systems without tribal knowledge
🔧 Pre-Refactoring Analysis	Assess impact scope before making changes
⚠️ Technical Debt Assessment	Identify high-risk modules and hidden dependencies
🏗️ Architecture Decision Support	Make better design choices with clear system visibility
🔍 Code Review Preparation	Understand unfamiliar code before reviewing PRs

Core Design & Features

🧠 Psychology-Driven Design

Designed for human comprehension, not machines. Lantern uses psychological principles:

Chunking (Miller's Law): Analyzes batches of ~3 related files to prevent cognitive overload
Scaffolding: Generates a plan first for human review, building understanding step-by-step
Human-First Output: Explains "Why" and "How", focusing on comprehension over data

🔄 Dual-Perspective Analysis

Bottom-up (file-by-file details) + Top-down (architecture overview) = complete understanding from any angle.

🔌 Flexible Backends

Choose between local privacy (Ollama), cloud power (OpenRouter/OpenAI), or agent-based workflows (CLI tools). Lantern automatically detects backend type and uses the appropriate analysis workflow.

🤖 Agentic Modes

Independently upgrade the planning and synthesis stages to LLM-powered agents:

--planning-mode agentic: Uses AgenticPlanner to generate LLM-enhanced batch hints and learning objectives on top of the static dependency graph.
--synthesis-mode agentic: Uses AgenticSynthesizer (LangGraph) to write Markdown documentation files directly via file tools instead of structured JSON parsing.

🔁 LangGraph Workflow Orchestration

Use --workflow to run the full pipeline as a LangGraph StateGraph with checkpoint-based resumption:

Enables pause and resume via --resume <thread-id>
Conditional routing based on quality gates
Human-in-the-loop interrupt support

✏️ Human-in-the-Loop

Review and edit lantern_plan.md before execution. You control what gets analyzed and how.

What Lantern Does

One command. Full documentation.

lantern run

Lantern analyzes your repository and generates a complete documentation repository:

Lantern Input & Output

Input

path to repo

Output

.lantern/output/
├── en/
│   ├── top_down/                    # 📖 High-level guides
│   │   ├── OVERVIEW.md             # Project vision & scope
│   │   ├── ARCHITECTURE.md         # System design + Mermaid dependency graphs
│   │   ├── CONCEPTS.md             # Key concepts (auth flow, caching, retry)
│   │   └── GETTING_STARTED.md      # Onboarding guide + Mermaid sequence diagrams
│   │
│   └── bottom_up/                   # 📝 File-by-file analysis
│       └── src/                     # Mirrors your repo structure
│           ├── kernel/
│           │   ├── scheduler.py.md  # Detailed breakdown
│           │   └── events.py.md
│           └── api/
│               └── routes.py.md
│
└── zh-TW/                           # 🌐 Native language version
    └── (same structure)

How It Maintains Quality

Internally, Lantern uses batch-based analysis for quality control:

Files are analyzed in small batches (1-3 related files)
Each batch builds on context from previous batches
This ensures traceability and consistent reasoning

You don't need to manage this—just run lantern run and let it work.

Visual Flow Reconstruction

Lantern automatically generates Mermaid diagrams for every analyzed file:

Architecture Diagrams

Show module dependencies in ARCHITECTURE.md:

graph LR
    API --> Auth
    API --> Models
    Auth --> Database
    Models --> Database

Sequence Diagrams & Per-File Flows: Lantern also generates request/response sequence diagrams and per-file flow diagrams showing internal logic.

No manual work needed—diagrams are generated automatically by analyzing your code structure.

Quick Start

Prerequisites

Lantern supports multiple backend options. See Backend Configuration for detailed setup instructions:

OpenAI (Recommended) - Cost-effective, production-ready
Ollama (Free & Private) - Run locally without API calls
OpenRouter - Access multiple providers (Claude, Gemini, etc.)
CLI Tool - Leverage agent capabilities (file tools, code execution)

Installation

pip install repo-lantern

Simple Mode (Recommended)

# Run in current directory (outputs to .lantern/)
lantern run

# Specify input and output
lantern run --repo ~/projects/my-app --output ~/docs/my-app-docs

# Use specific language
lantern run --lang zh-TW  # Traditional Chinese

# Skip the cost confirmation prompt
lantern run --yes

Lantern will show you a cost estimate before starting. The default backend is OpenAI, but you can configure it in .lantern/lantern.toml:

[backend]
type = "openai"              # or "ollama", "openrouter"
openai_model = "gpt-4o-mini" # fast and cheap for production
# openai_model = "gpt-4o"    # higher quality option

Advanced Mode

For reviewing the analysis plan before execution:

# Step 1: Initialize
lantern init --repo /path/to/repo
# Re-initialize and overwrite existing config
lantern init --repo /path/to/repo --overwrite

# Step 2: Generate plan (review lantern_plan.md)
lantern plan

# Step 3: Execute analysis with optional flags
lantern run
lantern run --planning-mode agentic    # LLM-enhanced planning
lantern run --synthesis-mode agentic   # LangGraph-powered synthesis
lantern run --workflow                 # Full LangGraph workflow orchestration
lantern run --workflow --resume <thread-id>  # Resume from checkpoint

All `lantern run` Options

Flag	Default	Description
`--repo`	`.`	Repository path to analyze
`--output`	`.lantern`	Output directory
`--lang`	`en`	Output language (e.g., `zh-TW`, `ja`)
`--yes` / `-y`	false	Skip cost confirmation prompt
`--planning-mode`	`agentic`	`static` (topological) or `agentic` (LLM-enhanced)
`--synthesis-mode`	`agentic`	`batch` (rule-based) or `agentic` (LLM-powered)
`--workflow`	false	Use LangGraph workflow orchestration
`--resume`	—	Resume from checkpoint with given thread ID

Configuration

Language settings

You can set your preferred output language (e.g., Traditional Chinese, Japanese) to lower the cognitive barrier even further.

Option A: Command line

lantern run --lang zh-TW

LangSmith Observability (Optional)

Lantern integrates with LangSmith for tracing and debugging LLM calls across the entire pipeline.

Enable it in .lantern/lantern.toml:

[langsmith]
enabled = true
project = "repo-lantern"            # Project name in LangSmith dashboard
# endpoint = "https://api.smith.langchain.com"
# api_key_env = "LANGCHAIN_API_KEY"

Set your API key:

export LANGCHAIN_API_KEY="ls__..."

When enabled, Lantern prints LangSmith tracing: ON (project=repo-lantern) at startup and all LangChain/LangGraph calls are traced.

Backend Configuration

Lantern supports multiple LLM backends with easy configuration:

OpenAI (Recommended for Production) ⭐

# .lantern/lantern.toml
[backend]
type = "openai"
openai_model = "gpt-4o-mini"  # Fast and cheap
# openai_model = "gpt-4o"     # Higher quality

Set your API key:

export OPENAI_API_KEY="sk-..."

Pricing (as of 2025):

gpt-4o-mini: $0.15/1M input, $0.60/1M output
gpt-4o: $2.50/1M input, $10/1M output

Ollama (Local Models)

[backend]
type = "ollama"
ollama_model = "qwen2.5:14b"  # or llama3, mistral, etc.

OpenRouter (Multi-Model Access)

[backend]
type = "openrouter"
openrouter_model = "openai/gpt-4o-mini"  # or anthropic/claude-sonnet-4, etc.

Set your API key:

export OPENROUTER_API_KEY="sk-or-v1-..."

CLI Tool (Agent-Based Workflow) 🤖

[backend]
type = "cli"
cli_command = "codex exec"  # or "llm -m gpt-4o-mini", "claude", etc.
cli_model_name = "cli"

How it works:

Lantern detects CLI backends and automatically switches to agent-based workflow
Prompts instruct the agent to write Markdown files directly using file tools
Agents leverage their native capabilities (code execution, file operations, etc.)
No JSON parsing required - agents write documentation files directly

Supported CLI tools:

codex exec - OpenAI Codex with agent capabilities
llm -m <model> - Simon Willison's LLM tool
claude - Anthropic Claude CLI
Any custom CLI that accepts stdin and outputs to stdout

Example workflow:

# Install a CLI tool (example: llm)
pip install llm

# Configure Lantern to use it
echo '[backend]
type = "cli"
cli_command = "llm -m gpt-4o-mini"
cli_model_name = "gpt-4o-mini"' > .lantern/lantern.toml

# Run analysis
lantern run

Cost Estimation

Before execution, Lantern fetches real-time pricing and shows you:

Estimated input/output tokens
Projected cost (USD)
Confirmation prompt

Local models (Ollama) show $0.00 cost.

Roadmap

LangGraph Workflow Orchestration: Full StateGraph with checkpoint-based resumption (--workflow, --resume).
Agentic Planning & Synthesis: LLM-enhanced planning (--planning-mode agentic) and synthesis (--synthesis-mode agentic).
LangSmith Observability: Tracing integration for debugging LLM calls.
Execution Trace Mode: Collect call graphs via unit tests for dynamic analysis.
Memory Cross-talk: Enhanced reasoning across batch boundaries.
Multi-language Static Analysis: Go, Rust, and Java support.
VSCode Extension: Integrated progress tracking.

Contributing

PRs are welcome! Help us build the ultimate tool for code understanding.

License

MIT

Project details

These details have not been verified by PyPI

Release history Release notifications | RSS feed

0.1.3

Feb 24, 2026

This version

0.1.2

Feb 24, 2026

0.1.1

Feb 16, 2026

0.1.0

Feb 14, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

repo_lantern-0.1.2.tar.gz (80.6 kB view details)

Uploaded Feb 24, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

repo_lantern-0.1.2-py3-none-any.whl (101.6 kB view details)

Uploaded Feb 24, 2026 Python 3

File details

Details for the file repo_lantern-0.1.2.tar.gz.

File metadata

Download URL: repo_lantern-0.1.2.tar.gz
Upload date: Feb 24, 2026
Size: 80.6 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for repo_lantern-0.1.2.tar.gz
Algorithm	Hash digest
SHA256	`09411165555e2217dfa0c1448bd7ef6763ca856b69d254654125437da1ec53c6`
MD5	`a0da7a7377e3a02a19322eaf36794489`
BLAKE2b-256	`4e27dbe12dbc437dc8598adf7f9c693c3569c6ba76d6407276ca4966c5528210`

See more details on using hashes here.

Provenance

The following attestation bundles were made for repo_lantern-0.1.2.tar.gz:

Publisher: release.yml on abc1199281/repo-lantern

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: repo_lantern-0.1.2.tar.gz
- Subject digest: 09411165555e2217dfa0c1448bd7ef6763ca856b69d254654125437da1ec53c6
- Sigstore transparency entry: 984698367
- Sigstore integration time: Feb 24, 2026
Source repository:
- Permalink: abc1199281/repo-lantern@25b94b9b4339da0274b68694291147abc2fd57cc
- Branch / Tag: refs/tags/v0.1.2
- Owner: https://github.com/abc1199281
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yml@25b94b9b4339da0274b68694291147abc2fd57cc
- Trigger Event: push

File details

Details for the file repo_lantern-0.1.2-py3-none-any.whl.

File metadata

Download URL: repo_lantern-0.1.2-py3-none-any.whl
Upload date: Feb 24, 2026
Size: 101.6 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for repo_lantern-0.1.2-py3-none-any.whl
Algorithm	Hash digest
SHA256	`5d660a2f47270abf0ab04028ded7f2fe283ea974c434c03e01532f50ab93599e`
MD5	`b0d711aee109bb0a5aa158a478803c78`
BLAKE2b-256	`4ae24068c48325f13e2455d3fb2b36bab5d087a9475687e1f936f7dbf68cbab3`

See more details on using hashes here.

Provenance

The following attestation bundles were made for repo_lantern-0.1.2-py3-none-any.whl:

Publisher: release.yml on abc1199281/repo-lantern

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: repo_lantern-0.1.2-py3-none-any.whl
- Subject digest: 5d660a2f47270abf0ab04028ded7f2fe283ea974c434c03e01532f50ab93599e
- Sigstore transparency entry: 984698371
- Sigstore integration time: Feb 24, 2026
Source repository:
- Permalink: abc1199281/repo-lantern@25b94b9b4339da0274b68694291147abc2fd57cc
- Branch / Tag: refs/tags/v0.1.2
- Owner: https://github.com/abc1199281
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yml@25b94b9b4339da0274b68694291147abc2fd57cc
- Trigger Event: push

repo-lantern 0.1.2

Navigation

Verified details

Maintainers

Unverified details

Meta

Classifiers

Project description

Lantern

✨ Highlights

Why Lantern exists

Use Cases

Core Design & Features

🧠 Psychology-Driven Design

🔄 Dual-Perspective Analysis

🔌 Flexible Backends

🤖 Agentic Modes

🔁 LangGraph Workflow Orchestration

✏️ Human-in-the-Loop

What Lantern Does

Input

Output

How It Maintains Quality

Visual Flow Reconstruction

Architecture Diagrams

Quick Start

Prerequisites

Installation

Simple Mode (Recommended)

Advanced Mode

All lantern run Options

Configuration

Language settings

LangSmith Observability (Optional)

Backend Configuration

OpenAI (Recommended for Production) ⭐

Ollama (Local Models)

OpenRouter (Multi-Model Access)

CLI Tool (Agent-Based Workflow) 🤖

Cost Estimation

Roadmap

Contributing

License

Project details

Verified details

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance

All `lantern run` Options