Give your legacy code a voice. LLM-powered codebase analysis and documentation generator.

These details have not been verified by PyPI

Project links

Project description

DecipherCode

Give your legacy code a voice.

DecipherCode is a CLI tool that uses LLMs to analyze legacy codebases and generate comprehensive documentation. Point it at any project directory (or GitHub URL) and get instant architecture analysis, README generation, diagrams, and git archaeology reports.

Works with any OpenAI-compatible API: OpenAI, Ollama, Azure OpenAI, Anthropic (via proxy), and more.

Features

:mag: Full Repo Analysis - Detect languages, frameworks, architecture patterns, APIs, database models, environment variables, and dead code
:page_facing_up: README Generation - Generate a professional README.md complete with badges, setup instructions, and API documentation
:triangular_ruler: Architecture Diagrams - Produce Mermaid or GraphViz DOT diagrams showing components, data flow, and module dependencies
:scroll: Git Archaeology - Analyze commit history to find contributors, tech debt hotspots, and project evolution narrative
:speech_balloon: Interactive Q&A - Ask natural language questions about any codebase and get precise, context-aware answers
:electric_plug: LLM-Agnostic - Configure once via environment variables; works with OpenAI, Azure OpenAI, Anthropic, and Ollama
:white_check_mark: Repository Health Auditor - decipher practices scores a Python project against 8 categories (project structure, testing, quality tooling, CI/CD, licensing, release readiness, dependency hygiene, documentation) with a reproducible fail/warn severity model

Quick Start

Installation

Install from source:

git clone https://github.com/boricles/deciphercode.git
cd deciphercode
pip install -e ".[dev]"

Configuration

DecipherCode supports multiple LLM providers. Configure via environment variables:

# OpenAI
export DECIPHER_API_BASE="https://api.openai.com/v1"
export DECIPHER_API_KEY="sk-..."
export DECIPHER_MODEL="gpt-4o"

# Ollama (default, no key needed)
export DECIPHER_API_BASE="http://localhost:11434/v1"
export DECIPHER_API_KEY="ollama"
export DECIPHER_MODEL="llama3"

# Azure OpenAI (auto-detected from URL)
export DECIPHER_API_BASE="https://your-resource.cognitiveservices.azure.com"
export DECIPHER_API_KEY="your-azure-key"
export DECIPHER_MODEL="your-deployment-name"
export DECIPHER_API_VERSION="2024-10-21"    # optional, this is the default

# Anthropic via proxy
export DECIPHER_API_PROVIDER="anthropic"
export DECIPHER_API_BASE="http://localhost:8080"
export DECIPHER_API_KEY="your-anthropic-key"
export DECIPHER_MODEL="claude-opus-4-6"

Provider auto-detection: If your DECIPHER_API_BASE URL contains azure.com, the Azure OpenAI client is used automatically. Set DECIPHER_API_PROVIDER explicitly to force a specific provider (openai, azure, or anthropic).

Usage

# Full codebase analysis
decipher scan ./my-legacy-app

# Generate a README
decipher readme ./my-legacy-app -o README.md

# Architecture diagrams (Mermaid or DOT)
decipher diagram ./my-legacy-app --format mermaid
decipher diagram ./my-legacy-app --format dot -o architecture.dot

# Git archaeology report
decipher history ./my-legacy-app

# Ask a question
decipher ask ./my-legacy-app "How does the auth flow work?"

# Interactive Q&A session
decipher ask ./my-legacy-app

# Scan from a GitHub URL
decipher scan https://github.com/user/repo

# Export analysis as JSON
decipher scan ./my-legacy-app --json -o analysis.json

# Verbose mode for debugging
decipher -v scan ./my-legacy-app

Practices Auditor

Audit any Python repository against software-development best practices:

# Default audit (terminal output when TTY, markdown in CI)
decipher practices /path/to/repo

# JSON report to file
decipher practices . --format json -o report.json

# Fail CI on warnings (not just failures)
decipher practices . --strict

# Run only specific checkers
decipher practices . --only testing,quality_gates

The auditor produces a structured report with per-category scores (0-100) and prioritised recommendations. Exit codes: 0 = pass, 1 = fail (or warn with --strict), 2 = input error.

Commands

Command	Description
`decipher scan <target>`	Full analysis: languages, architecture, APIs, dead code, and more
`decipher readme <target>`	Generate a professional README.md
`decipher diagram <target>`	Generate Mermaid or GraphViz architecture diagrams
`decipher history <target>`	Git archaeology: contributors, hotspots, evolution timeline
`decipher ask <target> [question]`	Ask questions about the codebase (interactive if no question given)
`decipher practices <target>`	Audit repository against best practices (8 checkers, fail/warn scoring)

All commands accept a local directory path or a GitHub URL as the target.

Example Output

See the examples/ directory for sample outputs:

Scan report - Full analysis of a Django e-commerce API
Mermaid diagrams - Component, data flow, and dependency diagrams
Archaeology report - Git history analysis with contributor and hotspot data

Project Structure

deciphercode/
├── decipher/
│   ├── __init__.py          # Package version
│   ├── cli.py               # Click CLI commands
│   ├── scanner.py           # Codebase scanning and file discovery
│   ├── analyzer.py          # LLM-powered code analysis
│   ├── readme_generator.py  # README.md generation
│   ├── archaeologist.py     # Git history analysis
│   ├── diagrammer.py        # Mermaid/DOT diagram generation
│   ├── interactive.py       # Interactive Q&A mode
│   ├── llm.py               # LLM client wrapper (OpenAI, Azure, Anthropic)
│   └── utils.py             # File reading, language detection, helpers
├── tests/                   # Test suite (72 tests)
├── examples/                # Sample outputs
├── pyproject.toml           # Project metadata and dependencies
├── LICENSE                  # MIT
└── README.md

How It Works

Scan - Walks the directory tree, identifies source files, detects languages and frameworks, finds dependency files, maps config and environment variables, and identifies entry points.
Analyze - Representative source files are sampled and sent to the LLM along with the project structure. The LLM identifies the architecture pattern, components, API routes, database models, dead code candidates, and key observations.
Generate - Based on the analysis, DecipherCode can produce a README, architecture diagrams, or an archaeology report. Each output type uses a specialized prompt designed to produce accurate, well-structured results.
Interactive - In Q&A mode, the full codebase context is loaded into the conversation, allowing you to ask natural language questions and get precise answers that reference specific files and functions.

Upgrading

See CHANGELOG.md for what's new and MIGRATION.md for breaking changes between releases.

Configuration Reference

Environment Variable	Default	Description
`DECIPHER_API_BASE`	`http://localhost:11434/v1`	API base URL
`DECIPHER_API_KEY`	`ollama`	API key (use `ollama` for local Ollama)
`DECIPHER_MODEL`	`llama3`	Model name (or Azure deployment name)
`DECIPHER_API_PROVIDER`	(auto-detected)	Force a provider: `openai`, `azure`, or `anthropic`
`DECIPHER_API_VERSION`	`2024-10-21`	Azure OpenAI API version

Dependencies

Click - CLI framework
OpenAI Python SDK - OpenAI and Azure OpenAI client
Anthropic Python SDK - Anthropic API client
Rich - Terminal formatting and progress bars
GitPython - Git history analysis
tiktoken - Token counting for prompt management
PyYAML - YAML config file parsing

Development

# Install with dev dependencies
pip install -e ".[dev]"

# Run tests
pytest

# Run tests with coverage
pytest --cov=decipher

# Lint
ruff check decipher/ tests/

Contributing

Contributions are welcome! Here's how to get involved:

Report bugs - Open an issue with steps to reproduce
Suggest features - Describe the use case and expected behavior in an issue
Submit PRs - Fork the repo, create a feature branch, add tests, and open a pull request

Please keep PRs focused on a single change and ensure all tests pass before submitting.

Acknowledgments

This project was built with Claude Code by Anthropic as the primary development tool. Architecture design, implementation, testing, and documentation were developed through collaborative AI-assisted programming.

License

MIT License. See LICENSE for details.

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

This version

0.2.0

Apr 28, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

deciphercode-0.2.0.tar.gz (51.6 kB view details)

Uploaded Apr 28, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

deciphercode-0.2.0-py3-none-any.whl (57.3 kB view details)

Uploaded Apr 28, 2026 Python 3

File details

Details for the file deciphercode-0.2.0.tar.gz.

File metadata

Download URL: deciphercode-0.2.0.tar.gz
Upload date: Apr 28, 2026
Size: 51.6 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: uv/0.11.7 {"installer":{"name":"uv","version":"0.11.7","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}

File hashes

Hashes for deciphercode-0.2.0.tar.gz
Algorithm	Hash digest
SHA256	`b60341be528a04b36da9f9d3c02b0484e9d26ea0acd4f9f4bfb68bbadb01d2e4`
MD5	`1793b1d238e98fb516fa37adef4264dc`
BLAKE2b-256	`364bb65fd2be7eab0bc07641cbb61a6bdcbdf154caf7e95dc36c863821336eb9`

See more details on using hashes here.

File details

Details for the file deciphercode-0.2.0-py3-none-any.whl.

File metadata

Download URL: deciphercode-0.2.0-py3-none-any.whl
Upload date: Apr 28, 2026
Size: 57.3 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: uv/0.11.7 {"installer":{"name":"uv","version":"0.11.7","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}

File hashes

Hashes for deciphercode-0.2.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`3d2624392bb27f952352766226787da00cd2330caa76674279a2c1ade51c4a18`
MD5	`5005aae7504f247baa63d11054ed0a28`
BLAKE2b-256	`6037ea85c58912d2d49d76a474164415ab227efce167dda0874545c89e5d08bc`

See more details on using hashes here.

deciphercode 0.2.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

DecipherCode

Features

Quick Start

Installation

Configuration

Usage

Practices Auditor

Commands

Example Output

Project Structure

How It Works

Upgrading

Configuration Reference

Dependencies

Development

Contributing

Acknowledgments

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes