Your codebase, understood by any LLM. Generate a checked-in wiki and skill files from any repo — no cloud, no lock-in.

Project description

kiwiskil logo

kiwiskil

Your codebase, understood by any LLM.
Generate a checked-in wiki and skill files from any repo — no cloud, no lock-in.

Install · Quick start · CLAUDE.md snippet · CLI · Loading the skill · Configuration · Contributing

kiwiskil architecture diagram

kiwiskil generates a checked-in structural wiki and skill files from any codebase. It enables LLM agents to navigate code without reading source files — using a knowledge graph built from your repo and checked into git.

How it works

AST parsing extracts symbols, imports, and call graphs from your source files (deterministic, free) — Python, JS/TS, Go, Java, Ruby, and Rust
An LLM adds one-line descriptions and a system overview — via any LiteLLM provider, or your logged-in claude CLI with no API key (details)
A density-based grouper organises files into wiki pages by logical density, not directory structure
Relationships and impact are rendered inline — every page shows each symbol's callers, callees, and an "editing this affects…" blast radius (precomputed, deterministic), so an agent can trace and debug without reading source
A ranked repo map (PageRank over the call graph, fit to a token budget) leads INDEX.md and the skill file with the most load-bearing symbols first
A pre-commit hook keeps the wiki in sync — every commit includes updated wiki pages atomically
A skill file is generated at .indexer/skills/codebase.md, and init writes both CLAUDE.md and AGENTS.md navigation snippets so any LLM agent or assistant can use the map

The wiki is plain markdown checked into your repo, with OKF-style YAML frontmatter on every page. No cloud service, no search index, no running server, no lock-in — the map is files an agent reads directly.

Install

pip install kiwiskil

Quick start

# In any git repo
kiwiskil init       # creates .indexer.toml, installs pre-commit hook, writes CLAUDE.md + AGENTS.md
kiwiskil run        # generates wiki/ and .indexer/skills/codebase.md

On every subsequent commit, the pre-commit hook runs kiwiskil run --staged automatically — only changed files are re-indexed.

CLAUDE.md / AGENTS.md snippet

kiwiskil init writes this navigation snippet to both CLAUDE.md (for Claude Code) and AGENTS.md (the cross-tool convention read by Cursor, Codex, Gemini CLI, Copilot, and others) automatically. kiwiskil run --smart also restores either file if it goes missing. If you prefer to add it manually, paste it into the file in your repo root:

## Codebase Navigation

This repo is indexed with kiwiskil. Before reading any source file or answering any code question:

1. Load `.indexer/skills/codebase.md` as a skill — it contains the full navigation workflow.
2. Read `wiki/INDEX.md` for the system overview and module map.
3. Match the question to a wiki page, look up symbols there, and only read source when you know the exact file and line range.

Do not read source files speculatively. The wiki gives you structure and relationships in a fraction of the tokens.

- Wiki pages: `wiki/` — grouped by logical density, not directory structure
- Manifest: `.indexer/manifest.json` — maps every file to its wiki page and component IDs
- Component IDs: `relative/path.py::ClassName.method_name`

The snippet does three things:

Tells Claude to load the skill before doing anything — this is what makes the navigation workflow kick in
Points to wiki/INDEX.md as the first read, not random source files
Sets the rule: wiki first, source only when you know exactly where to look

CLI

kiwiskil init                    # set up config, hook, CLAUDE.md + AGENTS.md
kiwiskil run                     # smart incremental + deep enrichment (default)
kiwiskil run --skip-deep         # skip narrative/flows/constraints enrichment (faster)
kiwiskil run --force             # force full re-index of all files
kiwiskil run --staged            # incremental on staged files only (used by hook)
kiwiskil run --smart             # verify generated artifacts and repair only what's broken
kiwiskil run --smart --dry-run   # report drift without fixing
kiwiskil status                  # show last indexed commit, stale files, stats
kiwiskil hook install            # manually install pre-commit hook
kiwiskil hook remove             # remove pre-commit hook

Deep mode

By default, kiwiskil run performs a deep enrichment pass after structural indexing. This uses your configured LLM to generate:

System narrative — a plain-English overview of what the codebase does
Key request flows — end-to-end data flows across modules
Design constraints — per-module gotchas, invariants, and non-obvious rules

These appear in wiki/INDEX.md and in the skill file, giving agents richer context without reading source. Use --skip-deep to run structural-only indexing when speed matters.

Smart mode

kiwiskil run --smart is a verify-then-repair mode for the generated artifacts themselves. It checks for drift that the default incremental mode doesn't catch:

Missing or deleted wiki pages
Orphan wiki pages (no manifest entry points to them)
Missing .indexer/skills/codebase.md or wiki/INDEX.md
Dangling manifest entries for files that no longer exist
Untracked source files that were never indexed
Missing CLAUDE.md or AGENTS.md snippet, .gitignore cache entry, or pre-commit hook
Wiki pages missing deep-mode sections (when deep mode is enabled)

Smart mode also fills missing pieces, not just drift: on a fresh repo with no index yet, kiwiskil run --smart performs a full initial index of every tracked source file (it only bails when there are genuinely no indexable files). Never-indexed tracked files are picked up and indexed in the same pass.

Smart mode only re-LLMs the groups it has to. Pass --dry-run to see what's drifted (including "would do a full initial index of N files") without changing anything. With --dry-run, the command exits non-zero when drift is found and zero when clean — making it a drop-in CI drift-gate (see .github/workflows/kiwiskil.yml). The dry-run is deterministic and needs no LLM / API key.

Output

`wiki/INDEX.md`

Top-level map of the entire codebase — which wiki page covers which files, entry points for each group, system overview, and key request flows (when deep mode is enabled). It also carries:

Repo Map — symbols ranked by importance (PageRank over the call graph), fit to a token budget (map_tokens), most load-bearing first — read these to orient
Core abstractions — the highest-connectivity symbols (most callers + callees)

Every page is OKF-framed: YAML frontmatter with type, title, description, tags, timestamp, and resource; INDEX.md declares okf_version.

`wiki/<group>.md`

One page per logical folder cluster. Each page contains:

Modules — files covered
Key Symbols — functions, classes, methods with one-line descriptions
Symbol Relationships — per symbol: its Callers, its Calls, and Editing this affects (the transitive blast radius) — so you can trace and assess impact without reading source
Relationships — what this group calls, what calls it, what it imports
Entry Points — symbols with no callers (architectural roots)
Data Flows — end-to-end flows through this module (deep mode)
Design Constraints — invariants and non-obvious rules to respect (deep mode)

`.indexer/skills/codebase.md`

A skill file that teaches any LLM agent how to navigate your codebase. The skill file includes:

Codebase stats (symbol count, file count, index date, commit)
System overview and key request flows
A ranked Repo Map (most load-bearing symbols first, fit to a token budget)
Wiki page index with entry points
Critical constraints extracted per module
Step-by-step navigation workflow for agents
Component ID format reference and manifest lookup instructions

Loading the skill

The skill file lives at .indexer/skills/codebase.md after you run kiwiskil run. Load it into your agent once — it activates automatically on any codebase question.

Claude Code

# Global — available in every project
mkdir -p ~/.claude/skills/codebase
cp .indexer/skills/codebase.md ~/.claude/skills/codebase/SKILL.md

# Project-local — available in this repo only
mkdir -p .claude/skills/codebase
cp .indexer/skills/codebase.md .claude/skills/codebase/SKILL.md

Or reference it directly in CLAUDE.md (already done by kiwiskil init):

## Codebase Navigation
Load `.indexer/skills/codebase.md` as a skill before reading source files.

Cursor

Add the skill content to your .cursor/rules file or a *.mdc rule file:

# Append the skill as a project rule
cat .indexer/skills/codebase.md >> .cursor/rules/codebase.mdc

Or in Cursor Settings → Rules → Project Rules, paste the contents of .indexer/skills/codebase.md.

Windsurf

Add the skill to your project's Windsurf rules file:

# Append to existing rules, or create the file
cat .indexer/skills/codebase.md >> .windsurfrules

Windsurf loads .windsurfrules automatically for every conversation in the project.

GitHub Copilot (VS Code)

Add the skill as a custom instruction file:

mkdir -p .github
cat .indexer/skills/codebase.md >> .github/copilot-instructions.md

Copilot picks up .github/copilot-instructions.md automatically for workspace-scoped chat.

Zed

Paste the skill content into your project's .zed/assistant_instructions.md:

mkdir -p .zed
cat .indexer/skills/codebase.md >> .zed/assistant_instructions.md

Any other agent / MCP client

The skill file is plain markdown. Load it into your agent's context however it supports custom instructions — system prompt, context file, instruction file, or rules file. The skill activates on any codebase navigation question automatically.

Keeping the skill in sync

The pre-commit hook regenerates .indexer/skills/codebase.md on every commit. If you copy the file to a global location, re-copy it after each index run:

kiwiskil run && cp .indexer/skills/codebase.md ~/.claude/skills/codebase/SKILL.md

For project-local paths (.claude/skills/codebase/SKILL.md, .cursor/rules/, .windsurfrules), the file updates in-place automatically — no extra step needed.

Configuration

.indexer.toml is created by kiwiskil init and checked into the repo:

[llm]
provider = "anthropic/claude-sonnet-4-6"  # any LiteLLM-compatible model string
api_key_env = "ANTHROPIC_API_KEY"         # env var name, not the key itself; leave unset to use the logged-in `claude` CLI

[indexer]
wiki_dir = "wiki"
ignore = ["node_modules", ".venv", "dist", "build", "__pycache__", "*.test.*"]
max_tokens_per_batch = 8000
merge_threshold = 2       # min files under a folder before it becomes its own wiki page
map_tokens = 1024         # token budget for the ranked Repo Map spine in INDEX/skill

[hooks]
pre_commit = true
synthesize_commit_message = true
deep = true           # set false to skip --deep on commits (faster, structural only)

Any LiteLLM-compatible provider works: OpenAI, Anthropic, Gemini, Ollama, local models.

No API key? Use your logged-in Claude

If you have the claude CLI installed and signed in (Claude Code / a Claude subscription), kiwiskil works with no API key at all — just run kiwiskil run. When no key is found and the provider is Anthropic, kiwiskil shells out to your authenticated claude CLI for the enrichment calls. Credential priority:

An explicit API key (api_key_env / a known env var) → used directly.
Otherwise, Anthropic provider + claude on your PATH → your logged-in session (zero config, no key).
Neither → kiwiskil still emits a structural wiki (symbols, call graph, repo map, blast radius); only the LLM-written descriptions and the system-overview prose are skipped.

High-volume symbol descriptions use a fast model (Haiku); the few deep-mode prose calls use your configured model.

Commit message synthesis

When running as a pre-commit hook, kiwiskil synthesises a commit message from the code changes and prints it to stdout. You can use it, edit it, or ignore it — your choice.

Design principles

Structural facts only — wiki pages contain symbols, relationships, and entry points. No prose summaries, no architectural opinions. The client LLM draws its own conclusions.
Checked in, not served — the wiki is plain markdown in your repo. It travels with your code, is tracked by git, and is readable by humans and agents alike.
Incremental by default — git diff + content hash manifest means only changed files are re-processed on each commit.
Provider-agnostic — LiteLLM means you can use any model, local or cloud, without changing the tool.

Supported languages

Language	Status	Parser
Python	Supported	stdlib `ast`
JavaScript (`.js`, `.jsx`, `.mjs`, `.cjs`)	Supported	tree-sitter
TypeScript (`.ts`, `.tsx`)	Supported	tree-sitter
Go (`.go`)	Supported	tree-sitter
Java (`.java`)	Supported	tree-sitter
Ruby (`.rb`)	Supported	tree-sitter
Rust (`.rs`)	Supported	tree-sitter

Coverage varies by language (e.g. Go methods are emitted by their own name rather than attributed to a receiver type). A file whose grammar isn't installed degrades gracefully — it's skipped with a warning, never a crash.

License

MIT

Project details

Release history Release notifications | RSS feed

0.4.3

Jul 1, 2026

0.4.2

Jun 29, 2026

0.4.1

Jun 22, 2026

This version

0.4.0

Jun 21, 2026

0.3.3

Apr 14, 2026

0.3.2

Apr 14, 2026

0.3.1

Apr 10, 2026

0.3.0

Apr 9, 2026

0.2.0

Apr 9, 2026

0.1.0

Apr 9, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

kiwiskil-0.4.0.tar.gz (67.1 kB view details)

Uploaded Jun 21, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

kiwiskil-0.4.0-py3-none-any.whl (52.2 kB view details)

Uploaded Jun 21, 2026 Python 3

File details

Details for the file kiwiskil-0.4.0.tar.gz.

File metadata

Download URL: kiwiskil-0.4.0.tar.gz
Upload date: Jun 21, 2026
Size: 67.1 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.12.13

File hashes

Hashes for kiwiskil-0.4.0.tar.gz
Algorithm	Hash digest
SHA256	`6eae871e0056783f55026ce6b610dc57a7e31bd694120af2a160960d7b0a26c0`
MD5	`c49f14e65325cc800233de24109afd66`
BLAKE2b-256	`f92cb3d128f5431266d7e68e25692a3f8d09ad226a76c3862c626307158d4314`

See more details on using hashes here.

File details

Details for the file kiwiskil-0.4.0-py3-none-any.whl.

File metadata

Download URL: kiwiskil-0.4.0-py3-none-any.whl
Upload date: Jun 21, 2026
Size: 52.2 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.12.13

File hashes

Hashes for kiwiskil-0.4.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`1553c1d1843910b9d071f5efaa8a17635064e4ac29ff51e8a0fb2ce9ce335c4d`
MD5	`e221b65614eb97a172f51dd83f60b05b`
BLAKE2b-256	`16ea30db36e28484804c32222b7359ba1c5f61ad28a5b95a3cb4253c44b78c8a`

See more details on using hashes here.

kiwiskil 0.4.0

Navigation

Verified details

Maintainers

Unverified details

Meta

Project description

kiwiskil

How it works

Install

Quick start

CLAUDE.md / AGENTS.md snippet

CLI

Deep mode

Smart mode

Output

wiki/INDEX.md

wiki/<group>.md

.indexer/skills/codebase.md

Loading the skill

Claude Code

Cursor

Windsurf

GitHub Copilot (VS Code)

Zed

Any other agent / MCP client

Keeping the skill in sync

Configuration

No API key? Use your logged-in Claude

Commit message synthesis

Design principles

Supported languages

License

Project details

Verified details

Maintainers

Unverified details

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes

`wiki/INDEX.md`

`wiki/<group>.md`

`.indexer/skills/codebase.md`