Claude Code skill - turn any folder of code, docs, papers, images, or tweets into a queryable knowledge graph

These details have not been verified by PyPI

Project links

Project description

graphify

A Claude Code skill. Type /graphify in Claude Code - it reads your files, builds a knowledge graph, and gives you back structure you didn't know was there. Understand a codebase faster. Find the "why" behind architectural decisions.

Fully multimodal. Drop in code, PDFs, markdown, screenshots, diagrams, whiteboard photos, even images in other languages - graphify uses Claude vision to extract concepts and relationships from all of it and connects them into one graph.

Andrej Karpathy keeps a /raw folder where he drops papers, tweets, screenshots, and notes. graphify is the answer to that problem - 71.5x fewer tokens per query vs reading the raw files, persistent across sessions, honest about what it found vs guessed.

/graphify .                        # works on any folder - your codebase, notes, papers, anything

graphify-out/
├── graph.html       interactive graph - click nodes, search, filter by community
├── GRAPH_REPORT.md  god nodes, surprising connections, suggested questions
├── graph.json       persistent graph - query weeks later without re-reading
└── cache/           SHA256 cache - re-runs only process changed files

Install

Requires: Claude Code and Python 3.10+

pip install graphifyy && graphify install

The PyPI package is temporarily named graphifyy while the graphify name is being reclaimed. The CLI and skill command are still graphify.

Then open Claude Code in any directory and type:

/graphify .

Manual install (curl)

mkdir -p ~/.claude/skills/graphify
curl -fsSL https://raw.githubusercontent.com/safishamsi/graphify/v1/skills/graphify/skill.md \
  > ~/.claude/skills/graphify/SKILL.md

Add to ~/.claude/CLAUDE.md:

- **graphify** (`~/.claude/skills/graphify/SKILL.md`) - any input to knowledge graph. Trigger: `/graphify`
When the user types `/graphify`, invoke the Skill tool with `skill: "graphify"` before doing anything else.

Usage

/graphify                          # run on current directory
/graphify ./raw                    # run on a specific folder
/graphify ./raw --mode deep        # more aggressive INFERRED edge extraction
/graphify ./raw --update           # re-extract only changed files, merge into existing graph
/graphify ./raw --obsidian         # also generate Obsidian vault (opt-in)

/graphify add https://arxiv.org/abs/1706.03762        # fetch a paper, save, update graph
/graphify add https://x.com/karpathy/status/...       # fetch a tweet

/graphify query "what connects attention to the optimizer?"
/graphify path "DigestAuth" "Response"
/graphify explain "SwinTransformer"

/graphify ./raw --watch            # auto-sync graph as files change (code: instant, docs: notifies you)
/graphify ./raw --wiki             # build agent-crawlable wiki (index.md + article per community)
/graphify ./raw --svg              # export graph.svg
/graphify ./raw --graphml          # export graph.graphml (Gephi, yEd)
/graphify ./raw --neo4j            # generate cypher.txt for Neo4j
/graphify ./raw --mcp              # start MCP stdio server

graphify hook install              # git hooks - rebuilds graph on commit and branch switch
graphify claude install            # write graphify rules to local CLAUDE.md + install PreToolUse hook

Works with any mix of file types:

Type	Extensions	Extraction
Code	`.py .ts .js .go .rs .java .c .cpp .rb .cs .kt .scala .php`	AST via tree-sitter + call-graph pass + docstring/comment rationale
Docs	`.md .txt .rst`	Concepts + relationships + design rationale via Claude
Papers	`.pdf`	Citation mining + concept extraction
Images	`.png .jpg .webp .gif`	Claude vision - screenshots, diagrams, any language

What you get

God nodes - highest-degree concepts (what everything connects through)

Surprising connections - ranked by composite score. Code-paper edges rank higher than code-code. Each result includes a plain-English why.

Suggested questions - 4-5 questions the graph is uniquely positioned to answer

The "why" - docstrings, inline comments (# NOTE:, # IMPORTANT:, # HACK:, # WHY:), and design rationale from docs are extracted as rationale_for nodes. Not just what the code does - why it was written that way.

Confidence scores - every INFERRED edge has a confidence_score (0.0-1.0). You know not just what was guessed but how confident the model was. EXTRACTED edges are always 1.0.

Semantic similarity edges - cross-file conceptual links that have no structural connection. Two functions solving the same problem without calling each other, a class in code and a concept in a paper describing the same algorithm.

Hyperedges - group relationships connecting 3+ nodes that pairwise edges can't express. All classes implementing a shared protocol, all functions in an auth flow, all concepts from a paper section forming one idea.

Token benchmark - printed automatically after every run. On a mixed corpus (Karpathy repos + papers + images): 71.5x fewer tokens per query vs reading raw files.

Auto-sync (--watch) - run in a background terminal and the graph updates itself as your codebase changes. Code file saves trigger an instant rebuild (AST only, no LLM). Doc/image changes notify you to run --update for the LLM re-pass.

Git hooks (graphify hook install) - installs post-commit and post-checkout hooks. Graph rebuilds automatically after every commit and every branch switch. No background process needed.

Always-on for Claude (graphify claude install) - writes a CLAUDE.md section so Claude checks the graph before answering architecture questions, plus a .claude/settings.json PreToolUse hook that fires before every Glob/Grep - Claude is reminded to check the graph before searching raw files.

Wiki (--wiki) - Wikipedia-style markdown articles per community and god node, with an index.md entry point. Point any agent at index.md and it can navigate the knowledge base by reading files instead of parsing JSON.

Every edge is tagged EXTRACTED, INFERRED, or AMBIGUOUS - you always know what was found vs guessed.

Worked examples

Corpus	Files	Reduction	Output
Karpathy repos + 5 papers + 4 images	52	71.5x	`worked/karpathy-repos/`
graphify source + Transformer paper	4	5.4x	`worked/mixed-corpus/`
httpx (synthetic Python library)	6	~1x	`worked/httpx/`

Token reduction scales with corpus size. 6 files fits in a context window anyway, so graph value there is structural clarity, not compression. At 52 files (code + papers + images) you get 71x+. Each worked/ folder has the raw input files and the actual output (GRAPH_REPORT.md, graph.json) so you can run it yourself and verify the numbers.

Tech stack

NetworkX + Leiden (graspologic) + tree-sitter + Claude + vis.js. No Neo4j required, no server, runs entirely locally.

Contributing

Worked examples are the most trust-building contribution. Run /graphify on a real corpus, save output to worked/{slug}/, write an honest review.md evaluating what the graph got right and wrong, submit a PR.

Extraction bugs - open an issue with the input file, the cache entry (graphify-out/cache/), and what was missed or invented.

See ARCHITECTURE.md for module responsibilities and how to add a language.

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

0.4.19

Apr 17, 2026

0.4.18

Apr 16, 2026

0.4.17

Apr 16, 2026

0.4.16

Apr 16, 2026

0.4.15

Apr 15, 2026

0.4.14

Apr 14, 2026

0.4.13

Apr 14, 2026

0.4.12

Apr 13, 2026

0.4.11

Apr 13, 2026

0.4.10

Apr 13, 2026

0.4.9

Apr 13, 2026

0.4.8

Apr 12, 2026

0.4.7

Apr 12, 2026

0.4.6

Apr 12, 2026

0.4.5

Apr 12, 2026

0.4.4

Apr 12, 2026

0.4.3

Apr 12, 2026

0.4.2

Apr 11, 2026

0.4.1

Apr 10, 2026

0.4.0

Apr 10, 2026

0.3.29

Apr 10, 2026

0.3.28

Apr 10, 2026

0.3.27

Apr 10, 2026

0.3.26

Apr 10, 2026

0.3.25

Apr 9, 2026

0.3.24

Apr 9, 2026

0.3.23

Apr 9, 2026

0.3.22

Apr 9, 2026

0.3.21

Apr 9, 2026

0.3.20

Apr 9, 2026

0.3.19

Apr 9, 2026

0.3.18

Apr 9, 2026

0.3.17

Apr 8, 2026

0.3.16

Apr 8, 2026

0.3.15

Apr 8, 2026

0.3.14

Apr 8, 2026

0.3.13

Apr 8, 2026

0.3.12

Apr 8, 2026

0.3.11

Apr 7, 2026

0.3.10

Apr 7, 2026

0.3.9

Apr 7, 2026

0.3.8

Apr 7, 2026

0.3.7

Apr 7, 2026

0.3.6

Apr 7, 2026

0.3.5

Apr 7, 2026

0.3.4

Apr 7, 2026

0.3.3

Apr 7, 2026

0.3.2

Apr 7, 2026

0.3.1

Apr 6, 2026

0.3.0

Apr 6, 2026

0.2.2

Apr 6, 2026

0.2.1

Apr 6, 2026

This version

0.2.0

Apr 6, 2026

0.1.15

Apr 5, 2026

0.1.14

Apr 5, 2026

0.1.13

Apr 5, 2026

0.1.12

Apr 5, 2026

0.1.11

Apr 5, 2026

0.1.10

Apr 5, 2026

0.1.9

Apr 5, 2026

0.1.8

Apr 5, 2026

0.1.7

Apr 5, 2026

0.1.6

Apr 5, 2026

0.1.5

Apr 5, 2026

0.1.4

Apr 4, 2026

0.1.3

Apr 4, 2026

0.1.2

Apr 4, 2026

0.1.1

Apr 4, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

graphifyy-0.2.0.tar.gz (98.1 kB view details)

Uploaded Apr 6, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

graphifyy-0.2.0-py3-none-any.whl (80.5 kB view details)

Uploaded Apr 6, 2026 Python 3

File details

Details for the file graphifyy-0.2.0.tar.gz.

File metadata

Download URL: graphifyy-0.2.0.tar.gz
Upload date: Apr 6, 2026
Size: 98.1 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.12.3

File hashes

Hashes for graphifyy-0.2.0.tar.gz
Algorithm	Hash digest
SHA256	`1ee14428070cf60fca8b0879372fea1f0fc32e8302ec54e4d6f4301d16d30420`
MD5	`e5a593e15bd388960b701fc7b94327f9`
BLAKE2b-256	`2f05cd3e7606e7db99cb5c68e9c3253fa1666cfcf6ea0ecc77971dd69109f059`

See more details on using hashes here.

File details

Details for the file graphifyy-0.2.0-py3-none-any.whl.

File metadata

Download URL: graphifyy-0.2.0-py3-none-any.whl
Upload date: Apr 6, 2026
Size: 80.5 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.12.3

File hashes

Hashes for graphifyy-0.2.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`2dbea909c4e6662153a736513fa76b57efcd100113451023ca819c4380a580ea`
MD5	`5106ad9ad8dc3b9b39c61c75cce39376`
BLAKE2b-256	`30975d02e57789e607fd575b2b8b3438ffd377fb03b22d870dfaad582c59250e`

See more details on using hashes here.

graphifyy 0.2.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Project description

graphify

Install

Usage

What you get

Worked examples

Tech stack

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes