Skip to main content

Codebase knowledge graph: polyglot indexer, semantic search, call-graph analysis

Project description

Descry

Polyglot codebase knowledge graph with call-graph analysis, semantic search, and SCIP integration. Built for AI coding agents (MCP), with CLI and Web UI interfaces.

Descry indexes your codebase into a knowledge graph of symbols (functions, classes, constants) and their relationships (calls, imports, defines). It supports Rust, Python, TypeScript, JavaScript, Svelte, Go, Java, and more — with type-aware resolution via SCIP for Rust and TypeScript.

⚠️ Disclaimer — please read

Descry was built with heavy assistance from AI coding agents. While the codebase has been through an internal security and correctness review, AI-authored code can still contain subtle defects, missed edge cases, or security assumptions that weren't obvious to the reviewer. Treat this tool as experimental pre-1.0 software: do not rely on it for safety-critical analysis, and do not point it at code or repositories you don't trust.

This tool is designed to run locally on your own machine. The web UI (descry-web) binds to 127.0.0.1 and is unauthenticated by design — the threat model assumes a single trusted user on the host.

  • Do not expose descry-web to a network. No reverse proxy, no public tunnel, no --host 0.0.0.0. If you need remote access, put it behind your own authenticated reverse proxy and understand that any browser tab on the host machine can still reach it via localhost.
  • Do not run Descry on untrusted repositories. Indexing a repository executes configuration from its .descry.toml, walks its file tree, and feeds its docstrings/source to your LLM (if you use the MCP server). The same trust boundary that applies to cargo build, npm install, and opening the repo in your IDE applies here.
  • Report security issues privately — see SECURITY.md.

Quick Start

# Install with all optional features
pip install descry-codegraph[all]

# Index your project
cd your-project
descry index

# Search for symbols
descry search authenticate

# Find callers of a function
descry callers validate_token

# One-step lookup (search + full context)
descry quick handle_request

Interfaces

Interface Command Use Case
CLI descry <command> Interactive terminal use
MCP Server descry-mcp AI coding agents (Claude Code, etc.)
Web UI descry-web Visual exploration at http://127.0.0.1:8787
Pi Extension See pi-extension/ Pi coding agent integration

MCP Setup

Claude Code

Add to your Claude Code MCP settings (.claude/settings.json or global):

{
  "mcpServers": {
    "descry": {
      "command": "descry-mcp",
      "args": []
    }
  }
}

Or with a specific Python path:

{
  "mcpServers": {
    "descry": {
      "command": "/path/to/venv/bin/descry-mcp",
      "args": []
    }
  }
}

Other MCP Hosts

Descry uses the standard MCP stdio transport. Any MCP-compatible host can spawn descry-mcp as a subprocess.

Tools

Descry provides 18 tools, available through all interfaces:

Tool Description
health Diagnostic check — version, graph status, feature availability
status Graph existence and freshness
ensure Ensure graph exists and is fresh (regenerates if stale)
index Regenerate graph, SCIP indices, and embeddings
search Search symbol names and docstrings (keyword + semantic)
semantic Pure semantic search using embeddings only
quick Find symbol and show full context in one step
callers Find all callers of a symbol
callees Find what a symbol calls
context Full context for a symbol — source, callers, callees, tests
flow Trace call flow from a starting symbol (forward/backward)
path Find shortest call path between two symbols
structure Show file structure — imports, classes, functions
flatten Show effective API of a class including inherited methods
impls Find all implementations of a trait/interface method
cross-lang Trace frontend API calls to backend handlers via OpenAPI
churn Find code churn hotspots (symbols, files, or co-change pairs)
evolution Track how a symbol has changed over time
changes Analyze change impact for a commit range

Configuration

Descry works zero-config by auto-detecting your project root (looks for .git, Cargo.toml, package.json, pyproject.toml). For customization, add a .descry.toml to your project root:

[project]
excluded_dirs = ["target", "node_modules", "dist", ".git", "__pycache__", "build", "vendor"]
max_stale_hours = 48

[features]
enable_scip = true        # Type-aware resolution (requires rust-analyzer or scip-typescript)
enable_embeddings = true   # Semantic search (requires sentence-transformers)

[embeddings]
model = "jinaai/jina-code-embeddings-0.5b"

[test_detection]
path_patterns = ["/tests/", "/test/", "/__tests__/"]
file_suffixes = ["_test.rs", ".test.ts", ".spec.ts", "_test.py"]

[code_files]
extensions = [".rs", ".py", ".ts", ".tsx", ".js", ".jsx", ".svelte", ".go", ".java"]

[git]
churn_exclusions = [".descry_cache/", "Cargo.lock", "package-lock.json"]
timeout = 30

[timeouts]
scip_minutes = 0       # 0 = unlimited
embedding_seconds = 60
query_ms = 4000

[query]
max_depth = 3
max_nodes = 100
max_children_per_level = 10
max_callers_shown = 15

[scip]
extra_args = ["--exclude-vendored-libraries"]
skip_crates = []         # Crate names to skip during SCIP indexing

[scip.rust]
toolchain = "1.92.0"     # Pin rust-analyzer version via rustup

[syntax.lang_map]
".svelte" = "svelte"
".proto" = "protobuf"

Environment Variables

Variable Default Description
DESCRY_LOG_LEVEL WARNING Log level (DEBUG, INFO, WARNING, ERROR)
DESCRY_CACHE_DIR .descry_cache/ Override cache directory location
DESCRY_NO_SCIP false Disable SCIP indexing
DESCRY_NO_EMBEDDINGS false Disable semantic search

Configuration precedence: defaults < .descry.toml < environment variables.

Language Support

Language Parsing SCIP (Type-Aware) Requirements
Rust Regex + AST Yes rust-analyzer via rustup
TypeScript Regex (+ Tree-sitter opt-in) Yes scip-typescript via npm; descry-codegraph[ast] for tree-sitter
Python Regex + AST Yes scip-python via npm
JavaScript Regex (+ Tree-sitter opt-in) descry-codegraph[ast] for tree-sitter
Svelte Regex
Go Regex
Java Regex

The tree-sitter TS/TSX/JS parser is currently extractor-only (symbol discovery) and ships behind the ast extra + [features] use_tree_sitter_ts = true in .descry.toml. It runs alongside the regex parser and is a stepping stone toward full AST-driven extraction in a future release.

SCIP provides precise call-graph resolution (resolving which specific function is called through traits, generics, etc.). Without SCIP, Descry falls back to regex-based name matching which handles most cases but may produce false positives on overloaded names.

Installation

Minimal (graph + CLI only)

pip install descry-codegraph

With MCP server

pip install descry-codegraph[mcp]

With Web UI

pip install descry-codegraph[web]

With semantic search

pip install descry-codegraph[embeddings]

Everything

pip install descry-codegraph[all]

Development

git clone https://github.com/06chaynes/descry.git
cd descry
just install    # Creates venv and installs with dev deps
just test       # Run tests
just lint       # Ruff linting
just fmt        # Ruff formatting

Requires uv and just.

How It Works

  1. Index — Descry walks your codebase, parses source files into an AST-like representation, and builds a graph of symbols and their relationships. If SCIP is available, it overlays type-aware call resolution.

  2. Cache — The graph is cached as JSON in .descry_cache/codebase_graph.json. Embeddings are cached separately. SCIP indices are cached per-crate/package.

  3. Query — All tools query the cached graph. Keyword search uses TF-IDF scoring. Semantic search uses sentence-transformer embeddings. Call-graph traversal follows edges in the graph.

  4. Freshnessensure checks graph age against max_stale_hours and regenerates if needed. The MCP server pre-warms the graph on startup.

Design Notes

Web UI is local-only (CORS + auth)

descry-web is designed as a single-user local development tool. It binds to 127.0.0.1 by default, allows cross-origin requests (allow_origins=["*"]), and does not require authentication. This is deliberate:

  • The UI is served by the same process that reads your repository; any authentication layer would be a shared-secret between your browser and your own terminal.
  • Path traversal and file-serving endpoints are hardened independently of CORS: /api/source enforces project-root containment, rejects non-regular files, caps size at 10 MiB, and refuses non-text content (with O_NOFOLLOW on the final open to defeat symlink swaps).
  • The reindex endpoints accept no path parameter; they always index the configured project root.

Do not expose descry-web to an untrusted network. If you need remote access, put it behind your own authenticated reverse proxy.

Versioning

Descry is pre-1.0. Minor version bumps (0.1.x0.2.x) may include breaking changes to the library API, graph schema, CLI, or MCP tool signatures. Patch releases (0.1.00.1.1) will not introduce breaking changes. Once we reach 1.0.0, the project will follow semver strictly.

The public library API in v0.1 is limited to descry.__version__. Submodules (descry.handlers, descry.query, etc.) are not considered stable API yet.

Further Reading

License

MIT

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

descry_codegraph-0.1.0.tar.gz (207.9 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

descry_codegraph-0.1.0-py3-none-any.whl (180.1 kB view details)

Uploaded Python 3

File details

Details for the file descry_codegraph-0.1.0.tar.gz.

File metadata

  • Download URL: descry_codegraph-0.1.0.tar.gz
  • Upload date:
  • Size: 207.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.13

File hashes

Hashes for descry_codegraph-0.1.0.tar.gz
Algorithm Hash digest
SHA256 cc0f748da4a8a3d26d4cdfbfabcb32b40833b2d411de0d2e70ef98e016032313
MD5 a854fa2f1d886caa86e51a64d129c494
BLAKE2b-256 5d55fcf134ecbd2d60cffe4551577a47b261aadce90176dd3dfd1e1599f0041f

See more details on using hashes here.

Provenance

The following attestation bundles were made for descry_codegraph-0.1.0.tar.gz:

Publisher: release.yml on 06chaynes/descry

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file descry_codegraph-0.1.0-py3-none-any.whl.

File metadata

File hashes

Hashes for descry_codegraph-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 a65d5583458ca5b0eb8c9c67b90e86a039a57bb5e97a999902186f951deb358e
MD5 5b2314368f810c538202f6237d7496bc
BLAKE2b-256 8f34fcaf6477212a898f6962f0e1cb954e1da64d7228112d01d6a2fe03c09cdc

See more details on using hashes here.

Provenance

The following attestation bundles were made for descry_codegraph-0.1.0-py3-none-any.whl:

Publisher: release.yml on 06chaynes/descry

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page