Skip to main content

CodeScry local codebase index with MCP retrieval tools

Project description

CodeScry logo

CodeScry

CodeScry is a local codebase retrieval tool for coding agents. It indexes committed code from local git repos into a local SQLite database, then exposes ranked snippets through a CLI and MCP stdio server.

Install

Fast path:

curl -LsSf https://raw.githubusercontent.com/Zhachory1/codescry/main/scripts/install.sh | sh

The installer uses uv tool install codescry when uv is available, otherwise pipx install codescry. If neither uv nor pipx is installed, it bootstraps pipx with python3 -m pip --user.

If you prefer explicit installs:

pipx install codescry
# or, if uv is already installed
uv tool install codescry

Node users can run the npm wrapper after installing uv:

npx codescry doctor

The npm package is a thin wrapper around the Python package. It does not bundle local SQLite index data.

For development:

python -m venv .venv
source .venv/bin/activate
pip install -e '.[dev]'

Check local readiness:

codescry doctor

First success path

For a deterministic five-minute smoke test, see docs/getting-started.md.

Index this repo or another local git repo:

codescry index /path/to/git/repo

Query it:

codescry query "where is request retry handled" -k 5

Lookup a symbol:

codescry get-symbol RepoIndex --repo /path/to/git/repo

Discover and index every git repo under a root:

codescry index-root ~/code

Show indexed repos and freshness:

codescry status

MCP setup

Run the MCP server over stdio:

codescry serve

Agent config example:

{
  "mcpServers": {
    "codescry": {
      "type": "stdio",
      "command": "/Users/YOU/.local/bin/codescry",
      "args": ["--db", "/Users/YOU/.codescry/index.sqlite", "serve"],
      "env": {}
    }
  }
}

npm/npx config example:

{
  "mcpServers": {
    "codescry": {
      "type": "stdio",
      "command": "npx",
      "args": ["-y", "codescry", "--db", "/Users/YOU/.codescry/index.sqlite", "serve"],
      "env": {}
    }
  }
}

Use which codescry to find the absolute command path for your machine when using direct CLI installs.

Freshness hooks

Install hooks for one repo or a repo root:

codescry install-hooks /path/to/git/repo
codescry install-hooks ~/code --recursive

Hooks run best-effort after commit/merge:

codescry --db <db> reindex "$PWD"

They preserve the selected DB path and must not fail git commands.

Docs

  • docs/getting-started.md — install to first useful query.
  • docs/mcp-clients.md — MCP config examples.
  • docs/troubleshooting.md — common setup/query/freshness issues.
  • docs/cli-reference.md — command reference.
  • docs/output-schema.md — JSON fields.
  • docs/evals.md — eval authoring and gate.
  • docs/pilot.md — 5-engineer pilot measurement plan and local reporting commands.
  • docs/language-support.md — parser/regex/window support matrix.
  • docs/recipes.md — common operations.
  • docs/upgrade-uninstall.md — lifecycle commands.
  • docs/release.md — PyPI-first and npm-wrapper release flow.

Evals

The seed golden set lives in evals/golden.codescry.jsonl.

Run the eval gate:

codescry eval evals/golden.codescry.jsonl . -k 10 --fail-under 0.85

Pilot proof

Pilot task/activation/miss events are recorded in ~/.codescry/usage.jsonl without snippets. Passive query logging is opt-in with CODESCRY_ENABLE_USAGE_LOG=1. Use:

codescry pilot report

See docs/pilot.md for activation, timing, miss capture, and decision gates.

Retrieval behavior

  • Python functions/classes/methods get parser-backed symbol metadata.
  • TS/JS/Go/Java/Rust/C/C++/SQL get Tree-sitter parser-backed symbol metadata.
  • Other common declaration patterns get regex-backed symbol metadata.
  • get_symbol uses stored symbol metadata before search fallback.
  • Search blends vector, lexical, symbol, and path scores.
  • Results include stale/dirty flags.

Data boundary and safety

  • Default embeddings are local deterministic hash vectors.
  • Default configuration does not send source code to external APIs.
  • Index data is local SQLite derived data and can be deleted/rebuilt.
  • Files matching high-confidence secret patterns are skipped and prior chunks for those paths are removed.
  • Secret skipping is a best-effort local guardrail, not a guarantee. See SECURITY.md.

Current limits

  • Python uses stdlib AST parser chunks; TS/JS/Go/Java/Rust/C/C++/SQL use Tree-sitter parser chunks; other languages use regex-backed symbol hints plus line windows.
  • Local deterministic hash embeddings, not quality-tuned semantic embeddings.
  • SQLite storage scans/scoring in Python, no ANN/vector extension yet.
  • Freshness is committed-code freshness; dirty working-tree edits are reported but not indexed.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

codescry-0.2.6.tar.gz (57.0 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

codescry-0.2.6-py3-none-any.whl (40.4 kB view details)

Uploaded Python 3

File details

Details for the file codescry-0.2.6.tar.gz.

File metadata

  • Download URL: codescry-0.2.6.tar.gz
  • Upload date:
  • Size: 57.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.9.17 {"installer":{"name":"uv","version":"0.9.17","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"macOS","version":null,"id":null,"libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}

File hashes

Hashes for codescry-0.2.6.tar.gz
Algorithm Hash digest
SHA256 6c97c1e32337711d778bb6fbeb3b2e904ef0c8886ba7ffb8c4d786af9cccd7b1
MD5 2df03dc688fd28ae1abeb159ce326c7b
BLAKE2b-256 a8177604d5fcb988d165a61c87ce0157af93d38630fb8bf5518d26525d3b335f

See more details on using hashes here.

File details

Details for the file codescry-0.2.6-py3-none-any.whl.

File metadata

  • Download URL: codescry-0.2.6-py3-none-any.whl
  • Upload date:
  • Size: 40.4 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.9.17 {"installer":{"name":"uv","version":"0.9.17","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"macOS","version":null,"id":null,"libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}

File hashes

Hashes for codescry-0.2.6-py3-none-any.whl
Algorithm Hash digest
SHA256 bec1826b4c46400e7451cf9b05843d568ef5fe836eb6fd905904ee2cf4e008ee
MD5 5f8bee706e7e39e74fdc2f614f3965ce
BLAKE2b-256 1462d98f66e4c985527f2bcbe892ec6f2a7516580eb758ce76869bf3868b3b1a

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page