Deterministic, token-efficient codebase packs and skeleton maps for AI review (Python)

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

bradsegal

These details have not been verified by PyPI

Project description

anatomize

Generate deterministic, token-efficient maps and review bundles for Python repositories.

anatomize has two complementary workflows:

Skeletons: structure-only “code maps” for navigation and architecture understanding.
Packs: single-file bundles (repomix-style) for external review, with filtering and slicing.

If you want the full guide (modes, slicing, config, determinism guarantees), see docs/GUIDE.md.

Installation

pip install anatomize

Quick Start (CLI)

Generate skeletons

# Generate skeleton output to .skeleton/ (default format: yaml)
anatomize generate ./src

# Choose resolution level
anatomize generate ./src --level hierarchy
anatomize generate ./src --level modules
anatomize generate ./src --level signatures

# Write multiple formats
anatomize generate ./src --format yaml --format json --format markdown --output .skeleton

Estimate tokens

anatomize estimate ./src --level modules

Validate (and fix) skeleton output

# Validate existing output directory against sources
anatomize validate .skeleton --source ./src

# Rewrite the skeleton output to match regenerated content (strict, atomic-ish replacement)
anatomize validate .skeleton --source ./src --fix

Pack a repository into an AI-friendly bundle

# If --format is omitted, it is inferred from --output when the extension is known
anatomize pack . --output codebase.jsonl
anatomize pack . --output codebase.md

# Full bundle
anatomize pack . --format markdown --output codebase.md

# Filter by globs
anatomize pack . --include "src/**" --ignore "**/__pycache__/**" --output src-only.md

# Forward dependency closure (entrypoint + everything it imports)
anatomize pack . --entry src/anatomize/cli.py --deps --output slice.md

# Reverse dependency closure (module + everything that imports it)
anatomize pack . --target src/anatomize/cli.py --reverse-deps --output importers.md

# Reverse + forward (importers plus what they import)
anatomize pack . --target src/anatomize/cli.py --reverse-deps --deps --output importers-and-deps.md

# Token-efficient Python compression (signatures/imports/constants)
anatomize pack . --compress --output compressed.md

# Make markdown robust to embedded ``` fences (default)
anatomize pack . --content-encoding fence-safe --output safe.md

# Maximum robustness (content is base64-encoded UTF-8)
anatomize pack . --content-encoding base64 --output safe.base64.md

# Split output into multiple files (markdown/plain only)
anatomize pack . --split-output 500kb --output codebase.md

# Hard cap output (bytes or tokens)
anatomize pack . --max-output 20_000t --output codebase.md

# Print a per-file content token tree to stdout
anatomize pack . --token-count-tree --output codebase.md

# JSONL (stream-friendly)
anatomize pack . --format jsonl --output codebase.jsonl

# Hybrid mode (skeleton-style summaries + selective fill)
# - defaults to JSONL when --mode hybrid is set
# - Python files default to summary; non-Python defaults to metadata-only
anatomize pack . --mode hybrid --output hybrid.jsonl

# Hybrid: include full content for a slice and fit within a hard token budget
anatomize pack . --mode hybrid --max-output 50_000t --fit-to-max-output \
  --content "src/pkg/**" --output hybrid.slice.jsonl

Reference-based usage slicing (requires Pyright language server):

anatomize pack . --target src/anatomize/cli.py --uses --slice-backend pyright --output uses.md

Python API

Generate skeletons in code

from anatomize import SkeletonGenerator
from anatomize.formats import OutputFormat, write_skeleton

gen = SkeletonGenerator(sources=["./src"])
skeleton = gen.generate(level="modules")

print("Modules:", skeleton.metadata.total_modules)
print("Classes:", skeleton.metadata.total_classes)
print("Functions:", skeleton.metadata.total_functions)
print("Estimated tokens:", skeleton.metadata.token_estimate)

write_skeleton(skeleton, ".skeleton", formats=[OutputFormat.YAML, OutputFormat.JSON])

Key exported objects

anatomize.SkeletonGenerator: orchestrates discovery + extraction.
anatomize.formats.write_skeleton: writes YAML/JSON/Markdown plus schemas and manifest.json.
anatomize.validation.validate_skeleton_dir: strict validator with optional fix.

Configuration (`.anatomize.yaml`)

The CLI can auto-discover .anatomize.yaml. Generation commands use config from the current working directory (or explicit --config). pack discovers config relative to the chosen ROOT when --config is not provided.

Minimal config:

sources:
  - src
output: .skeleton
level: modules
formats: [yaml, json, markdown]
exclude:
  - __pycache__/
  - "*.pyc"
symlinks: forbid # forbid|files|dirs|all
workers: 0 # 0 = auto

pack:
  format: markdown # markdown|plain|json|xml|jsonl
  mode: bundle # bundle|hybrid
  output: anatomize-pack.md # if the extension is known, it must match `format`
  include: []
  ignore: []
  ignore_files: []
  respect_standard_ignores: true
  symlinks: forbid # forbid|files|dirs|all
  max_file_bytes: 1000000
  workers: 0 # 0 = auto
  token_encoding: cl100k_base
  compress: false
  content_encoding: fence-safe # verbatim|fence-safe|base64 (markdown disallows verbatim)
  line_numbers: false
  no_structure: false
  no_files: false
  max_output: null # e.g. "500kb" or "20_000t"
  split_output: null # e.g. "500kb" or "20_000t"
  fit_to_max_output: false
  # Hybrid representation rules (repeatable patterns). Precedence: meta < summary < content.
  meta: []
  summary: []
  content: []
  summary_config:
    max_depth: 3
    max_keys: 200
    max_items: 200
    max_headings: 200
  python_roots: [] # defaults to ["src"] if present, else ["."]
  slice_backend: imports # imports|pyright
  uses_include_private: false
  pyright_langserver_cmd: "pyright-langserver --stdio"

Exclude patterns use gitignore-like semantics and are applied relative to each configured root.

Output artifacts

Skeleton output directory

write_skeleton(...) and anatomize generate ... --output DIR create:

hierarchy.yaml|json|md / modules.* / signatures.* depending on selected formats and level
schemas/*.json embedded with the package
manifest.json (SHA-256 per output file and metadata for validation)

Pack output file(s)

anatomize pack writes one or more files depending on splitting:

anatomize-pack.md (or .txt|.json|.xml)
if split: anatomize-pack.1.md, anatomize-pack.2.md, …

Each pack artifact starts with a lightweight, deterministic overview (and, if enabled, a structure tree) before file blocks/records.

Token reporting:

Artifact tokens: exact tokens of the written output file(s).
Content tokens: tokens of file contents only (useful for budgeting and “what’s expensive”).

Determinism and strictness

Deterministic ordering (paths and symbols sorted).
No timestamps in outputs.
Parse failures are hard failures (no partial output).
Validation is strict; --fix replaces output with regenerated content.

Development

python -m venv .venv
. .venv/bin/activate
python -m pip install -U pip
python -m pip install -e ".[dev]"

python -m ruff check .
python -m mypy -p anatomize
python -m pytest

Optional local benchmark:

.venv/bin/python scripts/bench_pack.py . --compress --workers 0

Tests

Tests are indexed via pytest markers in pyproject.toml and documented in tests/README.md:

unit: fast, isolated tests
integration: filesystem-level tests
e2e: CLI-level tests

License

MIT

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

bradsegal

These details have not been verified by PyPI

Release history Release notifications | RSS feed

0.2.1

Feb 1, 2026

0.2.0

Feb 1, 2026

This version

0.1.0

Jan 31, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

anatomize-0.1.0.tar.gz (82.9 kB view details)

Uploaded Jan 31, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

anatomize-0.1.0-py3-none-any.whl (78.8 kB view details)

Uploaded Jan 31, 2026 Python 3

File details

Details for the file anatomize-0.1.0.tar.gz.

File metadata

Download URL: anatomize-0.1.0.tar.gz
Upload date: Jan 31, 2026
Size: 82.9 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for anatomize-0.1.0.tar.gz
Algorithm	Hash digest
SHA256	`2807c814bbcfd4f1e8e97a8d44fa248251daaea37c365fb63095272a8a43b373`
MD5	`420b6d667a1bc36e25c0b5963d87d0c1`
BLAKE2b-256	`401805a2eaff8d1d8013570dcd7d95f582f6150206d7b0c4b5815f170d718fac`

See more details on using hashes here.

Provenance

The following attestation bundles were made for anatomize-0.1.0.tar.gz:

Publisher: publish-pypi.yml on BradSegal/anatomize

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: anatomize-0.1.0.tar.gz
- Subject digest: 2807c814bbcfd4f1e8e97a8d44fa248251daaea37c365fb63095272a8a43b373
- Sigstore transparency entry: 886806153
- Sigstore integration time: Jan 31, 2026
Source repository:
- Permalink: BradSegal/anatomize@fc0c0f24a461d70cb7f2bda07b2cb74ecd74364c
- Branch / Tag: refs/tags/v0.1.0
- Owner: https://github.com/BradSegal
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish-pypi.yml@fc0c0f24a461d70cb7f2bda07b2cb74ecd74364c
- Trigger Event: push

File details

Details for the file anatomize-0.1.0-py3-none-any.whl.

File metadata

Download URL: anatomize-0.1.0-py3-none-any.whl
Upload date: Jan 31, 2026
Size: 78.8 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for anatomize-0.1.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`d589655546321c864dfb12b38258597f968972402391fd7d9d6f3142f4ccec57`
MD5	`40bcea30d00de5203b48891e668493cc`
BLAKE2b-256	`4050f2733e6ea92e8a3d2c57c7eb1806590d619bf33459053e02b8cac8381a32`

See more details on using hashes here.

Provenance

The following attestation bundles were made for anatomize-0.1.0-py3-none-any.whl:

Publisher: publish-pypi.yml on BradSegal/anatomize

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: anatomize-0.1.0-py3-none-any.whl
- Subject digest: d589655546321c864dfb12b38258597f968972402391fd7d9d6f3142f4ccec57
- Sigstore transparency entry: 886806198
- Sigstore integration time: Jan 31, 2026
Source repository:
- Permalink: BradSegal/anatomize@fc0c0f24a461d70cb7f2bda07b2cb74ecd74364c
- Branch / Tag: refs/tags/v0.1.0
- Owner: https://github.com/BradSegal
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish-pypi.yml@fc0c0f24a461d70cb7f2bda07b2cb74ecd74364c
- Trigger Event: push

anatomize 0.1.0

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Project description

anatomize

Installation

Quick Start (CLI)

Generate skeletons

Estimate tokens

Validate (and fix) skeleton output

Pack a repository into an AI-friendly bundle

Python API

Generate skeletons in code

Key exported objects

Configuration (.anatomize.yaml)

Output artifacts

Skeleton output directory

Pack output file(s)

Determinism and strictness

Development

Tests

License

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance

Configuration (`.anatomize.yaml`)