Skip to main content

Uses AI to help understand repositories and their changes.

Project description

repo-guide

PyPI Changelog Tests License

Use AI to generate guides to code repositories.

NEO: Can you fly that thing?
TRINITY: Not yet. Tank, I need a pilot program for a military B-212 helicopter. Hurry!
[seconds later ...]
TRINITY: Let's go.

The Matrix (1999)

You can see the output of repo-guide on its own repository at https://wolfmanstout.github.io/repo-guide/. This is automatically generated and published after every release. As of 1/4/2025, this consumes under 25K tokens on each run and costs less than 1 cent on Gemini 1.5 Flash (in fact it's using Gemini 2.0 Experimental, which is currently free).

For more on why I built this, check out my blog post: Repo-guide: Mapping Code Repositories with AI.

NOTE: The guides generated by repo-guide are intended to be complementary to human-authored documentation, not replacements.

Installation

Install this tool using pip, pipx, or uv tool install, e.g.:

pip install repo-guide

By default it uses Gemini Flash as the AI model, which requires an API key. You can either set the LLM_GEMINI_KEY environment variable or install Simon Willison's LLM command-line tool and use llm keys set gemini to store your key in a configuration file.

DISCLAIMER: LLM API calls may cost money. Although this tool displays token counts and provides methods to limit token usage, you are ultimately responsible for any costs incurred, including costs that may be higher than expected due to bugs in this tool. Consider setting hard limits or other protections in your API accounts where possible.

Usage

This tool currently only supports Git repositories, with some additional features for GitHub repositories (e.g. links to files).

Typical usage:

repo-guide <path_to_cloned_repo_or_subdirectory>

This will create a generated_docs directory within the current directory, populate it with an AI-generated Markdown guide, then run a private MkDocs server at localhost:8000 to serve the docs. It will show a progress bar as it generates docs, including how many tokens the model has used (combining input + output). You can start viewing the docs immediately, and the page will automatically reload as new docs are generated.

If you kill the server and need to restart it later, by default it will reuse any previously-generated Markdown files, so you can simply rerun the same command. You can also add --no-resume to delete and regenerate the files, or --no-gen to explicitly disable doc generation (e.g. even if a new directory has been added). Alternatively, you can add --no-serve when building the guide and run MkDocs directly to host the files. You'll need to install MkDocs and the necessary dependencies (e.g. with uv tool install mkdocs --with mkdocs-material,bleach,bleach-allowlist) then run mkdocs serve -f generated_docs/mkdocs.yml. With this approach, you can also deploy the docs to other hosting platforms.

Here are some of the most common flags you may want to use:

  • --output-dir: Change where the generated docs are written.
  • -v or --verbose: Prints details on doc generation progress instead of a progress bar.
  • --model: Sets the LLM model to use. As of 1/1/2025, the default is Gemini 1.5 Flash, which costs under a dollar for 10 million tokens and has a 1 million token context window, making it a great fit. You can try other Gemini models such as gemini-2.0-flash-exp, or OpenAI models if OPENAI_API_KEY is set, as supported by simonw/llm and simonw/llm-gemini.
  • --token-budget: Sets an approximate token budget to avoid overspending. Tokens are counted after each LLM call, so the actual number may be higher.
  • --custom-instructions and --custom-instructions-file: Use either of these to append custom instructions to the system prompt. Let me know if you come up with something that significantly improves the general result quality!

For a full description of command line flags, run:

repo-guide --help

You can also use:

python -m repo_guide --help

Troubleshooting

If the command fails either due to an error or hitting the token budget, simply rerun the command and it will resume and retry (unless --no-resume is applied). Most common model errors (e.g. rate-limiting) should be automatically retried with exponential backoff. You can --ignore large generated or binary files that aren't automatically filtered out (the tool automatically respects .gitignore files and ignores files annotated in git ls-files --eol as non-text). If you still hit the model token limit, try setting --files-token-limit, which is applied per-directory.

LLMs are unpredictable, and the generated Markdown may contain errors and broken links. The system prompt tries to mitigate common issues, but they happen anyways. The only real fix to this will be better models, which will surely come soon.

Development

To contribute to this tool, use uv. The following command will establish the venv and run tests:

uv run pytest

To run repo-guide locally, use:

uv run repo-guide

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

repo_guide-0.5.0.tar.gz (68.5 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

repo_guide-0.5.0-py3-none-any.whl (15.8 kB view details)

Uploaded Python 3

File details

Details for the file repo_guide-0.5.0.tar.gz.

File metadata

  • Download URL: repo_guide-0.5.0.tar.gz
  • Upload date:
  • Size: 68.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.0.1 CPython/3.12.8

File hashes

Hashes for repo_guide-0.5.0.tar.gz
Algorithm Hash digest
SHA256 669311ca5f897a83a1a4e1eb485f0d542baea7020255bba8ffe3817e328db426
MD5 9f329a9333fd27a4ea7c8800068f0191
BLAKE2b-256 8c28973796812f3329544bab68fa074de4e244a457816a7dcbf275128c552f6a

See more details on using hashes here.

Provenance

The following attestation bundles were made for repo_guide-0.5.0.tar.gz:

Publisher: publish.yml on wolfmanstout/repo-guide

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file repo_guide-0.5.0-py3-none-any.whl.

File metadata

  • Download URL: repo_guide-0.5.0-py3-none-any.whl
  • Upload date:
  • Size: 15.8 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.0.1 CPython/3.12.8

File hashes

Hashes for repo_guide-0.5.0-py3-none-any.whl
Algorithm Hash digest
SHA256 61e091494da588e8204d33d18989ed1624ba562dba991d184fc41fea3d223d3d
MD5 945684ce566304a7777f971483e9ab1a
BLAKE2b-256 96d620be1e564432fbf2b50a59deaff460e9203b8d5d2e9c2c4782cf17ad3e28

See more details on using hashes here.

Provenance

The following attestation bundles were made for repo_guide-0.5.0-py3-none-any.whl:

Publisher: publish.yml on wolfmanstout/repo-guide

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page