Skip to main content

Uses AI to help understand repositories and their changes.

Project description

repo-guide

PyPI Changelog Tests License

Use AI to generate guides to code repositories.

NEO: Can you fly that thing?
TRINITY: Not yet. Tank, I need a pilot program for a military B-212 helicopter. Hurry!
[seconds later ...]
TRINITY: Let's go.

The Matrix (1999)

You can see the output of repo-guide on its own repository at https://wolfmanstout.github.io/repo-guide/. This is automatically generated and published after every release. As of 1/4/2025, this consumes under 25K tokens on each run and costs less than 1 cent on Gemini 1.5 Flash (in fact it's using Gemini 2.0 Experimental, which is currently free).

NOTE: The guides generated by repo-guide are intended to be complementary to human-authored documentation, not replacements.

Installation

Install this tool using pip, pipx, or uv tool install, e.g.:

pip install repo-guide

By default it uses Gemini Flash as the AI model, which requires an API key. You can either set the LLM_GEMINI_KEY environment variable or install Simon Willison's LLM command-line tool and use llm keys set gemini to store your key in a configuration file.

DISCLAIMER: LLM API calls may cost money. Although this tool displays token counts and provides methods to limit token usage, you are ultimately responsible for any costs incurred, including costs that may be higher than expected due to bugs in this tool. Consider setting hard limits or other protections in your API accounts where possible.

Usage

This tool currently only supports Git repositories, with some additional features for GitHub repositories (e.g. links to files).

Typical usage:

repo-guide <path_to_cloned_repo_or_subdirectory>

This will create a generated_docs directory within the current directory, populate it with an AI-generated Markdown guide, then run a private MkDocs server at localhost:8000 to serve the docs. It will show a progress bar as it generates docs, including how many tokens the model has used (combining input + output). You can start viewing the docs immediately, and the page will automatically reload as new docs are generated.

If you kill the server and need to restart it later, by default it will reuse any previously-generated Markdown files, so you can simply rerun the same command. You can also add --no-resume to delete and regenerate the files, or --no-gen to explicitly disable doc generation (e.g. even if a new directory has been added). Alternatively, you can add --no-serve when building the guide and run MkDocs directly to host the files. You'll need to install MkDocs and the necessary dependencies (e.g. with uv tool install mkdocs --with mkdocs-material,bleach,bleach-allowlist) then run mkdocs serve -f generated_docs/mkdocs.yml. With this approach, you can also deploy the docs to other hosting platforms.

Here are some of the most common flags you may want to use:

  • --output-dir: Change where the generated docs are written.
  • -v or --verbose: Prints details on doc generation progress instead of a progress bar.
  • --model: Sets the LLM model to use. As of 1/1/2025, the default is Gemini 1.5 Flash, which costs under a dollar for 10 million tokens and has a 1 million token context window, making it a great fit. You can try other Gemini models such as gemini-2.0-flash-exp, or OpenAI models if OPENAI_API_KEY is set, as supported by simonw/llm and simonw/llm-gemini.
  • --token-budget: Sets an approximate token budget to avoid overspending. Tokens are counted after each LLM call, so the actual number may be higher.
  • --custom-instructions and --custom-instructions-file: Use either of these to append custom instructions to the system prompt. Let me know if you come up with something that significantly improves the general result quality!

For a full description of command line flags, run:

repo-guide --help

You can also use:

python -m repo_guide --help

Troubleshooting

If the command fails either due to an error or hitting the token budget, simply rerun the command and it will resume and retry (unless --no-resume is applied). Most common model errors (e.g. rate-limiting) should be automatically retried with exponential backoff. You can --ignore large generated or binary files that aren't automatically filtered out (the tool automatically respects .gitignore files and ignores files annotated in git ls-files --eol as non-text). If you still hit the model token limit, try setting --files-token-limit, which is applied per-directory.

LLMs are unpredictable, and the generated Markdown may contain errors and broken links. The system prompt tries to mitigate common issues, but they happen anyways. The only real fix to this will be better models, which will surely come soon.

Development

To contribute to this tool, use uv. The following command will establish the venv and run tests:

uv run pytest

To run repo-guide locally, use:

uv run repo-guide

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

repo_guide-0.4.2.tar.gz (68.3 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

repo_guide-0.4.2-py3-none-any.whl (15.6 kB view details)

Uploaded Python 3

File details

Details for the file repo_guide-0.4.2.tar.gz.

File metadata

  • Download URL: repo_guide-0.4.2.tar.gz
  • Upload date:
  • Size: 68.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.0.1 CPython/3.12.8

File hashes

Hashes for repo_guide-0.4.2.tar.gz
Algorithm Hash digest
SHA256 b28bbf4b585438da5e1796561ff856a8f240e517be1fbc2b96f7d7667167eb0f
MD5 d6fc130175b9550eb0885a01424e1dac
BLAKE2b-256 b2b595c609eee9339491baaa728b0095a0db5ac718fdb01f680aa96d14d409f0

See more details on using hashes here.

Provenance

The following attestation bundles were made for repo_guide-0.4.2.tar.gz:

Publisher: publish.yml on wolfmanstout/repo-guide

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file repo_guide-0.4.2-py3-none-any.whl.

File metadata

  • Download URL: repo_guide-0.4.2-py3-none-any.whl
  • Upload date:
  • Size: 15.6 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.0.1 CPython/3.12.8

File hashes

Hashes for repo_guide-0.4.2-py3-none-any.whl
Algorithm Hash digest
SHA256 1c4b6bebdc42ad04bb7bd750f95ee8b73a4f094bc7e0c204bea8357fcb680e33
MD5 b4a260a67b65eaebf037a076e878256d
BLAKE2b-256 6193c0dcd4afa773291a7251ac291064b05b44aae5340d61efce518e70ad6c5c

See more details on using hashes here.

Provenance

The following attestation bundles were made for repo_guide-0.4.2-py3-none-any.whl:

Publisher: publish.yml on wolfmanstout/repo-guide

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page