Uses AI to help understand repositories and their changes.

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

wolfmanstout

These details have not been verified by PyPI

Project description

repo-guide

Use AI to generate guides to code repositories.

NEO: Can you fly that thing?
TRINITY: Not yet. Tank, I need a pilot program for a military B-212 helicopter. Hurry!
[seconds later ...]
TRINITY: Let's go.

— The Matrix (1999)

You can see the output of repo-guide on its own repository at https://wolfmanstout.github.io/repo-guide/. This is automatically generated and published after every release. As of 1/4/2025, this consumes under 25K tokens on each run and costs less than 1 cent on Gemini 1.5 Flash (in fact it's using Gemini 2.0 Experimental, which is currently free).

For more on why I built this, check out my blog post: Repo-guide: Mapping Code Repositories with AI.

NOTE: The guides generated by repo-guide are intended to be complementary to human-authored documentation, not replacements.

Installation

Install this tool using pip, pipx, or uv tool install, e.g.:

pip install repo-guide

By default it uses Gemini Flash as the AI model, which requires an API key. You can either set the LLM_GEMINI_KEY environment variable or install Simon Willison's LLM command-line tool and use llm keys set gemini to store your key in a configuration file.

DISCLAIMER: LLM API calls may cost money. Although this tool displays token counts and provides methods to limit token usage, you are ultimately responsible for any costs incurred, including costs that may be higher than expected due to bugs in this tool. Consider setting hard limits or other protections in your API accounts where possible.

Usage

This tool currently only supports Git repositories, with some additional features for GitHub repositories (e.g. links to files).

Typical usage:

repo-guide <path_to_cloned_repo_or_subdirectory>

This will create a generated_docs directory within the current directory, populate it with an AI-generated Markdown guide, then run a private MkDocs server at localhost:8000 to serve the docs. It will show a progress bar as it generates docs, including how many tokens the model has used (combining input + output). You can start viewing the docs immediately, and the page will automatically reload as new docs are generated.

If you kill the server and need to restart it later, by default it will reuse any previously-generated Markdown files, so you can simply rerun the same command. You can also add --no-resume to delete and regenerate the files, or --no-gen to explicitly disable doc generation (e.g. even if a new directory has been added). Alternatively, you can add --no-serve when building the guide and run MkDocs directly to host the files. You'll need to install MkDocs and the necessary dependencies (e.g. with uv tool install mkdocs --with mkdocs-material,bleach,bleach-allowlist) then run mkdocs serve -f generated_docs/mkdocs.yml. With this approach, you can also deploy the docs to other hosting platforms.

Here are some of the most common flags you may want to use:

--output-dir: Change where the generated docs are written.
-v or --verbose: Prints details on doc generation progress instead of a progress bar.
--model: Sets the LLM model to use. As of 1/1/2025, the default is Gemini 1.5 Flash, which costs under a dollar for 10 million tokens and has a 1 million token context window, making it a great fit. You can try other Gemini models such as gemini-2.0-flash-exp, or OpenAI models if OPENAI_API_KEY is set, as supported by simonw/llm and simonw/llm-gemini.
--token-budget: Sets an approximate token budget to avoid overspending. Tokens are counted after each LLM call, so the actual number may be higher.
--custom-instructions and --custom-instructions-file: Use either of these to append custom instructions to the system prompt. Let me know if you come up with something that significantly improves the general result quality!

For a full description of command line flags, run:

repo-guide --help

You can also use:

python -m repo_guide --help

Troubleshooting

If the command fails either due to an error or hitting the token budget, simply rerun the command and it will resume and retry (unless --no-resume is applied). Most common model errors (e.g. rate-limiting) should be automatically retried with exponential backoff. You can --ignore large generated or binary files that aren't automatically filtered out (the tool automatically respects .gitignore files and ignores files annotated in git ls-files --eol as non-text). If you still hit the model token limit, try setting --files-token-limit, which is applied per-directory.

LLMs are unpredictable, and the generated Markdown may contain errors and broken links. The system prompt tries to mitigate common issues, but they happen anyways. The only real fix to this will be better models, which will surely come soon.

Development

To contribute to this tool, use uv. The following command will establish the venv and run tests:

uv run pytest

To run repo-guide locally, use:

uv run repo-guide

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

wolfmanstout

These details have not been verified by PyPI

Release history Release notifications | RSS feed

0.7.0

Feb 6, 2025

0.6.1

Jan 19, 2025

0.6.0

Jan 19, 2025

This version

0.5.0

Jan 5, 2025

0.4.2

Jan 5, 2025

0.4.1

Jan 2, 2025

0.4.0

Jan 2, 2025

0.3.0

Jan 2, 2025

0.2.0

Dec 15, 2024

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

repo_guide-0.5.0.tar.gz (68.5 kB view details)

Uploaded Jan 5, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

repo_guide-0.5.0-py3-none-any.whl (15.8 kB view details)

Uploaded Jan 5, 2025 Python 3

File details

Details for the file repo_guide-0.5.0.tar.gz.

File metadata

Download URL: repo_guide-0.5.0.tar.gz
Upload date: Jan 5, 2025
Size: 68.5 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.0.1 CPython/3.12.8

File hashes

Hashes for repo_guide-0.5.0.tar.gz
Algorithm	Hash digest
SHA256	`669311ca5f897a83a1a4e1eb485f0d542baea7020255bba8ffe3817e328db426`
MD5	`9f329a9333fd27a4ea7c8800068f0191`
BLAKE2b-256	`8c28973796812f3329544bab68fa074de4e244a457816a7dcbf275128c552f6a`

See more details on using hashes here.

Provenance

The following attestation bundles were made for repo_guide-0.5.0.tar.gz:

Publisher: publish.yml on wolfmanstout/repo-guide

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: repo_guide-0.5.0.tar.gz
- Subject digest: 669311ca5f897a83a1a4e1eb485f0d542baea7020255bba8ffe3817e328db426
- Sigstore transparency entry: 159897893
- Sigstore integration time: Jan 5, 2025
Source repository:
- Permalink: wolfmanstout/repo-guide@8f37a5307c5b79ee621235c3273d48637a09368e
- Branch / Tag: refs/tags/0.5.0
- Owner: https://github.com/wolfmanstout
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@8f37a5307c5b79ee621235c3273d48637a09368e
- Trigger Event: release

File details

Details for the file repo_guide-0.5.0-py3-none-any.whl.

File metadata

Download URL: repo_guide-0.5.0-py3-none-any.whl
Upload date: Jan 5, 2025
Size: 15.8 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.0.1 CPython/3.12.8

File hashes

Hashes for repo_guide-0.5.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`61e091494da588e8204d33d18989ed1624ba562dba991d184fc41fea3d223d3d`
MD5	`945684ce566304a7777f971483e9ab1a`
BLAKE2b-256	`96d620be1e564432fbf2b50a59deaff460e9203b8d5d2e9c2c4782cf17ad3e28`

See more details on using hashes here.

Provenance

The following attestation bundles were made for repo_guide-0.5.0-py3-none-any.whl:

Publisher: publish.yml on wolfmanstout/repo-guide

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: repo_guide-0.5.0-py3-none-any.whl
- Subject digest: 61e091494da588e8204d33d18989ed1624ba562dba991d184fc41fea3d223d3d
- Sigstore transparency entry: 159897895
- Sigstore integration time: Jan 5, 2025
Source repository:
- Permalink: wolfmanstout/repo-guide@8f37a5307c5b79ee621235c3273d48637a09368e
- Branch / Tag: refs/tags/0.5.0
- Owner: https://github.com/wolfmanstout
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@8f37a5307c5b79ee621235c3273d48637a09368e
- Trigger Event: release

repo-guide 0.5.0

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Project description

repo-guide

Installation

Usage

Troubleshooting

Development

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance