A picky and eager Git hook runner.
Project description
[!NOTE] This project is in its infancy. Please try it out and report and help fix any issues or missing features, but expect a somewhat broken experience.
Breaking CLI changes is to be expected without notice between arbitrary versions until version 1.0 is released.
goose
🦆🧪💻
A picky and eager Git hook runner.
- Reproducible builds.
- Dynamic parallelism.
- Small file-system footprint.
Installation
Docker alias
alias goose='docker run --rm -it -v ${PWD}:/wd -v ~/.cache/goose-docker:/home/nonroot/.cache ghcr.io/antonagestam/goose:latest'
Via PyPI
pip install --require-venv git-goose
Features
- Smart parallelism schedules hooks across CPUs while avoiding concurrent writes.
- Deterministic environments by using ecosystem-specific lock files.
- Environments are shared across hooks.
- Self-contained definitions means there's no need to push tool-specific configuration upstream, or to maintain brittle mirroring schemes.
Parallelism
Goose takes care to keep your CPUs as busy as possible, optimizing to have the full suite of hooks finish as soon as possible. It does this by distributing units of work to all available processing cores.
Parameterized hooks, or hooks that take files as command line arguments, are divided to one unit of work per available core. Whenever a core becomes available for more work, a new unit is chosen for execution.
The scheduler takes care to never run more than one mutating hook on the same file. It
does this by taking into account hooks marked as read_only
and by comparing sets of
files a unit of work is assigned to. Two incompatible hooks can be simultaneously
working on two separate parts of the code-base.
Deterministic environments
Goose uses lock files to facilitate deterministic results across developer environments
and CI. You specify dependencies in goose.yaml
, and invoking goose run
will produce
the appropriate lock files under a .goose/
directory. The .goose/
directory is meant
to be checked into git, so that future invocations of goose run
can use the lock files
it contains to produce identical environments for hooks to run in.
Usage
Create a goose.yaml
file in the repository root.
environments:
- id: python
ecosystem:
language: python
version: "3.12"
dependencies:
- ruff
hooks:
- id: ruff
environment: python
command: ruff
args: [check, --force-exclude, --fix]
types: [python]
- id: ruff-format
environment: python
command: ruff
args: [format, --force-exclude]
types: [python]
Bootstrap environments, generate lock files, and install dependencies.
$ python -m goose upgrade
Run all hooks over all files.
$ python -m goose run --select=all
Commit configuration and lock files.
$ git add goose.yaml .goose
$ git commit -m 'Add goose configuration'
Upgrading hook versions
As pinning of hook versions is handled with lock files, there's no need to change configuration to upgrade hook dependency versions, instead you just run the upgrade command.
$ python -m goose upgrade
$ git add .goose
$ git commit -m 'Bump goose dependencies'
Example node hook
Goose currently supports Python and Node environments, here's an example using Prettier to format Markdown files.
environments:
- id: node
ecosystem:
language: node
version: "21.7.1"
dependencies:
- prettier
hooks:
- id: prettier
environment: node
command: prettier
types: [markdown]
args:
- --write
- --ignore-unknown
- --parser=markdown
- --print-width=88
- --prose-wrap=always
Read-only hooks
You will likely want to use a mix of pure linters, as well as formatters and
auto-fixers. Tools that don't mutate files can be more heavily parallelized by Goose,
because they can inspect overlapping sets of files simultaneously as other tools. To
enable this you set read_only: true
in hook configuration.
environments:
- id: python
ecosystem:
language: python
version: "3.12"
dependencies:
- pre-commit-hooks
hooks:
- id: check-case-conflict
environment: python
command: check-case-conflict
read_only: true
- id: check-merge-conflict
environment: python
command: check-merge-conflict
read_only: true
types: [text]
- id: python-debug-statements
environment: python
command: debug-statement-hook
read_only: true
types: [python]
- id: detect-private-key
environment: python
command: detect-private-key
read_only: true
types: [text]
- id: end-of-file-fixer
environment: python
command: end-of-file-fixer
types: [text]
- id: trailing-whitespace-fixer
environment: python
command: trailing-whitespace-fixer
types: [text]
Hooks that do not specify read_only: true
will never run simultaneously as other tools
over the same file.
Non-parameterized hooks
Some tools don't support passing files, or just work better if given the responsibility to parallelize work itself. One such tool is mypy. You can instruct goose to not pass filenames to a hook (and as a consequence, also not spawn multiple parallel jobs for this hook).
environments:
- id: mypy
ecosystem:
language: python
version: "3.12"
dependencies:
- mypy
hooks:
- id: mypy
environment: mypy
command: mypy
read_only: true
parameterize: false
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for git_goose-0.2.2-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | b0ed467a123b01f1bcedc9685101ad3da8ab5801a1b60220d503f0b0b01c85de |
|
MD5 | 007cc4ee8aa43c4cea515320f51a36e7 |
|
BLAKE2b-256 | 324022971552de3150b91a08bc9a4ab301ab05b370e14f624caa82117790df91 |