Skip to main content

concurrent, pipelined, platform-agnostic Git utilities for managing a large number of Git repositories

Project description

GitMC -- concurrent asynchronous Git Utilities for operations on massive numbers of Git repos

DOI ci PyPI Download stats

Platform-independent (Linux/Mac/Windows) Git utilities, useful for managing large (100+) numbers of Git repos. Speed is an emphasis throughout, with concurrency via Python stdlib asyncio via asyncio.create_subprocess_exec and pipelining makes operations in effect 100x faster overall as the coroutines simultaneously wait for Git operations (particularly remote operations like "fetch" and "pull"). We have implemented individual concurrent subprocess timeout using asyncio.wait_for so that one Git operation hanging doesn't cause other Git operations to fail--this is good for when a Git login popup may go unnoticed by the human.

GitMC uses command-line Git because PyGit also requires command-line Git installed, and we don't need the advanced functionality.


Also see PyGit-bulk for managing large (100+) numbers of users / teams.

This repo contains a Git pre-commit script with explanation.

Install

Install Git in a way accessible from the command line line

  • Mac: brew install git
  • Linux: apt install git
  • Windows: command line Git.
python -m pip install -e .

Usage

gitbranch : Tells of any non-master branches under directory ~/code

python -m gitutils.email : list all contributor email addresses. To fix unwanted emails use Git-filter-repo

find_missing_file : find directories missing exact fullpath to file

find_matching_file : find directories matching exact fullpath to file

Sync large number of git repos

These assume numerous subdirectories under ~/code. They work very quickly for large numbers (100+) of repos.

  • gitstat check if any local repos have pending changes
  • gitpull Git pulls all repos (suggest gitfetch instead)
  • gitfetch Git fetches all repos, printing a summary of files changed on remote

Place an empty file .nogit in a subdirectory to skip it.

[optional] speedup with https pull

For public repos, to make the Git remote checking go at least twice as fast, and significantly reduce the computational burden when SSH is used for git push (as is recommended), consider the "pushInsteadOf" global Git config. To do this, when cloning a public repo (including ones you're a collaborator on), use git clone https://. This global SSH push config one-time does SSH push for HTTPS-cloned repos:

git config --global url."ssh://github.com/".pushInsteadOf https://github.com/

The pattern matching can be made for all sites by omitting github.com from the command above, or you can refine it for each site, or even for specific usernames by editing the command above. For private repos, simply clone with SSH as usual

Preview all changed Jekyll files

This is for a website made using Jekyll or Hugo:

ActOnChanged . -p

It shows web page previews of all pages changed locally--start the Jekyll or Hugo debug server first e.g. hugo serve

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

gitutils-1.12.1.tar.gz (13.8 kB view details)

Uploaded Source

Built Distribution

gitutils-1.12.1-py3-none-any.whl (16.8 kB view details)

Uploaded Python 3

File details

Details for the file gitutils-1.12.1.tar.gz.

File metadata

  • Download URL: gitutils-1.12.1.tar.gz
  • Upload date:
  • Size: 13.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.11.7

File hashes

Hashes for gitutils-1.12.1.tar.gz
Algorithm Hash digest
SHA256 413d1fc33d3405dae0eb4b08f83d7d0dae7cb5368abe88025a8336c380be0389
MD5 f4d168c928ed240da930503daa799ced
BLAKE2b-256 851308407945a432aa54777696c1e8934b383a447f9e036334f9c2c8c7a4c11b

See more details on using hashes here.

File details

Details for the file gitutils-1.12.1-py3-none-any.whl.

File metadata

  • Download URL: gitutils-1.12.1-py3-none-any.whl
  • Upload date:
  • Size: 16.8 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.11.7

File hashes

Hashes for gitutils-1.12.1-py3-none-any.whl
Algorithm Hash digest
SHA256 9fd61b9328535316cedd5bea6b4633205d558a5ec94edb6836ecf2c69ef83b30
MD5 7b53da21bca74b89fdccf669c7f974f2
BLAKE2b-256 40e1c572fc4762fc1def4b7c3a09d48e8fb58810d0ec223c1698c1382a4bf5d6

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page