Skip to main content

Metadata extraction tool for git repositories.

Project description

diffhouse

diffhouse is a git metadata extraction tool for Python that retrieves high-quality repository information such as commit history, branches, diffs and more.

Requirements

Git 2.20 and Python 3.10 or greater required.

Quick start

  1. Install diffhouse with pip:
pip install diffhouse
  1. Import the Repo class in Python:
from diffhouse import Repo
  1. Create a Repo instance with the git repository URL as an argument. Set blobs to True to load file-level diffs as well.

Note that blobs=True greatly increases processing time, as it requires a complete clone of the repository.

r = Repo(url='https://github.com/user/name.git', blobs=True)
  1. Access data through the following pandas DataFrames:
Table Description
Repo.commits Commit history.
Repo.branches Branch names.
Repo.tags Tag names.
Repo.diffs File-level changes. Available if blobs is True.

For a full list of objects and table columns, see the documentation.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

diffhouse-0.2.5.tar.gz (8.7 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

diffhouse-0.2.5-py3-none-any.whl (8.7 kB view details)

Uploaded Python 3

File details

Details for the file diffhouse-0.2.5.tar.gz.

File metadata

  • Download URL: diffhouse-0.2.5.tar.gz
  • Upload date:
  • Size: 8.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.13.2

File hashes

Hashes for diffhouse-0.2.5.tar.gz
Algorithm Hash digest
SHA256 5fb53516984e6742f8657ed0930faba04e8e306783e0534f75c509182c799904
MD5 1ddb6e56c487a83fc77593ab37329cfd
BLAKE2b-256 8a73d71ceae2ff0cc99bab2df80f21752e7231ea9deee541239136b0a6c3ad4d

See more details on using hashes here.

File details

Details for the file diffhouse-0.2.5-py3-none-any.whl.

File metadata

  • Download URL: diffhouse-0.2.5-py3-none-any.whl
  • Upload date:
  • Size: 8.7 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.13.2

File hashes

Hashes for diffhouse-0.2.5-py3-none-any.whl
Algorithm Hash digest
SHA256 e71ea8834599ad84488a701bf8b830eb2ee0a43902204522f083690a759223d0
MD5 975179d8058dc9177a5803c61ba64f90
BLAKE2b-256 457f0973d56394b268148f1b8a4895af8a2fca99015f5c71410b649692e5312c

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page