Skip to main content

Metadata extraction tool for git repositories.

Project description

diffhouse

diffhouse is a git metadata extraction tool for Python that retrieves high-quality repository information such as commit history, branches, diffs and more.

Requirements

Git 2.19 or greater.

Quick start

  1. Install diffhouse with pip:
pip install diffhouse
  1. Import the Repo class in Python:
from diffhouse import Repo
  1. Create a Repo instance with the git repository URL as an argument. Set blobs to True to load file-level diffs as well.

Note that blobs=True greatly increases processing time, as it requires a complete clone of the repository.

r = Repo(url='https://github.com/user/name.git', blobs=True)
  1. Access data through the following pandas DataFrames:
Table Description
Repo.commits Commit history.
Repo.branches Branch names.
Repo.tags Tag names.
Repo.diffs File-level changes. Available if blobs is True.

For a full list of objects and table columns, see the documentation.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

diffhouse-0.2.3.tar.gz (8.4 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

diffhouse-0.2.3-py3-none-any.whl (8.1 kB view details)

Uploaded Python 3

File details

Details for the file diffhouse-0.2.3.tar.gz.

File metadata

  • Download URL: diffhouse-0.2.3.tar.gz
  • Upload date:
  • Size: 8.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.13.2

File hashes

Hashes for diffhouse-0.2.3.tar.gz
Algorithm Hash digest
SHA256 471679da1ec94c9186b322335cb0075bde34a326302ce8c97a68e936b9c56ea6
MD5 e33aa6321f0bb0ea814a25388da1b365
BLAKE2b-256 f3fe9c94c689c703d82f4ed78d3d8aa80174d038a41c33f1b6e25cf03014a4c3

See more details on using hashes here.

File details

Details for the file diffhouse-0.2.3-py3-none-any.whl.

File metadata

  • Download URL: diffhouse-0.2.3-py3-none-any.whl
  • Upload date:
  • Size: 8.1 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.13.2

File hashes

Hashes for diffhouse-0.2.3-py3-none-any.whl
Algorithm Hash digest
SHA256 0c31a323f7f6f58fe829af289411ebdc97a983107b80d3b544a305c2a665cb17
MD5 6ff3349f0a69ca840aa95762b82a02c5
BLAKE2b-256 3d66b7f09314eaba9e25e9cbc3e3076a07242fa73db167eef0e5e43f6691096e

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page