Skip to main content

Metadata extraction tool for git repositories.

Project description

diffhouse

diffhouse is a git metadata extraction tool for Python that retrieves high-quality repository information such as commit history, branches, diffs and more.

Requirements

Git 2.19 or greater.

Quick start

  1. Install diffhouse with pip:
pip install diffhouse
  1. Import the Repo class in Python:
from diffhouse import Repo
  1. Create a Repo instance with the git repository URL as an argument. Set blobs to True to load file-level diffs as well.

Note that blobs=True greatly increases processing time, as it requires a complete clone of the repository.

r = Repo(url='https://github.com/user/name.git', blobs=True)
  1. Access data through the following pandas DataFrames:
Table Description
Repo.commits Commit history.
Repo.branches Branch names.
Repo.tags Tag names.
Repo.diffs File-level changes. Available if blobs is True.

For a full list of objects and table columns, see the documentation.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

diffhouse-0.2.1.tar.gz (7.9 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

diffhouse-0.2.1-py3-none-any.whl (7.7 kB view details)

Uploaded Python 3

File details

Details for the file diffhouse-0.2.1.tar.gz.

File metadata

  • Download URL: diffhouse-0.2.1.tar.gz
  • Upload date:
  • Size: 7.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.13.2

File hashes

Hashes for diffhouse-0.2.1.tar.gz
Algorithm Hash digest
SHA256 c9f1fc835c4af28aa8bcfb01815756a87af4ba66e481f77ae77dd8abb0a32a03
MD5 3f527b295b60ba9f77e26d29753f9929
BLAKE2b-256 19d1299cf69969e307fa070bfb2a94f25e825536422a24c64cf9dbb9180e864f

See more details on using hashes here.

File details

Details for the file diffhouse-0.2.1-py3-none-any.whl.

File metadata

  • Download URL: diffhouse-0.2.1-py3-none-any.whl
  • Upload date:
  • Size: 7.7 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.13.2

File hashes

Hashes for diffhouse-0.2.1-py3-none-any.whl
Algorithm Hash digest
SHA256 3940d193c401a0bd508c40bd0a6777a3524d4ea6703804aed6479364af8a0e6f
MD5 dde5e3468d96d111f8b13199f82ef002
BLAKE2b-256 61a41917ff3f7a290dd5ce8af8c538bfa21d4dc3c202f57735690dad476e374b

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page