Skip to main content

Metadata extraction tool for git repositories.

Project description

diffhouse

diffhouse is a git metadata extraction tool for Python that retrieves high-quality repository information such as commit history, branches, diffs and more.

Requirements

Git 2.19 or greater.

Quick start

  1. Install diffhouse with pip:
pip install diffhouse
  1. Import the Repo class in Python:
from diffhouse import Repo
  1. Create a Repo instance with the git repository URL as an argument. Set blobs to True to load file-level diffs as well.

Note that blobs=True greatly increases processing time, as it requires a complete clone of the repository.

r = Repo(url='https://github.com/user/name.git', blobs=True)
  1. Access data through the following pandas DataFrames:
Table Description
Repo.commits Commit history.
Repo.branches Branch names.
Repo.tags Tag names.
Repo.diffs File-level changes. Available if blobs is True.

For a full list of objects and table columns, see the documentation.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

diffhouse-0.2.2.tar.gz (8.0 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

diffhouse-0.2.2-py3-none-any.whl (7.7 kB view details)

Uploaded Python 3

File details

Details for the file diffhouse-0.2.2.tar.gz.

File metadata

  • Download URL: diffhouse-0.2.2.tar.gz
  • Upload date:
  • Size: 8.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.13.2

File hashes

Hashes for diffhouse-0.2.2.tar.gz
Algorithm Hash digest
SHA256 7595132119a5c6a3a637bb82fe7823270c504d6db7f64b9776bc5ca53de65623
MD5 0570051fd4b49c28623478fed0f3d7f9
BLAKE2b-256 ad1ee79daa5707a0b7e805c030f82a6e04fd2d878ea9befbbe8371cfdb3a4336

See more details on using hashes here.

File details

Details for the file diffhouse-0.2.2-py3-none-any.whl.

File metadata

  • Download URL: diffhouse-0.2.2-py3-none-any.whl
  • Upload date:
  • Size: 7.7 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.13.2

File hashes

Hashes for diffhouse-0.2.2-py3-none-any.whl
Algorithm Hash digest
SHA256 8609913abe13e9fbf07dd7d7fe29e3773a9408a0015983a7414eb5178c1b91a9
MD5 8e1ca271a07bc857b256cde410bf7e37
BLAKE2b-256 38545de1826e18ce7c788b07019c806a75aef17bbdf4df866f2a6669861208e9

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page