Skip to main content

A utility for interacting with data from git repositories as Pandas dataframes

Project description

![license]( [![Coverage Status](]( ![travis status]( [![PyPI version](]( ![downloads](

![Cumulative Blame](

A simple set of wrappers around gitpython for creating pandas dataframes out of git data. The project is centered around two primary objects:

  • Repository
  • ProjectDirectory

A Repository object contains a single git repo, and is used to interact with it. A ProjectDirectory references a directory in your filesystem which may have in it multiple git repositories. The subdirectories are all walked to find any child repos, and any analysis is aggregated up from all of those into a single output (pandas dataframe).

Current functionality includes:

  • Commit history with extension and directory filtering
  • Edited files history with extension and directory filtering
  • Blame with extension and directory filtering
  • Branches
  • Tags
  • ProjectDirectory-level general information table
  • Approximate bus factor
  • Cumulative Blame as a time series
  • profile analysis via GitHubProfile object
  • Plotting helpers in utilities module
  • Punchcard dataframe and plotting utility
  • Filewise blame
  • File owner approximation
  • Estimation of hours spent per project or per author across projects

Please see examples for more detailed usage. The image above is generated using the repository object’s cumulative blame function on stravalib.


Git-pandas supports python 2.7+ and 3.3+. To install use:

pip install git-pandas


We are looking for contributors, so if you are interested, please review our contributor guidelines in, which includes some proposed starter issues, or if you have an idea of your own, send us a pull request.

Projects Using Git-Pandas


This is BSD licensed (see

Project details

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Files for git-pandas, version 1.2.0
Filename, size File type Python version Upload date Hashes
Filename, size git-pandas-1.2.0.tar.gz (20.1 kB) File type Source Python version None Upload date Hashes View

Supported by

AWS AWS Cloud computing Datadog Datadog Monitoring DigiCert DigiCert EV certificate Facebook / Instagram Facebook / Instagram PSF Sponsor Fastly Fastly CDN Google Google Object Storage and Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Salesforce Salesforce PSF Sponsor Sentry Sentry Error logging StatusPage StatusPage Status page