Skip to main content

A utility for interacting with data from git repositories as Pandas dataframes

Project description

![license](https://img.shields.io/pypi/l/Django.svg) [![Coverage Status](https://coveralls.io/repos/wdm0006/git-pandas/badge.svg?branch=master&service=github)](https://coveralls.io/github/wdm0006/git-pandas?branch=master) ![travis status](https://travis-ci.org/wdm0006/git-pandas.svg?branch=master) [![PyPI version](https://badge.fury.io/py/git-pandas.svg)](https://badge.fury.io/py/git-pandas) ![downloads](https://img.shields.io/pypi/dm/git-pandas.svg)

![Cumulative Blame Stravalib](https://raw.githubusercontent.com/wdm0006/git-pandas/master/examples/img/stravalib_cumulative_blame.png)

A simple set of wrappers around gitpython for creating pandas dataframes out of git data. The project is centered around two primary objects:

  • Repository

  • ProjectDirectory

A Repository object contains a single git repo, and is used to interact with it. A ProjectDirectory references a directory in your filesystem which may have in it multiple git repositories. The subdirectories are all walked to find any child repos, and any analysis is aggregated up from all of those into a single output (pandas dataframe).

This is a pre-v1.0.0 project, so the interfaces and functionality may change.

Current functionality includes:

  • Commit history with extension and directory filtering

  • Edited files history with extension and directory filtering

  • Blame with extension and directory filtering

  • Branches

  • Tags

  • ProjectDirectory-level general information table

  • Approximate bus factor

  • Cumulative Blame as a time series

Please see examples for more detailed usage. The image above is generated using the repository object’s cumulative blame function on stravalib.

Installation

Git-pandas supports python 2.7+ and 3.3+. To install use:

pip install git-pandas

Documentation

Docs can be found here: [http://wdm0006.github.io/git-pandas/](http://wdm0006.github.io/git-pandas/)

Contributing

We are looking for contributors, so if you are interested, please review our contributor guidelines in CONTRIBUTING.md, which includes some proposed starter issues, or if you have an idea of your own, send us a pull request.

License

This is BSD licensed (see LICENSE.md)

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

git-pandas-0.0.6.tar.gz (13.8 kB view details)

Uploaded Source

File details

Details for the file git-pandas-0.0.6.tar.gz.

File metadata

  • Download URL: git-pandas-0.0.6.tar.gz
  • Upload date:
  • Size: 13.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No

File hashes

Hashes for git-pandas-0.0.6.tar.gz
Algorithm Hash digest
SHA256 6c43562d39fb2f198163800279779543f52e4ea01ddfea44990d746ba7c94192
MD5 ad14c75028689fbcc05e59b3fc4a1acd
BLAKE2b-256 906f8233018ee46ac94d00c2d80c4c16ebe7c3ea75a082bd6c226a99370ede0c

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page