Skip to main content

Tool to compute the complexity of a Git contributions

Project description

Contribution Complexity

What is this?

This tool computes the complexity of a specified contribution to a git repository. A contribution is one or more commits specified by their commit hashes. Alternatively, if commit messages contain references to issue numbers, a contribution can be specified by a regular expression matching a certain set of commits.

The tool reports a contribution complexity on the scale low, moderate, medium, elevated, high. That value identifies weather a contribution was simple to make (value low) or if it consists of multiple intricate changes (value high) that were difficult to integrate into the system.

For example, the storage engine of Apache Cassandra (DBMS) was refactored for version 3 to better support certain concepts of the query language and to allow for future performance optimizations, see ticket CASSANDRA-8099 The corresponding commit modifies almost 50k lines in 645 files and contains many non-trivial changes. On the other hand a bug that prevented under certain circumstances streaming between cluster nodes was fixed with a quite tiny patch modifying 15 lines in two files.

For humans inspecting the two contributions it is quickly clear that the former contribution is way more complex to implement than the latter.

This tool is meant to automate the process of identification of contributions of various complexities either for inclusion in a CI setup or for research.

Installation

$ pip install contribution-complexity

Running

You can run the tool either by specifying a list of commits or by providing a regular expression that matches commit messages containing

$ contribcompl commits <path_to_repo> <commit_shas>...
$ contribcompl issue <path_to_repo> <issue_regex>...

For example,

$ git clone git@github.com:apache/Cassandra.git /tmp/cassandra
$ contribcompl commits /tmp/cassandra 021df085074b761f2b3539355ecfc4c237a54a76 2f1d6c7254342af98c2919bd74d37b9944c41a6b
ContributionComplexity.LOW
$ contribcompl issue /tmp/cassandra 'CASSANDRA-8099( |$)'
ContributionComplexity.HIGH

Calling from Code

from contribution_complexity.compute import find_commits_for_issue
from contribution_complexity.metrics import compute_contrib_compl


issue_re = "CASSANDRA-8099( |$)"
path_to_repo = "/tmp/cassandra"
commit_shas = find_commits_for_issue(path_to_repo, issue_re)
contribcompl = compute_contrib_compl(path_to_repo, commit_shas)
print(contribcompl)

Recreating the Experiment

Requirements

Run!

  • Set your Github API key in the Vagrantfile, i.e., replace <PUT_YOUR_KEY_HERE> on line 33 with your key.
  • Run vagrant up in this directory, which will bring up and configure a VM accordingly. It will automatically start the experiment recreation, which will take some hours to run.
  • Once done you have all results on the VM (log onto the machine with vagrant ssh) in the directory /vagrant/data/

The experiment is described in experiment/run_experiment.sh.

Attribution

The logo is adapted from a [flaticon icon](on https://www.flaticon.com/free-icon/puzzle_808497?term=contribution&page=1&position=16&page=1&position=16&related_id=808497&origin=search). Proper attribution to the original:

Icons made by mynamepong from www.flaticon.com

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

contribution-complexity-0.1.1.tar.gz (11.1 kB view details)

Uploaded Source

Built Distribution

contribution_complexity-0.1.1-py3-none-any.whl (10.2 kB view details)

Uploaded Python 3

File details

Details for the file contribution-complexity-0.1.1.tar.gz.

File metadata

  • Download URL: contribution-complexity-0.1.1.tar.gz
  • Upload date:
  • Size: 11.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.1.4 CPython/3.9.0 Darwin/18.7.0

File hashes

Hashes for contribution-complexity-0.1.1.tar.gz
Algorithm Hash digest
SHA256 c9c910bfa5095804a737159560235ac8a2b0fa819896a46d1181ca836fb37e77
MD5 bd4e39be2376568d900c44b1a8ea4a27
BLAKE2b-256 43248518c20db0971df7100fd7ad61c6a54eafc27cc4d1c6d19521dc99ed2b59

See more details on using hashes here.

File details

Details for the file contribution_complexity-0.1.1-py3-none-any.whl.

File metadata

File hashes

Hashes for contribution_complexity-0.1.1-py3-none-any.whl
Algorithm Hash digest
SHA256 c91576191136ddb6dab54a62afdfa5048d2c4b63c0f65e1f2019e899a1e244c7
MD5 c9b70654248eec829766e2a549863abc
BLAKE2b-256 df62b3f34226866246dd3a9da47bdd0b883c22e7e0eff50f250a316f447ba48e

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page