Skip to main content

An Open Source name disambiguation tool for version control systems

Project description

gambit

gambit is an Open Source name disambiguation tool for version control systems.

Download and installation

gambit is pure python code. It has no platform-specific dependencies and thus works on all platforms. Assuming you are using pip, you can install latest version of gambit by running:

> pip install gambit-disambig

This also installs the necessary dependencies. gambit depends on the python-Levenshtein package to compute Levenshtein distances for edited lines of code. On sytems running Windows, automatically compiling this C based module might fail during installation. In this case, unofficial Windows binaries can be found here, which might help you get started.

How to use gambit

After installation, we suggest to check out our tutorial, detailing how to get started using gambit. We also provide detailed inline documentation serving as reference.

We have published some motivating results as well as details on the disambiguation algorithm and its hyperparameters in "gambit – An Open Source Name Disambiguation Tool for Version Control Systems". An earlier version of gambit used to obtain the results shown in our paper is archived on zenodo.org. Due to GDPR, we cannot publish the manually disambiguated ground-truth data on zenodo. However, if you require this data for research purposes or replication of our results please feel free to contact us directly.

All functions of gambit have been tested on Ubuntu, Mac OS, and Windows.

How to cite gambit

@INPROCEEDINGS{gote2021gambit,
  author={Gote, Christoph and Zingg, Christian},
  booktitle={2021 IEEE/ACM 18th International Conference on Mining Software Repositories (MSR)}, 
  title={gambit – An Open Source Name Disambiguation Tool for Version Control Systems}, 
  year={2021},
  volume={},
  number={},
  pages={80-84},
  doi={10.1109/MSR52588.2021.00021}
}

License

This software is licensed under the GNU Affero General Public License v3 (AGPL-3.0).

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

gambit_disambig-1.0.5.tar.gz (18.8 kB view details)

Uploaded Source

Built Distribution

gambit_disambig-1.0.5-py3-none-any.whl (19.8 kB view details)

Uploaded Python 3

File details

Details for the file gambit_disambig-1.0.5.tar.gz.

File metadata

  • Download URL: gambit_disambig-1.0.5.tar.gz
  • Upload date:
  • Size: 18.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.0 CPython/3.12.5

File hashes

Hashes for gambit_disambig-1.0.5.tar.gz
Algorithm Hash digest
SHA256 d9ad3ad1abe8a089b491b988936988655c25c84cdddb60e10ad4275f70a49847
MD5 d4caac9089f3b699ff93690be6887bcf
BLAKE2b-256 29447f825de1d825ecc6d0090cd5f27c1d52ba047b8f2f0b892810ea3c99e423

See more details on using hashes here.

File details

Details for the file gambit_disambig-1.0.5-py3-none-any.whl.

File metadata

File hashes

Hashes for gambit_disambig-1.0.5-py3-none-any.whl
Algorithm Hash digest
SHA256 3abbca041dcbb04a88ee56e91b046f0b09ebdfe4f7b8bad59c3cd625c2384790
MD5 a8520c390c5301a92c78aa0b204fa621
BLAKE2b-256 d4a99342eb2dd9d4b6ea9ec354d807810d765e75ab3b948c308321846033233d

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page