Tools to manage large BibTex libraries
CITeX is a set of tools for comparing and filtering very large reference databases.
CITeX searches through BibTeX files to search and remove duplicate entries. It does this by comparing parsed titles strings and for simple cases, it then groups by similarities computes scores and selects the best single citation from that group of duplicates.
This project was a part of HealthHack 2016 in Canberra.
Authors: Aqeel Akber, Michael Barson, Sam Blackwell, Zac Hatfield-Dodds, Andrea Parisi
CITeX is on PyPI, use python -m pip install --upgrade citex to install or update. If you do not have Python, download the latest version from https://python.org
Open a command prompt in the directory with your Bibtex files (see below).
Run the citex or citex-check command followed by the files to process.
See citex --help for details
For any number (one or more) of input files (collectively L) CITex outputs three files:
- dedupe - the best selection of the duplicates (B)
- dupes - the remaining duplicates (R)
- unique - originial unique citations (U)
For Endnote users BibTeX files can be exported.
- Go to the Edit > Output Styles > Style Manager menu
- select ‘BibTex export’
- close Style Manager
- select all items (ctrl-a)
- File > Export
- save file
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
|Filename, Size & Hash SHA256 Hash Help||File Type||Python Version||Upload Date|
(10.3 kB) Copy SHA256 Hash SHA256
|Wheel||3.5||Oct 16, 2016|
(5.9 kB) Copy SHA256 Hash SHA256
|Source||None||Oct 16, 2016|