Skip to main content

Tools to manage large BibTex libraries

Project description

CITeX is a set of tools for comparing and filtering very large reference databases.


CITeX searches through BibTeX files to search and remove duplicate entries. It does this by comparing parsed titles strings and for simple cases, it then groups by similarities computes scores and selects the best single citation from that group of duplicates.

This project was a part of HealthHack 2016 in Canberra.

Authors: Aqeel Akber, Michael Barson, Sam Blackwell, Zac Hatfield-Dodds, Andrea Parisi



CITeX is on PyPI, use python -m pip install --upgrade citex to install or update. If you do not have Python, download the latest version from


Open a command prompt in the directory with your Bibtex files (see below).

Run the citex or citex-check command followed by the files to process.

See citex --help for details

For any number (one or more) of input files (collectively L) CITex outputs three files:

  • dedupe - the best selection of the duplicates (B)
  • dupes - the remaining duplicates (R)
  • unique - originial unique citations (U)

BibTex files

For Endnote users BibTeX files can be exported.

In Endnote:

  • Go to the Edit > Output Styles > Style Manager menu
  • select ‘BibTex export’
  • close Style Manager
  • select all items (ctrl-a)
  • File > Export
  • save file

Project details

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Filename, size & hash SHA256 hash help File type Python version Upload date
CITeX-0.2.2-py3-none-any.whl (10.3 kB) Copy SHA256 hash SHA256 Wheel 3.5 Oct 16, 2016
CITeX-0.2.2.tar.gz (5.9 kB) Copy SHA256 hash SHA256 Source None Oct 16, 2016

Supported by

Elastic Elastic Search Pingdom Pingdom Monitoring Google Google BigQuery Sentry Sentry Error logging AWS AWS Cloud computing DataDog DataDog Monitoring Fastly Fastly CDN DigiCert DigiCert EV certificate StatusPage StatusPage Status page