Skip to main content
This is a pre-production deployment of Warehouse. Changes made here affect the production instance of PyPI (
Help us improve Python packaging - Donate today!

Tools to manage large BibTex libraries

Project Description

CITeX is a set of tools for comparing and filtering very large reference databases.


CITeX searches through BibTeX files to search and remove duplicate entries. It does this by comparing parsed titles strings and for simple cases, it then groups by similarities computes scores and selects the best single citation from that group of duplicates.

This project was a part of HealthHack 2016 in Canberra.

Authors: Aqeel Akber, Michael Barson, Sam Blackwell, Zac Hatfield-Dodds, Andrea Parisi


CITeX is on PyPI, use python -m pip install --upgrade citex to install or update. If you do not have Python, download the latest version from


Open a command prompt in the directory with your Bibtex files (see below).

Run the citex or citex-check command followed by the files to process.

See citex --help for details

For any number (one or more) of input files (collectively L) CITex outputs three files:

  • dedupe - the best selection of the duplicates (B)
  • dupes - the remaining duplicates (R)
  • unique - originial unique citations (U)

BibTex files

For Endnote users BibTeX files can be exported.

In Endnote:

  • Go to the Edit > Output Styles > Style Manager menu
  • select ‘BibTex export’
  • close Style Manager
  • select all items (ctrl-a)
  • File > Export
  • save file

Release History

This version
History Node


History Node


History Node


History Node


History Node


History Node


History Node


History Node


History Node


Download Files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Filename, Size & Hash SHA256 Hash Help File Type Python Version Upload Date
(10.3 kB) Copy SHA256 Hash SHA256
Wheel 3.5 Oct 16, 2016
(5.9 kB) Copy SHA256 Hash SHA256
Source None Oct 16, 2016

Supported By

Elastic Elastic Search Pingdom Pingdom Monitoring Dyn Dyn DNS Sentry Sentry Error Logging CloudAMQP CloudAMQP RabbitMQ Heroku Heroku PaaS Kabu Creative Kabu Creative UX & Design Fastly Fastly CDN DigiCert DigiCert EV Certificate Google Google Cloud Servers