Skip to main content

A python library/command-line tool to quickly and automatically generate BibTeX data starting from the pdf file of a scientific publication.

Project description

pdf2bib

pdf2bib is a Python library to automatically extract bibliographic information from the .pdf file of a publication (or from a folder containing several .pdf files), and generate BibTeX entries. The pdf file can be either a paper published in a scientific journal (i.e. with a DOI associated to it), or an arxiv preprint. It uses the library pdf2doi to first find a valid identifier of the publication (i.e. either a DOI or an arxiv ID), and then uses this identifier to query public archives (e.g. http://dx.doi.org) to obtain bibliographic data.

pdf2bib can be used either from command line, or inside your python script or, only for Windows, directly from the right-click context menu of a pdf file or a folder.

Table of Contents

Installation

Use the package manager pip to install pdf2bib.

pip install pdf2bib==1.0rc2

Under Windows, it is also possible to add shortcuts to the right-click context menu.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distribution

pdf2bib-1.0rc2-py3-none-any.whl (18.9 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page