Skip to main content

PDF Annotation Utils

Project description

# pdfannot

This package aims to create a two-way link between annotated pdf and excel data frame.

It allows you to :

- create an excel file containing each string annotated of the pdf in a column 'annot_text', along with its
annotation in a column 'content'.

- annotate a pdf given an excel file of the form described above.

It can be really useful for generating automatically annotated pdf documents with NLP models capable to
infer annotations from raw texts in a data frame.


### Prerequisites

fitz

### Installing

pip install pymupdf
(pipenv install pymupdf)

import fitz

### Authors

Arthur Renaud, Antoine Marullaz

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pdfannot-0.0.5.tar.gz (359.9 kB view hashes)

Uploaded Source

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page