Skip to main content

Redact PDF annotation metadata to control disclosure of personal data

Project description

pdf-slashannots – Redact PDF Annotation Metadata

By annotating PDFs you leave metadata along with the annotation content like your name and the date and time when you created and last modified each annotation. This metadata can be seen and exploited by anyone with access to the annotated PDF file.

That way, others could for instance infer how long you took to read through and comment on a document and even individual pages, or infer when you made breaks or did something else.

pdf-slashannots aims at redacting that annotation metadata from PDFs so that you can pass them along without revealing your habits.

Installation

pdf-slashannots is available on PyPI:

python3 -m pip install pdf-slashannots

Usage

Simply run pdf-slashannots input.pdf redacted.pdf and you are done.

Okay, there is more. If you run the command without any options then all annotations are equally redacted and the dates set to none precision (January 1st, 1970). Check out pdf-slashannots --help for the more advanced features like

  • specify a subset of annotation authors to redact
  • specify the precision to which the dates are reduced
  • redact also the authors' name

Or, if you prefer a graphical interface, there is also pdf-slashannots-gui based on Python Tkinter.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pdf-slashannots-0.4.0.tar.gz (7.5 kB view hashes)

Uploaded Source

Built Distribution

pdf_slashannots-0.4.0-py3-none-any.whl (8.5 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page