Skip to main content

Extracts email metadata and text from a PDF file

Project description

xmpdf

Extracts email metadata and text body from a PDF containing emails.

Installation

pip install xmpdf

Usage

from xmpdf import Xmpdf

ems = Xmpdf(pdf_file)
# print summary info about emails in PDF file
print(ems.info())
# process emails
for m in ems.emails:
    process(m)

Notes

  • The initial development of this package was funded in part by The Mellon Foundation’s “Email Archives: Building Capacity and Community” program.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

xmpdf-0.5.2.tar.gz (5.6 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

xmpdf-0.5.2-py3-none-any.whl (5.9 kB view details)

Uploaded Python 3

File details

Details for the file xmpdf-0.5.2.tar.gz.

File metadata

  • Download URL: xmpdf-0.5.2.tar.gz
  • Upload date:
  • Size: 5.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.7.1 importlib_metadata/4.10.1 pkginfo/1.8.2 requests/2.27.1 requests-toolbelt/0.9.1 tqdm/4.62.3 CPython/3.9.9

File hashes

Hashes for xmpdf-0.5.2.tar.gz
Algorithm Hash digest
SHA256 98bda382a4c2fd9bb68d9e9630dc51bea294fe57d927fb3cc882f1227d64ead8
MD5 751c589a6313dbc34f1ae502767e3ae4
BLAKE2b-256 a63574ce2e6a49484c763f612e1b7c8d2b26c233bfa49a6ade8a64a143427a8e

See more details on using hashes here.

File details

Details for the file xmpdf-0.5.2-py3-none-any.whl.

File metadata

  • Download URL: xmpdf-0.5.2-py3-none-any.whl
  • Upload date:
  • Size: 5.9 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.7.1 importlib_metadata/4.10.1 pkginfo/1.8.2 requests/2.27.1 requests-toolbelt/0.9.1 tqdm/4.62.3 CPython/3.9.9

File hashes

Hashes for xmpdf-0.5.2-py3-none-any.whl
Algorithm Hash digest
SHA256 196a3e63b900d0a27dbd54d133d58c24e9584e1fda20b3e318fe5a2c4e80de49
MD5 8fe4d92afb0092e157bda05c8f89d43e
BLAKE2b-256 012139ffd62a18003186f020d1adebbc961bced93e7c145fbd4107ec935a9102

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page