Skip to main content

Extracts email metadata and text from a PDF file

Project description

xmpdf

Extracts email metadata and text body from a PDF containing emails.

Installation

pip install xmpdf

Usage

from xmpdf import Xmpdf
ems = Xmpdf(pdf_file)
# print summary info about emails in PDF file
print(ems.info())
# process emails
for m in ems.emails:
    process(m)

Notes

  • The initial development of this package was funded in part by The Mellon Foundation’s “Email Archives: Building Capacity and Community” program.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

xmpdf-0.4.tar.gz (5.3 kB view details)

Uploaded Source

File details

Details for the file xmpdf-0.4.tar.gz.

File metadata

  • Download URL: xmpdf-0.4.tar.gz
  • Upload date:
  • Size: 5.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.7.1 importlib_metadata/4.10.1 pkginfo/1.8.2 requests/2.27.1 requests-toolbelt/0.9.1 tqdm/4.62.3 CPython/3.9.9

File hashes

Hashes for xmpdf-0.4.tar.gz
Algorithm Hash digest
SHA256 9f9deaa1579f5c5a709b19d334e26aacc4b692d3d3b3c008436db60c3ac82a8d
MD5 90eb8789eb9292224936c1f2278e965e
BLAKE2b-256 d1478e21f36d736fb3c392195e39bd04c08259337bfa0e5a7b8a6537623acc60

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page