Extracts email metadata and text from a PDF file
Project description
xmpdf
Extracts email metadata and text body from a PDF containing emails.
Installation
pip install xmpdf
Usage
from xmpdf import Xmpdf
ems = Xmpdf(pdf_file)
# print summary info about emails in PDF file
print(ems.info())
# process emails
for m in ems.emails:
process(m)
Notes
- The initial development of this package was funded in part by The Mellon Foundation’s “Email Archives: Building Capacity and Community” program.
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
xmpdf-0.4.tar.gz
(5.3 kB
view details)
File details
Details for the file xmpdf-0.4.tar.gz.
File metadata
- Download URL: xmpdf-0.4.tar.gz
- Upload date:
- Size: 5.3 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.7.1 importlib_metadata/4.10.1 pkginfo/1.8.2 requests/2.27.1 requests-toolbelt/0.9.1 tqdm/4.62.3 CPython/3.9.9
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
9f9deaa1579f5c5a709b19d334e26aacc4b692d3d3b3c008436db60c3ac82a8d
|
|
| MD5 |
90eb8789eb9292224936c1f2278e965e
|
|
| BLAKE2b-256 |
d1478e21f36d736fb3c392195e39bd04c08259337bfa0e5a7b8a6537623acc60
|