Skip to main content

Tool for extracting emails from pdf and docx files. (Designed especially for resumes)

Project description

PyEmailExtractor

PyEmailExtractor is a tool designed specifically for extracting emails from PDF and DOCX files, with a focus on resumes. It provides a convenient way to extract email addresses from these document formats, which can be useful for various applications, such as recruitment, data analysis, or contact management.

Features

  • Extract email addresses from PDF files.
  • Extract email addresses from DOCX files.
  • Designed specifically for resumes, ensuring accurate email extraction.
  • Simple and easy-to-use.

Installation

You can install PyEmailExtractor using pip:

pip install PyEmailExtractor

Example Usage

from PyEmailExtractor import extract_emails

dir = "/home/username/Downloads/resumes"

list_emails = extract_emails(dir)

print(list_emails)
  • The above example will print the parsed emails in a list.

Requirements

PyEmailExtractor requires the following dependencies:

  • docx2txt==0.8
  • lxml==4.9.2
  • PyPDF2==3.0.1
  • python-docx==0.8.11
  • PyMuPDF==1.22.5

Contributing

Contributions are welcome! If you encounter any issues or have suggestions for improvements, please open an issue on the GitHub repository. License

PyEmailExtractor is released under the MIT License. See the LICENSE file for more details.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

PyEmailExtractor-0.2.tar.gz (2.8 kB view details)

Uploaded Source

File details

Details for the file PyEmailExtractor-0.2.tar.gz.

File metadata

  • Download URL: PyEmailExtractor-0.2.tar.gz
  • Upload date:
  • Size: 2.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.10.6

File hashes

Hashes for PyEmailExtractor-0.2.tar.gz
Algorithm Hash digest
SHA256 009247e95bafaa78cbe7d38c031fbd8028bc394cfba1c3ecc28864dd082a61a9
MD5 11b2cc9e09552704829e00d9ee6528be
BLAKE2b-256 1b3761aaed952be58d3add23d725d19cd8b6c1d57d43949d5e06d15152b28317

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page