Skip to main content

Extract email addresses from given URL.

Project description

Extract emails from a given website

Requirements

  • Minimum Python3.6

  • requests

  • lxml

Installation

pip install extract_emails

Usage

from extract_emails import ExtractEmails

em = ExtractEmails(url, depth=None, print_log=False, ssl_verify=True, user_agent=None, request_delay=0.0)
emails = em.emails
  • url: str, ex: http://example.com

  • depth: int, depth of scan

  • print_log: boolean, print log or not

  • ssl_verify: boolean

  • user_agent: str

  • request_delay: float

ssl_verify - use to avoid errors like this: *exceeded with url: /api/v1/pods?watch=False (Caused by SSLError(SSLError(1, ‘[SSL: CERTIFICATE_VERIFY_FAILED] certificate verify failed (_ssl.c:777)’),))*

user_agent - you can choose from several user agents: ie, msie, opera, chrome, google, firefox, safari, or random

request_delay - time delay between requests in seconds

Return list of emails.

Changelog

Version 3.0.1

  • Minimum Python version: 3.6

  • Remove fake_useragent library

  • Improve email extraction

  • Add time delay between requests

Version 2.0.0

  • Replaced BeautifulSoup to lxml

  • Improved regex for emails

  • Added different user agents

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

extract_emails-3.0.1.tar.gz (2.9 kB view hashes)

Uploaded Source

Built Distribution

extract_emails-3.0.1-py3-none-any.whl (5.0 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page