Skip to main content

Extract email addresses from given URL.

Project description

Extract emails from a given website

Requirements

  • Minimum Python3.6
  • requests
  • lxml

Installation

pip install extract_emails

Usage

from extract_emails import ExtractEmails

em = ExtractEmails(url, depth=None, print_log=False, ssl_verify=True, user_agent=None, request_delay=0.0)
emails = em.emails
  • url: str, ex: http://example.com
  • depth: int, depth of scan
  • print_log: boolean, print log or not
  • ssl_verify: boolean
  • user_agent: str
  • request_delay: float

ssl_verify - use to avoid errors like this: *exceeded with url: /api/v1/pods?watch=False (Caused by SSLError(SSLError(1, ‘[SSL: CERTIFICATE_VERIFY_FAILED] certificate verify failed (_ssl.c:777)’),))*

user_agent - you can choose from several user agents: ie, msie, opera, chrome, google, firefox, safari, or random

request_delay - time delay between requests in seconds

Return list of emails.

Changelog

Version 3.0.4

  • Buf fixing

Version 3.0.3

  • Improve parser

Version 3.0.1

  • Minimum Python version: 3.6
  • Remove fake_useragent library
  • Improve email extraction
  • Add time delay between requests

Version 2.0.0

  • Replaced BeautifulSoup to lxml
  • Improved regex for emails
  • Added different user agents

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Files for extract-emails, version 3.0.4
Filename, size File type Python version Upload date Hashes
Filename, size extract_emails-3.0.4-py2-none-any.whl (10.3 kB) File type Wheel Python version py2 Upload date Hashes View hashes
Filename, size extract_emails-3.0.4.tar.gz (9.2 kB) File type Source Python version None Upload date Hashes View hashes

Supported by

Elastic Elastic Search Pingdom Pingdom Monitoring Google Google BigQuery Sentry Sentry Error logging AWS AWS Cloud computing DataDog DataDog Monitoring Fastly Fastly CDN SignalFx SignalFx Supporter DigiCert DigiCert EV certificate StatusPage StatusPage Status page