Skip to main content
Join the official Python Developers Survey 2018 and win valuable prizes: Start the survey!

Extract email addresses from given URL.

Project description

Extract emails from a given website

Requirements

  • Python3
  • requests
  • lxml
  • fake-useragent

Installation

pip install extract_emails

Usage

from extract_emails import ExtractEmails

em = ExtractEmails(url, depth=None, print_log=False, ssl_verify=True, user_agent='random')
emails = em.emails
  • url: str, ex: http://example.com
  • depth: int, depth of scan
  • print_log: boolean, print log or not
  • ssl_verify: boolean
  • user_agent: str

ssl_verify - use to avoid errors like this: *exceeded with url: /api/v1/pods?watch=False (Caused by SSLError(SSLError(1, ‘[SSL: CERTIFICATE_VERIFY_FAILED] certificate verify failed (_ssl.c:777)’),))*

user_agent - you can choose from several user agents: ie, msie, opera, chrome, google, firefox, safari, or random

Return list of emails.

Changelog

Version 2.0.0

  • Replaced BeautifulSoup to lxml
  • Improved regex for emails
  • Added different user agents

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Filename, size & hash SHA256 hash help File type Python version Upload date
extract_emails-2.0.1.tar.gz (2.7 kB) Copy SHA256 hash SHA256 Source None Feb 11, 2018

Supported by

Elastic Elastic Search Pingdom Pingdom Monitoring Google Google BigQuery Sentry Sentry Error logging AWS AWS Cloud computing DataDog DataDog Monitoring Fastly Fastly CDN SignalFx SignalFx Supporter DigiCert DigiCert EV certificate StatusPage StatusPage Status page