Extract email addresses from given URL.
Project description
Extract emails from a given website
Requirements
Minimum Python3.6
requests
lxml
Installation
pip install extract_emails
Usage
from extract_emails import ExtractEmails em = ExtractEmails(url, depth=None, print_log=False, ssl_verify=True, user_agent=None, request_delay=0.0) emails = em.emails
url: str, ex: http://example.com
depth: int, depth of scan
print_log: boolean, print log or not
ssl_verify: boolean
user_agent: str
request_delay: float
ssl_verify - use to avoid errors like this: *exceeded with url: /api/v1/pods?watch=False (Caused by SSLError(SSLError(1, ‘[SSL: CERTIFICATE_VERIFY_FAILED] certificate verify failed (_ssl.c:777)’),))*
user_agent - you can choose from several user agents: ie, msie, opera, chrome, google, firefox, safari, or random
request_delay - time delay between requests in seconds
Return list of emails.
Changelog
Version 3.0.1
Minimum Python version: 3.6
Remove fake_useragent library
Improve email extraction
Add time delay between requests
Version 2.0.0
Replaced BeautifulSoup to lxml
Improved regex for emails
Added different user agents
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for extract_emails-3.0.1-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 7f5b252bcdca96062deb46689030c4256aa984ecfa3c3525ac570e5a8830b491 |
|
MD5 | 28bc4b367c5d35455f6d1fb6a8ce1539 |
|
BLAKE2b-256 | 30567c0fe65f071f0a1757d50cbc1a94793fa67ac303f708ad3e2ca88dd0d260 |