Simple utility to extract email addresses from HTML, including obfuscated email addresses
Project description
The email_scraper module provides a simple method that extracts email addresses from HTML. It is able to find emails in plain text, links, atob() obfuscation and HTML entities obfuscation.
Available on PyPI.
Usage
>>> from email_scraper import scrape_emails >>> scrape_emails('<html><body><a href="mailto:hello@world.com">email me</a></body></html>') {'hello@world.com'} >>> scarpe_emails('<a href="javascript:window.location.href=atob(\'bWFpbHRvOmVtYWlsQGV4YW1wbGUuY29t\')">E-Mail</a>') {'email@example.com'}
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
email-scraper-0.4.tar.gz
(2.8 kB
view hashes)
Built Distribution
Close
Hashes for email_scraper-0.4-py2.py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | af015e725c0ee0d7ef8d2d60bf3fb1ae9e1dfda70059a09106941f300efdea0f |
|
MD5 | c2f5b10a8e3a7f6ee380b3d0b1fb7b4b |
|
BLAKE2b-256 | 6eaa19891f399376dd0d69399b55404487ec3f6c245e231a09c3a018b37ea313 |