Skip to main content

Simple utility to extract email addresses from HTML, including obfuscated email addresses

Project description

The email_scraper module provides a simple method that extracts email addresses from HTML. It is able to find emails in plain text, links, atob() obfuscation and HTML entities obfuscation.

Available on PyPI.

https://github.com/kichik/email-scraper/workflows/Build/badge.svg https://badge.fury.io/py/email-scraper.svg

Usage

>>> from email_scraper import scrape_emails
>>> scrape_emails('<html><body><a href="mailto:hello@world.com">email me</a></body></html>')
{'hello@world.com'}
>>> scarpe_emails('<a href="javascript:window.location.href=atob(\'bWFpbHRvOmVtYWlsQGV4YW1wbGUuY29t\')">E-Mail</a>')
{'email@example.com'}

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

email-scraper-0.5.tar.gz (2.8 kB view hashes)

Uploaded Source

Built Distribution

email_scraper-0.5-py2.py3-none-any.whl (4.0 kB view hashes)

Uploaded Python 2 Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page