Extract email addresses and linkedin profiles from given URL.
Project description
Extract Emails
Extract emails and linkedins profiles from a given website
Support the project with BTC: bc1q0cxl5j3se0ufhr96h8x0zs8nz4t7h6krrxkd6l
Requirements
- Python >= 3.7
Installation
pip install extract_emails
Simple Usage
As library
from extract_emails.browsers.requests_browser import RequestsBrowser as Browser
from extract_emails import DefaultFilterAndEmailFactory as Factory
from extract_emails import DefaultWorker
browser = Browser()
url = 'https://en.wikipedia.org/'
factory = Factory(website_url=url, browser=browser)
worker = DefaultWorker(factory)
data = worker.get_data()
print(data)
"""
[
PageData(
website='https://en.wikipedia.org/',
page_url='https://en.wikipedia.org/Email_address',
data={'email': ['"John.Doe."@example.com', 'x@example.com']}
),
PageData(
website='https://en.wikipedia.org/',
page_url='https://en.wikipedia.org/Email_address2',
data={'email': ['"John.Doe2."@example.com', 'x2@example.com']}
),
]
"""
As CLI tool
$ extract-emails --help
$ extract-emails --url https://en.wikipedia.org/wiki/Email -of output.csv -d 1
$ cat output.csv
email,page,website
bob@b.org,https://en.wikipedia.org/wiki/Email,https://en.wikipedia.org/wiki/Email
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
extract-emails-5.2.0.tar.gz
(19.6 kB
view hashes)
Built Distribution
Close
Hashes for extract_emails-5.2.0-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | d837b8f60116c25910b4ee9b04dc28ddbe60a62968204527cfdd08f960c42d22 |
|
MD5 | 3820d51fd166a64b1285722fe651ea78 |
|
BLAKE2b-256 | e4de6bb0dc93356bf4179e34bf464b42a5a5c14942f632cba906f5730359b66e |