Extract email addresses and linkedin profiles from given URL.
Project description
Extract Emails
Extract emails and linkedins profiles from a given website
Support the project with BTC: bc1q0cxl5j3se0ufhr96h8x0zs8nz4t7h6krrxkd6l
Requirements
- Python >= 3.9
Installation
pip install extract_emails[all]
# or
pip install extract_emails[requests]
# or
pip install extract_emails[selenium]
Simple Usage
As library
from pathlib import Path
from extract_emails import DefaultFilterAndEmailFactory as Factory
from extract_emails import DefaultWorker
from extract_emails.browsers.requests_browser import RequestsBrowser as Browser
from extract_emails.data_savers import CsvSaver
websites = [
"website1.com",
"website2.com",
]
browser = Browser()
data_saver = CsvSaver(save_mode="a", output_path=Path("output.csv"))
for website in websites:
factory = Factory(
website_url=website, browser=browser, depth=5, max_links_from_page=1
)
worker = DefaultWorker(factory)
data = worker.get_data()
data_saver.save(data)
As CLI tool
$ extract-emails --help
$ extract-emails --url https://en.wikipedia.org/wiki/Email -of output.csv -d 1
$ cat output.csv
email,page,website
bob@b.org,https://en.wikipedia.org/wiki/Email,https://en.wikipedia.org/wiki/Email
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
extract_emails-5.3.3.tar.gz
(19.9 kB
view hashes)
Built Distribution
Close
Hashes for extract_emails-5.3.3-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 34fbc2003f2bdf9e3a42588088209ba1cd967d6b6023e89d0ce444ebe28d7659 |
|
MD5 | 6d811772d6e9b9f81e3b40c75731feb7 |
|
BLAKE2b-256 | 3dbe622dc1c8992b19720b704650a81bd013eafcc4029307f4677600bdaf9290 |