Extract email addresses and linkedin profiles from given URL.
Project description
Extract Emails
Extract emails and linkedins profiles from a given website
Support the project with BTC: bc1q0cxl5j3se0ufhr96h8x0zs8nz4t7h6krrxkd6l
Requirements
- Python >= 3.7
Installation
pip install extract_emails
Simple Usage
As library
from pathlib import Path
from extract_emails import DefaultFilterAndEmailFactory as Factory
from extract_emails import DefaultWorker
from extract_emails.browsers.requests_browser import RequestsBrowser as Browser
from extract_emails.data_savers import CsvSaver
websites = [
"website1.com",
"website2.com",
]
browser = Browser()
data_saver = CsvSaver(save_mode="a", output_path=Path("output.csv"))
for website in websites:
factory = Factory(
website_url=website, browser=browser, depth=5, max_links_from_page=1
)
worker = DefaultWorker(factory)
data = worker.get_data()
data_saver.save(data)
As CLI tool
$ extract-emails --help
$ extract-emails --url https://en.wikipedia.org/wiki/Email -of output.csv -d 1
$ cat output.csv
email,page,website
bob@b.org,https://en.wikipedia.org/wiki/Email,https://en.wikipedia.org/wiki/Email
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
extract-emails-5.3.0.tar.gz
(19.6 kB
view hashes)
Built Distribution
Close
Hashes for extract_emails-5.3.0-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 85447f1238af4aa900c632ee2de0aa962342df23047860cc144585a3b904588e |
|
MD5 | 9c64da425b055393745e2061eedcd171 |
|
BLAKE2b-256 | 7edd00cb27c80860570b4bd00447884a5b38b9249d26bf4cbbfacf02088e774f |