Extract email addresses and linkedin profiles from given URL.
Project description
Extract Emails
Extract emails and linkedins profiles from a given website
Support the project with BTC: bc1q0cxl5j3se0ufhr96h8x0zs8nz4t7h6krrxkd6l
Requirements
- Python >= 3.7
Installation
pip install extract_emails
Simple Usage
As library
from pathlib import Path
from extract_emails import DefaultFilterAndEmailFactory as Factory
from extract_emails import DefaultWorker
from extract_emails.browsers.requests_browser import RequestsBrowser as Browser
from extract_emails.data_savers import CsvSaver
websites = [
"website1.com",
"website2.com",
]
browser = Browser()
data_saver = CsvSaver(save_mode="a", output_path=Path("output.csv"))
for website in websites:
factory = Factory(
website_url=website, browser=browser, depth=5, max_links_from_page=1
)
worker = DefaultWorker(factory)
data = worker.get_data()
data_saver.save(data)
As CLI tool
$ extract-emails --help
$ extract-emails --url https://en.wikipedia.org/wiki/Email -of output.csv -d 1
$ cat output.csv
email,page,website
bob@b.org,https://en.wikipedia.org/wiki/Email,https://en.wikipedia.org/wiki/Email
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
extract-emails-5.3.1.tar.gz
(19.7 kB
view hashes)
Built Distribution
Close
Hashes for extract_emails-5.3.1-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | c1dae82c1d4734222795b67dac4c99da263070feb43e69cb2ea29bda4b9a3e38 |
|
MD5 | 3b47bf42672fc0032bec7408055fcc50 |
|
BLAKE2b-256 | 1c2e68953e1ac060914073c6ff13c025adc081deeb8b6bcae756121cca384fbb |