Extract email addresses and linkedin profiles from given URL.
Project description
Extract Emails
Extract emails and linkedins profiles from a given website
Support the project with BTC: bc1q0cxl5j3se0ufhr96h8x0zs8nz4t7h6krrxkd6l
Requirements
- Python >= 3.10
Installation
pip install extract_emails[all]
# or
pip install extract_emails[requests]
# or
pip install extract_emails[selenium]
Simple Usage
As library
from pathlib import Path
from extract_emails import DefaultFilterAndEmailFactory as Factory
from extract_emails import DefaultWorker
from extract_emails.browsers.requests_browser import RequestsBrowser as Browser
from extract_emails.data_savers import CsvSaver
websites = [
"website1.com",
"website2.com",
]
browser = Browser()
data_saver = CsvSaver(save_mode="a", output_path=Path("output.csv"))
for website in websites:
factory = Factory(
website_url=website, browser=browser, depth=5, max_links_from_page=1
)
worker = DefaultWorker(factory)
data = worker.get_data()
data_saver.save(data)
As CLI tool
$ extract-emails --help
$ extract-emails --url https://en.wikipedia.org/wiki/Email -of output.csv -d 1
$ cat output.csv
email,page,website
bob@b.org,https://en.wikipedia.org/wiki/Email,https://en.wikipedia.org/wiki/Email
By me a coffee
- USDT (TRC20): TXuYegp5L8Zf7wF2YRFjskZwdBxhRpvxBS
- BEP20: 0x4D51Db2B754eA83ce228F7de8EaEB93a88bdC965
- TON: UQA5quJljQz84RwzteN3uuKsdPTDee7a_GF5lgIgezA2oib5
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
extract_emails-5.3.4.tar.gz
(20.0 kB
view details)
Built Distribution
File details
Details for the file extract_emails-5.3.4.tar.gz
.
File metadata
- Download URL: extract_emails-5.3.4.tar.gz
- Upload date:
- Size: 20.0 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: poetry/1.8.3 CPython/3.12.3 Linux/6.9.3-arch1-1
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 90a7c680028a582eda7501c79b27c0a2ad46268ae0b7a40bed31f942ae51debd |
|
MD5 | 9382691bd735de25f15f57ef95b9e56b |
|
BLAKE2b-256 | 0f5e0e23c79df27780dc516bad14fb6b8378780e7601dab1192638c77bab6c78 |
File details
Details for the file extract_emails-5.3.4-py3-none-any.whl
.
File metadata
- Download URL: extract_emails-5.3.4-py3-none-any.whl
- Upload date:
- Size: 31.0 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: poetry/1.8.3 CPython/3.12.3 Linux/6.9.3-arch1-1
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | f1c1745193d7b3ebc77f03c846290f027ab9a6aa39a117cdf931cf4d46227587 |
|
MD5 | b50884c012e4cefedde03ebe6e5749ed |
|
BLAKE2b-256 | e74e9375e91815e405837083ed16b55b214b11d7da4b12075104b00989e73e10 |