MailScout is a Python library designed for finding business email addresses and simple email validation. It offers a range of tools for email validation, SMTP checks, email normalization, and generating potential business email addresses based on common naming conventions/employee name combinations.
Project description
MailScout - A Python Business Email Finder and Email Validator
MailScout is a Python library designed for finding business email addresses and simple email validation.
It offers a range of tools for email validation, SMTP checks, and generating potential business email addresses based on provided names and common naming conventions.
Features
- Generate and find potential business email addresses based on provided names and common patterns.
- Check SMTP deliverability of email addresses.
- Detect catch-all domains.
- Normalize names to create email-friendly formats.
- Bulk email finder for multiple domains.
Installation
Install MailScout using pip:
pip install mailscout
Initialization
Initialize the Scout class
from mailscout import Scout
scout = Scout()
The Scout class is the core of the MailScout library, providing various functionalities for email finding, processing and validation. When initializing a Scout object, you can customize its behavior using several arguments:
Arguments
check_variants (bool): If set toTrue, the Scout object will generate and check different variants of email addresses based on provided names. Defaults toTrue.**check_prefixes (bool)**: Enables the checking of common email prefixes (like 'info', 'contact', etc.) when generating email addresses. This is useful for finding potential business emails. Defaults to**True**.check_catchall (bool): Determines whether the Scout object should check if a domain is a catch-all. A catch-all domain accepts emails sent to any address under that domain. Defaults toTrue.normalize (bool): If set toTrue, the Scout object will normalize names to create email-friendly formats. This is particularly useful for names with diacritics or special characters. Defaults toTrue.num_threads (int): Specifies the number of threads to use for concurrent email checking. Increasing the number of threads can speed up the process when checking a large number of emails. Defaults to5.num_bulk_threads (int): Sets the number of threads for bulk email finding tasks. This is separate fromnum_threadsto provide flexibility in handling large-scale operations. Defaults to1.smtp_timeout (int): The timeout in seconds for the SMTP connection. This parameter is crucial to avoid long waits on unresponsive servers. Defaults to2.
Usage
Find Business Emails with Names
Mailscout generates combinations using the names you provide. These names should ideally belong to the same person, typically a first name and a last name.
To find business emails, we use the **find_valid_emails** method.
Names might be a list of strings.
names = ["Batuhan", "Akyazı"]
# or, names = ["Batuhan Akyazı"]
domain = "example.com"
emails = scout.find_valid_emails(domain, names)
print(emails)
# ['b.akyazi@example.com']
You can also provide a list of lists containing strings to check on multiple people.
names = [["Jeff", "Winger"], ["Ben Cheng"], ["Łukas Nowicki"]]
domain = "microsoft.com"
emails = scout.find_valid_emails(domain, names)
print(emails)
# ['jeff@microsoft.com', 'ben.cheng@microsoft.com', 'bencheng@microsoft.com', 'ben@microsoft.com', 'lukas@microsoft.com']
Or simply a string.
names = "Jeffrey Tobias Winger"
domain = "microsoft.com"
emails = scout.find_valid_emails(domain, names)
print(emails)
# ['winger.tobias@microsoft.com']
Find Business Emails with Common Prefixes
If you don't provide any names, Mailscout will use brute force on common prefixes to find email addresses.
domain = "microsoft.com"
emails = scout.find_valid_emails(domain)
print(emails)
# ['support@microsoft.com', 'team@microsoft.com', 'marketing@microsoft.com', 'accounts@microsoft.com', 'help@microsoft.com', 'finance@microsoft.com', 'manager@microsoft.com', 'events@microsoft.com', 'community@microsoft.com', 'feedback@microsoft.com', 'dev@microsoft.com', 'developer@microsoft.com', 'status@microsoft.com', 'security@microsoft.com']
Find Business Emails in Bulk
To find valid email addresses in bulk for multiple domains and names, use the **find_valid_emails_bulk** method. This function takes a list of dictionaries, each containing a domain and optional names to check, and returns a list of dictionaries, each containing the domain, names, and a list of valid emails found.
You may think of each list item as a task and provide the data accordingly.
Here is an example of how to use this function:
email_data = [
{"domain": "example.com", "names": ["John Doe"]},
{"domain": "example.com", "names": ["Jane Smith"]},
{"domain": "example.com"}
]
valid_emails = scout.find_valid_emails_bulk(email_data)
print(valid_emails)
# [{'domain': 'example.com', 'names': ['John Doe'], 'valid_emails': ['j.doe@example.com']}, {'domain': 'example2.com', 'names': ['Jane Smith'], 'valid_emails': ['j.smith@example2.com', 'jane.smith@example2.com']}, {'domain': 'example.com', 'valid_emails': ['info@example.com']}]
Utility Methods
Mailscout comes with a variety of utility methods for different tasks.
Check SMTP Deliverability (Email Validation)
To validate an email with Mailscout, use the **check_smtp** method.
email = "batuhan@microsoft.com"
is_deliverable = scout.check_smtp(email)
print(f"Email {email} is deliverable: {is_deliverable}")
# Email batuhan@microsoft.com is deliverable: False
Checking for Catch-All Domains
The check_email_catchall method can be used to determine if a given domain is configured as a catch-all. A catch-all domain is set up to accept emails sent to any address under that domain, even if the specific address does not exist.
domain = "example.com"
is_catchall = scout.check_email_catchall(domain)
print(f"Domain {email} is catch-all: {is_catchall}")
# Email xample.com is catch-all: True
Normalize Names into Email-friendly Format
To normalize a name for an email-friendly format, use the **normalize_name** method. This method converts a non-compliant name into a format that is acceptable for an email address. Here are some examples:
name1 = "Şule"
name2 = "Dzirżyterg"
normalized_name1 = scout.normalize_name(name1)
normalized_name2 = scout.normalize_name(name2)
print(normalized_name1)
# 'sule'
print(normalized_name2)
# 'dzirzyterg'
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file mailscout-0.1.1.tar.gz.
File metadata
- Download URL: mailscout-0.1.1.tar.gz
- Upload date:
- Size: 8.2 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.10.13
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
55535d162cf18c4f34fcdc32b01c1ab874c3a2825770e07054ab05f08f6240f7
|
|
| MD5 |
622ec12533f4a8e62959e8c639166ebb
|
|
| BLAKE2b-256 |
58b8e78a9ed6858466e8d4d88e73af37a7924835431905ed7a6b24fb19bf10b0
|
File details
Details for the file mailscout-0.1.1-py3-none-any.whl.
File metadata
- Download URL: mailscout-0.1.1-py3-none-any.whl
- Upload date:
- Size: 7.4 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.10.13
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
af0d73ee9a547fbfb3e7748d48e7dc6184da1a82dbf5e16618dfdf873a5b23fd
|
|
| MD5 |
3b471a7c0373726d45cf532d7212a662
|
|
| BLAKE2b-256 |
8be27fb0b2b5c7a54fb0832b56191dd79f3fb6c07ce5815214aa35f96a591015
|