Spam Doggy is a spam classifier that uses a Naive Bayes classifier to classify emails as spam or not spam.
Project description
Spam Doggy
Spam Doggy is a spam classifier that uses a Naive Bayes classifier to classify emails as spam or not spam. It is designed to be easy to use and efficient, leveraging the power of machine learning to accurately identify unwanted emails.
Features
- Utilizes a Naive Bayes classifier for efficient spam detection.
- Easy to train with custom datasets.
- Supports saving and loading models for quick predictions without retraining.
Installation
To use Spam Doggy, you need to have Python installed along with some additional dependencies. You can install the necessary packages using:
pip install -r requirements.txt
Usage
train model
from spam_doggy import SpamDoggy
# Load default training data
spam_doggy = SpamDoggy()
spam_doggy.set_default_training_data()
# Train the model
spam_doggy.train(print_report=True)
# Save the trained model and vectorizer
spam_doggy.save_model('vectorizer.joblib', 'classifier.joblib')
load model
# Create a new instance of SpamDoggy
spam_doggy = SpamDoggy()
# Load the pre-trained model and vectorizer
spam_doggy.load_model('vectorizer.joblib', 'classifier.joblib')
# Predict the class of a new email
email_text = "Congratulations! You've won a free cruise. Call now to claim your prize."
prediction = spam_doggy.predict(email_text)
print(f"The email is classified as: {prediction}")
custom dataset
If you want to train the model with your own dataset, ensure your data is in a pandas.DataFrame with two columns: Message (the email text) and Category (the label: "spam" or "not spam").
import pandas as pd
# Example custom data
data = {'Message': ["Free money", "Hi there, how are you?", "Limited time offer!"],
'Category': ["spam", "not spam", "spam"]}
custom_df = pd.DataFrame(data)
# Train with custom data
spam_doggy = SpamDoggy()
spam_doggy.set_train_data(custom_df)
spam_doggy.train(print_report=True)
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for spam_doggy-1.0.1-py3-none-any.whl
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 | 88439c923760e7af25269d14e08ff3da33754e147c0fe70ab4d65f79069fc735 |
|
| MD5 | adc4f9f1c740d80f205fb3b9ae2d48b7 |
|
| BLAKE2b-256 | 3a7d5464d5ae6889f44f2f994be48f02e960b5b68b31578c7590d35e6740b4f5 |