Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

✨[FEATURE] Spam mail detector #423

Closed
Niraj1608 opened this issue Oct 23, 2024 · 1 comment
Closed

✨[FEATURE] Spam mail detector #423

Niraj1608 opened this issue Oct 23, 2024 · 1 comment

Comments

@Niraj1608
Copy link
Contributor

Niraj1608 commented Oct 23, 2024

Email Spam Detector is a machine learning model that predicts whether an email is spam or not. It utilizes Multinomial Naive Bayes algorithm and LogicalRegression , NLTK, Matplotlib, and both CountVectorizer and TfidfVectorizer for feature extraction .

Description

The Email Spam Classifier is designed to analyze the content of an email and determine whether it is likely to be a spam or legitimate email. It uses a machine learning approach to classify emails based on their textual features.

The project includes the following components:

Multinomial Naive Bayes: The model is trained using Multinomial Naive Bayes, a popular algorithm for text classification, to learn the patterns and characteristics of spam and non-spam emails.

NLTK: The Natural Language Toolkit (NLTK) is a Python library that provides various tools and resources for natural language processing. It is used for preprocessing the email text and extracting relevant features.

Matplotlib: Matplotlib is a plotting library in Python. It is used for visualizing the performance metrics and results of the classifier, such as accuracy, precision, and recall.

Feature Extraction with CountVectorizer and TfidfVectorizer: The project utilizes both CountVectorizer and TfidfVectorizer for feature extraction. CountVectorizer converts the text data into numerical features by counting the occurrences of words, while TfidfVectorizer converts the text data into numerical features based on term frequency-inverse document frequency.

Deployment and gui : with straeamlit

@Niraj1608
Copy link
Contributor Author

@suryanshsk assign me this issue :)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants