SMS Spam Detection

Overview

SMS Spam Detection is a machine learning model that takes an SMS as input and predicts whether the message is a spam or not spam message. The model is built using Python and deployed on the web using Streamlit.

Technology Used

Python
Scikit-learn
Pandas
NumPy
Streamlit

Features

Data collection
Data cleaning and preprocessing
Exploratory Data Analysis
Model building and selection
Web deployment using Streamlit

Data Collection

The SMS Spam Collection dataset was collected from Kaggle, which contains over 5,500 SMS messages labeled as either spam or not spam. You can access the dataset from here

Data Cleaning and Preprocessing

The data was cleaned by handling null and duplicate values, and the "type" column was label-encoded. The data was then preprocessed by converting the text into tokens, removing special characters, stop words and punctuation, and stemming the data. The data was also converted to lowercase before preprocessing.

Exploratory Data Analysis

Exploratory Data Analysis was performed to gain insights into the dataset. The count of characters, words, and sentences was calculated for each message. The correlation between variables was also calculated, and visualizations were created using pyplots, bar charts, pie charts, 5 number summaries, and heatmaps. Word clouds were also created for spam and non-spam messages, and the most frequent words in spam texts were visualized.

Model Building and Selection

Multiple classifier models were tried, including NaiveBayes, random forest, KNN, decision tree, logistic regression, ExtraTreesClassifier, and SVC. The best classifier was chosen based on precision, with a precision of 100% achieved.

Web Deployment

The model was deployed on the web using Streamlit. The user interface has a simple input box where the user can input a message, and the model will predict whether it is spam or not spam.

Usage

To use the SMS Spam Detection model on your own machine, follow these steps:

Clone this repository.
Install the required Python packages using

pip install -r requirements.txt.

Run the model using

streamlit run app.py.

Visit localhost:8501 on your web browser to access the web app.

Contributions

Contributions to this project are welcome. If you find any issues or have any suggestions for improvement, please open an issue or a pull request on this repository.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
.gitignore		.gitignore
README.md		README.md
app.py		app.py
model.pkl		model.pkl
requirements.txt		requirements.txt
sms-spam.csv		sms-spam.csv
sms-spam.ipynb		sms-spam.ipynb
vectorizer.pkl		vectorizer.pkl

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SMS Spam Detection

Overview

Technology Used

Features

Data Collection

Data Cleaning and Preprocessing

Exploratory Data Analysis

Model Building and Selection

Web Deployment

Usage

Contributions

About

Releases

Packages

Languages

Chitranshu2411/SMS-SPAM-DETECTION-

Folders and files

Latest commit

History

Repository files navigation

SMS Spam Detection

Overview

Technology Used

Features

Data Collection

Data Cleaning and Preprocessing

Exploratory Data Analysis

Model Building and Selection

Web Deployment

Usage

Contributions

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages