Fake-News-Classification-Using-LSTM

In this project, we have used various Natural language processing techniques with LSTM model to classify fake news articles using Tensorflow and Sci-kit libraries from python.

Download dataset from kaggle https://www.kaggle.com/competitions/fake-news/data?select=train.csv

Implementation

Download dataset from data folder

Data Cleaning for Analysis: In this section, we will clean our dataset to do some analysis:
- Perform null value imputation.
- Remove stop words.
- Remove special characters.
- Drop unused rows and columns.
- Apply stemming.
Explorative Data Analysis: In this section, we will perform:
- Statistical Analysis of the text.
- Word Cloud Visualizations of text analysis.
Building a LSTM Classifier
- Data Preperation: In this section splitting of dataset into training and testing is done.
- Tokenizing the Dataset: One Hot Representation and post padding is applied to fix a sentence length to fix the input on the dataset.
- Training the model: Embedding layer , LSTM , Dense layers are added to Sequential model and binary cross entropy, adam optimer, accuracy as metrics are used to configure the model for training.
- Model Evaluation : Accuracy score and confusion matrix are evaluated on the test dataset.

Each of these steps contained in fake-news-classification-using-lstm.ipynb file. The file with the ipynb extension has the advantage of saving the state of the last run of that file and the screen output.

Thus, screen output can be seen without re-running the files. Files with the ipynb extension can be run using the jupyter notebook program. When running the codes, the sequence numbers in the filenames should be followed.

Because the output of almost every step is the prerequisite for the operation of the next step.

Adding new features

The overall accuracy of our trained model when classifying articles is 0.9074. It should be noted that this number represents perfect classification. In fact, other models that focus on binary classification of Fake news label may achieve higher accuracy of perfect classification.

In furter improvements we can train our model with Different Classification algorithms like Logsitic Regression, Naive Bayes, SVM, KNN, Random Forest and AdaBoost and based on the comparision analysis on performance metrics we can decide the best algorithm for Fake-news classification.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
README.md		README.md
fake-news-classification-using-lstm.ipynb		fake-news-classification-using-lstm.ipynb

Library	Task
Numpy	Mathematical Operations
Pandas	Data Analysis Tools
Matplotlib	Visualizations
Sklearn	Machine Learning Library
Tensorflow	Modelling
NLTK	NLP Library

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Fake-News-Classification-Using-LSTM

Table of contents

Implementation

Adding new features

About

Releases

Packages

Languages

Vardhan503/Fake-News-Classification-Using-LSTM

Folders and files

Latest commit

History

Repository files navigation

Fake-News-Classification-Using-LSTM

Table of contents

Implementation

Adding new features

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages