Detecting Misinformation in News Using BERT and Natural Language Processing

Ankannagari Harshith Reddy; Tabitha Indupalli; Dinesh Ragipani; T.Dheeraj; Ch.Bhanu uday

doi:XX.XXX/IJARIIT-V12I2-1258

This paper is published in Volume-12, Issue-2, 2026

Paper Details
Abstract & PDF

Area

Machine Learning

Author

Ankannagari Harshith Reddy, Tabitha Indupalli, Dinesh Ragipani, T.Dheeraj, Ch.Bhanu uday

Org/Univ

Gokaraju Rangaraju Institute of Engineering and Technology, Telangana, India

Pub. Date

24 April, 2026

Paper ID

V12I2-1258

Publisher

IJARIIT

Edition

Volume-12, Issue-2, 2026

Keywords

Fake News, WELFake dataset, DistilBERT, Support Vector Machine, LightGBM, TF-IDF.

Citations

IEEE
Ankannagari Harshith Reddy, Tabitha Indupalli, Dinesh Ragipani, T.Dheeraj, Ch.Bhanu uday. Detecting Misinformation in News Using BERT and Natural Language Processing, International Journal of Advance Research, Ideas and Innovations in Technology, www.IJARIIT.com.

APA
Ankannagari Harshith Reddy, Tabitha Indupalli, Dinesh Ragipani, T.Dheeraj, Ch.Bhanu uday (2026). Detecting Misinformation in News Using BERT and Natural Language Processing. International Journal of Advance Research, Ideas and Innovations in Technology, 12(2) www.IJARIIT.com.

MLA
Ankannagari Harshith Reddy, Tabitha Indupalli, Dinesh Ragipani, T.Dheeraj, Ch.Bhanu uday. "Detecting Misinformation in News Using BERT and Natural Language Processing." International Journal of Advance Research, Ideas and Innovations in Technology 12.2 (2026). www.IJARIIT.com.

Give proper credits, use Citation.

Abstract

The widespread use of social media and online news platforms has made it easier for misinformation and fake news to spread rapidly. This creates serious challenges for individuals and organizations that rely on accurate information. To address this problem, this study proposes a fake news detection system that combines Natural Language Processing (NLP) techniques with both traditional machine learning and transformer-based models. The dataset used for the study is derived from the WELFake dataset, containing labeled news articles categorized as real or fake. Text preprocessing techniques such as tokenization, removal of noise, and normalization are applied to prepare the data. Traditional models like Support Vector Machine (SVM) and LightGBM use TF-IDF features to capture important word patterns, while DistilBERT is used to understand contextual meaning in text. The results show that transformer-based models achieve higher accuracy, while traditional models remain efficient and reliable. This hybrid approach improves the overall effectiveness of fake news detection systems.

All content is copyright protected.