This paper is published in Volume-9, Issue-5, 2023
Area
Machine Learning
Author
Thanniru Lakshman, Singarapu Sanjay Kumar, Ulligaddala Satish Kumar, Yenikepalli Sri Sekhar, Yellamati Suresh
Org/Univ
Vasireddy Venkatadri Institute of Technology, Guntur, Andhra Pradesh, India
Pub. Date
02 November, 2023
Paper ID
V9I5-1190
Publisher
Keywords
SMS, Machine Learning, Random Forest, Logistic Regression, NLP, Spam, Ham.

Citationsacebook

IEEE
Thanniru Lakshman, Singarapu Sanjay Kumar, Ulligaddala Satish Kumar, Yenikepalli Sri Sekhar, Yellamati Suresh. SMS spam detection in Machine Learning using Natural Language Processing, International Journal of Advance Research, Ideas and Innovations in Technology, www.IJARIIT.com.

APA
Thanniru Lakshman, Singarapu Sanjay Kumar, Ulligaddala Satish Kumar, Yenikepalli Sri Sekhar, Yellamati Suresh (2023). SMS spam detection in Machine Learning using Natural Language Processing. International Journal of Advance Research, Ideas and Innovations in Technology, 9(5) www.IJARIIT.com.

MLA
Thanniru Lakshman, Singarapu Sanjay Kumar, Ulligaddala Satish Kumar, Yenikepalli Sri Sekhar, Yellamati Suresh. "SMS spam detection in Machine Learning using Natural Language Processing." International Journal of Advance Research, Ideas and Innovations in Technology 9.5 (2023). www.IJARIIT.com.

Abstract

This paper presents the identification of Spam and ham messages using supervised machine learning algorithms Random forest Classifier, and Logistic Regression algorithms and Analyzes how each filter performs when detecting Ham and Spam. A spam message is a big issue in mobile communication to reduce this effective spam detection techniques should be built Preprocessing is done using the NLTK library with various Stemming Algorithms, Word clouds are used and tokenizing is also performed. The data set is divided into two categories for training and testing the classifiers . the results demonstrated that the performance of Random Forest is better than Logistic Regression. Random forest achieved a better accuracy of 97%.