Utilizing Machine Learning for Sentiment Analysis of IMDB Movie Review Data

Dahir, Ubaid Mohamed and Alkindy, Faisal Kevin (2023) Utilizing Machine Learning for Sentiment Analysis of IMDB Movie Review Data. International Journal of Engineering Trends and Technology, 71 (5). pp. 18-26. ISSN 22315381

[thumbnail of SU-PG-2023-0033.pdf] Text
SU-PG-2023-0033.pdf

Download (575kB)

Abstract

In this study, we focus on sentiment analysis, an essential technique in the rapidly evolving field of text analytics.
Ourapproach involves preprocessing the movie review text data using tokenization, lemmatization techniques, and feature extraction using Word of Bags and TF-IDF. We employ three popular machine learning methods, Logistic Regression, SVM, and Random Forest, to develop sentiment classification models. Our results show that logistic regression with the TF-IDF technique and default parameters outperforms the other models in terms of minimizing false positives, with an accuracy of 89.20%, a precision of 88.80%, recall of 89.80%, and an area under the receiver operating characteristics curve (AUC) of 89%. These findings have important implications for improving sentiment analysis and developing more accurate and effective text analytics tools, contributing to the novelty of the work in the journal fields.

Item Type: Article
Subjects: A General Works > AC Collections. Series. Collected works
Divisions: Faculty of Computing > Department of Computer Science
Depositing User: Center for Research and Development SIMAD University
Date Deposited: 20 Jan 2024 07:45
Last Modified: 20 Jan 2024 07:45
URI: https://repository.simad.edu.so/id/eprint/31

Actions (login required)

View Item
View Item