Dahir, Ubaid Mohamed and Alkindy, Faisal Kevin (2023) Utilizing Machine Learning for Sentiment Analysis of IMDB Movie Review Data. International Journal of Engineering Trends and Technology, 71 (5). pp. 18-26. ISSN 22315381
SU-PG-2023-0033.pdf
Download (575kB)
Abstract
In this study, we focus on sentiment analysis, an essential technique in the rapidly evolving field of text analytics.
Ourapproach involves preprocessing the movie review text data using tokenization, lemmatization techniques, and feature extraction using Word of Bags and TF-IDF. We employ three popular machine learning methods, Logistic Regression, SVM, and Random Forest, to develop sentiment classification models. Our results show that logistic regression with the TF-IDF technique and default parameters outperforms the other models in terms of minimizing false positives, with an accuracy of 89.20%, a precision of 88.80%, recall of 89.80%, and an area under the receiver operating characteristics curve (AUC) of 89%. These findings have important implications for improving sentiment analysis and developing more accurate and effective text analytics tools, contributing to the novelty of the work in the journal fields.
Item Type: | Article |
---|---|
Subjects: | A General Works > AC Collections. Series. Collected works |
Divisions: | Faculty of Computing > Department of Computer Science |
Depositing User: | Center for Research and Development SIMAD University |
Date Deposited: | 20 Jan 2024 07:45 |
Last Modified: | 20 Jan 2024 07:45 |
URI: | https://repository.simad.edu.so/id/eprint/31 |