Comparative Analysis of Machine Learning and Deep Learning Models for Sentiment Analysis in Somali Language

Abdirahman, Abdullahi Ahmed and Hashi, Abdirahman Osman and Dahir, Ubaid Mohamed and Elmi, Mohamed Abdirahman and Romo Rodriguez, Octavio Ernest (2023) Comparative Analysis of Machine Learning and Deep Learning Models for Sentiment Analysis in Somali Language. International Journal of Electrical and Electronics Engineering, 10 (7). pp. 41-52. ISSN 23488379

Full text not available from this repository.

Abstract

Understanding and analysing sentiment in user-generated content has become crucial with the increasing use of social media and online platforms. However, sentiment analysis in less-resourced languages like Somali poses unique challenges. This paper presents the performance of three ML algorithms (DTC, RFC, XGB) and two DL models (CNN, LSTM) in accurately classifying sentiment in Somali text. The CC100-Somali dataset, comprising 78M monolingual Somali texts from the Common crawl snapshots, is utilized for training and evaluation. The study employed rigorous evaluation techniques, including train-test splits and cross-validation, to assess classification accuracy and performance metrics. The results demonstrated that DTC achieved the highest accuracy among ML algorithms, 87.94%, while LSTM achieved the highest accuracy among DL models, 88.58%. This study's findings contribute to sentiment analysis in less-resourced languages, specifically Somali, and provide valuable insights into the performance of ML and DL techniques. Moreover, the study highlights the potential of leveraging both ML and DL approaches to analyze sentiment in Somali text effectively. The results

Item Type: Article
Subjects:
Divisions: Faculty of Computing > Department of Information Technology
Depositing User: Center for Research and Development SIMAD University
Date Deposited: 26 May 2024 07:15
Last Modified: 26 May 2024 07:15
URI: https://repository.simad.edu.so/id/eprint/125

Actions (login required)

View Item
View Item