Abdirahman, Abdullahi Ahmed and Hashi, Abdirahman Osman and Dahir, Ubaid Mohamed and Elmi, Mohamed Abdirahman and Romo Rodriguez, Octavio Ernest (2023) Comparative Analysis of Machine Learning and Deep Learning Models for Sentiment Analysis in Somali Language. International Journal of Electrical and Electronics Engineering, 10 (7). pp. 41-52. ISSN 23488379
Full text not available from this repository.Abstract
Understanding and analysing sentiment in user-generated content has become crucial with the increasing use of social media and online platforms. However, sentiment analysis in less-resourced languages like Somali poses unique challenges. This paper presents the performance of three ML algorithms (DTC, RFC, XGB) and two DL models (CNN, LSTM) in accurately classifying sentiment in Somali text. The CC100-Somali dataset, comprising 78M monolingual Somali texts from the Common crawl snapshots, is utilized for training and evaluation. The study employed rigorous evaluation techniques, including train-test splits and cross-validation, to assess classification accuracy and performance metrics. The results demonstrated that DTC achieved the highest accuracy among ML algorithms, 87.94%, while LSTM achieved the highest accuracy among DL models, 88.58%. This study's findings contribute to sentiment analysis in less-resourced languages, specifically Somali, and provide valuable insights into the performance of ML and DL techniques. Moreover, the study highlights the potential of leveraging both ML and DL approaches to analyze sentiment in Somali text effectively. The results
Item Type: | Article |
---|---|
Subjects: | |
Divisions: | Faculty of Computing > Department of Information Technology |
Depositing User: | Center for Research and Development SIMAD University |
Date Deposited: | 26 May 2024 07:15 |
Last Modified: | 26 May 2024 07:15 |
URI: | https://repository.simad.edu.so/id/eprint/125 |