Please use this identifier to cite or link to this item:
https://gnanaganga.inflibnet.ac.in:8443/jspui/handle/123456789/16658
Title: | Clickbait Detection for Amharic Language Using Deep Learning Techniques |
Authors: | Rajesh Sharma, R Sungheetha, Akey Haile, Mesfin Abebe Kedir, Arefat Hyeredin Rajasekaran, A Charles Babu, G |
Keywords: | Amharic Language Artificial Neural Networks Clickbait Detection Deep Learning Techniques Machine Learning Techniques Natural Language Processing Social Media |
Issue Date: | 2024 |
Publisher: | Journal of Machine and Computing AnaPub Publications |
Citation: | Vol. 4, No. 3; pp. 603-615 |
Abstract: | Because of, the increasing number of Ethiopians who actively engaging with the Internet and social media platforms, the incidence of clickbait is becomes a significant concern. Clickbait, often utilizing enticing titles to tempt users into clicking, has become rampant for various reasons, including advertising and revenue generation. However, the Amharic language, spoken by a large population, lacks sufficient NLP resources for addressing this issue. In this study, the authors developed a machine learning model for detecting and classifying clickbait titles in Amharic Language. To facilitate this, authors prepared the first Amharic clickbait dataset. 53,227 social media posts from well-known sites including Facebook, Twitter, and YouTube are included in the dataset. To assess the impact of conventional machine learning methods like Random Forest (RF), Logistic Regression (LR), and Support Vector Machines (SVM) with TF-IDF and N-gram feature extraction approaches, the authors set up a baseline. Subsequently, the authors investigated the efficacy of two word embedding techniques, word2vec and fastText, with Convolutional Neural Network (CNN), Long Short-Term Memory (LSTM), and Gated Recurrent Unit (GRU) deep learning algorithms. At 94.27% accuracy and 94.24% F1 score measure, the CNN model with the rapid Text word embedding performs the best compared to the other models, according to the testing data. The study advances natural language processing on low-resource languages and offers insightful advice on how to counter clickbait content in Amharic. ©2024 The Authors. Published by AnaPub Publications. |
URI: | https://doi.org/10.53759/7669/jmc202404058 https://gnanaganga.inflibnet.ac.in:8443/jspui/handle/123456789/16658 |
ISSN: | 2789-1801 |
Appears in Collections: | Journal Articles |
Files in This Item:
File | Size | Format | |
---|---|---|---|
JMC202404058.pdf | 755.43 kB | Adobe PDF | View/Open |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.