Sentiment analysis based- Hate speech detection of Amharic social media post and comments

Shita, Tsion

Full metadata record

DC Field	Value	Language
dc.contributor.author	Shita, Tsion	-
dc.date.accessioned	2025-07-01T12:23:45Z	-
dc.date.available	2025-07-01T12:23:45Z	-
dc.date.issued	2025-01	-
dc.identifier.uri	http://hdl.handle.net/123456789/8781	-
dc.description.abstract	Social media allows the user to post, comment and communicate freely. And this led to an increasing amount of online hate speech. Online hate speech has different offline repercussions, according to studies. In recent years, hate speech have led to internal violence, relocation, and human rights violations against specific social groups around the world. And Ethiopian societies are among the victims. To lessen the spread of hate speech, this study develops Amharic hate speech detection. The study's main goal is to create a model for detecting hate speech by taking into account sentiment analysis of the relevant datasets and proving a link between hate speech and sentiment analysis. Peacemakers can take action when hate speech comments are being circulated online by using an Amhariclanguage hate speech detection system. Additionally, it will assist owners of social media platforms by automatically reporting hate speech remarks before they are seen by a wider audience Comments were gathered from Facebook, TikTok, and YouTube channels in order to create a labeled large Amharic dataset. Following data cleaning, 79991 hate and hate-free annotated datasets along with their sentiment were obtained. To label the dataset as hate and hate-free, new annotation guidelines were created. Despite previous related work, recent and large dataset were collected and their sentiment were also considered. To construct the model, CNN and GRU deep learnings were used in conjunction with Word embedding features. Negative sentiment was revealed to be the source of hate speech content. And most of the hate free dataset were found to be having positive sentiment. Using datasets that have been annotated by humans as a hate and hate free, the GRU and CNN models demonstrated respective accuracies of 0.90 and 0.72. And when both hate and non-hate annotated datasets along with their sentiment were used in the hate speech detection model, the models' respective accuracies become 0.75 and 0.74. As a result, in both model GRU outperform CNN model, and the CNN approach shows good performance for the hate speech detection model that was developed by integrating sentiment analysis.	en_US
dc.language.iso	en	en_US
dc.publisher	St. Mary’s University	en_US
dc.subject	Deep Learning, Hate speech detection, Amharic post and comment dataset, sentiment analysis, Gated Recurrent Unit; Convolutional Neural Networks	en_US
dc.title	Sentiment analysis based- Hate speech detection of Amharic social media post and comments	en_US
dc.type	Thesis	en_US
Appears in Collections:	Master of computer science