Digital Repository

Lankan Hate Check - A Deep Learning Approach to The Detection & Categorization of Sinhala or Singlish Hate-Speech

Show simple item record

dc.contributor.author Sirimanne, Senesh
dc.date.accessioned 2024-03-14T04:34:14Z
dc.date.available 2024-03-14T04:34:14Z
dc.date.issued 2023
dc.identifier.citation Sirimanne, Senesh (2023) Lankan Hate Check - A Deep Learning Approach to The Detection & Categorization of Sinhala or Singlish Hate-Speech. BSc. Dissertation, Informatics Institute of Technology en_US
dc.identifier.issn 2019348
dc.identifier.uri http://dlib.iit.ac.lk/xmlui/handle/123456789/1902
dc.description.abstract With the increasing use of the internet and social media, the spread of hate speech has become a major concern. This research aims to address the issue of hate speech detection and categorization in the context of Sri Lanka, specifically for Sinhala and Singlish (a mix of Sinhala and English) languages. While the existing research in the domain of hate speech detection in the Sri Lankan region focuses on Sinhala or Singlish hate speech detection with machine learning, those solutions do not possess the ability to categorize hate speech. This system uses deep learning to detect and categorize both Sinhala and Singlish hate speech, which has not been addressed thus far. A novel approach is taken by the author to train seven LSTM models with binary classification to obtain the overall result of each of the seven models, which are used to produce the final result. Seven datasets were manually created for the purpose of training the models. A back-transliteration function is used to convert the Singlish text into Sinhala and then fed to the model. The idea was to find a complete solution that can automatically detect hate speech and warn the users to in order to reduce Singlish and Sinhala hate speech in Sri Lankan social media. Notably, this research looks into categorizing Sinhala and Singlish hate speech into racism, sexism and xenophobia, addressing the gap in existing research. en_US
dc.language.iso en en_US
dc.publisher IIT en_US
dc.subject Hate Speech Detection en_US
dc.subject Hate Speech Categorization en_US
dc.subject Sinhala & Singlish en_US
dc.title Lankan Hate Check - A Deep Learning Approach to The Detection & Categorization of Sinhala or Singlish Hate-Speech en_US
dc.type Thesis en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search


Advanced Search

Browse

My Account