Digital Repository

Biasblocker – a Hate Speech Detection System for Transliterated Sinhala-English Code- Mixed Language

Show simple item record

dc.contributor.author Kodithuwakku, Hashini
dc.date.accessioned 2024-03-14T03:50:02Z
dc.date.available 2024-03-14T03:50:02Z
dc.date.issued 2023
dc.identifier.citation Kodithuwakku, Hashini (2023) Biasblocker – a Hate Speech Detection System for Transliterated Sinhala-English Code- Mixed Language. BSc. Dissertation, Informatics Institute of Technology en_US
dc.identifier.issn 2019750
dc.identifier.uri http://dlib.iit.ac.lk/xmlui/handle/123456789/1895
dc.description.abstract "This study proposes a novel system for identifying hate speech in transliterated language that is a mixture of Sinhala and English. Due to the intricacy of the language and the prevalence of code-mixed languages on social media platforms, it is difficult to identify hate speech in these languages. The proposed novel system uses two pre-trained transformer models to detect hate speech content in Sinhala-English code-mixed, which is first transliterated and then used to train a hate speech detection model. The proposed approach consists of three components: a pre-processing module, a transliteration module, and a hate speech detection module. These components work together to process the input text, transliterate it into Sinhala, and then classify it for hate speech content. The suggested approach employs a Sinhala-English code-mixed aggregated dataset with hate speech annotations, and then utilizes a pre-trained transformer model to detect hate speech content. The proposed novel solution has outperformed the existing benchmarks for identifying hate speech content in Sinhala-English code-mixed language over 92% in Precision, Recall, and F1-score. The system can be simply modified to accommodate other low-resource code- mixed languages and aid in the identification of hate speech content on social media sites." en_US
dc.language.iso en en_US
dc.publisher IIT en_US
dc.subject Transliteration en_US
dc.subject Hate Speech Detection en_US
dc.subject Sinhala-English Code-mixed Language en_US
dc.title Biasblocker – a Hate Speech Detection System for Transliterated Sinhala-English Code- Mixed Language en_US
dc.type Thesis en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search


Advanced Search

Browse

My Account