Digital Repository

Offensive Word Detection on the Images Intended to Mislead Bots

Show simple item record

dc.contributor.author Perera, Dimalka
dc.date.accessioned 2024-02-15T08:14:11Z
dc.date.available 2024-02-15T08:14:11Z
dc.date.issued 2023
dc.identifier.citation Perera, Dimalka (2023) Offensive Word Detection on the Images Intended to Mislead Bots . MSc. Dissertation, Informatics Institute of Technology en_US
dc.identifier.issn 20200204
dc.identifier.uri http://dlib.iit.ac.lk/xmlui/handle/123456789/1701
dc.description.abstract The research aims to detect offensive words on images written in the Sinhala language, particularly those covered with jargon characters. The author explains the problem domain, highlighting the prevalence of abusive language on social media and its manifestation in memes. The research defines the problem statement, emphasizing the need to address the loopholes utilized by "memers" to hide offensive content. The research aims to contribute to the body of knowledge by developing a robust system for detecting and flagging offensive words in Sinhala images. The research question and the novelty of the research in the domain are presented, along with the identified research gap and potential contributions. A system to detect crossed letters on images is proposed and designed and the system will also be able to extract the clean text from the image. Then an abusive language detection model will recognize the text as offensive or not offensive. A sequential CNN was implemented to detect the crossed letters and K-Nearest Neighbor, Random Forest and Support Vector Machine algorithms were used to detect offensive language in text format. Sequential CNN model trained on a limited dataset achieved an accuracy of 96% and KNearest Neighbor, Random Forest and Support Vector Machine algorithms have achieved an accuracy of 88% , 86% and 83% respectively. en_US
dc.language.iso en en_US
dc.publisher IIT en_US
dc.subject Machine Learning en_US
dc.subject Deep Learning en_US
dc.subject ResNet en_US
dc.title Offensive Word Detection on the Images Intended to Mislead Bots en_US
dc.type Thesis en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search


Advanced Search

Browse

My Account