Enhancing Fake News Detection in Sinhala Language: Integrating Pre-Trained Models with Explainable AI Techniques

Hewa Waduge, Vimoth

dc.contributor.author	Hewa Waduge, Vimoth
dc.date.accessioned	2026-04-08T06:57:38Z
dc.date.available	2026-04-08T06:57:38Z
dc.date.issued	2025
dc.identifier.citation	Hewa Waduge, Vimoth (2025) Enhancing Fake News Detection in Sinhala Language: Integrating Pre-Trained Models with Explainable AI Techniques. BSc. Dissertation, Informatics Institute of Technology	en_US
dc.identifier.issn	20210192
dc.identifier.uri	http://dlib.iit.ac.lk/xmlui/handle/123456789/3146
dc.description.abstract	The rapid growth of digital communication platforms has intensified the spread of misinformation, posing significant social, political, and economic risks. This challenge is particularly severe in low-resource languages such as Sinhala, where the lack of advanced NLP tools and annotated datasets limits the development of reliable automated fake news detection systems. Addressing this gap, this research proposes an enhanced fake news detection framework tailored for the Sinhala language by integrating a fine-tuned transformer-based model with Explainable Artificial Intelligence (XAI) techniques to promote transparency and trust. The study employs sinBERT, a pre-trained transformer model optimized for Sinhala linguistic patterns, which is further fine-tuned using a curated dataset of Sinhala news articles labeled across four categories: Credible, Partial, Uncertain, and Wrong. The data undergo rigorous preprocessing, including cleaning, tokenization, class balancing, and augmentation to address noise and imbalance. The model’s performance is evaluated using standard metrics such as accuracy, precision, recall, and F1 score, demonstrating strong predictive capability in identifying misleading news content. To improve interpretability—an essential factor for user trust—the system incorporates XAI methods, specifically SHAP-based explanations, enabling users to visualize the most influential words that contribute to each prediction. This transparency helps bridge the gap between automated decision-making and human understanding. Overall, this research contributes to the advancement of Sinhala NLP by providing a robust detection model, a valuable labeled dataset, and an interpretable decision-support framework. It highlights the potential of transformer architectures and XAI in combating misinformation in low-resource linguistic environments and sets a foundation for future improvements and multilingual extensions.	en_US
dc.language.iso	en	en_US
dc.subject	Fake News Detection	en_US
dc.subject	Explainable AI	en_US
dc.title	Enhancing Fake News Detection in Sinhala Language: Integrating Pre-Trained Models with Explainable AI Techniques	en_US
dc.type	Thesis	en_US