Digital Repository

Multilingual Math Word Problem Generation for Low Resource Languages

Show simple item record

dc.contributor.author Athapaththu, Dineth
dc.date.accessioned 2026-03-11T04:12:51Z
dc.date.available 2026-03-11T04:12:51Z
dc.date.issued 2025
dc.identifier.citation Athapaththu, Dineth (2025) Multilingual Math Word Problem Generation for Low Resource Languages. Msc. Dissertation, Informatics Institute of Technology en_US
dc.identifier.issn 20222022
dc.identifier.uri http://dlib.iit.ac.lk/xmlui/handle/123456789/2915
dc.description.abstract The manual creation of high-quality Math Word Problems presents a significant bottleneck for educators in low resource languages such as Sinhala and Tamil. While prior works have demonstrated the potential of multilingual models for this task, a systematic evaluation of advanced fine-tuning strategies to optimize performance remained a notable research gap. This thesis presents a comprehensive experimental framework to address this gap. A series of nine controlled experiments were conducted on multilingual sequence-to-sequence models, primarily mBART, to thoroughly evaluate and compare various training methodologies. A mixed-method evaluation was employed, combining automated metrics (BLEU, METEOR) with a structured qualitative human evaluation. The results prove that fine-tuning is a prerequisite for this task, as all models fail without it. The experiments further show that while a simple fine-tuning approach works, it fails to generalize across new problem domains and new languages. In contrast, advanced strategies like sequential transfer learning and multitask learning were validated as top-tier solutions that successfully overcome these critical limitations, producing high-quality and robust models. Sequential crosslingual transfer achieved the state-of-the-art performance. Finally, this research provides a clear, validated guide for optimizing MWP generation and offers a powerful new pathway for developing educational tools in underserved language communities. en_US
dc.language.iso en en_US
dc.subject AI-based Question Generation en_US
dc.subject Low-resource Languages en_US
dc.subject Natural Language Generation en_US
dc.title Multilingual Math Word Problem Generation for Low Resource Languages en_US
dc.type Thesis en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search


Advanced Search

Browse

My Account