Multilingual Math Word Problem Generation for Low Resource Languages

Athapaththu, Dineth

dc.contributor.author	Athapaththu, Dineth
dc.date.accessioned	2026-03-11T04:12:51Z
dc.date.available	2026-03-11T04:12:51Z
dc.date.issued	2025
dc.identifier.citation	Athapaththu, Dineth (2025) Multilingual Math Word Problem Generation for Low Resource Languages. Msc. Dissertation, Informatics Institute of Technology	en_US
dc.identifier.issn	20222022
dc.identifier.uri	http://dlib.iit.ac.lk/xmlui/handle/123456789/2915
dc.description.abstract	The manual creation of high-quality Math Word Problems presents a significant bottleneck for educators in low resource languages such as Sinhala and Tamil. While prior works have demonstrated the potential of multilingual models for this task, a systematic evaluation of advanced fine-tuning strategies to optimize performance remained a notable research gap. This thesis presents a comprehensive experimental framework to address this gap. A series of nine controlled experiments were conducted on multilingual sequence-to-sequence models, primarily mBART, to thoroughly evaluate and compare various training methodologies. A mixed-method evaluation was employed, combining automated metrics (BLEU, METEOR) with a structured qualitative human evaluation. The results prove that fine-tuning is a prerequisite for this task, as all models fail without it. The experiments further show that while a simple fine-tuning approach works, it fails to generalize across new problem domains and new languages. In contrast, advanced strategies like sequential transfer learning and multitask learning were validated as top-tier solutions that successfully overcome these critical limitations, producing high-quality and robust models. Sequential crosslingual transfer achieved the state-of-the-art performance. Finally, this research provides a clear, validated guide for optimizing MWP generation and offers a powerful new pathway for developing educational tools in underserved language communities.	en_US
dc.language.iso	en	en_US
dc.subject	AI-based Question Generation	en_US
dc.subject	Low-resource Languages	en_US
dc.subject	Natural Language Generation	en_US
dc.title	Multilingual Math Word Problem Generation for Low Resource Languages	en_US
dc.type	Thesis	en_US