dc.contributor.author |
Hasalanka, Savinu |
|
dc.date.accessioned |
2025-06-16T06:48:46Z |
|
dc.date.available |
2025-06-16T06:48:46Z |
|
dc.date.issued |
2024 |
|
dc.identifier.citation |
Hasalanka, Savinu (2024) QText: Model-Free Reinforcement Learning Approach for Text Data Augmentation. BSc. Dissertation, Informatics Institute of Technology |
en_US |
dc.identifier.issn |
20200813 |
|
dc.identifier.uri |
http://dlib.iit.ac.lk/xmlui/handle/123456789/2577 |
|
dc.description.abstract |
Lack of rich datasets is a major concern in Artificial Intelligence (AI) and Machine learning (ML). Moreover, curating proper datasets is extremely time consuming and labour intensive. Data augmentation is a solid solution to this problem. However, finding optimum augmentation magnitudes is challenging. The author has explored a model-free reinforcement learning technique called QLearning to select the best magnitudes to a given set of text data augmentation techniques. Author has taken a novel approach by utilising a model-free architecture which is easy to scale compared to existing approaches such as neural networks. With this less computationally complex solution, the author was able to provide a high accuracy rate in less time. Tests were performed on various widely used as well as custom made datasets, and it was noted that the accuracy rose from 76.8% to 97.7% when the News Analysis dataset was augmented with the proposed solution. Also it needs to be highlighted that the data points were increased by 5 times that of the original number of data points. This shows that the solution proposed by the author has been successful in achieving better accuracies and more diverse datasets, thus achieving the ambition of the project. |
en_US |
dc.language.iso |
en |
en_US |
dc.subject |
Machine Learning |
en_US |
dc.subject |
Low Diversity Data |
en_US |
dc.subject |
Low Performing Models |
en_US |
dc.title |
QText: Model-Free Reinforcement Learning Approach for Text Data Augmentation |
en_US |
dc.type |
Thesis |
en_US |