Digital Repository

Topic Classification using Active Learning for Sinhala Language Documents

Show simple item record

dc.contributor.author Sameemdeen, Arshad
dc.contributor.author Selvanthan, Nikethan
dc.date.accessioned 2025-04-29T04:59:54Z
dc.date.available 2025-04-29T04:59:54Z
dc.date.issued 2021
dc.identifier.citation Sameemdeen, A. and Selvanthan, N. (2021) ‘Topic Classification using Active Learning for Sinhala Language Documents’, in 2021 Asian Conference on Innovation in Technology (ASIANCON). 2021 Asian Conference on Innovation in Technology (ASIANCON), pp. 1–5. Available at: https://doi.org/10.1109/ASIANCON51346.2021.9544739. en_US
dc.identifier.uri https://ieeexplore.ieee.org/document/9544739
dc.identifier.uri http://dlib.iit.ac.lk/xmlui/handle/123456789/2287
dc.description.abstract In the field of Classifying Text data, Text Classification and Topic Modeling plays the higher role. When compared between the two techniques, Text Classification provides outputs with higher accuracy level. Due to this Data analysts tend to move towards this technique. Text classification is also referred to as text categorization/tagging, and it is a task of categorizing text according to its specified class. Text classifiers can automatically examine a set of text and classify it under a pre-defined category according to the content of the set of text with the help of Natural Language Processing (NLP) [1]. As this is a Supervised learning, it requires a vast range of classified dataset to make the classification efficient. But when it comes to languages with scarcity of classified dataset such as Sinhala, it becomes a problem to train the model due to the insufficiency of the dataset. Thus, the author proposes a solution for performing Text classification using Active learning. This solution utilizes the available classified dataset, learns from this supervised model, and produces outcomes (Classified Text Data) with a high accuracy level. en_US
dc.language.iso en en_US
dc.publisher IEEE en_US
dc.subject Text Classification en_US
dc.subject Semi-Supervised Learning en_US
dc.subject Natural Language Processing en_US
dc.subject Active learning en_US
dc.title Topic Classification using Active Learning for Sinhala Language Documents en_US
dc.type Article en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search


Advanced Search

Browse

My Account