Digital Repository

Sinhala News Summarization

Show simple item record

dc.contributor.author Bandara, Pamudu
dc.date.accessioned 2024-03-22T08:01:44Z
dc.date.available 2024-03-22T08:01:44Z
dc.date.issued 2023
dc.identifier.citation Bandara, Pamudu (2023) Sinhala News Summarization. BSc. Dissertation, Informatics Institute of Technology en_US
dc.identifier.issn 2019198
dc.identifier.uri http://dlib.iit.ac.lk/xmlui/handle/123456789/1935
dc.description.abstract "In this modern era, an enormous amount of data is publicly available on the internet. However, nowadays people don’t have much time to read all those information because of their busy work schedules. To overcome that issues, automatic text summarization can be utilized to automatically summarize lengthy articles. Even though NLP text summarization has been evolving in past decades, few research attempts were conducted on the Sinhala text summarization domain since it is a low resource language. This research is dedicated to the Sinhala text summarization in order to create condensed summaries by excerpting the crucial segments from the source news article. In order to achieve this task TextRank algorithm which is a graph based unsupervised learning technique was used. In this research rather than using a statistical feature to identify the important and relevant information, deep learning techniques was adopted to identify the input text more accurately. To identify the semantic relationships SinBERT model and sentence transformer were adhered with the TextRank algorithm. The quantitative testing was performed by using ROUGE metrics and F1 score to evaluate the performance of the created solution. For the testing task, machine generated summaries were compared with the human created manual summaries based on the assumption that manual summaries are perfect. The created solution obtained better results for the F1 score when compared to the other Sinhala text summarization projects." en_US
dc.publisher en_US
dc.subject Automatic Text Summarization en_US
dc.subject Sinhala news Summarization en_US
dc.subject Extractive summarization en_US
dc.title Sinhala News Summarization en_US
dc.type Thesis en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search


Advanced Search

Browse

My Account