Digital Repository

Polylingo - A Short Utterance Based Automatic Sinhala Language Identification & Translation Tool

Show simple item record

dc.contributor.author Arafath, Aysha Manal
dc.date.accessioned 2021-07-03T13:22:22Z
dc.date.available 2021-07-03T13:22:22Z
dc.date.issued 2020
dc.identifier.citation Arafath, Aysha Manal (2020) Polylingo - A Short Utterance Based Automatic Sinhala Language Identification & Translation Tool, BSc. Dissertation Informatics Institute of Technology en_US
dc.identifier.other 2016381
dc.identifier.uri http://dlib.iit.ac.lk/xmlui/handle/123456789/517
dc.description.abstract Language Identification (LI) has become a popular research field in the past couple of years. It is the process of identifying the language spoken from an audio recording. Researches have been done using different approaches to increase the accuracy of the system. Language identification also play an important role in systems such as the Automatic Speech Recognition Systems (ASR). Hence, it has many uses to it. However, most of the researches in this field focuses on the commonly used languages and languages which are low resourced tend to get left behind from these benefits. Sinhala language which is spoken by over 16 million people is still considered low resource, as efforts are not made to do research in this field and make resources public. Despite certain researches been done in the text field of Sinhala, there are no corpuses available publicly for research to be done in Sinhala speech. PolyLingo is an approach to automatically identify Sinhala language and translate it to other languages. A create a clean dataset will be built and made publicly available in order to aid for future researches in the field of Sinhala speech. Bidirectional Long Short Term Memory (LSTM) will be used in order to automatically identify the language within a short time frame. en_US
dc.subject Speech language identification en_US
dc.subject Natural language processing en_US
dc.subject Sinhala language en_US
dc.subject Bidirectional LSTM en_US
dc.title Polylingo - A Short Utterance Based Automatic Sinhala Language Identification & Translation Tool en_US
dc.type Thesis en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search


Advanced Search

Browse

My Account