Digital Repository

Multi - accent speech recognition for Tamil English mixed language system

Show simple item record

dc.contributor.author Mathusagar, Ragavan
dc.date.accessioned 2022-03-21T08:22:21Z
dc.date.available 2022-03-21T08:22:21Z
dc.date.issued 2021
dc.identifier.citation Mathusagar, Ragavan (2021) Multi - accent speech recognition for Tamil English mixed language system. MSc. Dissertation Informatics Institute of Technology en_US
dc.identifier.issn 20191292
dc.identifier.uri http://dlib.iit.ac.lk/xmlui/handle/123456789/1060
dc.description.abstract " Speech recognition has been a hot topic as the intelligent era has developed. Even though numerous automated speech recognition (ASR) programs have been available, a significant number of them do not support Tamil with full features. And when it comes to practical usability, just supporting only Tamil language words will not be an option since the English words are often used in the day-to-day conversation.But still considering a mixed language scenario will not totally satisfy the scenario without considering the different accent in language. So that the research has been done by considering both Tamil-English mixed language and accented speech. Data has been collected from two group of people with different Tamil accent (Sri Lankan Tamil , Indian Tamil) and the ASR models have been created using the open sourcetool “Kaldi”.“Mel Frequency Cepstral Coefficient”(MFCC) has been used for feature extracting and Hidden Markov Model (HMM) and Gaussian Mixture Model (GMM) based monophone and triphone models were created and tested. From the triphone model, the best model has been selected and used for the hybrid model creation by replacing the GMM with neural network. Model accuracy has been compared based on the WER and SER value for each model and also benchmarked with the previous systems. The results showedaccuracy improvement for the hybrid modelcompared to triphone models. Key words :Automatic speech recognition, Neural network , Acoustic models, Kaldi, Mixed language ASR" en_US
dc.language.iso en en_US
dc.title Multi - accent speech recognition for Tamil English mixed language system en_US
dc.type Thesis en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search


Advanced Search

Browse

My Account