Digital Repository

Topic modeling approach for the automation of candidate recruitment process in sinhala natural language

Show simple item record

dc.contributor.author D.D, Weerasinghe,
dc.date.accessioned 2023-08-03T05:01:13Z
dc.date.available 2023-08-03T05:01:13Z
dc.date.issued 2020
dc.identifier.citation Weerasinghe, D.D (2021) Topic modeling approach for the automation of candidate recruitment process in sinhala natural language. BEng. Dissertation, Informatics Institute of Technology en_US
dc.identifier.issn 2016262
dc.identifier.uri http://dlib.iit.ac.lk/xmlui/handle/123456789/1597
dc.description.abstract Word shortlisting simply refers to shortening the list. Shortlisting of curriculum vitaes is very time consuming and abstract work. There are number of software tools that can detect and shortlist resumes. Almost all of these tools are available for English language, but similar tools for Sinhala language is not yet available. There are many attempts of developing language dependent shortlisting tools for languages like Hindi, Chinese, French, Malayalam, Arabic, and English. Most of these tools outperforms the available language independent commercial shortlisting tools as well. Sinhala language being similar to these languages and also being the official language of Sri Lanka along with Tamil, the need of a comprehensive tool to reduce the time for the recruitment procedure is a timely need. Due to the complexity of the language itself the available language independent tools produces very poor results. This research’s main objective is to address the need of a classification tool for Sinhala language to detect and identify similarity of words and rank them according to the weight. A novel algorithm has been developed to detect words correctly and to preprocess then rank then corerctly. The proposed system mainly consists of two stages as text pre-processing and classifying (ranking). Sinhala language resources used in this project were taken from the Language Technology Research Laboratory of University of Colombo and Natural Language Processing Research group of University of Kelaniya. Testing and validations has been carried out by collecting random text samples govenement organizations. en_US
dc.language.iso en en_US
dc.publisher IIT en_US
dc.subject Sinhala, Natural language Processing, en_US
dc.subject text pre-processing Publications, en_US
dc.title Topic modeling approach for the automation of candidate recruitment process in sinhala natural language en_US
dc.type Thesis en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search


Advanced Search

Browse

My Account