Digital Repository

Personality Type Identification System using unstructured text in English based on Natural Language Processing and Machine Learning

Show simple item record

dc.contributor.author De Seram, Edirisooriya Mohottige Pamodya Jayangani
dc.date.accessioned 2024-06-04T09:06:05Z
dc.date.available 2024-06-04T09:06:05Z
dc.date.issued 2023
dc.identifier.citation De Seram, Edirisooriya Mohottige Pamodya Jayangani (2023) Personality Type Identification System using unstructured text in English based on Natural Language Processing and Machine Learning. MSc. Dissertation, Informatics Institute of Technology en_US
dc.identifier.issn 20211035
dc.identifier.uri http://dlib.iit.ac.lk/xmlui/handle/123456789/2186
dc.description.abstract "Perosnality type identification is beneficial to understand the associates. Especially to understand life partner in order to have a successful marriage life, select the most suitable candidates for a company, and understand and explore the self capacities are some of them. Personality is combination of a person’s behavior, feelings, motivation, and thought patterns. Those characteristics take years to understand and identify in a person’s personality. Personality type identification system was proposed to speed up the process. In the study, the author identified that existing systems have a gap in identifying personality using unstructured text. To provide speedy and accurate information, the author selected to use Natural Language Processing and Machine Learning techniques. Therefore, the author used the Decission Tree algorithm, the K-Neigbour Algorithm, Support Vector Machine, Naive Bayes algorithm, Logistic Regression algorithm, Random Forest algorithm, XGBoost model, and the LightGBM model. Considering the data set analysis, algorithms’ accuracy and evaluation metrics, the author developed Voting classifier ensemble model. To improve the accuracy, user balanced the dataset. If the original dataset was balanced, the author will be able to implement a more accurate model." en_US
dc.language.iso en en_US
dc.subject Personality Type Identification System en_US
dc.subject Natural Language Process en_US
dc.subject Machine Learning en_US
dc.title Personality Type Identification System using unstructured text in English based on Natural Language Processing and Machine Learning en_US
dc.type Thesis en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search


Advanced Search

Browse

My Account