Digital Repository

Risk Predict - A Machine Learning Pipeline For Localized Risk Prediction In Health Insurance

Show simple item record

dc.contributor.author Welihinda, Madhusha
dc.date.accessioned 2023-01-13T10:05:53Z
dc.date.available 2023-01-13T10:05:53Z
dc.date.issued 2022
dc.identifier.citation Welihinda, Madhusha (2022) Risk Predict - A Machine Learning Pipeline For Localized Risk Prediction In Health Insurance . MSc. Dissertation, Informatics Institute of Technology en_US
dc.identifier.issn 2017246
dc.identifier.uri http://dlib.iit.ac.lk/xmlui/handle/123456789/1419
dc.description.abstract "In Sri Lanka, one of the main challenges faced by health insurance providers is identifying future risk of policyholders to come up with competitive and affordable premiums. Risk is a factor that differ from person to person and careful identification of risk is a crucial part of underwriting process. With the increase of insurance data and the use of machines learning algorithms the underwriting process can be further improved by faster data processing and identification of risks. This research aims at providing a framework coupled with a machine learning pipeline to predict health insurance risk amount in an accurate manner. A real-world dataset containing five years of claimed data has been used to conduct the analysis. The research was carried out by training seven classification models and four regression models to develop a machine learning pipeline to classify risk and then to predict risk amount. The experimental results showed that the proposed machine learning pipeline has obtained acceptable results and ensemble algorithms works well on health insurance data compared to other machine learning algorithms. In classification part of the study, Random Forest and XGBoost achieved the best accuracies. The regression part of the study reveals that XGBoost perform well in predicting amounts for low-risk policyholders and Random Forest generate better results for normal, high, and bad risk data after applying data augmentation techniques." en_US
dc.language.iso en en_US
dc.subject Health insurance risk en_US
dc.subject Underwriting process en_US
dc.subject Machine Learning en_US
dc.title Risk Predict - A Machine Learning Pipeline For Localized Risk Prediction In Health Insurance en_US
dc.type Thesis en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search


Advanced Search

Browse

My Account