Machine learning approach to predict mental distress of IT workforce in remote working environments

Gamage, Sanduni

Home
→
Dissertations & Thesis
→
MSc Business Analytics
→
2022
→
View Item

Machine learning approach to predict mental distress of IT workforce in remote working environments

Gamage, Sanduni

URI: http://dlib.iit.ac.lk/xmlui/handle/123456789/1472

Date: 2022

Abstract:

Abstract - When considering online workers, due to the emergence of the coronavirus pandemic prevailing in the world, employees have been restricted to work remotely for a prolonged period. All the working arrangements are now based at home than before. Since this has been novel to society, the impact caused by this crisis on people is unknown in the short or long term. Since various factors can cause mental distress among online workers, periodic screening for mental distresses such as anxiety, depression, and stress is necessary for health and well-being. The causes of mental distress are multifactorial. They include socio-demographic, biological, economic, environmental, occupational, and psychological aspects. This paper proposes a concept of a screening system to predict mental distress given the external features associated with individuals, using supervised machine learning approaches and identifying the employees prone to higher risk and referring them early to professional assistance. The study was conducted concerning the circumstances in a pandemic era considering COVID-19 as the case study. The study was done with remote IT workers in Sri Lanka who works as a part of a software development team. 481 professionals participated in the study and were selected based on selection criteria and appropriate encoding techniques were utilized to encode categorical variables where most important 25 features were detected among 60 features using feature selection. Finally, classification techniques such as Random Forest, SVM, XGBoost, CatBoost, decision tree, and Naïve Bayes were used for modeling by which the CatBoost algorithm in overall measures outperformed other algorithms with a predictive accuracy of 97.1%, precision of 97.4%, recall of 99.7%, and f1 measure is 98.5% and ROC/AUC score of the model is 99%. This prediction model is more efficient since it scored higher values for the f1 measure and ROC/AUC score. learning has been done on highly imbalanced data and higher interest is in deriving distressed individuals correctly. Therefore, with regard to the main performance evaluation methods of F1 score and ROC/AUC score CatBoost outperformed other models in both measures.

Show full item record