SafeView: Predicting Sensory Overstimulation in Children’s Videos with Multimodal Ordinal Regression Modeling

Asaf, Taalib

dc.contributor.author	Asaf, Taalib
dc.date.accessioned	2026-04-07T06:09:19Z
dc.date.available	2026-04-07T06:09:19Z
dc.date.issued	2025
dc.identifier.citation	Asaf, Taalib (2025) SafeView: Predicting Sensory Overstimulation in Children’s Videos with Multimodal Ordinal Regression Modeling. BSc. Dissertation, Informatics Institute of Technology	en_US
dc.identifier.issn	20210075
dc.identifier.uri	http://dlib.iit.ac.lk/xmlui/handle/123456789/3124
dc.description.abstract	Digital content exposure continues to grow for children adding stress to their sensory abilities because it produces cognitive overload effects and emotional struggles especially if they possess sensory processing sensitivities. Manual annotation of overstimulating content requires human effort and does not scale well to large video datasets thus the method also comes with inherent subjectivity. The SafeView project establishes an automated detection system for sensory overstimulation in videos based on audio-visual characteristics that generate high cognitive demand because of scene transitions, loud effects, and multiple auditory elements to ensure child- friendly media usage. The SafeView system employs a machine learning pipeline to detect sensory overstimulation, utilizing a Random Forest model trained on a diverse set of audio-visual features extracted from 6-minute video segments. Audio features were derived using advanced signal processing techniques, including sound classification with YAMNet to identify elements like speech, music, and sound effects, and spectral analysis with Librosa. Visual features, including scene frequency and visual complexity, were extracted through computer vision methods using Opencv for scene detection and frame analysis..A few models, such as the Random Forest, were optimised with GroupKFold cross-validation to ensure robustness across diverse video content, enabling a scalable and automated approach to overstimulation detection. Random Forest achieved an accuracy with an RQWK of 0.86, an RMSE of 0.62, and a Spearman’s rank correlation coefficient of 0.82 for identifying overstimulation scores in SafeView. The performance surpassed both SVR measurements (QWK: 0.65, RMSE: 0.85) as well as XGBoost models (QWK: 0.84, RMSE: 0.65	en_US
dc.language.iso	en	en_US
dc.subject	Sensory Overstimulation	en_US
dc.subject	Audio Visual Processing	en_US
dc.subject	Random Forest Classifier	en_US
dc.title	SafeView: Predicting Sensory Overstimulation in Children’s Videos with Multimodal Ordinal Regression Modeling	en_US
dc.type	Thesis	en_US