Digital Repository

AutoCap: Captioning System for Images

Show simple item record

dc.contributor.author De Silva, Vathila
dc.date.accessioned 2025-06-17T05:47:33Z
dc.date.available 2025-06-17T05:47:33Z
dc.date.issued 2024
dc.identifier.citation De Silva, Vathila (2024) AutoCap: Captioning System for Images. BSc. Dissertation, Informatics Institute of Technology en_US
dc.identifier.issn 20200765
dc.identifier.uri http://dlib.iit.ac.lk/xmlui/handle/123456789/2610
dc.description.abstract "With continuous advancements in social technology, image/video and textual data growth has been rapid. While audio captioning has seen effective implementation, image captioning demands further meticulous attention in accurately captioning images, focusing on detecting small and overlapping objects and exotic fonts within images. These elements, often overlooked by current ICS, lead to generated captions that are inaccurate and lack detail. The omission of small and overlapping objects and certain text styles from captions reduces the captioning system's overall quality and risks conveying incorrect information to users. A new approach is proposed to address the problem of detecting small and overlapping objects within an image. This approach involves incorporating depth estimations in Convolutional Neural Networks (CNNs), which aims to improve object detection processes' precision to generate more accurate and detailed image captions. This methodology aims to bridge the gap in the current state of image captioning technologies and provide a more comprehensive understanding of the content within images. Based on the evaluation, the model achieves a 62% accuracy even though it was trained with a small dataset. The caption produced using feature extraction, object detection, and depth estimation technology has a high cosine similarity of 66%. These results clearly demonstrate the effectiveness and reliability of the model." en_US
dc.language.iso en en_US
dc.subject Convolution Neural Networks (CNN) en_US
dc.subject Depth Estimation en_US
dc.subject Detail Enhancement en_US
dc.title AutoCap: Captioning System for Images en_US
dc.type Thesis en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search


Advanced Search

Browse

My Account