Digital Repository

A Survey on Image Captioning Using Object Detection and NLP

Show simple item record

dc.contributor.author De Silva, Vathila
dc.contributor.author Sumanathilaka, T.G.D.K.
dc.date.accessioned 2025-04-21T08:15:24Z
dc.date.available 2025-04-21T08:15:24Z
dc.date.issued 2024
dc.identifier.citation De Silva, V. and Sumanathilaka, T.G.D.K. (2024) ‘A Survey on Image Captioning Using Object Detection and NLP’, in 2024 4th International Conference on Advanced Research in Computing (ICARC). 2024 4th International Conference on Advanced Research in Computing (ICARC), pp. 270–275. Available at: https://doi.org/10.1109/ICARC61713.2024.10499755. en_US
dc.identifier.uri http://dlib.iit.ac.lk/xmlui/handle/123456789/2250
dc.description.abstract Recent years have seen the emergence of image captioning as a revolutionary technical development that seamlessly blends computer vision and natural language processing. The integration of various domains facilitates the production of captivating captions for pictures, promoting a more profound comprehension of visual material. This study offers a thorough analysis of the rapidly developing field of image captioning, examining its uses in various settings such as social media, online platforms, assistive technology, and content indexing. It emphasizes the critical importance of advanced techniques like You Only Look Once (YOLO) for accurate item detection and Natural Language Processing (NLP) for creating subtle captions. While NLP gives the resulting text a layer of contextual depth, YOLO guarantees precise object detection, which helps with caption accuracy. Using these cutting-edge methods boosts the overall efficacy of image captioning systems and represents a noteworthy trend in the literature. Examining the evolution of image captioning, the review paper highlights how important it is for multimodal comprehension. Image captioning is a transformative tool that enhances the readability of visual content and influences contemporary digital experiences in various contexts. The development of image captioning is summarized in this abstract, which also highlights the tendency to use cutting-edge techniques for more accurate and nuanced caption production. en_US
dc.language.iso en en_US
dc.publisher IEEE en_US
dc.subject Computer vision en_US
dc.subject Natural Language Processing en_US
dc.subject Object Detection en_US
dc.title A Survey on Image Captioning Using Object Detection and NLP en_US
dc.type Article en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search


Advanced Search

Browse

My Account